BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 022435
(297 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255545672|ref|XP_002513896.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
gi|223546982|gb|EEF48479.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
Length = 386
Score = 583 bits (1504), Expect = e-164, Method: Compositional matrix adjust.
Identities = 272/297 (91%), Positives = 290/297 (97%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLDVKHDI KKRLDS GNVIE+RQDGIGAPKI+ PLQRHGGRLEHNETYCGSC
Sbjct: 90 MDISGEQHLDVKHDIIKKRLDSHGNVIEARQDGIGAPKIENPLQRHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+SDEDCCN+CE+VREAYRKKGWALSNPDLIDQCKREGFLQRIK+EEGEGCNIYGFL
Sbjct: 150 YGAEASDEDCCNSCEDVREAYRKKGWALSNPDLIDQCKREGFLQRIKDEEGEGCNIYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+LAFQ+DSFNISHKIN+LAFG++FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNVHVHDLLAFQKDSFNISHKINRLAFGDYFPGVVNPLDGV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
WTQETPSGMYQYFIKVVPTVYTDVSG+TIQSNQFSVTEHFRS+E GRLQ+LPGVFFFYD
Sbjct: 270 HWTQETPSGMYQYFIKVVPTVYTDVSGYTIQSNQFSVTEHFRSAEAGRLQSLPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI+D+FIYHGQ+AIKKK+EIGKFS
Sbjct: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGILDSFIYHGQKAIKKKMEIGKFS 386
>gi|224082148|ref|XP_002306582.1| predicted protein [Populus trichocarpa]
gi|222856031|gb|EEE93578.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 577 bits (1487), Expect = e-162, Method: Compositional matrix adjust.
Identities = 267/297 (89%), Positives = 289/297 (97%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLDVKHDI KKRLD GNVIE+RQDGIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 90 MDISGEQHLDVKHDIIKKRLDFHGNVIEARQDGIGAPKIEKPLQRHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+SDEDCCN+CE+VREAYRKKGWA++NPDL+DQCKREGFLQ+IK+EEGEGCNIYGFL
Sbjct: 150 YGAEASDEDCCNSCEDVREAYRKKGWAVTNPDLMDQCKREGFLQKIKDEEGEGCNIYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ+DSFNI+HKIN+L FGE+FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDSFNITHKINRLTFGEYFPGVVNPLDGV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR ++ GRLQ+LPGVFFFYD
Sbjct: 270 QWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRGTDIGRLQSLPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI+D FIYHGQ+AIKKK+EIGKFS
Sbjct: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGILDTFIYHGQKAIKKKMEIGKFS 386
>gi|356552872|ref|XP_003544786.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 386
Score = 574 bits (1479), Expect = e-161, Method: Compositional matrix adjust.
Identities = 265/297 (89%), Positives = 288/297 (96%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLDVKHDI KKRLDS GNVIE+RQ+GIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 90 MDISGEQHLDVKHDIIKKRLDSHGNVIETRQEGIGAPKIEKPLQRHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE SD+DCCN+CE+VREAYRKKGWALSNPDLIDQCKREGFLQRIK+EEGEGCN+YGFL
Sbjct: 150 YGAEESDDDCCNSCEDVREAYRKKGWALSNPDLIDQCKREGFLQRIKDEEGEGCNVYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ+DSFN+SH IN+LAFGE+FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDSFNLSHHINRLAFGEYFPGVVNPLDNV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR+ + GRLQ+LPGVFFFYD
Sbjct: 270 HWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRTGDVGRLQSLPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTEE+VSFLHFLTNVCAIVGG+FTVSGI+D+FIYHGQRAIKKK+E+GKF+
Sbjct: 330 LSPIKVTFTEENVSFLHFLTNVCAIVGGIFTVSGILDSFIYHGQRAIKKKMELGKFN 386
>gi|356548103|ref|XP_003542443.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 386
Score = 570 bits (1468), Expect = e-160, Method: Compositional matrix adjust.
Identities = 262/297 (88%), Positives = 287/297 (96%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQ LDVKHDI KKRLDS+GNVIE+RQ+GIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 90 MDISGEQRLDVKHDIIKKRLDSRGNVIETRQEGIGAPKIEKPLQRHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YG+E SD+DCCN+CE+VREAYRKKGWALSNPDLIDQCKREGFLQRIK+EEGEGCN+YGFL
Sbjct: 150 YGSEVSDDDCCNSCEDVREAYRKKGWALSNPDLIDQCKREGFLQRIKDEEGEGCNVYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ+DSFN+SH IN+L FGE+FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDSFNLSHHINRLTFGEYFPGVVNPLDNV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR+ + GRLQ+LPGVFFFYD
Sbjct: 270 HWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRTGDMGRLQSLPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTEE+VSFLHFLTNVCAIVGG+FTVSGI+D+FIYHGQRAIKKK+E+GKF+
Sbjct: 330 LSPIKVTFTEENVSFLHFLTNVCAIVGGIFTVSGILDSFIYHGQRAIKKKMELGKFN 386
>gi|225459342|ref|XP_002285801.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Vitis vinifera]
gi|302141938|emb|CBI19141.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 568 bits (1464), Expect = e-160, Method: Compositional matrix adjust.
Identities = 261/297 (87%), Positives = 287/297 (96%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLDV+HDI KKR+D+ G+VIE+RQDGIG+PKI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90 MDISGEQHLDVRHDIIKKRIDAHGSVIEARQDGIGSPKIEKPLQKHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+SD+DCCNNCEEVREAYRKKGWA+SNPDLIDQCKREGFLQRIK+EEGEGCNIYGFL
Sbjct: 150 YGAEASDDDCCNNCEEVREAYRKKGWAMSNPDLIDQCKREGFLQRIKDEEGEGCNIYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS +HVHD+LAFQ+DSFNISHKIN+LAFG++FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNIHVHDLLAFQKDSFNISHKINRLAFGDYFPGVVNPLDGV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W Q TPSGMYQYFIKVVPTVYT VSGHTI +NQFSVTEHFR++E GRLQ+LPGVFFFYD
Sbjct: 270 QWIQATPSGMYQYFIKVVPTVYTHVSGHTISTNQFSVTEHFRNAELGRLQSLPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI+D+FIYH Q+AIKKKIEIGKFS
Sbjct: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGILDSFIYHSQKAIKKKIEIGKFS 386
>gi|357489473|ref|XP_003615024.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355516359|gb|AES97982.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 386
Score = 562 bits (1449), Expect = e-158, Method: Compositional matrix adjust.
Identities = 257/297 (86%), Positives = 288/297 (96%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLDV+HDI KKR+DS GNVIE+RQDGIG+P I+KPLQRHGGRLEHNETYCGSC
Sbjct: 90 MDISGEQHLDVRHDIIKKRIDSHGNVIETRQDGIGSPNIEKPLQRHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+SDE+CCN+CEEVREAYRKKGWALS+PD IDQCKREGFL+RIKEEEGEGCN+YGFL
Sbjct: 150 YGAEASDEECCNSCEEVREAYRKKGWALSSPDSIDQCKREGFLERIKEEEGEGCNVYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ++SFN+SH IN++AFG++FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKESFNLSHHINRIAFGDYFPGVVNPLDRV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
WTQETPSGMYQYFIKVVPT+YTDVSG+TIQSNQFSVTEHFR+++ GRLQ+LPGVFFFYD
Sbjct: 270 HWTQETPSGMYQYFIKVVPTMYTDVSGNTIQSNQFSVTEHFRTADVGRLQSLPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTEEHVSFLHFLTNVCAIVGG+FTVSGI+D+FIYHGQ+AIKKK+E+GKFS
Sbjct: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGIFTVSGILDSFIYHGQKAIKKKMELGKFS 386
>gi|449465886|ref|XP_004150658.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
gi|449518819|ref|XP_004166433.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 386
Score = 560 bits (1442), Expect = e-157, Method: Compositional matrix adjust.
Identities = 260/297 (87%), Positives = 281/297 (94%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLDVKHDI KKRLDS GN IE+R DGIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 90 MDISGEQHLDVKHDIIKKRLDSHGNAIEARPDGIGAPKIEKPLQRHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+GAES+D+DCCN+CEEVREAYRKKGWALSNPDLIDQCKREGFLQRIK+E+GEGCNIYGFL
Sbjct: 150 FGAESADDDCCNSCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKDEDGEGCNIYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+LAFQ+DSFNISHKIN+LAFGE+FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNVHVHDLLAFQKDSFNISHKINRLAFGEYFPGVVNPLDSV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W QETPS YQYFIKVVPTVY VSG+TIQSNQFSVTEH R++E GRLQ+LP VFFFYD
Sbjct: 270 QWKQETPSATYQYFIKVVPTVYNSVSGYTIQSNQFSVTEHVRTAEVGRLQSLPAVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI+D+FIYHGQ+ IKKK+EIGKFS
Sbjct: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGILDSFIYHGQKVIKKKMEIGKFS 386
>gi|224066933|ref|XP_002302286.1| predicted protein [Populus trichocarpa]
gi|222844012|gb|EEE81559.1| predicted protein [Populus trichocarpa]
Length = 377
Score = 556 bits (1434), Expect = e-156, Method: Compositional matrix adjust.
Identities = 263/297 (88%), Positives = 282/297 (94%), Gaps = 9/297 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLDVKHDI KKRLDS GNVIESRQDGIGAPKI+KPLQRHGGRLEHNETYC
Sbjct: 90 MDISGEQHLDVKHDIIKKRLDSHGNVIESRQDGIGAPKIEKPLQRHGGRLEHNETYC--- 146
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
DEDCCN+CEEVREAY+KKGWA++NPDL+DQCKREGFLQRIK+EEGEGCNIYGFL
Sbjct: 147 ------DEDCCNSCEEVREAYQKKGWAVTNPDLMDQCKREGFLQRIKDEEGEGCNIYGFL 200
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ+DSFN SHKIN+LAFGE+FPGVVNPLDGV
Sbjct: 201 EVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDSFNTSHKINRLAFGEYFPGVVNPLDGV 260
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR ++ GRLQ+LPGVFFFYD
Sbjct: 261 QWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRGADIGRLQSLPGVFFFYD 320
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI+D+FIYHGQ+AIKKK+EIGKFS
Sbjct: 321 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGILDSFIYHGQKAIKKKMEIGKFS 377
>gi|297846654|ref|XP_002891208.1| hypothetical protein ARALYDRAFT_891247 [Arabidopsis lyrata subsp.
lyrata]
gi|297337050|gb|EFH67467.1| hypothetical protein ARALYDRAFT_891247 [Arabidopsis lyrata subsp.
lyrata]
Length = 386
Score = 556 bits (1432), Expect = e-156, Method: Compositional matrix adjust.
Identities = 254/297 (85%), Positives = 282/297 (94%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE HLDVKHDI K+RLDS GN IE+RQDGIGA KI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90 MDISGELHLDVKHDIIKRRLDSNGNTIEARQDGIGATKIEKPLQKHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ + DCCN+CE+VREAYRKKGW ++NPDLIDQCKREGFLQR+K+EEGEGCNIYGFL
Sbjct: 150 YGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLIDQCKREGFLQRVKDEEGEGCNIYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ+DSFNISHKIN+L +G++FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDSFNISHKINRLTYGDYFPGVVNPLDKV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQSNQFSVTEH +SSE G+LQ+LPGVFFFYD
Sbjct: 270 EWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSNQFSVTEHVKSSEAGQLQSLPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTEEH+SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 330 LSPIKVTFTEEHISFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQKAIKKKMEIGKFS 386
>gi|238478737|ref|NP_001154394.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|12324714|gb|AAG52317.1|AC021666_6 unknown protein; 24499-21911 [Arabidopsis thaliana]
gi|27808598|gb|AAO24579.1| At1g36050 [Arabidopsis thaliana]
gi|110736190|dbj|BAF00066.1| hypothetical protein [Arabidopsis thaliana]
gi|332193720|gb|AEE31841.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 386
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 252/297 (84%), Positives = 280/297 (94%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE HLDVKHDI K+RLDS GN IE+RQDGIGA KI+ PLQ+HGGRL HNETYCGSC
Sbjct: 90 MDISGELHLDVKHDIIKRRLDSNGNTIEARQDGIGATKIENPLQKHGGRLGHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ + DCCN+CE+VREAYRKKGW ++NPDLIDQCKREGFLQR+K+EEGEGCNIYGFL
Sbjct: 150 YGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLIDQCKREGFLQRVKDEEGEGCNIYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ+DSFNISHKIN+L +G++FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDSFNISHKINRLTYGDYFPGVVNPLDKV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQSNQFSVTEH +SSE G+LQ+LPGVFFFYD
Sbjct: 270 EWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSNQFSVTEHVKSSEAGQLQSLPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTEEH+SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 330 LSPIKVTFTEEHISFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQKAIKKKMEIGKFS 386
>gi|240254210|ref|NP_564467.5| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|332193719|gb|AEE31840.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 489
Score = 544 bits (1401), Expect = e-152, Method: Compositional matrix adjust.
Identities = 248/293 (84%), Positives = 276/293 (94%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE HLDVKHDI K+RLDS GN IE+RQDGIGA KI+ PLQ+HGGRL HNETYCGSC
Sbjct: 90 MDISGELHLDVKHDIIKRRLDSNGNTIEARQDGIGATKIENPLQKHGGRLGHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ + DCCN+CE+VREAYRKKGW ++NPDLIDQCKREGFLQR+K+EEGEGCNIYGFL
Sbjct: 150 YGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLIDQCKREGFLQRVKDEEGEGCNIYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ+DSFNISHKIN+L +G++FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDSFNISHKINRLTYGDYFPGVVNPLDKV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQSNQFSVTEH +SSE G+LQ+LPGVFFFYD
Sbjct: 270 EWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSNQFSVTEHVKSSEAGQLQSLPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
LSPIKVTFTEEH+SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ+AIKKK+EI
Sbjct: 330 LSPIKVTFTEEHISFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQKAIKKKMEI 382
>gi|38347102|emb|CAE02574.2| OSJNBa0006M15.17 [Oryza sativa Japonica Group]
gi|116309990|emb|CAH67017.1| H0523F07.5 [Oryza sativa Indica Group]
gi|218194960|gb|EEC77387.1| hypothetical protein OsI_16129 [Oryza sativa Indica Group]
Length = 386
Score = 530 bits (1366), Expect = e-148, Method: Compositional matrix adjust.
Identities = 241/297 (81%), Positives = 273/297 (91%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG++HLDVKHDIFK+R+D GNVI ++QD +G K+++PLQRHGGRLEHNETYCGSC
Sbjct: 90 MDISGQEHLDVKHDIFKQRIDVHGNVIATKQDAVGGMKVEQPLQRHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE SDE CCN+CE+VREAYRKKGW +SNPDLIDQCKREGFLQ IK+EEGEGCNIYGFL
Sbjct: 150 YGAEESDEQCCNSCEDVREAYRKKGWGVSNPDLIDQCKREGFLQSIKDEEGEGCNIYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF ++ VHVHD+L FQ+DSFN+SHKINKL+FG+ FPGVVNPLDG
Sbjct: 210 EVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDSFNVSHKINKLSFGQRFPGVVNPLDGA 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W Q + GMYQYFIKVVPTVYTD++ H I SNQFSVTEHFRSSE GR+Q +PGVFFFYD
Sbjct: 270 QWMQHSSYGMYQYFIKVVPTVYTDINEHIILSNQFSVTEHFRSSESGRIQAVPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YHGQRAIKKK+EIGKF+
Sbjct: 330 LSPIKVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHGQRAIKKKMEIGKFN 386
>gi|226494692|ref|NP_001148795.1| LOC100282412 [Zea mays]
gi|194696974|gb|ACF82571.1| unknown [Zea mays]
gi|195622210|gb|ACG32935.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|414586929|tpg|DAA37500.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 386
Score = 524 bits (1350), Expect = e-146, Method: Compositional matrix adjust.
Identities = 237/297 (79%), Positives = 269/297 (90%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG++HLDVKHD+FK+R+D+ GNVI +RQD +G K++ PLQ HGGRLEHNETYCGSC
Sbjct: 90 MDISGQEHLDVKHDVFKQRIDAHGNVIATRQDVVGGMKMEAPLQHHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+ SD+ CCN CE+VREAYRKKGW +SNPDL+DQCKREGFLQ IK+EEGEGCNIYGF+
Sbjct: 150 YGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLLDQCKREGFLQSIKDEEGEGCNIYGFI 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKIN+L+FGE+FPGVVNPLDG
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNVSHKINRLSFGEYFPGVVNPLDGA 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
W Q + GMYQYFIKVVPTVYTD++ H I SNQFSVTEHFRS E GR+Q LPGVFFFYD
Sbjct: 270 NWVQHSSYGMYQYFIKVVPTVYTDINEHIILSNQFSVTEHFRSGESGRMQALPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YH QRAIKKK+EIGKF+
Sbjct: 330 LSPIKVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHSQRAIKKKMEIGKFN 386
>gi|242076030|ref|XP_002447951.1| hypothetical protein SORBIDRAFT_06g018670 [Sorghum bicolor]
gi|241939134|gb|EES12279.1| hypothetical protein SORBIDRAFT_06g018670 [Sorghum bicolor]
Length = 386
Score = 524 bits (1349), Expect = e-146, Method: Compositional matrix adjust.
Identities = 237/297 (79%), Positives = 269/297 (90%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG++HLDVKHD+FK+R+D+ GNVI +RQD +G K++ PLQ HGGRLEHNETYCGSC
Sbjct: 90 MDISGQEHLDVKHDVFKQRIDAHGNVIATRQDAVGGMKMEAPLQHHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+ SD CCN+CE+VREAYRKKGW +SNPDL+DQCKREGFLQ IK+EEGEGCNIYGF+
Sbjct: 150 YGAQESDGQCCNSCEDVREAYRKKGWGVSNPDLLDQCKREGFLQSIKDEEGEGCNIYGFI 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKIN+L+FGE+FPGVVNPLDG
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNVSHKINRLSFGEYFPGVVNPLDGA 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
W Q + GMYQYFIKVVPTVYTD++ H I SNQFSVTEHFRS E GR+Q LPGVFFFYD
Sbjct: 270 SWVQHSSYGMYQYFIKVVPTVYTDINEHIILSNQFSVTEHFRSGESGRMQALPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YH QRAIKKK+EIGKF+
Sbjct: 330 LSPIKVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHSQRAIKKKMEIGKFN 386
>gi|6598578|gb|AAF18633.1|AC006228_4 F5J5.4 [Arabidopsis thaliana]
Length = 440
Score = 523 bits (1346), Expect = e-146, Method: Compositional matrix adjust.
Identities = 252/348 (72%), Positives = 280/348 (80%), Gaps = 51/348 (14%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE HLDVKHDI K+RLDS GN IE+RQDGIGA KI+ PLQ+HGGRL HNETYCGSC
Sbjct: 93 MDISGELHLDVKHDIIKRRLDSNGNTIEARQDGIGATKIENPLQKHGGRLGHNETYCGSC 152
Query: 61 YGAES---------------------------SDEDCCNNCEEVREAYRKKGWALSNPDL 93
YGAE+ + DCCN+CE+VREAYRKKGW ++NPDL
Sbjct: 153 YGAEAVIVLSLYLTLWSMVSQLSSEVCFFPVQEEHDCCNSCEDVREAYRKKGWGVTNPDL 212
Query: 94 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 153
IDQCKREGFLQR+K+EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ+D
Sbjct: 213 IDQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKD 272
Query: 154 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
SFNISHKIN+L +G++FPGVVNPLD V W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQSN
Sbjct: 273 SFNISHKINRLTYGDYFPGVVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSN 332
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG----- 268
QFSVTEH +SSE G+LQ+LPGVFFFYDLSPIKVTFTEEH+SFLHFLTNVCAIVGG
Sbjct: 333 QFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGISLIS 392
Query: 269 -------------------VFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
VFTVSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 393 IYHNNTCWLTHIKIRNETCVFTVSGIIDAFIYHGQKAIKKKMEIGKFS 440
>gi|225448309|ref|XP_002264644.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Vitis vinifera]
gi|296085664|emb|CBI29463.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 232/297 (78%), Positives = 271/297 (91%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQH D+KHDI KKR+D+ GNV+ RQDGIG P+I+KPLQRHGGRLEHNE YCGSC
Sbjct: 90 MDISGEQHHDIKHDIVKKRIDAHGNVVAVRQDGIGGPQIEKPLQRHGGRLEHNEKYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE +D+DCCN+C+EVREAYRKKGW ++NPDLIDQCKREGF+Q++KEEEGEGCN+YGFL
Sbjct: 150 YGAEVTDDDCCNSCDEVREAYRKKGWGMTNPDLIDQCKREGFVQKVKEEEGEGCNVYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHF+PGK F+QS +HV+D+LA +D +NISH+INKLAFG+HFPGVVNPLDG
Sbjct: 210 EVNKVAGNFHFSPGKGFYQSNIHVNDLLAISKDGYNISHRINKLAFGDHFPGVVNPLDGA 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W Q+ P GMYQYFIKVVPT+YTD+ GHTIQSNQFSVTEHFRS+E GR +LPGV+FFYD
Sbjct: 270 QWFQDAPDGMYQYFIKVVPTIYTDIRGHTIQSNQFSVTEHFRSAEPGRPHSLPGVYFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVT EEH SFLHF+TN+CAIVGG+FTVSGIID+F+YHG RAIKKK+E+GKFS
Sbjct: 330 LSPIKVTSKEEHSSFLHFMTNICAIVGGIFTVSGIIDSFVYHGHRAIKKKMELGKFS 386
>gi|224032113|gb|ACN35132.1| unknown [Zea mays]
gi|414586931|tpg|DAA37502.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 391
Score = 517 bits (1332), Expect = e-144, Method: Compositional matrix adjust.
Identities = 237/302 (78%), Positives = 269/302 (89%), Gaps = 5/302 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG++HLDVKHD+FK+R+D+ GNVI +RQD +G K++ PLQ HGGRLEHNETYCGSC
Sbjct: 90 MDISGQEHLDVKHDVFKQRIDAHGNVIATRQDVVGGMKMEAPLQHHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ-----CKREGFLQRIKEEEGEGCN 115
YGA+ SD+ CCN CE+VREAYRKKGW +SNPDL+DQ CKREGFLQ IK+EEGEGCN
Sbjct: 150 YGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLLDQVEPSDCKREGFLQSIKDEEGEGCN 209
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
IYGF+EVNKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKIN+L+FGE+FPGVVN
Sbjct: 210 IYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNVSHKINRLSFGEYFPGVVN 269
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
PLDG W Q + GMYQYFIKVVPTVYTD++ H I SNQFSVTEHFRS E GR+Q LPGV
Sbjct: 270 PLDGANWVQHSSYGMYQYFIKVVPTVYTDINEHIILSNQFSVTEHFRSGESGRMQALPGV 329
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FFFYDLSPIKVTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YH QRAIKKK+EIGK
Sbjct: 330 FFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHSQRAIKKKMEIGK 389
Query: 296 FS 297
F+
Sbjct: 390 FN 391
>gi|357163897|ref|XP_003579883.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Brachypodium distachyon]
Length = 386
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 233/297 (78%), Positives = 263/297 (88%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG++HLDVKHD+FK+R+D+ GNVI ++QD +G K++KPLQ HGGRLEHNETYCGSC
Sbjct: 90 MDISGQEHLDVKHDVFKQRIDANGNVIATKQDAVGGMKVEKPLQMHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE E CCN+CE+VREAYRKKGW +SNPD IDQCKREGFLQ IK+EEGEGCNIYGF+
Sbjct: 150 YGAEEPGEQCCNSCEDVREAYRKKGWGVSNPDSIDQCKREGFLQTIKDEEGEGCNIYGFV 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
E+NKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKINKL+FGE FPGVVNPLDG
Sbjct: 210 EINKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNVSHKINKLSFGEPFPGVVNPLDGA 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
W Q +P GMYQYF+KVVPTVY+ ++ I SNQFSVTEH RSSE R+Q LPGVFFFYD
Sbjct: 270 HWFQHSPYGMYQYFVKVVPTVYSHINEQIILSNQFSVTEHARSSESVRMQALPGVFFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTE HVSFLHFLTNVCAIVGGVFTVSGIID+F+YHGQRAI KK EIGKF+
Sbjct: 330 LSPIKVTFTERHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHGQRAITKKREIGKFN 386
>gi|18395087|ref|NP_564162.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|9454530|gb|AAF87853.1|AC073942_7 Contains similarity to a PR00989 protein from Homo sapiens
gi|7959731. EST gb|AI995648 comes from this gene
[Arabidopsis thaliana]
gi|13878151|gb|AAK44153.1|AF370338_1 unknown protein [Arabidopsis thaliana]
gi|21281042|gb|AAM44956.1| unknown protein [Arabidopsis thaliana]
gi|21553754|gb|AAM62847.1| unknown [Arabidopsis thaliana]
gi|332192089|gb|AEE30210.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 386
Score = 504 bits (1297), Expect = e-140, Method: Compositional matrix adjust.
Identities = 229/297 (77%), Positives = 272/297 (91%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE+HLDV+HDI K+RLDS GNVIE++QDGIG KI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90 MDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHTKIEKPLQKHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+GAE+SD+ CCN+CEEVREAYRKKGWALS+P+ IDQCKREGF+Q++K+EEGEGCN++GFL
Sbjct: 150 FGAEASDDACCNSCEEVREAYRKKGWALSDPESIDQCKREGFVQKVKDEEGEGCNVHGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHF PG+SFHQSG HD+L FQ+ ++NISHK+N+LAFG+ FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGNYNISHKVNRLAFGDFFPGVVNPLDGV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W Q SG+YQYFIKVVP++YTDV +TIQSNQFSVTEHF++ E GR+Q+ PGVFF+YD
Sbjct: 270 QWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQSNQFSVTEHFQNMEAGRMQSPPGVFFYYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKV F E+HV FLHFLTNVCAIVGG+FTVSGI+D+FIYHGQRAIKKK+EIGKF+
Sbjct: 330 LSPIKVIFEEQHVEFLHFLTNVCAIVGGIFTVSGIVDSFIYHGQRAIKKKMEIGKFN 386
>gi|297850670|ref|XP_002893216.1| hypothetical protein ARALYDRAFT_472456 [Arabidopsis lyrata subsp.
lyrata]
gi|297339058|gb|EFH69475.1| hypothetical protein ARALYDRAFT_472456 [Arabidopsis lyrata subsp.
lyrata]
Length = 386
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 228/297 (76%), Positives = 271/297 (91%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE+HLDV+HDI K+RLDS GNVIE++QDGIG KI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90 MDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHTKIEKPLQKHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+GAE+SD+ CCN+CEEVREAYRKKGWALS+P+ IDQCKREGF+Q++K+EEGEGCN++GFL
Sbjct: 150 FGAEASDDACCNSCEEVREAYRKKGWALSDPESIDQCKREGFVQKVKDEEGEGCNVHGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHF PG+SFHQSG HD+L FQ+ ++NISH +N+LAFG+ FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGNYNISHTVNRLAFGDFFPGVVNPLDGV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W Q SG+YQYFIKVVP++YTDV +TIQSNQFSVTEHF++ E GR+Q+ PGVFF+YD
Sbjct: 270 QWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQSNQFSVTEHFQNMEAGRMQSPPGVFFYYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKV F E+HV FLHFLTNVCAIVGG+FTVSGI+D+FIYHGQRAIKKK+EIGKF+
Sbjct: 330 LSPIKVIFEEQHVEFLHFLTNVCAIVGGIFTVSGIVDSFIYHGQRAIKKKMEIGKFN 386
>gi|326497521|dbj|BAK05850.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 391
Score = 499 bits (1284), Expect = e-139, Method: Compositional matrix adjust.
Identities = 228/297 (76%), Positives = 262/297 (88%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG++HLDVKHD+FK+R+D+ GNVI ++QD +G K++KPLQ HGGRLEHNETYCGSC
Sbjct: 95 MDISGQEHLDVKHDVFKQRIDAHGNVIATKQDAVGGMKVEKPLQHHGGRLEHNETYCGSC 154
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+ S E CCN+CE+VREAYRKKGW +SNPD IDQCK EGFLQ IK+EEGEGCNIYGFL
Sbjct: 155 YGAQESPEQCCNSCEDVREAYRKKGWGVSNPDSIDQCKSEGFLQTIKDEEGEGCNIYGFL 214
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
E+NKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKINKL+FGE FPGV+NPLDG
Sbjct: 215 EINKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNLSHKINKLSFGEPFPGVINPLDGA 274
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W Q + GM QYF+KVVPTVY+ ++ I SNQFSVTEH RS + GR+Q LPGVFFFYD
Sbjct: 275 QWIQHSSYGMAQYFVKVVPTVYSHINEQIILSNQFSVTEHSRSGDSGRVQALPGVFFFYD 334
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LSPIKVTFTE HVSFLHFLTNVCAIVGGVFTVSGIID+F+YHGQRAI KK E+GKF+
Sbjct: 335 LSPIKVTFTERHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHGQRAITKKRELGKFT 391
>gi|224059030|ref|XP_002299683.1| predicted protein [Populus trichocarpa]
gi|222846941|gb|EEE84488.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 473 bits (1218), Expect = e-131, Method: Compositional matrix adjust.
Identities = 213/297 (71%), Positives = 260/297 (87%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+DISGEQH D++HDI KKR+++ G+VIE RQDGIGAPKIDKPLQ+HGGRLEHNE YCGSC
Sbjct: 90 IDISGEQHHDIRHDITKKRINAHGDVIEVRQDGIGAPKIDKPLQKHGGRLEHNEEYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+GAE SD+ CCN+C+EVREAYRKKGWAL+N DLIDQC REGF+Q IK+EEGEGCNI G L
Sbjct: 150 FGAEMSDDHCCNSCDEVREAYRKKGWALTNMDLIDQCIREGFVQMIKDEEGEGCNINGSL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVN+VAGNFHF PGKSFHQS + D+L Q++S+NISH+IN+LAFG++FPGVVNPLDG+
Sbjct: 210 EVNRVAGNFHFVPGKSFHQSNFQLLDLLDMQKESYNISHRINRLAFGDYFPGVVNPLDGI 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+ T +G+ Q+FIKVVPT+YTD+ G T+ SNQ+SVTEHF SE RL +LPGV+F YD
Sbjct: 270 QLMHGTQNGVQQFFIKVVPTIYTDIRGRTVHSNQYSVTEHFTKSELMRLDSLPGVYFIYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
SPIKVTF EEH SFLHF+T++CAI+GG+FT++GI+D+FIYHG+RAIKKK+EIGKFS
Sbjct: 330 FSPIKVTFKEEHTSFLHFMTSICAIIGGIFTIAGIVDSFIYHGRRAIKKKMEIGKFS 386
>gi|356512071|ref|XP_003524744.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 431
Score = 473 bits (1216), Expect = e-131, Method: Compositional matrix adjust.
Identities = 214/297 (72%), Positives = 259/297 (87%), Gaps = 2/297 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLD++H+I KKR+D+ GNVIE R+DGIGAPKI++PLQ+HGGRL H+E YCGSC
Sbjct: 137 MDISGEQHLDIRHNIVKKRIDANGNVIEERKDGIGAPKIERPLQKHGGRLGHDEKYCGSC 196
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+GAE SDE CCN+CEEVREAYRKKGWA++N DLIDQC+REG++QR+K+EEGEGCN+ G L
Sbjct: 197 FGAEESDEHCCNSCEEVREAYRKKGWAMTNMDLIDQCQREGYVQRVKDEEGEGCNLQGSL 256
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFA GKSF QS + + D+LA Q + +NISH+INKL+FG HFPG+VNPLDGV
Sbjct: 257 EVNKVAGNFHFATGKSFLQSAIFLADLLALQDNHYNISHRINKLSFGHHFPGLVNPLDGV 316
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W Q GMYQYFIKVVPT+YTD+ G I SNQ+SVTEHF+SSE G +PGVFFFYD
Sbjct: 317 KWVQGPAHGMYQYFIKVVPTIYTDIRGRVIHSNQYSVTEHFKSSELG--VAVPGVFFFYD 374
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
+SPIKV F EEH+ FLHFLTN+CAI+GGVFTV+GIID+ IY+GQR IK+K+E+GKF+
Sbjct: 375 ISPIKVNFKEEHIPFLHFLTNICAIIGGVFTVAGIIDSSIYYGQRTIKRKMELGKFT 431
>gi|363806898|ref|NP_001242045.1| uncharacterized protein LOC100781612 [Glycine max]
gi|255644390|gb|ACU22700.1| unknown [Glycine max]
Length = 384
Score = 472 bits (1215), Expect = e-131, Method: Compositional matrix adjust.
Identities = 214/297 (72%), Positives = 256/297 (86%), Gaps = 2/297 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLD++H+I KKR+D+ GNVIE R+DGIGAPKI+KPLQ+HGGRL H+E YCGSC
Sbjct: 90 MDISGEQHLDIRHNIVKKRIDANGNVIEERKDGIGAPKIEKPLQKHGGRLGHDEKYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+GAE SDE CCN+CEEVREAYRKKGWA++N DLIDQC+REG++QR+K+EEGEGCN+ G L
Sbjct: 150 FGAEESDEHCCNSCEEVREAYRKKGWAMTNMDLIDQCQREGYVQRVKDEEGEGCNLQGSL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFA GKSF QS + + D+LA Q + +NISH+INKL+FG HFPG+VNPLDGV
Sbjct: 210 EVNKVAGNFHFATGKSFLQSAIFLADVLALQDNHYNISHRINKLSFGHHFPGLVNPLDGV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
RW Q GMYQYFIKVVPT+YTD+ G I SNQ+SVTEHF+SSE G +PGVFFFYD
Sbjct: 270 RWVQGPTHGMYQYFIKVVPTIYTDIRGRVIHSNQYSVTEHFKSSELG--VAVPGVFFFYD 327
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
+SPIKV F EEH FLHFLTN+CAI+GGV V+GIID+ IY+GQR IK+K+E+GKF+
Sbjct: 328 ISPIKVNFKEEHTPFLHFLTNICAIIGGVLAVAGIIDSSIYYGQRTIKRKMELGKFT 384
>gi|449449715|ref|XP_004142610.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 385
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 222/298 (74%), Positives = 257/298 (86%), Gaps = 3/298 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLDVKHDI KKR+D QGNVI+SR DGIG+ +I++PLQ+HGGRL+ NETYCGSC
Sbjct: 90 MDISGEQHLDVKHDIVKKRIDYQGNVIDSRPDGIGSTEIERPLQKHGGRLKQNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA S EDCCN+C++VREAY +KGWALS+PDLIDQCKREGF QR+K EEGEGCNIYGFL
Sbjct: 150 YGA--SGEDCCNSCQDVREAYHRKGWALSHPDLIDQCKREGFFQRVKNEEGEGCNIYGFL 207
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILA-FQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
EVNKVAGNFHFAPG+ F S +H+ LA FQ D+FNISH+IN+L FG+ FPGVVNPLDG
Sbjct: 208 EVNKVAGNFHFAPGRGFQLSYFQIHNPLASFQWDAFNISHRINRLTFGDDFPGVVNPLDG 267
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
V+W Q T SGM+QYFIKVVPTVY V+G I+SNQFSVT+H R + Q L GVFFFY
Sbjct: 268 VQWNQGTLSGMFQYFIKVVPTVYKAVNGKAIKSNQFSVTQHLRGIDGESFQALHGVFFFY 327
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
DLSPIKVTFTEEH+SF HFLTNVCAIVGGVFT+SGI+D+ IYHGQ+AIKKK+ +GKF+
Sbjct: 328 DLSPIKVTFTEEHISFFHFLTNVCAIVGGVFTISGILDSIIYHGQKAIKKKMALGKFT 385
>gi|302790744|ref|XP_002977139.1| hypothetical protein SELMODRAFT_271242 [Selaginella moellendorffii]
gi|302820940|ref|XP_002992135.1| hypothetical protein SELMODRAFT_162191 [Selaginella moellendorffii]
gi|300140061|gb|EFJ06790.1| hypothetical protein SELMODRAFT_162191 [Selaginella moellendorffii]
gi|300155115|gb|EFJ21748.1| hypothetical protein SELMODRAFT_271242 [Selaginella moellendorffii]
Length = 386
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 219/297 (73%), Positives = 255/297 (85%), Gaps = 1/297 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESR-QDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD+SGEQHLDVKH+IFKKRLD G V++ Q+ IG PKIDKPLQ+HGGRLEHNETYCGS
Sbjct: 89 MDVSGEQHLDVKHNIFKKRLDPSGKVVQPPVQEDIGGPKIDKPLQKHGGRLEHNETYCGS 148
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
C+GAE SD++CCN+CEEVREAYRK+GWA+ N DLIDQCKREG+L +IKEEEGEGCNIYG
Sbjct: 149 CFGAEQSDDECCNSCEEVREAYRKRGWAIHNADLIDQCKREGWLTKIKEEEGEGCNIYGS 208
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
LEVNKVAGNFHFAPGKSF Q VHVHD+ + ++ FN+SH IN+L+FG FPGVVNPLD
Sbjct: 209 LEVNKVAGNFHFAPGKSFSQQHVHVHDVQSLHKEKFNVSHYINELSFGARFPGVVNPLDK 268
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
+ Q+ PS MYQYFIKVVPT YTD++GH I +NQFSVT+HF++ E ++LPGVFFFY
Sbjct: 269 EKRIQKFPSAMYQYFIKVVPTAYTDMTGHKIVTNQFSVTDHFKAVEGLNGRSLPGVFFFY 328
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
+LSPIKV FTE SFLHFLTNVCAI+GGVFTVSGIID+FIYHG RAIKKK+EIGK+
Sbjct: 329 ELSPIKVLFTERKTSFLHFLTNVCAIIGGVFTVSGIIDSFIYHGHRAIKKKMEIGKY 385
>gi|449510462|ref|XP_004163672.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 3-like [Cucumis
sativus]
Length = 385
Score = 469 bits (1207), Expect = e-130, Method: Compositional matrix adjust.
Identities = 221/298 (74%), Positives = 256/298 (85%), Gaps = 3/298 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQHLDVKHDI KKR+D QGNVI+SR DGIG+ +I++PLQ+HGGRL+ NETYCGSC
Sbjct: 90 MDISGEQHLDVKHDIVKKRIDYQGNVIDSRPDGIGSTEIERPLQKHGGRLKQNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA S EDCCN+C++VREAY +KGWALS+PDLIDQCKREGF QR+K EEGEGCNIYGFL
Sbjct: 150 YGA--SGEDCCNSCQDVREAYHRKGWALSHPDLIDQCKREGFFQRVKNEEGEGCNIYGFL 207
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILA-FQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
EVNKVAGNFHFAPG+ F S +H+ LA FQ D+FNISH+IN+L FG+ FPGVVNPLDG
Sbjct: 208 EVNKVAGNFHFAPGRGFQLSYFQIHNPLASFQWDAFNISHRINRLTFGDDFPGVVNPLDG 267
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
V+W Q T SGM+QYFIKVVPTVY V+G I+SNQFSVT+H R + Q L G FFFY
Sbjct: 268 VQWNQGTLSGMFQYFIKVVPTVYKAVNGKAIKSNQFSVTQHLRGIDGESFQALHGXFFFY 327
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
DLSPIKVTFTEEH+SF HFLTNVCAIVGGVFT+SGI+D+ IYHGQ+AIKKK+ +GKF+
Sbjct: 328 DLSPIKVTFTEEHISFFHFLTNVCAIVGGVFTISGILDSIIYHGQKAIKKKMALGKFT 385
>gi|168014180|ref|XP_001759631.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689170|gb|EDQ75543.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 382
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 214/292 (73%), Positives = 250/292 (85%), Gaps = 1/292 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIE-SRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MDISGE HLDVKH+IFKKRLD G VIE +RQ+ I PK+DKPLQ+HGGRLEHNETYCGS
Sbjct: 87 MDISGEAHLDVKHNIFKKRLDVNGKVIEPARQESINQPKLDKPLQKHGGRLEHNETYCGS 146
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
C+GAE+ ++ CCNNCEEVREAYRKKGWAL+NPDLIDQCKREGFLQ+IK+E+GEGCN+YG
Sbjct: 147 CFGAETEEDHCCNNCEEVREAYRKKGWALNNPDLIDQCKREGFLQKIKDEDGEGCNVYGT 206
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
LE NKVAGNFHFAPGKSF Q+ +HVHD++AF +DSFN+SHKIN+++FG +PG VNPLD
Sbjct: 207 LEANKVAGNFHFAPGKSFQQANMHVHDLMAFGKDSFNVSHKINEISFGVRYPGAVNPLDK 266
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
+ Q T GMYQYFIKVVPTVYTD G I +NQF+VT+HF+ G LPGVFFFY
Sbjct: 267 LERIQTTTHGMYQYFIKVVPTVYTDTRGRKISTNQFAVTDHFKGVGPGEDHALPGVFFFY 326
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
DLSPIKV FTE+ +SF HFLTNVCAIVGGVF+VSGIIDAF+YHGQ+ IKK++
Sbjct: 327 DLSPIKVKFTEKRMSFFHFLTNVCAIVGGVFSVSGIIDAFVYHGQKQIKKRL 378
>gi|168024878|ref|XP_001764962.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683771|gb|EDQ70178.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 215/298 (72%), Positives = 255/298 (85%), Gaps = 2/298 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES-RQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MDISGEQHL+V+H+IFKKRLD G V+ + + D I APK+ KPLQ+HGGRLEHNETYCGS
Sbjct: 89 MDISGEQHLNVRHNIFKKRLDVHGKVVNAPKPDAINAPKVQKPLQKHGGRLEHNETYCGS 148
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
C+GAESSD++CCNNCEEVREAYRKKGWAL+N DLIDQC REGF++R+KEE GEGCNIYG
Sbjct: 149 CFGAESSDDECCNNCEEVREAYRKKGWALTNADLIDQCHREGFIERVKEEAGEGCNIYGK 208
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
LEVNKVAGNFHFAPGKSF QS +H+ D++ F DSFN+SH IN+L+FG HFPG VNPLD
Sbjct: 209 LEVNKVAGNFHFAPGKSFQQSAMHLLDLMGFITDSFNVSHTINELSFGAHFPGAVNPLDK 268
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
V Q+ +GMYQYFIKVVPTVYTD+ G I +NQFSVTEH+ + + G + +PGVFFFY
Sbjct: 269 VTNIQKDLNGMYQYFIKVVPTVYTDIKGRKISTNQFSVTEHYTAGDHGP-RFVPGVFFFY 327
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
DLSPIKV F+EE SFLHFLTNVCAIVGGV++++GIID+F+YHG RAIKKK+E+GK S
Sbjct: 328 DLSPIKVKFSEERPSFLHFLTNVCAIVGGVYSIAGIIDSFVYHGHRAIKKKMELGKLS 385
>gi|222628979|gb|EEE61111.1| hypothetical protein OsJ_15023 [Oryza sativa Japonica Group]
Length = 369
Score = 466 bits (1198), Expect = e-129, Method: Compositional matrix adjust.
Identities = 221/306 (72%), Positives = 251/306 (82%), Gaps = 25/306 (8%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG++HLDVKHDIFK+R+D GNVI ++QD +G N Y G
Sbjct: 80 MDISGQEHLDVKHDIFKQRIDVHGNVIATKQDAVGG----------------NGPYSGMA 123
Query: 61 YGAES---------SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
G + SDE CCN+CE+VREAYRKKGW +SNPDLIDQCKREGFLQ IK+EEG
Sbjct: 124 AGLNTMRPIVALVMSDEQCCNSCEDVREAYRKKGWGVSNPDLIDQCKREGFLQSIKDEEG 183
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
EGCNIYGFLEVNKVAGNFHFAPGKSF ++ VHVHD+L FQ+DSFN+SHKINKL+FG+ FP
Sbjct: 184 EGCNIYGFLEVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDSFNVSHKINKLSFGQRFP 243
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT 231
GVVNPLDG +W Q + GMYQYFIKVVPTVYTD++ H I SNQFSVTEHFRSSE GR+Q
Sbjct: 244 GVVNPLDGAQWMQHSSYGMYQYFIKVVPTVYTDINEHIILSNQFSVTEHFRSSESGRIQA 303
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
+PGVFFFYDLSPIKVTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YHGQRAIKKK+
Sbjct: 304 VPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHGQRAIKKKM 363
Query: 292 EIGKFS 297
EIGKF+
Sbjct: 364 EIGKFN 369
>gi|449438787|ref|XP_004137169.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 386
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 207/296 (69%), Positives = 258/296 (87%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+DISGEQHLD++H+I KKR+D G VIE+R DGIGAPKI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90 IDISGEQHLDIRHNIIKKRIDHLGTVIEARPDGIGAPKIEKPLQKHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+GAE+SD+DCCN+CEEVREAYRKKGWA++N DLIDQC+RE F+Q++K+EEGEGCNI G L
Sbjct: 150 FGAEASDDDCCNSCEEVREAYRKKGWAITNQDLIDQCQREDFIQKVKDEEGEGCNIEGSL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAG+FHF PGKSF+QS + +LA Q +N+SH+IN+LAFG H+ G+VNPLDGV
Sbjct: 210 EVNKVAGSFHFVPGKSFYQSSFNFLGLLALQTSDYNVSHRINRLAFGNHYDGLVNPLDGV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
W + M+QYF+KVVPT+Y ++ G T+ SNQ+SVTEHF+S E G Q++PGVFF+YD
Sbjct: 270 HWEYNEQNVMHQYFVKVVPTIYKNIRGRTVHSNQYSVTEHFKSVEFGSSQSIPGVFFYYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
LSP+KVT+TEEHV FLHF+T++CAI+GGVF+V+GIIDAFIYHGQR +KKK+EIGKF
Sbjct: 330 LSPVKVTYTEEHVPFLHFMTHICAIIGGVFSVAGIIDAFIYHGQRKMKKKVEIGKF 385
>gi|224073341|ref|XP_002304080.1| predicted protein [Populus trichocarpa]
gi|222841512|gb|EEE79059.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 205/296 (69%), Positives = 255/296 (86%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+DISGEQHLD++HDI KKR+++ G+VIE RQ+GIGAPKID+PLQ HGGRL HNE YCGSC
Sbjct: 90 IDISGEQHLDIRHDISKKRINAHGDVIEVRQEGIGAPKIDRPLQSHGGRLGHNEEYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+G E S +DCCN CEEVREAYR+KGWA++N DLIDQCKREGF+Q IK+EEGEGCNI G L
Sbjct: 150 FGGEMSHDDCCNTCEEVREAYRRKGWAMTNMDLIDQCKREGFIQMIKDEEGEGCNINGSL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVN+VAG+FHFAP KSFH S + D+L Q+DS+NISH+IN+LAFG++FPGVVNPL G+
Sbjct: 210 EVNRVAGSFHFAPWKSFHLSNFLIQDLLDLQKDSYNISHRINRLAFGDYFPGVVNPLAGI 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+ +TP+G+ Q+FIKVVPT+YTD+ G T+ SNQ+S TEHF+ SE L +LPGV+FFYD
Sbjct: 270 QLMHDTPNGVQQFFIKVVPTIYTDIRGRTVHSNQYSATEHFKKSELTPLDSLPGVYFFYD 329
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
SPIKV F EEH+SFLHF+T++CAI+GG+FT++GIID+FIY+GQRAI KK+ IGKF
Sbjct: 330 FSPIKVIFKEEHISFLHFMTSICAIIGGIFTIAGIIDSFIYYGQRAITKKVGIGKF 385
>gi|217071774|gb|ACJ84247.1| unknown [Medicago truncatula]
Length = 384
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 206/296 (69%), Positives = 257/296 (86%), Gaps = 2/296 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE+H D+ H+I K+R+D+ G VIE+R++GIGAPKI++PLQ+HGGRLEH+E YCGSC
Sbjct: 90 MDISGERHHDILHNIMKQRIDANGKVIEARKEGIGAPKIERPLQKHGGRLEHDEKYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+GAE SD+ CCNNCEEVREAYRKKGWAL+N DLIDQC+REGF+Q++K+EEGEGCNI+G L
Sbjct: 150 FGAEESDDHCCNNCEEVREAYRKKGWALTNIDLIDQCQREGFVQKVKDEEGEGCNIHGSL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFA G+SF QS + + D+LA Q + +NISH+INKL+FG H+PG+VNPLDG+
Sbjct: 210 EVNKVAGNFHFATGQSFLQSAIFLTDLLALQDNHYNISHQINKLSFGHHYPGLVNPLDGI 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W Q GM QYFIKVVPTVYTD+ G I SNQ+SVTEHF+SSE G +PGVFFFYD
Sbjct: 270 KWVQGNDHGMCQYFIKVVPTVYTDIRGRVIHSNQYSVTEHFKSSELG--AAVPGVFFFYD 327
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
+SPIKV F EEH+ FLHFLTN+CAI+GG+FT++GI+D+ IY+GQ+ IKKK+EIGK+
Sbjct: 328 ISPIKVNFKEEHIPFLHFLTNICAIIGGIFTIAGIVDSSIYYGQKTIKKKMEIGKY 383
>gi|357112459|ref|XP_003558026.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Brachypodium distachyon]
Length = 387
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 211/299 (70%), Positives = 256/299 (85%), Gaps = 4/299 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGEQH D++HDIFKKR+D GNVIESR+DG+G+PKI++PLQ HGGRL+HNE YCGSC
Sbjct: 89 MDVSGEQHYDIRHDIFKKRIDHLGNVIESRKDGVGSPKIERPLQNHGGRLDHNEAYCGSC 148
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YG+E SD+ CCN+CEEVR+AYRKKGWAL+N + IDQCKREGF+QR+K+E+GEGCNI+GF+
Sbjct: 149 YGSEESDDQCCNSCEEVRDAYRKKGWALTNVESIDQCKREGFVQRLKDEQGEGCNIHGFV 208
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+VNKVAGNFHFAPGK QS + D+L FQ +++NISHKINKL+FG+ FPGVVNPLDGV
Sbjct: 209 DVNKVAGNFHFAPGKHLDQSFNFLQDMLNFQPENYNISHKINKLSFGKEFPGVVNPLDGV 268
Query: 181 RWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
W QE +GMYQYF+KVVPT+YTD+ G I SNQFSVTEHFR + G + PGV+F
Sbjct: 269 EWKQEQATGLTGMYQYFVKVVPTIYTDIRGRKIHSNQFSVTEHFREA-IGFPRPPPGVYF 327
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
FY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+GIID+F+YHG RAIKKK+EIGK
Sbjct: 328 FYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVAGIIDSFVYHGHRAIKKKMEIGKL 386
>gi|226498912|ref|NP_001150650.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|194699894|gb|ACF84031.1| unknown [Zea mays]
gi|195640862|gb|ACG39899.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
Length = 387
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 209/299 (69%), Positives = 255/299 (85%), Gaps = 4/299 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGEQH D++HDI KKR+D GNVIESR+DG+GAPKI++PLQ+HGGRL+HNE YCGSC
Sbjct: 89 MDVSGEQHYDIRHDIIKKRIDHLGNVIESRKDGVGAPKIERPLQKHGGRLDHNEVYCGSC 148
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE SD+ CCN+CEEVR+AYRKKGWA++N +LIDQCKREG++QR+K+E+GEGC I+GF+
Sbjct: 149 YGAEESDDQCCNSCEEVRDAYRKKGWAVNNVELIDQCKREGYVQRLKDEQGEGCTIHGFV 208
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
VNKVAGNFHFAPGKS QS + D+L Q +++NISHKINKL+FGE FPGVVNPLDGV
Sbjct: 209 NVNKVAGNFHFAPGKSLDQSFNFLQDLLNLQPETYNISHKINKLSFGEEFPGVVNPLDGV 268
Query: 181 RWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
W Q+ +GMYQYF+KVVPT+YTD+ G I SNQFSVTEHFR + G + PGV+F
Sbjct: 269 EWIQDNSNGLTGMYQYFVKVVPTIYTDIRGRKIHSNQFSVTEHFREA-IGYPRPPPGVYF 327
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
FY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+GIID+F+YHG RAIKKK+E+GK
Sbjct: 328 FYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVAGIIDSFVYHGHRAIKKKMELGKL 386
>gi|108707873|gb|ABF95668.1| Serologically defined breast cancer antigen NY-BR-84, putative,
expressed [Oryza sativa Japonica Group]
Length = 387
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 210/299 (70%), Positives = 257/299 (85%), Gaps = 4/299 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGEQH D++HDI KKR+D+ GNVIESR+DG+GAPKI++PLQ+HGGRL+HNE YCGSC
Sbjct: 89 MDVSGEQHYDIRHDIIKKRIDNLGNVIESRKDGVGAPKIERPLQKHGGRLDHNEVYCGSC 148
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YG+E SD+ CCN+CE+VR+AYRKKGWAL+N + IDQCKREGF+QR+K+E+GEGC+I+GF+
Sbjct: 149 YGSEESDDQCCNSCEDVRDAYRKKGWALTNIEEIDQCKREGFVQRLKDEQGEGCSIHGFV 208
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
VNKVAGNFHFAPGKS QS + D+L FQ++++NISHKINKL+FG FPGVVNPLDGV
Sbjct: 209 NVNKVAGNFHFAPGKSLDQSFNFLQDLLNFQQENYNISHKINKLSFGVEFPGVVNPLDGV 268
Query: 181 RWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
W QE +GMYQYF+KVVPT+YTD+ G I SNQFSVTEHFR + G + PGV+F
Sbjct: 269 EWIQEHTNGLTGMYQYFVKVVPTIYTDIRGRKINSNQFSVTEHFREA-IGYPRPPPGVYF 327
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
FY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+GIID+F+YHG RAIKKK+EIGK
Sbjct: 328 FYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVAGIIDSFVYHGHRAIKKKMEIGKL 386
>gi|242035905|ref|XP_002465347.1| hypothetical protein SORBIDRAFT_01g036890 [Sorghum bicolor]
gi|241919201|gb|EER92345.1| hypothetical protein SORBIDRAFT_01g036890 [Sorghum bicolor]
Length = 387
Score = 450 bits (1158), Expect = e-124, Method: Compositional matrix adjust.
Identities = 206/299 (68%), Positives = 252/299 (84%), Gaps = 4/299 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGEQH D++HDI KKR+D GNVIESR+D +GAPKI++PLQ+HGGRL+HNE YCGSC
Sbjct: 89 MDVSGEQHYDIRHDITKKRIDHLGNVIESRKDRVGAPKIERPLQKHGGRLDHNEVYCGSC 148
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE +D+ CCN+CEEVR+ YRKKGWA++N +LIDQCKREG++QR+K+E GEGC I+GF+
Sbjct: 149 YGAEETDDQCCNSCEEVRDVYRKKGWAINNVELIDQCKREGYVQRLKDETGEGCTIHGFV 208
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
VNKVAGNFHFAPGKS QS + D+L Q +++NISHKINKL+FGE FPGVVNPLDGV
Sbjct: 209 NVNKVAGNFHFAPGKSLDQSFNFLQDLLNIQPETYNISHKINKLSFGEEFPGVVNPLDGV 268
Query: 181 RWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
W Q+ +GMYQYF+KVVPT+YTD+ G I SNQFSVTEHFR + G + PGV+F
Sbjct: 269 EWIQDNSNGLTGMYQYFVKVVPTIYTDIRGRKIYSNQFSVTEHFREA-IGYPRPPPGVYF 327
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
FY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+GIID+F+YHG RAIKKK+E+GK
Sbjct: 328 FYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVAGIIDSFVYHGHRAIKKKMELGKL 386
>gi|168004517|ref|XP_001754958.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694062|gb|EDQ80412.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 385
Score = 449 bits (1155), Expect = e-124, Method: Compositional matrix adjust.
Identities = 205/298 (68%), Positives = 249/298 (83%), Gaps = 2/298 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES-RQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MDISGE HLDV+H+I+KKRLD G +++ + D I APK+ KPLQ+HGGRLE +ETYCGS
Sbjct: 89 MDISGELHLDVRHNIYKKRLDVHGKAVDAPKPDAINAPKVQKPLQKHGGRLEDHETYCGS 148
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
C+GAESSD+ CCN+CEEVREAYRKKGWAL+N DLIDQC REGF++RIKEE GEGCNIYG
Sbjct: 149 CFGAESSDDQCCNSCEEVREAYRKKGWALTNTDLIDQCHREGFIERIKEEAGEGCNIYGK 208
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
LEVNKVAGNF APGKSF QS +H+ D++ F DSFN+SH IN+L+FG +FPG VNPLD
Sbjct: 209 LEVNKVAGNFQIAPGKSFQQSAMHLLDLMGFVTDSFNVSHTINELSFGAYFPGAVNPLDK 268
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
V Q+ +GM+QYFIKVVPTVYTD+ G I +NQFSV EH+ + + G + +PGVFFFY
Sbjct: 269 VTSIQKDQNGMFQYFIKVVPTVYTDIKGRKISTNQFSVMEHYTAGDHGP-RVIPGVFFFY 327
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
DL+PIKV FTEE SFLHFLTNVCAI+GG++T++GI+D+FIYHG RAIKKK+E+GK S
Sbjct: 328 DLTPIKVKFTEERPSFLHFLTNVCAIIGGIYTIAGIVDSFIYHGHRAIKKKMELGKLS 385
>gi|115464597|ref|NP_001055898.1| Os05g0490200 [Oryza sativa Japonica Group]
gi|50080302|gb|AAT69636.1| unknown protein [Oryza sativa Japonica Group]
gi|113579449|dbj|BAF17812.1| Os05g0490200 [Oryza sativa Japonica Group]
gi|218197014|gb|EEC79441.1| hypothetical protein OsI_20422 [Oryza sativa Indica Group]
gi|222632053|gb|EEE64185.1| hypothetical protein OsJ_19017 [Oryza sativa Japonica Group]
Length = 384
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 204/296 (68%), Positives = 248/296 (83%), Gaps = 2/296 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQH D++HDI K+RLD+ GNVIE+R++GIG KI+ PLQ+HGGRL E YCG+C
Sbjct: 90 MDISGEQHHDIRHDIEKRRLDAHGNVIEARKEGIGGAKIESPLQKHGGRLSKGEEYCGTC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++GEGCN++GFL
Sbjct: 150 YGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCTREDFVERVKTQQGEGCNVHGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V+KVAGN HFAPGK F++S ++V ++ A + FNI+HKINKL+FG FPGVVNPLDG
Sbjct: 210 DVSKVAGNLHFAPGKGFYESNINVPELSALEH-GFNITHKINKLSFGTEFPGVVNPLDGA 268
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+WTQ G YQYFIKVVPT+YTD+ G I SNQFSVTEHFR R + PGVFFFYD
Sbjct: 269 QWTQPASDGTYQYFIKVVPTIYTDLRGRKIHSNQFSVTEHFRDGNI-RPKPQPGVFFFYD 327
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
SPIKV FTEE+ S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 328 FSPIKVIFTEENSSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 383
>gi|226494401|ref|NP_001141198.1| uncharacterized protein LOC100273285 [Zea mays]
gi|194703210|gb|ACF85689.1| unknown [Zea mays]
gi|238011828|gb|ACR36949.1| unknown [Zea mays]
gi|413945823|gb|AFW78472.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 384
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 204/296 (68%), Positives = 245/296 (82%), Gaps = 2/296 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQH D++HDI K RLD+ GNVIE+R+ IG KI++PLQ+HGGRL+ E YCG+C
Sbjct: 90 MDISGEQHQDIRHDIEKIRLDAHGNVIEARKVSIGGAKIERPLQKHGGRLDKGEQYCGTC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++ EGCN++GFL
Sbjct: 150 YGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQQDEGCNVHGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V+KVAGNFHFAPGK F++S + V + L+ FNI+HKINKL+FG FPGVVNPLDG
Sbjct: 210 DVSKVAGNFHFAPGKGFYESNIDVPE-LSLLEGGFNITHKINKLSFGTEFPGVVNPLDGA 268
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+WTQ G YQYFIKVVPT+YTD+ GH I SNQFSVTEHFR R + PGVFFFYD
Sbjct: 269 QWTQPASDGTYQYFIKVVPTIYTDIRGHNIHSNQFSVTEHFRDGNV-RPKPQPGVFFFYD 327
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
SPIKV FTEE S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 328 FSPIKVIFTEESRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 383
>gi|242088319|ref|XP_002439992.1| hypothetical protein SORBIDRAFT_09g023960 [Sorghum bicolor]
gi|241945277|gb|EES18422.1| hypothetical protein SORBIDRAFT_09g023960 [Sorghum bicolor]
Length = 384
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 204/296 (68%), Positives = 247/296 (83%), Gaps = 2/296 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQH D++HDI K+RLDS GNVIE+R++GIG KI++PLQ+HGGRL+ E YCG+C
Sbjct: 90 MDISGEQHHDIRHDIEKRRLDSHGNVIEARKEGIGGAKIERPLQKHGGRLDKGEQYCGTC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++ EGCN++GFL
Sbjct: 150 YGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQQDEGCNVHGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V+KVAGNFHFAPGK F++S + V + L+ FNI+HKINKL+FG FPGVVNPLDG
Sbjct: 210 DVSKVAGNFHFAPGKGFYESNIDVPE-LSVLEGGFNITHKINKLSFGTEFPGVVNPLDGA 268
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W Q G YQYFIKVVPT+YTD+ GH I SNQFSVTEHFR + PGVFFFYD
Sbjct: 269 QWIQPASDGTYQYFIKVVPTIYTDIRGHNIHSNQFSVTEHFRDGNI-LPKPQPGVFFFYD 327
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
SPIKV FTEE+ S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 328 FSPIKVIFTEENRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 383
>gi|212721670|ref|NP_001132255.1| uncharacterized protein LOC100193691 [Zea mays]
gi|194693892|gb|ACF81030.1| unknown [Zea mays]
gi|223949235|gb|ACN28701.1| unknown [Zea mays]
gi|413949703|gb|AFW82352.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 384
Score = 441 bits (1133), Expect = e-121, Method: Compositional matrix adjust.
Identities = 203/295 (68%), Positives = 246/295 (83%), Gaps = 2/295 (0%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
DISGEQH D++HDI K+RL+S GNVIE+R++GIG K+++PLQ+HGGRL+ E YCG+CY
Sbjct: 91 DISGEQHHDIRHDIEKRRLNSHGNVIEARKEGIGGAKVERPLQKHGGRLDKGEQYCGTCY 150
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
GAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F+ R+K ++ EGCN+ GFL+
Sbjct: 151 GAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFIDRVKTQQDEGCNVLGFLD 210
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
V+KVAGNFHFAPGK F++S + V + L+ FNISHKINKL+FG FPGVVNPLDG +
Sbjct: 211 VSKVAGNFHFAPGKGFYESNIDVPE-LSLLEGGFNISHKINKLSFGTEFPGVVNPLDGAQ 269
Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDL 241
WTQ G YQYFIKVVPT+YTD+ G I SNQFSVTEHFR R ++ PGVFFFYD
Sbjct: 270 WTQPASDGTYQYFIKVVPTIYTDIRGRGIHSNQFSVTEHFRDGNV-RPKSQPGVFFFYDF 328
Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
SPIKV FTEE+ S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 329 SPIKVIFTEENRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 383
>gi|326510689|dbj|BAJ87561.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514988|dbj|BAJ99855.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326533080|dbj|BAJ93512.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 383
Score = 439 bits (1130), Expect = e-121, Method: Compositional matrix adjust.
Identities = 206/296 (69%), Positives = 246/296 (83%), Gaps = 5/296 (1%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
DISGEQH D++HDI KKRLDS GNVIESR++GIG KI+KPLQ+HGGRL E YCG+CY
Sbjct: 91 DISGEQHQDIRHDIEKKRLDSHGNVIESRKEGIGGTKIEKPLQKHGGRLGKGEEYCGTCY 150
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
GAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K + GEGC+++GFL+
Sbjct: 151 GAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQHGEGCSVHGFLD 210
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
V+KVAGNFHFAPGK +++S V + ++ A FNI+HKINKL+FG FPG VNPLDG +
Sbjct: 211 VSKVAGNFHFAPGKGYYESNVDMPELSA--EGGFNITHKINKLSFGTEFPGAVNPLDGAQ 268
Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE-QGRLQTLPGVFFFYD 240
WTQ G YQYFIKVVPT+Y D+ G I SNQFSVTEHFR Q R Q PGVFFFYD
Sbjct: 269 WTQPASDGTYQYFIKVVPTIYNDIRGRKIDSNQFSVTEHFRDGNVQPRPQ--PGVFFFYD 326
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
SPIKV FTEE+ SFLH+LTN+CAIVGG+FTV+GIID+FIYHGQ+A+KKK+EIGK+
Sbjct: 327 FSPIKVIFTEENRSFLHYLTNLCAIVGGIFTVAGIIDSFIYHGQKALKKKMEIGKY 382
>gi|357133202|ref|XP_003568216.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Brachypodium distachyon]
Length = 384
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 205/296 (69%), Positives = 247/296 (83%), Gaps = 4/296 (1%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
DISGEQH D++HDI KKRL+S GNVIESR++GIG KI++PLQ+HGGRL+ E YCG+CY
Sbjct: 91 DISGEQHQDIRHDIEKKRLNSHGNVIESRKEGIGGAKIERPLQKHGGRLDKGEQYCGTCY 150
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
GAE SDE CCN+C+EVREAY+KKGWAL+NPDLIDQC RE F++R+K + GEGC+++GFL+
Sbjct: 151 GAEESDEQCCNSCDEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQHGEGCSVHGFLD 210
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
V+KVAGNFHFAPG+ F++S V V ++ + + FNI+HKINKL+FG FPGVVNPLDG +
Sbjct: 211 VSKVAGNFHFAPGRGFYESNVDVPELSSLE-GGFNITHKINKLSFGTEFPGVVNPLDGAQ 269
Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE-QGRLQTLPGVFFFYD 240
WTQ G YQYFIKVVPT YTD G I SNQFSVTEHFR R Q PGVFFFYD
Sbjct: 270 WTQPASDGTYQYFIKVVPTNYTDTRGRKIDSNQFSVTEHFRDGNVHPRPQ--PGVFFFYD 327
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
SPIKV FTEE+ SFLH+LTN+CAIVGG+FTVSGIID+FIYHGQ+A+KKK+EIGK+
Sbjct: 328 FSPIKVIFTEENKSFLHYLTNLCAIVGGIFTVSGIIDSFIYHGQKALKKKMEIGKY 383
>gi|168019656|ref|XP_001762360.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686438|gb|EDQ72827.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 380
Score = 437 bits (1123), Expect = e-120, Method: Compositional matrix adjust.
Identities = 204/298 (68%), Positives = 245/298 (82%), Gaps = 7/298 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES-RQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MDISGEQHL+V+H+IFKKRLD G I++ + D I APK+ +PLQ+HGGRLEHNETYCGS
Sbjct: 89 MDISGEQHLNVRHNIFKKRLDVHGKAIDAPKPDAINAPKVQRPLQKHGGRLEHNETYCGS 148
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
C+GA SSD++CCN+CEEVREAYRKKGWAL N D+IDQC REGF++R+KEE GEGCNIYG
Sbjct: 149 CFGAASSDDECCNSCEEVREAYRKKGWALINIDIIDQCHREGFIERVKEEAGEGCNIYGK 208
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
LEVNKVAGNFH APGK F QS +H+ D+L + DSFN+SH +N+L+FG HFPG VNPLD
Sbjct: 209 LEVNKVAGNFHIAPGKLFQQSAMHLLDLLGIRSDSFNVSHIVNELSFGAHFPGRVNPLDK 268
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
+ Q+ +GMYQYFIKVVPTVYTD+ G I +NQFSVTEH+ + + G + +PGVFFFY
Sbjct: 269 ITSIQKDQNGMYQYFIKVVPTVYTDIRGSEIATNQFSVTEHYTAGDHGP-RVVPGVFFFY 327
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
DLSPIKV FTE+ SFLHFLT VCAIVG + IID+FIYHG RA+KKK+E+GKFS
Sbjct: 328 DLSPIKVKFTEKRPSFLHFLTTVCAIVG-----ASIIDSFIYHGHRAVKKKMELGKFS 380
>gi|413945824|gb|AFW78473.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein,
partial [Zea mays]
Length = 284
Score = 419 bits (1077), Expect = e-115, Method: Compositional matrix adjust.
Identities = 195/285 (68%), Positives = 235/285 (82%), Gaps = 2/285 (0%)
Query: 12 KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCC 71
+HDI K RLD+ GNVIE+R+ IG KI++PLQ+HGGRL+ E YCG+CYGAE SDE CC
Sbjct: 1 RHDIEKIRLDAHGNVIEARKVSIGGAKIERPLQKHGGRLDKGEQYCGTCYGAEESDEQCC 60
Query: 72 NNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF 131
N+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++ EGCN++GFL+V+KVAGNFHF
Sbjct: 61 NSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNFHF 120
Query: 132 APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMY 191
APGK F++S + V + L+ FNI+HKINKL+FG FPGVVNPLDG +WTQ G Y
Sbjct: 121 APGKGFYESNIDVPE-LSLLEGGFNITHKINKLSFGTEFPGVVNPLDGAQWTQPASDGTY 179
Query: 192 QYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEE 251
QYFIKVVPT+YTD+ GH I SNQFSVTEHFR R + PGVFFFYD SPIKV FTEE
Sbjct: 180 QYFIKVVPTIYTDIRGHNIHSNQFSVTEHFRDGNV-RPKPQPGVFFFYDFSPIKVIFTEE 238
Query: 252 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 239 SRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 283
>gi|79318328|ref|NP_001031077.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|332192090|gb|AEE30211.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 338
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 187/246 (76%), Positives = 224/246 (91%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE+HLDV+HDI K+RLDS GNVIE++QDGIG KI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90 MDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHTKIEKPLQKHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+GAE+SD+ CCN+CEEVREAYRKKGWALS+P+ IDQCKREGF+Q++K+EEGEGCN++GFL
Sbjct: 150 FGAEASDDACCNSCEEVREAYRKKGWALSDPESIDQCKREGFVQKVKDEEGEGCNVHGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHF PG+SFHQSG HD+L FQ+ ++NISHK+N+LAFG+ FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGNYNISHKVNRLAFGDFFPGVVNPLDGV 269
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+W Q SG+YQYFIKVVP++YTDV +TIQSNQFSVTEHF++ E GR+Q+ PGVFF+YD
Sbjct: 270 QWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQSNQFSVTEHFQNMEAGRMQSPPGVFFYYD 329
Query: 241 LSPIKV 246
LSPIKV
Sbjct: 330 LSPIKV 335
>gi|326506194|dbj|BAJ86415.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 363
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 192/277 (69%), Positives = 227/277 (81%), Gaps = 5/277 (1%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
DISGEQH D++HDI KKRLDS GNVIESR++GIG KI+KPLQ+HGGRL E YCG+CY
Sbjct: 91 DISGEQHQDIRHDIEKKRLDSHGNVIESRKEGIGGTKIEKPLQKHGGRLGKGEEYCGTCY 150
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
GAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K + GEGC+++GFL+
Sbjct: 151 GAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQHGEGCSVHGFLD 210
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
V+KVAGNFHFAPGK +++S V + ++ A FNI+HKINKL+FG FPG VNPLDG +
Sbjct: 211 VSKVAGNFHFAPGKGYYESNVDMPELSA--EGGFNITHKINKLSFGTEFPGAVNPLDGAQ 268
Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE-QGRLQTLPGVFFFYD 240
WTQ G YQYFIKVVPT+Y D+ G I SNQFSVTEHFR Q R Q PGVFFFYD
Sbjct: 269 WTQPASDGTYQYFIKVVPTIYNDIRGRKIDSNQFSVTEHFRDGNVQPRPQ--PGVFFFYD 326
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
SPIKV FTEE+ SFLH+LTN+CAIVGG+FTV+GIID
Sbjct: 327 FSPIKVIFTEENRSFLHYLTNLCAIVGGIFTVAGIID 363
>gi|255578837|ref|XP_002530273.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
gi|223530205|gb|EEF32113.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
Length = 265
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 183/246 (74%), Positives = 217/246 (88%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDI GEQH D+KH+I KKR+++ G+VIE R++GIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 1 MDIMGEQHFDIKHNITKKRINAHGDVIEVRKEGIGAPKIEKPLQRHGGRLEHNETYCGSC 60
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE SD+DCCN+C+EVREAYRKKGWAL+ DLIDQCKREGF+Q++K+EEGEGCNIYG L
Sbjct: 61 YGAEMSDDDCCNSCDEVREAYRKKGWALTGVDLIDQCKREGFIQKVKDEEGEGCNIYGSL 120
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHF+PGK HQS + D+L FQ DS+NISH IN+LAFG++FPGVVNPLDGV
Sbjct: 121 EVNKVAGNFHFSPGKGLHQSSFFIQDLLVFQGDSYNISHTINRLAFGDYFPGVVNPLDGV 180
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
W ETP+GM+QYF+KVVPT+YTD+ G T++SNQ+SVTEHF+ SE RL + PGVFFFYD
Sbjct: 181 PWVHETPNGMHQYFLKVVPTIYTDIRGRTVRSNQYSVTEHFKKSEFARLDSPPGVFFFYD 240
Query: 241 LSPIKV 246
SPIKV
Sbjct: 241 FSPIKV 246
>gi|449528843|ref|XP_004171412.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Cucumis sativus]
Length = 355
Score = 403 bits (1035), Expect = e-110, Method: Compositional matrix adjust.
Identities = 180/259 (69%), Positives = 226/259 (87%)
Query: 38 KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 97
+I+KPLQ+HGGRLEHNETYCGSC+GAE+SD+DCCN+CEEVREAYRKKGWA++N DLIDQC
Sbjct: 96 EIEKPLQKHGGRLEHNETYCGSCFGAEASDDDCCNSCEEVREAYRKKGWAITNQDLIDQC 155
Query: 98 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
+RE F+Q++K+EEGEGCNI G LEVNKVAG+FHF PGKSF+QS + +LA Q +N+
Sbjct: 156 QREDFIQKVKDEEGEGCNIEGSLEVNKVAGSFHFVPGKSFYQSSFNFLGLLALQTSDYNV 215
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
SH+IN+LAFG H+ G+VNPLDGV W + M+QYF+KVVPT+Y ++ G T+ SNQ+SV
Sbjct: 216 SHRINRLAFGNHYDGLVNPLDGVHWEYNEQNVMHQYFVKVVPTIYKNIRGRTVHSNQYSV 275
Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
TEHF+S E G Q++PGVFF+YDLSP+KVT+TEEHV FLHF+T++CAI+GGVF+V+GIID
Sbjct: 276 TEHFKSVEFGSSQSIPGVFFYYDLSPVKVTYTEEHVPFLHFMTHICAIIGGVFSVAGIID 335
Query: 278 AFIYHGQRAIKKKIEIGKF 296
AFIYHGQR +KKK+EIGKF
Sbjct: 336 AFIYHGQRKMKKKVEIGKF 354
>gi|218192721|gb|EEC75148.1| hypothetical protein OsI_11348 [Oryza sativa Indica Group]
gi|222624836|gb|EEE58968.1| hypothetical protein OsJ_10656 [Oryza sativa Japonica Group]
Length = 355
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 191/299 (63%), Positives = 232/299 (77%), Gaps = 36/299 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGEQH D++HDI KKR+D+ GNVIESR+DG+GAPKI++PLQ+HGGRL+HNE YCGSC
Sbjct: 89 MDVSGEQHYDIRHDIIKKRIDNLGNVIESRKDGVGAPKIERPLQKHGGRLDHNEVYCGSC 148
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YG+E SD+ CCN+CE+VR+AYRKKGWAL+N + IDQCKREGF+QR+K+E+GEGC+I+GF+
Sbjct: 149 YGSEESDDQCCNSCEDVRDAYRKKGWALTNIEEIDQCKREGFVQRLKDEQGEGCSIHGFV 208
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
VNK ISHKINKL+FG FPGVVNPLDGV
Sbjct: 209 NVNK--------------------------------ISHKINKLSFGVEFPGVVNPLDGV 236
Query: 181 RWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
W QE +GMYQYF+KVVPT+YTD+ G I SNQFSVTEHFR + G + PGV+F
Sbjct: 237 EWIQEHTNGLTGMYQYFVKVVPTIYTDIRGRKINSNQFSVTEHFREA-IGYPRPPPGVYF 295
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
FY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+GIID+F+YHG RAIKKK+EIGK
Sbjct: 296 FYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVAGIIDSFVYHGHRAIKKKMEIGKL 354
>gi|413949704|gb|AFW82353.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 398
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 180/266 (67%), Positives = 217/266 (81%), Gaps = 2/266 (0%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
DISGEQH D++HDI K+RL+S GNVIE+R++GIG K+++PLQ+HGGRL+ E YCG+CY
Sbjct: 91 DISGEQHHDIRHDIEKRRLNSHGNVIEARKEGIGGAKVERPLQKHGGRLDKGEQYCGTCY 150
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
GAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F+ R+K ++ EGCN+ GFL+
Sbjct: 151 GAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFIDRVKTQQDEGCNVLGFLD 210
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
V+KVAGNFHFAPGK F++S + V + L+ FNISHKINKL+FG FPGVVNPLDG +
Sbjct: 211 VSKVAGNFHFAPGKGFYESNIDVPE-LSLLEGGFNISHKINKLSFGTEFPGVVNPLDGAQ 269
Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDL 241
WTQ G YQYFIKVVPT+YTD+ G I SNQFSVTEHFR R ++ PGVFFFYD
Sbjct: 270 WTQPASDGTYQYFIKVVPTIYTDIRGRGIHSNQFSVTEHFRDGNV-RPKSQPGVFFFYDF 328
Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVG 267
SPIKV FTEE+ S LH+LTN+CAIVG
Sbjct: 329 SPIKVIFTEENRSLLHYLTNLCAIVG 354
>gi|384252531|gb|EIE26007.1| DUF1692-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 386
Score = 368 bits (944), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 177/302 (58%), Positives = 231/302 (76%), Gaps = 14/302 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MDISGE HLDV HD++K+RLDS G VI +S + P++D L NET CGS
Sbjct: 90 MDISGEMHLDVDHDVYKRRLDSNGVVIPDSIEKHQVGPELDDTLLHKA-----NETECGS 144
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYGA + DE+CCNNCEEVR AYR+KGW ++P I QC +EGF+++++ +EGEGC+++G
Sbjct: 145 CYGA-APDEECCNNCEEVRAAYRRKGWGFTDPQQISQCAKEGFVEKLRAQEGEGCHMWGS 203
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
L VNKVAGNFHFAPGKSF Q +HVHD++ FQ +F++SH+I+KL+FG +PG+ NPLD
Sbjct: 204 LAVNKVAGNFHFAPGKSFQQGPMHVHDLVPFQGVTFDLSHRIDKLSFGHEYPGMTNPLDR 263
Query: 180 V---RWTQETPSGM---YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 233
V ++ P G+ YQYF+KVVPT+Y + HTI SNQ+SVTEHF+ S+ + Q LP
Sbjct: 264 VNLPKFNTRNPQGLPGAYQYFLKVVPTIYVNSHNHTINSNQYSVTEHFKGSQDFQAQ-LP 322
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVFF+YDLSPIKV + E +SFLHFLT+VCAIVGG+FTV+GI+DAFIYHG +AIKKK+++
Sbjct: 323 GVFFYYDLSPIKVKYHETRMSFLHFLTSVCAIVGGIFTVAGIVDAFIYHGHQAIKKKVDL 382
Query: 294 GK 295
GK
Sbjct: 383 GK 384
>gi|215704311|dbj|BAG93745.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 261
Score = 363 bits (931), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 168/259 (64%), Positives = 206/259 (79%), Gaps = 2/259 (0%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQH D++HDI K+RLD+ GNVIE+R++GIG KI+ PLQ+HGGRL E YCG+C
Sbjct: 1 MDISGEQHHDIRHDIEKRRLDAHGNVIEARKEGIGGAKIESPLQKHGGRLSKGEEYCGTC 60
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++GEGCN++GFL
Sbjct: 61 YGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCTREDFVERVKTQQGEGCNVHGFL 120
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V+KVAGN HFAPGK F++S ++V ++ A + FNI+HKINKL+FG FPGVVNPLDG
Sbjct: 121 DVSKVAGNLHFAPGKGFYESNINVPELSALEH-GFNITHKINKLSFGTEFPGVVNPLDGA 179
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+WTQ G YQYFIKVVPT+YTD+ G I SNQFSVTEHFR R + PGVFFFYD
Sbjct: 180 QWTQPASDGTYQYFIKVVPTIYTDLRGRKIHSNQFSVTEHFRDGNI-RPKPQPGVFFFYD 238
Query: 241 LSPIKVTFTEEHVSFLHFL 259
SPIKV E + + F+
Sbjct: 239 FSPIKVVTMERNSYVVMFI 257
>gi|126291179|ref|XP_001371602.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Monodelphis domestica]
Length = 383
Score = 353 bits (907), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 166/297 (55%), Positives = 218/297 (73%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H+++K+RLD G + + + ++ K ++ + C SC
Sbjct: 89 MDVAGEQQLDVEHNLYKQRLDKDGRPVTTEAE---RHELGKEEEKAFDPSSLDPERCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I +L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRRLSFGEDYPGIVNPLDDT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY VSG ++SNQFSVT H + + G + Q LPGVF
Sbjct: 266 NITAPQASMMFQYFVKVVPTVYMKVSGEVLRSNQFSVTRHEKVA-NGLIGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKIE+GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIELGK 381
>gi|449265747|gb|EMC76893.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3,
partial [Columba livia]
Length = 330
Score = 353 bits (906), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 172/301 (57%), Positives = 216/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD GN + E + G K+ P R
Sbjct: 36 MDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEEEKVFDPNSLDADR------- 88
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAES D CCN C++VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 89 CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQV 148
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FG +PG+VNP
Sbjct: 149 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGRDYPGIVNP 208
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LDG T + S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPG
Sbjct: 209 LDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRTNQFSVTRHEKIA-NGLLGDQGLPG 267
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAIVGG+FTV+G ID+ IYH RAI+KKIE+G
Sbjct: 268 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIVGGIFTVAGFIDSLIYHSARAIQKKIELG 327
Query: 295 K 295
K
Sbjct: 328 K 328
>gi|224077228|ref|XP_002191084.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Taeniopygia guttata]
Length = 383
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 170/301 (56%), Positives = 217/301 (72%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
MD++G+Q LDV+H++FK+RLD GN + E + G K+ P R
Sbjct: 89 MDVAGDQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEEEKVFDPNSLDADR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAES D CCN C++VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDSIEQCKREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FG +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGRDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LDG T + S M+QYF+KVVPTVY V G +++NQFSVT+H + + G L Q LPG
Sbjct: 262 LDGTAVTAQQASMMFQYFVKVVPTVYRKVDGEVVRTNQFSVTQHEKIA-NGLLGDQGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HF+T VCAIVGG+FTV+G ID+ IYH RAI+KKIE+G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFVTGVCAIVGGIFTVAGFIDSLIYHSARAIQKKIELG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|148222292|ref|NP_001091124.1| ERGIC and golgi 3 [Xenopus laevis]
gi|120538715|gb|AAI29573.1| LOC100036873 protein [Xenopus laevis]
Length = 384
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 167/297 (56%), Positives = 216/297 (72%), Gaps = 5/297 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD + S D K+++ + L+ N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDKKPVTSEADKHELGKLEEHVVLDPKTLDPNR--CESC 146
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN+C++VREAYR+KGWA PD I+QCKREGF Q+++E++ EGC IYGFL
Sbjct: 147 YGAETEDFSCCNSCDDVREAYRRKGWAFKTPDSIEQCKREGFSQKMQEQKNEGCQIYGFL 206
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H+I L+FG +PG+VNPLDG
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHEIKHLSFGRDYPGLVNPLDGT 266
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
S M+QYF+K+VPTVY V G +++NQFSVT H + + G + Q LPGVF
Sbjct: 267 SIVAMQSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKMT-NGLIGDQGLPGVFVL 325
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GGVFTV+ +IDA IYH RAI+KKIE+GK
Sbjct: 326 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVASLIDALIYHSTRAIQKKIELGK 382
>gi|363741418|ref|XP_003642491.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Gallus gallus]
gi|363741445|ref|XP_003642499.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Gallus gallus]
Length = 383
Score = 350 bits (899), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 170/301 (56%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD GN + E + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEEEKVFDPNSLDADR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAES D CCN C++VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FG +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGRDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LDG T + S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 262 LDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRTNQFSVTRHEKIA-NGLIGDQGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H F HFLT VCAIVGG+FTV+G ID+ IYH RAI+KKIE+G
Sbjct: 321 VFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIFTVAGFIDSLIYHSARAIQKKIELG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|13384938|ref|NP_079792.1| endoplasmic reticulum-Golgi intermediate compartment protein 3 [Mus
musculus]
gi|37999778|sp|Q9CQE7.1|ERGI3_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3; AltName: Full=Serologically defined breast
cancer antigen NY-BR-84 homolog
gi|12844094|dbj|BAB26233.1| unnamed protein product [Mus musculus]
gi|12851518|dbj|BAB29073.1| unnamed protein product [Mus musculus]
gi|26341008|dbj|BAC34166.1| unnamed protein product [Mus musculus]
gi|27882157|gb|AAH43720.1| ERGIC and golgi 3 [Mus musculus]
gi|148674217|gb|EDL06164.1| ERGIC and golgi 3, isoform CRA_d [Mus musculus]
Length = 383
Score = 350 bits (898), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 169/297 (56%), Positives = 218/297 (73%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FKKRLD G + S + K++ + L+ N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTV-FDPNSLDPNR--CESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|354477966|ref|XP_003501188.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Cricetulus griseus]
gi|344246673|gb|EGW02777.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Cricetulus griseus]
Length = 383
Score = 350 bits (898), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 169/297 (56%), Positives = 218/297 (73%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FKKRLD G + S + K++ + L+ N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHELGKVEVAV-FDPNSLDPNR--CESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESDDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|348564091|ref|XP_003467839.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cavia porcellus]
Length = 383
Score = 350 bits (898), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 168/297 (56%), Positives = 215/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FKKRLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDLKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKIE+GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIELGK 381
>gi|157820783|ref|NP_001100003.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Rattus norvegicus]
gi|149030853|gb|EDL85880.1| ERGIC and golgi 3 (predicted) [Rattus norvegicus]
Length = 383
Score = 350 bits (897), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 169/297 (56%), Positives = 218/297 (73%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FKKRLD G + S + K++ + L+ N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHELGKVEVTV-FDPDSLDPNR--CESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|284004911|ref|NP_001164802.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Oryctolagus cuniculus]
gi|217038333|gb|ACJ76626.1| serologically defined breast cancer antigen 84 isoform b
(predicted) [Oryctolagus cuniculus]
Length = 383
Score = 348 bits (894), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 167/297 (56%), Positives = 215/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FKKRLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTVFNPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|359322740|ref|XP_864582.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 3 [Canis lupus familiaris]
Length = 383
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 166/297 (55%), Positives = 216/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSL---NPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDRT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT+VCAIVGG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTSVCAIVGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|426241390|ref|XP_004014574.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Ovis aries]
Length = 383
Score = 347 bits (891), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 168/301 (55%), Positives = 216/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 262 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|395830112|ref|XP_003788179.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Otolemur garnettii]
Length = 383
Score = 347 bits (891), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 166/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFNPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|260815243|ref|XP_002602383.1| hypothetical protein BRAFLDRAFT_63528 [Branchiostoma floridae]
gi|229287692|gb|EEN58395.1| hypothetical protein BRAFLDRAFT_63528 [Branchiostoma floridae]
Length = 397
Score = 347 bits (890), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 169/311 (54%), Positives = 222/311 (71%), Gaps = 19/311 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MDI+GEQ +DV H++FK+R+D QGN++ E ++ +G P D+ +Q C S
Sbjct: 92 MDIAGEQQIDVDHNLFKRRMDLQGNILDEPEKEDLGDPS-DEFMQAIKKLENKTADVCES 150
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYGAE+ D CCN CE+VREAYR+KGWA +NPD I+QCKREG+ +++K+++ EGC +YG+
Sbjct: 151 CYGAETEDLKCCNTCEDVREAYRRKGWAFNNPDTIEQCKREGWSEKLKQQKNEGCQVYGY 210
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVH--------VHDILAFQRDSFNISHKINKLAFGEHFP 171
LEVNKVAGNFHFAPGKSF Q VH VHD+ F + FN+SH +N L+FG P
Sbjct: 211 LEVNKVAGNFHFAPGKSFQQHHVHVSCFYHPIVHDLQPFGGEKFNLSHHVNHLSFGTDIP 270
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQ 226
G VNPLDG + S MYQYF+K+VPT+Y +SG +++NQFSVT+H + S EQ
Sbjct: 271 GRVNPLDGHMVAAKQGSMMYQYFVKIVPTIYKKISGQEVRTNQFSVTKHQKQVTASSGEQ 330
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
G LPGVF Y+LSP+ V FTE+ SF+HFLT VCAIVGGVFTV+G+ID+ IYH RA
Sbjct: 331 G----LPGVFVLYELSPMMVQFTEKQRSFMHFLTGVCAIVGGVFTVAGLIDSLIYHSARA 386
Query: 287 IKKKIEIGKFS 297
I++KI++GK S
Sbjct: 387 IQQKIDLGKAS 397
>gi|417399979|gb|JAA46966.1| Putative copii vesicle protein [Desmodus rotundus]
Length = 383
Score = 347 bits (890), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKAEMKVFDPNSLDPER------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY + G +++NQFSVT H + + G L Q LPG
Sbjct: 262 LDHTNVTALQASMMFQYFVKVVPTVYMKLDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|126291176|ref|XP_001371575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Monodelphis domestica]
Length = 388
Score = 347 bits (890), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 166/302 (54%), Positives = 218/302 (72%), Gaps = 11/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H+++K+RLD G + + + ++ K ++ + C SC
Sbjct: 89 MDVAGEQQLDVEHNLYKQRLDKDGRPVTTEAE---RHELGKEEEKAFDPSSLDPERCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I +L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRRLSFGEDYPGIVN 265
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLD T S M+QYF+KVVPTVY VSG ++SNQFSVT H + + G + Q LP
Sbjct: 266 PLDDTNITAPQASMMFQYFVKVVPTVYMKVSGEVLRSNQFSVTRHEKVA-NGLIGDQGLP 324
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKIE+
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIEL 384
Query: 294 GK 295
GK
Sbjct: 385 GK 386
>gi|61555014|gb|AAX46646.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
Length = 346
Score = 347 bits (889), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 168/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + G K+ P R
Sbjct: 52 MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 104
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 105 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 164
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 165 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 224
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 225 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 283
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 284 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 343
Query: 295 K 295
K
Sbjct: 344 K 344
>gi|296199725|ref|XP_002747286.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Callithrix jacchus]
gi|403281165|ref|XP_003932068.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Saimiri boliviensis boliviensis]
Length = 383
Score = 347 bits (889), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 166/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|194044515|ref|XP_001929457.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Sus scrofa]
gi|350594868|ref|XP_003483992.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Sus scrofa]
Length = 383
Score = 347 bits (889), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 167/301 (55%), Positives = 216/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEIKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 262 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVAS-GLMGDQGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|326931697|ref|XP_003211962.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Meleagris gallopavo]
Length = 411
Score = 347 bits (889), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 170/306 (55%), Positives = 215/306 (70%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD GN + E + G K+ P R
Sbjct: 112 MDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEEEKVFDPNSLDADR------- 164
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAES D CCN C++VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 165 CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQV 224
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FG +P
Sbjct: 225 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIKHLSFGRDYP 284
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLDG T + S M+QYF+KVVPTVY V G +++NQFSVT H + + G +
Sbjct: 285 GIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRTNQFSVTRHEKIA-NGLIGD 343
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H F HFLT VCAIVGG+FTV+G ID+ IYH RAI+K
Sbjct: 344 QGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIFTVAGFIDSLIYHSARAIQK 403
Query: 290 KIEIGK 295
KIE+GK
Sbjct: 404 KIELGK 409
>gi|344279905|ref|XP_003411726.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Loxodonta africana]
Length = 386
Score = 347 bits (889), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 92 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 144
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 145 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 204
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 205 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 264
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 265 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 323
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 324 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 383
Query: 295 K 295
K
Sbjct: 384 K 384
>gi|164448602|ref|NP_001029525.2| endoplasmic reticulum-Golgi intermediate compartment protein 3 [Bos
taurus]
gi|75057944|sp|Q5EAE0.1|ERGI3_BOVIN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|59857621|gb|AAX08645.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
gi|59857623|gb|AAX08646.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
gi|59857741|gb|AAX08705.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
gi|110665562|gb|ABG81427.1| serologically defined breast cancer antigen 84 [Bos taurus]
Length = 383
Score = 346 bits (888), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 168/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 262 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|301762088|ref|XP_002916455.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Ailuropoda melanoleuca]
Length = 383
Score = 346 bits (888), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 262 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|410953936|ref|XP_003983624.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Felis catus]
Length = 383
Score = 346 bits (888), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 262 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|426391505|ref|XP_004062113.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Gorilla gorilla gorilla]
gi|7959731|gb|AAF71038.1|AF116721_14 PRO0989 [Homo sapiens]
Length = 346
Score = 346 bits (888), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 165/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 52 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 108
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 109 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 168
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 169 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 228
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 229 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 287
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 288 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 344
>gi|95767501|gb|ABF57305.1| serologically defined breast cancer antigen 84 [Bos taurus]
Length = 376
Score = 346 bits (888), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 168/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + G K+ P R
Sbjct: 82 MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 134
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 135 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 194
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 195 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 254
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 255 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 313
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 314 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 373
Query: 295 K 295
K
Sbjct: 374 K 374
>gi|95767625|gb|ABF57320.1| serologically defined breast cancer antigen 84 [Bos taurus]
Length = 380
Score = 346 bits (887), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 168/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + G K+ P R
Sbjct: 86 MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 138
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 139 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 198
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 199 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 258
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 259 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 317
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 318 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 377
Query: 295 K 295
K
Sbjct: 378 K 378
>gi|109092202|ref|XP_001098982.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 3 [Macaca mulatta]
Length = 383
Score = 346 bits (887), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 165/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLKTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|431894341|gb|ELK04141.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pteropus alecto]
Length = 383
Score = 346 bits (887), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFTQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY + G +++NQFSVT H + + G L Q LPG
Sbjct: 262 LDRTNVTAPQASMMFQYFVKVVPTVYMKLDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 321 VFVLYELSPMVVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|335774962|gb|AEH58414.1| endoplasmic reticulum-golgi intermediat compartment protein 3-like
protein [Equus caballus]
Length = 354
Score = 346 bits (887), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 60 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 112
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 113 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 172
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 173 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 232
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 233 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 291
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 292 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 351
Query: 295 K 295
K
Sbjct: 352 K 352
>gi|410262554|gb|JAA19243.1| ERGIC and golgi 3 [Pan troglodytes]
Length = 383
Score = 346 bits (887), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 165/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|395510083|ref|XP_003759313.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3, partial [Sarcophilus harrisii]
Length = 335
Score = 346 bits (887), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 169/306 (55%), Positives = 218/306 (71%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H+++K+RLD G+ + E + G K+ P R
Sbjct: 36 MDVAGEQQLDVEHNLYKQRLDKDGHPVTTEAERHELGKEEEKVFDPSSLDPER------- 88
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 89 CESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 148
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I +L+FGE +P
Sbjct: 149 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRRLSFGEDYP 208
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLD T S M+QYF+KVVPTVY V+G ++SNQFSVT H + + G +
Sbjct: 209 GIVNPLDDTNITAPQASMMFQYFVKVVPTVYMKVNGEVLRSNQFSVTRHEKVA-NGLIGD 267
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+K
Sbjct: 268 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 327
Query: 290 KIEIGK 295
KIE+GK
Sbjct: 328 KIELGK 333
>gi|7706278|ref|NP_057050.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Homo sapiens]
gi|332858219|ref|XP_003316930.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Pan troglodytes]
gi|397523795|ref|XP_003831904.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Pan paniscus]
gi|37999823|sp|Q9Y282.1|ERGI3_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3; AltName: Full=Serologically defined breast
cancer antigen NY-BR-84
gi|4689108|gb|AAD27763.1|AF077030_1 hypothetical 43.2 kDa protein [Homo sapiens]
gi|4929577|gb|AAD34049.1|AF151812_1 CGI-54 protein [Homo sapiens]
gi|7671663|emb|CAB89412.1| ERGIC and golgi 3 [Homo sapiens]
gi|14602515|gb|AAH09765.1| ERGIC and golgi 3 [Homo sapiens]
gi|15559308|gb|AAH14014.1| ERGIC and golgi 3 [Homo sapiens]
gi|119596605|gb|EAW76199.1| ERGIC and golgi 3, isoform CRA_a [Homo sapiens]
gi|124249802|gb|ABM92879.1| endoplasmic reticulum-localized protein ERp43 [Homo sapiens]
gi|312152490|gb|ADQ32757.1| ERGIC and golgi 3 [synthetic construct]
gi|380785591|gb|AFE64671.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Macaca mulatta]
gi|383419067|gb|AFH32747.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Macaca mulatta]
gi|384947602|gb|AFI37406.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform b [Macaca mulatta]
gi|410342895|gb|JAA40394.1| ERGIC and golgi 3 [Pan troglodytes]
Length = 383
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 165/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|327271489|ref|XP_003220520.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Anolis carolinensis]
Length = 383
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 167/301 (55%), Positives = 214/301 (71%), Gaps = 14/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + E + G I P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGKHVTPEAERHELGKEEETIFDPNSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAES D CCN C++VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCKV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FG +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHIIKHLSFGRDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LDG + + S M+QYF+KVVPT+Y V G +++NQFSVT H + + G + Q LPG
Sbjct: 262 LDGTVVSAQQASMMFQYFVKVVPTIYMKVDGEVVRTNQFSVTRHEKIA-NGLIGDQGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH R I+KKIE+G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQKKIELG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|363741420|ref|XP_003642492.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Gallus gallus]
gi|363741447|ref|XP_003642500.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Gallus gallus]
Length = 388
Score = 344 bits (883), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 170/306 (55%), Positives = 215/306 (70%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD GN + E + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEEEKVFDPNSLDADR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAES D CCN C++VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FG +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIKHLSFGRDYP 261
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLDG T + S M+QYF+KVVPTVY V G +++NQFSVT H + + G +
Sbjct: 262 GIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRTNQFSVTRHEKIA-NGLIGD 320
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H F HFLT VCAIVGG+FTV+G ID+ IYH RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIFTVAGFIDSLIYHSARAIQK 380
Query: 290 KIEIGK 295
KIE+GK
Sbjct: 381 KIELGK 386
>gi|354477968|ref|XP_003501189.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Cricetulus griseus]
Length = 388
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 169/302 (55%), Positives = 218/302 (72%), Gaps = 11/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FKKRLD G + S + K++ + L+ N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHELGKVEVAV-FDPNSLDPNR--CESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESDDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIKHLSFGEDYPGIVN 265
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 324
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384
Query: 294 GK 295
GK
Sbjct: 385 GK 386
>gi|197100234|ref|NP_001126130.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pongo abelii]
gi|75041559|sp|Q5R8G3.1|ERGI3_PONAB RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|55730450|emb|CAH91947.1| hypothetical protein [Pongo abelii]
Length = 383
Score = 344 bits (882), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 164/297 (55%), Positives = 214/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VRE YR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVRETYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|410218732|gb|JAA06585.1| ERGIC and golgi 3 [Pan troglodytes]
Length = 383
Score = 343 bits (881), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 164/297 (55%), Positives = 214/297 (72%), Gaps = 6/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++F +RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFNQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381
>gi|47575764|ref|NP_001001226.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Xenopus (Silurana) tropicalis]
gi|82185697|sp|Q6NVS2.1|ERGI3_XENTR RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|45708932|gb|AAH67932.1| ERGIC and golgi 3 [Xenopus (Silurana) tropicalis]
Length = 384
Score = 343 bits (881), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 163/297 (54%), Positives = 215/297 (72%), Gaps = 5/297 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD + S D K ++ + L+ N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDKKPVTSEADRHELGKSEEHVVFDPKSLDPNR--CESC 146
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN C++VREAYR++GWA PD I+QCKREGF Q+++E++ EGC +YGFL
Sbjct: 147 YGAETDDFSCCNTCDDVREAYRRRGWAFKTPDSIEQCKREGFSQKMQEQKNEGCQVYGFL 206
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H+I L+FG +PG+VNPLDG
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHEIRHLSFGRDYPGLVNPLDGS 266
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
S M+QYF+K+VPTVY V G +++NQFSVT H + + G + Q LPGVF
Sbjct: 267 SVAAMQSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKMT-NGLIGDQGLPGVFVL 325
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GGVFTV+G+ID+ +Y+ RAI+KKIE+GK
Sbjct: 326 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLVYYSTRAIQKKIELGK 382
>gi|348521804|ref|XP_003448416.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Oreochromis niloticus]
Length = 384
Score = 343 bits (880), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 163/299 (54%), Positives = 214/299 (71%), Gaps = 5/299 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD + + + K D L+ + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKEFKPVTQEAEKHELGKADDGEVFDPSTLDPDR--CESC 146
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN C++VREAYR++GWA + D I+QCKREGF Q+++E++ EGC +YGFL
Sbjct: 147 YGAETEDLKCCNTCDDVREAYRRRGWAFKSADTIEQCKREGFTQKMQEQKNEGCQVYGFL 206
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FG+ +PG+VNPLDG
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHLIKHLSFGKDYPGLVNPLDGT 266
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S MYQYF+K+VPT+Y G +++NQFSVT H + + G + Q LPGVF
Sbjct: 267 DVTAPQASMMYQYFVKIVPTIYMKTDGEVVKTNQFSVTRHEKVA-NGLIGDQGLPGVFVL 325
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH R I+KKIE+GK S
Sbjct: 326 YELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQKKIELGKTS 384
>gi|148225661|ref|NP_001087591.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Xenopus laevis]
gi|82181499|sp|Q66KH2.1|ERGI3_XENLA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|51513379|gb|AAH80394.1| MGC83277 protein [Xenopus laevis]
Length = 389
Score = 342 bits (878), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 166/302 (54%), Positives = 218/302 (72%), Gaps = 10/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD + S D K ++ + L+ N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDLDKKPVTSEADRHELGKSEEQVVFDPKTLDPNR--CESC 146
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN+C++VREAYR+KGWA PD I+QCKREGF Q+++E++ EGC +YGFL
Sbjct: 147 YGAETDDFSCCNSCDDVREAYRRKGWAFKTPDSIEQCKREGFSQKMQEQKNEGCQVYGFL 206
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H+I L+FG+ +PG+VN
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHEIKHLSFGKDYPGLVN 266
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLDG S M+QYF+K+VPTVY V G +++NQFSVT H + + G + Q LP
Sbjct: 267 PLDGTSIVAMQSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKMT-NGLIGDQGLP 325
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+G+ID+ IY+ RAI+KKIE+
Sbjct: 326 GVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYYSTRAIQKKIEL 385
Query: 294 GK 295
GK
Sbjct: 386 GK 387
>gi|387015776|gb|AFJ50007.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3-like
[Crotalus adamanteus]
Length = 372
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 168/297 (56%), Positives = 218/297 (73%), Gaps = 17/297 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD +D +G ++ L + L+ C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLD---------KDELGK---EEELFFNPNSLDPER--CESC 134
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCNNC++VREAYR++GWA NPD I+QCKREGF ++++E++ EGC +YGFL
Sbjct: 135 YGAESEDIKCCNNCDDVREAYRRRGWAFKNPDTIEQCKREGFSEKMQEQKNEGCKVYGFL 194
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ ++ D+ NI+H I L+FG+ +PG+VNPLDG
Sbjct: 195 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSYGLDNINITHFIRHLSFGKDYPGLVNPLDGT 254
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPGVF
Sbjct: 255 IVTAHQASMMFQYFVKVVPTVYMKVDGEMVRTNQFSVTRHEKIA-NGLIGDQGLPGVFVL 313
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH RAI+KKIE+GK
Sbjct: 314 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARAIQKKIELGK 370
>gi|359322742|ref|XP_851879.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Canis lupus familiaris]
Length = 388
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 166/302 (54%), Positives = 216/302 (71%), Gaps = 11/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSL---NPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYPGIVN 265
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LP
Sbjct: 266 PLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLP 324
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V TE+H SF HFLT+VCAIVGG+FTV+G+ID+ IYH RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTSVCAIVGGMFTVAGLIDSLIYHSARAIQKKIDL 384
Query: 294 GK 295
GK
Sbjct: 385 GK 386
>gi|395830114|ref|XP_003788180.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Otolemur garnettii]
gi|197215642|gb|ACH53034.1| ERGIC and golgi 3 (predicted) [Otolemur garnettii]
Length = 388
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 166/302 (54%), Positives = 215/302 (71%), Gaps = 11/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFNPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYPGIVN 265
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 324
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384
Query: 294 GK 295
GK
Sbjct: 385 GK 386
>gi|190402265|gb|ACE77675.1| ERGIC and golgi 3 (predicted) [Sorex araneus]
Length = 388
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 167/302 (55%), Positives = 217/302 (71%), Gaps = 11/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + KI+ + L+ N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGVPVSSEAERHELGKIEVKV-FDPDSLDPNR--CESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCQREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYPGIVN 265
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LP
Sbjct: 266 PLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLP 324
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384
Query: 294 GK 295
GK
Sbjct: 385 GK 386
>gi|259155256|ref|NP_001158869.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Salmo salar]
gi|223647782|gb|ACN10649.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Salmo salar]
Length = 388
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 169/308 (54%), Positives = 218/308 (70%), Gaps = 19/308 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGIGAPK--IDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD GN + E+ + +G + I P + R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGNPVTTEAEKHDLGQEEGEIFDPSKLDPER------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN C++VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC I
Sbjct: 142 CESCYGAETEDLKCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQI 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FG +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHLIKHLSFGRDYP 261
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLDG S MYQYF+K+VPT+Y G +++NQFSVT H + + G +
Sbjct: 262 GIVNPLDGTDVAAPQASMMYQYFVKIVPTIYVKWDGEVVKTNQFSVTRHEKVA-NGLIGD 320
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V FTE+ SF HFLT VCAIVGGVFTV+G+ID+ IYH +AI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCAIVGGVFTVAGLIDSLIYHSAKAIQK 380
Query: 290 KIEIGKFS 297
KIE+GK S
Sbjct: 381 KIELGKAS 388
>gi|426241392|ref|XP_004014575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Ovis aries]
Length = 388
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 168/306 (54%), Positives = 216/306 (70%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G +
Sbjct: 262 GIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 320
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380
Query: 290 KIEIGK 295
KI++GK
Sbjct: 381 KIDLGK 386
>gi|296199723|ref|XP_002747285.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Callithrix jacchus]
gi|403281167|ref|XP_003932069.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Saimiri boliviensis boliviensis]
gi|166831592|gb|ABY90117.1| serologically defined breast cancer antigen 84 isoform a
(predicted) [Callithrix jacchus]
Length = 388
Score = 341 bits (874), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 166/302 (54%), Positives = 215/302 (71%), Gaps = 11/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYPGIVN 265
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 324
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384
Query: 294 GK 295
GK
Sbjct: 385 GK 386
>gi|440797665|gb|ELR18746.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
Length = 383
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 164/297 (55%), Positives = 211/297 (71%), Gaps = 8/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGE LDV+H+IFKKRL + G + + + A P G LE E CGSC
Sbjct: 91 MDVSGEHQLDVEHNIFKKRLAADGRPLGIEKGELEAAATPSP----GQELEPIE--CGSC 144
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YG+E CCN C EVRE+YRKKGWA ++P+ I+QC REGF + +++++GEGC +YG +
Sbjct: 145 YGSEQEPGQCCNTCAEVRESYRKKGWAFAHPESIEQCAREGFSENLEKQKGEGCQVYGHI 204
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
VNKVAGNFHFAPGKSF +HVHD+ F+ S+NISH+IN+++FG+ FPGV+NPLDGV
Sbjct: 205 LVNKVAGNFHFAPGKSFQAHHMHVHDLQPFRMSSWNISHRINRISFGKEFPGVINPLDGV 264
Query: 181 RWTQETPSG--MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
T + +G MYQYF+K+VPT+Y + G+ I +NQFSVTEH R G LPG+F
Sbjct: 265 EKTTDPGAGSAMYQYFVKIVPTIYESLDGNVINTNQFSVTEHTRMLPPGDKSGLPGLFVM 324
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
YDLSPI V FTE SF HFLT VCAI+GGVFTV+GIID+ IY+ R + KK+E+GK
Sbjct: 325 YDLSPIMVKFTERTKSFAHFLTGVCAIIGGVFTVAGIIDSLIYNSLRTLGKKMELGK 381
>gi|344279907|ref|XP_003411727.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Loxodonta africana]
Length = 391
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 92 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 144
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 145 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 204
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +P
Sbjct: 205 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 264
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G +
Sbjct: 265 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 323
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+K
Sbjct: 324 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 383
Query: 290 KIEIGK 295
KI++GK
Sbjct: 384 KIDLGK 389
>gi|194044517|ref|XP_001929458.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Sus scrofa]
gi|350594870|ref|XP_003483993.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Sus scrofa]
Length = 388
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 167/306 (54%), Positives = 216/306 (70%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEIKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYP 261
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G +
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-SGLMGD 320
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380
Query: 290 KIEIGK 295
KI++GK
Sbjct: 381 KIDLGK 386
>gi|229368723|gb|ACQ63006.1| serologically defined breast cancer antigen 84 isoform a
(predicted) [Dasypus novemcinctus]
Length = 388
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G +
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 320
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380
Query: 290 KIEIGK 295
KI++GK
Sbjct: 381 KIDLGK 386
>gi|41055991|ref|NP_957309.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform 2 [Danio rerio]
gi|82210123|sp|Q803I2.1|ERGI3_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|28278376|gb|AAH44474.1| ERGIC and golgi 3 [Danio rerio]
gi|182890166|gb|AAI64701.1| Ergic3 protein [Danio rerio]
Length = 383
Score = 340 bits (872), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 163/308 (52%), Positives = 214/308 (69%), Gaps = 24/308 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESR---------QDGIGAPKIDKPLQRHGGRLE 51
MD++GEQ LDV+H++FK+RLD G + + ++G+ P P +
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGQPVTTEAEKHDLGKEEEGVFDPSTLDPDR------- 141
Query: 52 HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
C SCYGAE+ D CCN C++VREAYR++GWA PD I+QCKREGF Q+++E++
Sbjct: 142 -----CESCYGAETDDLKCCNTCDDVREAYRRRGWAFKTPDTIEQCKREGFSQKMQEQKN 196
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FG+ +P
Sbjct: 197 EGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHFIKHLSFGKDYP 256
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLD S MYQYF+K+VPT+Y G +++NQFSVT H + + G +
Sbjct: 257 GIVNPLDDTNVAAPQASMMYQYFVKIVPTIYVKGDGEVVKTNQFSVTRHEKIA-NGLIGD 315
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V FTE+ SF HFLT VCAI+GGVFTV+G+ID+ IYH RAI+K
Sbjct: 316 QGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARAIQK 375
Query: 290 KIEIGKFS 297
KIE+GK S
Sbjct: 376 KIELGKAS 383
>gi|281346059|gb|EFB21643.1| hypothetical protein PANDA_004535 [Ailuropoda melanoleuca]
Length = 387
Score = 340 bits (872), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G +
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 320
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380
Query: 290 KIEIGK 295
KI++GK
Sbjct: 381 KIDLGK 386
>gi|410953938|ref|XP_003983625.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Felis catus]
Length = 388
Score = 340 bits (872), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G +
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 320
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380
Query: 290 KIEIGK 295
KI++GK
Sbjct: 381 KIDLGK 386
>gi|109092200|ref|XP_001098885.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Macaca mulatta]
Length = 388
Score = 340 bits (872), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 165/302 (54%), Positives = 215/302 (71%), Gaps = 11/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYPGIVN 265
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLKTNQFSVTRHEKVA-NGLLGDQGLP 324
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384
Query: 294 GK 295
GK
Sbjct: 385 GK 386
>gi|38327615|ref|NP_938408.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform a [Homo sapiens]
gi|281182526|ref|NP_001162565.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Papio anubis]
gi|397523797|ref|XP_003831905.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Pan paniscus]
gi|410055053|ref|XP_003953764.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Pan troglodytes]
gi|57208593|emb|CAI42842.1| ERGIC and golgi 3 [Homo sapiens]
gi|164623746|gb|ABY64672.1| ERGIC and golgi 3, isoform 1 (predicted) [Papio anubis]
gi|380785589|gb|AFE64670.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform a [Macaca mulatta]
Length = 388
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 165/302 (54%), Positives = 215/302 (71%), Gaps = 11/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYPGIVN 265
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 324
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384
Query: 294 GK 295
GK
Sbjct: 385 GK 386
>gi|301762086|ref|XP_002916454.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Ailuropoda melanoleuca]
Length = 388
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G +
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 320
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380
Query: 290 KIEIGK 295
KI++GK
Sbjct: 381 KIDLGK 386
>gi|184185558|gb|ACC68956.1| serologically defined breast cancer antigen 84 isoform a
(predicted) [Rhinolophus ferrumequinum]
Length = 388
Score = 340 bits (871), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLD T S M+QYF+KVVPTVY + G +++NQFSVT H + + G L
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKLDGEVLRTNQFSVTRHEKVA-NGLLGD 320
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380
Query: 290 KIEIGK 295
KI++GK
Sbjct: 381 KIDLGK 386
>gi|75077200|sp|Q4R8X1.1|ERGI3_MACFA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 3
gi|67967936|dbj|BAE00450.1| unnamed protein product [Macaca fascicularis]
Length = 382
Score = 340 bits (871), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 165/297 (55%), Positives = 215/297 (72%), Gaps = 7/297 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + G + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGTPVSSEAERHELGKVEVTV---FGPDSLDPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++G A NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRG-AFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 204
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 205 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 264
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 265 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 323
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 324 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 380
>gi|22760064|dbj|BAC11054.1| unnamed protein product [Homo sapiens]
Length = 388
Score = 340 bits (871), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 165/302 (54%), Positives = 214/302 (70%), Gaps = 11/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D N++H I L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDDINMTHYIQHLSFGEDYPGIVN 265
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 324
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384
Query: 294 GK 295
GK
Sbjct: 385 GK 386
>gi|334310895|ref|XP_003339551.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Monodelphis domestica]
Length = 396
Score = 340 bits (871), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 166/310 (53%), Positives = 218/310 (70%), Gaps = 19/310 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H+++K+RLD G + + + ++ K ++ + C SC
Sbjct: 89 MDVAGEQQLDVEHNLYKQRLDKDGRPVTTEAE---RHELGKEEEKAFDPSSLDPERCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDS--------FNISHKINKLAFG 167
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I +L+FG
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNVVLCWYLQINMTHYIRRLSFG 265
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
E +PG+VNPLD T S M+QYF+KVVPTVY VSG ++SNQFSVT H + + G
Sbjct: 266 EDYPGIVNPLDDTNITAPQASMMFQYFVKVVPTVYMKVSGEVLRSNQFSVTRHEKVA-NG 324
Query: 228 RL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 285
+ Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH R
Sbjct: 325 LIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSAR 384
Query: 286 AIKKKIEIGK 295
AI+KKIE+GK
Sbjct: 385 AIQKKIELGK 394
>gi|432101449|gb|ELK29631.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Myotis davidii]
Length = 391
Score = 339 bits (869), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 165/305 (54%), Positives = 215/305 (70%), Gaps = 14/305 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + H C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEMKVFDPDSLDPHR---CESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--------FNISHKINKLAFGEHFPG 172
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNVCTRCCLQINMTHYIRHLSFGEDYPG 265
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--Q 230
+VNPLD T S M+QYF+KVVPTVY + G +++NQFSVT H + + G L Q
Sbjct: 266 IVNPLDRTNVTALQASMMFQYFVKVVPTVYMKLDGQVLRTNQFSVTRHEKVA-NGLLGDQ 324
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KK
Sbjct: 325 GLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKK 384
Query: 291 IEIGK 295
I++GK
Sbjct: 385 IDLGK 389
>gi|410926566|ref|XP_003976749.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Takifugu rubripes]
Length = 384
Score = 338 bits (868), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 163/301 (54%), Positives = 215/301 (71%), Gaps = 9/301 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDS--QGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
MD++GEQ LDV+H++FK+RLD Q E+ + +G D P+ + C
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKNLQPVSTEAEKHELGGED-DVPVFDPSTL---DPERCE 144
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYGAE+ D CCN+C++VREAYR++GWA N D I+QCKREGF Q+++E++ EGC +YG
Sbjct: 145 SCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADTIEQCKREGFTQKMQEQKNEGCQVYG 204
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
LEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FG+ +PG++NPLD
Sbjct: 205 VLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHLIRHLSFGQDYPGLINPLD 264
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVF 236
T S MYQYF+K+VPT+Y G +++NQFSVT H + + G + Q LPGVF
Sbjct: 265 DTNITAPQASMMYQYFVKIVPTIYVKTDGEVLKTNQFSVTRHEKVA-NGLIGDQGLPGVF 323
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH R I+KKIE+GK
Sbjct: 324 VLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQKKIELGKA 383
Query: 297 S 297
S
Sbjct: 384 S 384
>gi|327271491|ref|XP_003220521.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Anolis carolinensis]
Length = 388
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 167/306 (54%), Positives = 214/306 (69%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + E + G I P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGKHVTPEAERHELGKEEETIFDPNSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAES D CCN C++VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCKV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FG +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHIIKHLSFGRDYP 261
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
G+VNPLDG + + S M+QYF+KVVPT+Y V G +++NQFSVT H + + G +
Sbjct: 262 GIVNPLDGTVVSAQQASMMFQYFVKVVPTIYMKVDGEVVRTNQFSVTRHEKIA-NGLIGD 320
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH R I+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQK 380
Query: 290 KIEIGK 295
KIE+GK
Sbjct: 381 KIELGK 386
>gi|291231388|ref|XP_002735646.1| PREDICTED: serologically defined breast cancer antigen 84-like,
partial [Saccoglossus kowalevskii]
Length = 358
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 160/301 (53%), Positives = 212/301 (70%), Gaps = 9/301 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV H+I K R+D G + + + K ++ +L+ + C SC
Sbjct: 59 MDVAGEQQLDVDHNIMKSRIDKNGKPVATPEKEDIGDKSEEAKDFDVNKLDPDR--CESC 116
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN CE+VREAYR+KGWA +N D I QC REG+ ++K + GEGC +YG L
Sbjct: 117 YGAESKDLKCCNTCEDVREAYRRKGWAFNNADGIAQCSREGWSDKLKSQSGEGCQVYGHL 176
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF Q VHVHD+ AF + FN+SH+IN L+FG +PG+ NPLD
Sbjct: 177 EVNKVAGNFHFAPGKSFQQHHVHVHDLQAFSGEKFNLSHRINHLSFGHKYPGMENPLDNS 236
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR------SSEQGRLQTLPG 234
+ T + S MYQYF+K+VPT YT ++G T +SNQ+SVT+H + +S G LPG
Sbjct: 237 KVTSQKASIMYQYFVKIVPTTYTKLNGATTRSNQYSVTKHEKVVSTSLASAAGE-HGLPG 295
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+ +P+ V +TE+H SF+HF+T VCAI+GGVFTV+G+ID+ IYH +AIKKKI++G
Sbjct: 296 VFILYEFAPLMVKYTEKHRSFMHFMTGVCAIIGGVFTVAGLIDSMIYHSSKAIKKKIDLG 355
Query: 295 K 295
K
Sbjct: 356 K 356
>gi|34849462|gb|AAH57130.1| Ergic3 protein [Mus musculus]
Length = 394
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 169/308 (54%), Positives = 218/308 (70%), Gaps = 17/308 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FKKRLD G + S + K++ + L+ N C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTV-FDPNSLDPNR--CESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDS------FNISHKINKLAFGEH 169
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FGE
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNPSDCLQINMTHYIKHLSFGED 265
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
+PG+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L
Sbjct: 266 YPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLL 324
Query: 230 --QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI
Sbjct: 325 GDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAI 384
Query: 288 KKKIEIGK 295
+KKI++GK
Sbjct: 385 QKKIDLGK 392
>gi|348521802|ref|XP_003448415.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Oreochromis niloticus]
Length = 389
Score = 337 bits (864), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 163/304 (53%), Positives = 214/304 (70%), Gaps = 10/304 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD + + + K D L+ + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKEFKPVTQEAEKHELGKADDGEVFDPSTLDPDR--CESC 146
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN C++VREAYR++GWA + D I+QCKREGF Q+++E++ EGC +YGFL
Sbjct: 147 YGAETEDLKCCNTCDDVREAYRRRGWAFKSADTIEQCKREGFTQKMQEQKNEGCQVYGFL 206
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
EVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FG+ +PG+VN
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHLIKHLSFGKDYPGLVN 266
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLDG T S MYQYF+K+VPT+Y G +++NQFSVT H + + G + Q LP
Sbjct: 267 PLDGTDVTAPQASMMYQYFVKIVPTIYMKTDGEVVKTNQFSVTRHEKVA-NGLIGDQGLP 325
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH R I+KKIE+
Sbjct: 326 GVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQKKIEL 385
Query: 294 GKFS 297
GK S
Sbjct: 386 GKTS 389
>gi|57208594|emb|CAI42843.1| ERGIC and golgi 3 [Homo sapiens]
Length = 396
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 165/312 (52%), Positives = 215/312 (68%), Gaps = 21/312 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 87 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 143
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 144 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 203
Query: 121 EVNKVAGNFHFAPGKSFHQSGVH---------------VHDILAFQRDSFNISHKINKLA 165
EVNKVAGNFHFAPGKSF QS VH VHD+ +F D+ N++H I L+
Sbjct: 204 EVNKVAGNFHFAPGKSFQQSHVHGCVCRLKMIARSLACVHDLQSFGLDNINMTHYIQHLS 263
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + +
Sbjct: 264 FGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA- 322
Query: 226 QGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH
Sbjct: 323 NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHS 382
Query: 284 QRAIKKKIEIGK 295
RAI+KKI++GK
Sbjct: 383 ARAIQKKIDLGK 394
>gi|196008679|ref|XP_002114205.1| hypothetical protein TRIADDRAFT_37998 [Trichoplax adhaerens]
gi|190583224|gb|EDV23295.1| hypothetical protein TRIADDRAFT_37998 [Trichoplax adhaerens]
Length = 369
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 162/303 (53%), Positives = 213/303 (70%), Gaps = 31/303 (10%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++G Q LD+K ++ K+R+D G KP G ++ N+T CGSC
Sbjct: 92 MDVAGMQQLDIKQNLMKRRIDENG----------------KPT---GDAVQKNKTKCGSC 132
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+++ CCN+CE+VREAYRKKGWAL++P+ I+QC+ EG+ Q +KE+E EGCN++G+L
Sbjct: 133 YGAENAEMKCCNSCEDVREAYRKKGWALTSPEGIEQCQEEGWAQMLKEQEKEGCNVFGYL 192
Query: 121 EVNKV-AGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
EVNKV AGNFHFAPGKSF Q VHVHD+ +F FN SH I+KL+FGE FPG++NPLDG
Sbjct: 193 EVNKVVAGNFHFAPGKSFQQHRVHVHDLQSFGSRKFNTSHTIHKLSFGEEFPGIINPLDG 252
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTLPG 234
R + + S MYQYFIKVVPTVY + G ++SNQ+SVT+H + EQG LPG
Sbjct: 253 HRMSSDQDSAMYQYFIKVVPTVYKKLKGEEVKSNQYSVTKHLKYIKLSMGEQG----LPG 308
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ + + E SF HFLT VCAI+GGVFTV+ +IDA +YH + + KIE+G
Sbjct: 309 VFISYELSPMIIRYAERRKSFAHFLTGVCAIIGGVFTVASLIDAMVYHSAKML--KIELG 366
Query: 295 KFS 297
K S
Sbjct: 367 KAS 369
>gi|74315943|ref|NP_001028277.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
isoform 1 [Danio rerio]
gi|72679324|gb|AAI00126.1| ERGIC and golgi 3 [Danio rerio]
Length = 388
Score = 334 bits (856), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 163/313 (52%), Positives = 214/313 (68%), Gaps = 29/313 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESR---------QDGIGAPKIDKPLQRHGGRLE 51
MD++GEQ LDV+H++FK+RLD G + + ++G+ P P +
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGQPVTTEAEKHDLGKEEEGVFDPSTLDPDR------- 141
Query: 52 HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
C SCYGAE+ D CCN C++VREAYR++GWA PD I+QCKREGF Q+++E++
Sbjct: 142 -----CESCYGAETDDLKCCNTCDDVREAYRRRGWAFKTPDTIEQCKREGFSQKMQEQKN 196
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAF 166
EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+F
Sbjct: 197 EGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHFIKHLSF 256
Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
G+ +PG+VNPLD S MYQYF+K+VPT+Y G +++NQFSVT H + +
Sbjct: 257 GKDYPGIVNPLDDTNVAAPQASMMYQYFVKIVPTIYVKGDGEVVKTNQFSVTRHEKIA-N 315
Query: 227 GRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
G + Q LPGVF Y+LSP+ V FTE+ SF HFLT VCAI+GGVFTV+G+ID+ IYH
Sbjct: 316 GLIGDQGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSA 375
Query: 285 RAIKKKIEIGKFS 297
RAI+KKIE+GK S
Sbjct: 376 RAIQKKIELGKAS 388
>gi|405966014|gb|EKC31342.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Crassostrea gigas]
Length = 397
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 158/304 (51%), Positives = 212/304 (69%), Gaps = 9/304 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI---ESRQDGIGAPKIDKPLQRHGGRLEH----- 52
MD+SGEQ LDV H +FK+RL++ G I E ++G I + + +E
Sbjct: 92 MDVSGEQQLDVDHHLFKQRLNADGEKIKDTEPEKEGTMYEPIFELGDKSKDAVEAVTKKL 151
Query: 53 NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
+ C SCYGAE+ D CCN CE+VREAYRKKGWA ++P+ I+QC REG+ ++K ++ E
Sbjct: 152 DPDRCESCYGAETGDLKCCNTCEDVREAYRKKGWAFNSPEGIEQCNREGWTAKMKAQQKE 211
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC +YG+LEVNKV GNFHFAPGKSF Q VHVHD+ AF FN+SH I L+FG+ +PG
Sbjct: 212 GCQVYGYLEVNKVQGNFHFAPGKSFQQHHVHVHDLQAFGGQKFNLSHAIRHLSFGQDYPG 271
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT- 231
++NPLD E M+QY++KVVPT Y DV G T+ +NQ+SV +H ++ G +
Sbjct: 272 IINPLDQTSQISEDEQTMFQYYVKVVPTTYVDVKGKTLYTNQYSVNKHSKTVGNGMGDSG 331
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
LPGVFF Y+LSP+ V +TE+ SF+HFLT VCAI+GG+FTV+G+ID+ IYH RA++KKI
Sbjct: 332 LPGVFFIYELSPMMVKYTEKQRSFMHFLTGVCAIIGGIFTVAGLIDSMIYHSSRALQKKI 391
Query: 292 EIGK 295
E+GK
Sbjct: 392 ELGK 395
>gi|302834369|ref|XP_002948747.1| hypothetical protein VOLCADRAFT_80399 [Volvox carteri f.
nagariensis]
gi|300265938|gb|EFJ50127.1| hypothetical protein VOLCADRAFT_80399 [Volvox carteri f.
nagariensis]
Length = 392
Score = 332 bits (851), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 170/301 (56%), Positives = 213/301 (70%), Gaps = 6/301 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGN-VIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MDISGE HLD+ HD++K+RL + G+ V E + + A K P+ +G CGS
Sbjct: 94 MDISGELHLDLDHDVYKQRLSANGSPVKEVEKHNVEATKKVVPV--NGTENSTATPVCGS 151
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYGAE DCCN C+EVR AYR+KGWAL+N D I+QC + + + IKE+ GEGC+++G
Sbjct: 152 CYGAEDRQGDCCNTCDEVRAAYRRKGWALANVDHIEQCAHDLYTESIKEQTGEGCHMWGM 211
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
LEVNKVAGNFHFAPG+S+ Q +HVHDI F + H +NKL+FG +PG+ NPLD
Sbjct: 212 LEVNKVAGNFHFAPGRSYQQGSMHVHDIAPFGDAVIDFRHTVNKLSFGAPYPGMKNPLDN 271
Query: 180 VR--WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-QTLPGVF 236
+ + +GMYQYF+KVVPT YT + T+ +NQFSVTE+FR S QG +TLPGVF
Sbjct: 272 AKAGYKSAAATGMYQYFLKVVPTSYTGIDNKTLATNQFSVTENFRESSQGGAGKTLPGVF 331
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
FFYDLSPIKV E SFL FLT+VCAIVGGVFTVSGI+DAFIY R I+KK+E+GKF
Sbjct: 332 FFYDLSPIKVRIVEHSSSFLSFLTSVCAIVGGVFTVSGIVDAFIYTSTRLIRKKMELGKF 391
Query: 297 S 297
S
Sbjct: 392 S 392
>gi|327271493|ref|XP_003220522.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 3 [Anolis carolinensis]
Length = 394
Score = 332 bits (851), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 167/312 (53%), Positives = 214/312 (68%), Gaps = 25/312 (8%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + E + G I P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGKHVTPEAERHELGKEEETIFDPNSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAES D CCN C++VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCKV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDS------FNISHKINKLA 165
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNVSILGKINMTHIIKHLS 261
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
FG +PG+VNPLDG + + S M+QYF+KVVPT+Y V G +++NQFSVT H + +
Sbjct: 262 FGRDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPTIYMKVDGEVVRTNQFSVTRHEKIA- 320
Query: 226 QGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH
Sbjct: 321 NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHS 380
Query: 284 QRAIKKKIEIGK 295
R I+KKIE+GK
Sbjct: 381 ARVIQKKIELGK 392
>gi|335304738|ref|XP_003360010.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Sus scrofa]
gi|350594872|ref|XP_003134465.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Sus scrofa]
Length = 398
Score = 332 bits (851), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 167/316 (52%), Positives = 216/316 (68%), Gaps = 29/316 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEIKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDS----------FNISHKI 161
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNVSTGHRCCLQINMTHYI 261
Query: 162 NKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 221
L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H
Sbjct: 262 QHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHE 321
Query: 222 RSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
+ + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+
Sbjct: 322 KVAS-GLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSL 380
Query: 280 IYHGQRAIKKKIEIGK 295
IYH RAI+KKI++GK
Sbjct: 381 IYHSARAIQKKIDLGK 396
>gi|410926568|ref|XP_003976750.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 2 [Takifugu rubripes]
Length = 389
Score = 332 bits (851), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 163/306 (53%), Positives = 215/306 (70%), Gaps = 14/306 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDS--QGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
MD++GEQ LDV+H++FK+RLD Q E+ + +G D P+ + C
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKNLQPVSTEAEKHELGGED-DVPVFDPSTL---DPERCE 144
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYGAE+ D CCN+C++VREAYR++GWA N D I+QCKREGF Q+++E++ EGC +YG
Sbjct: 145 SCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADTIEQCKREGFTQKMQEQKNEGCQVYG 204
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGV 173
LEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H I L+FG+ +PG+
Sbjct: 205 VLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHLIRHLSFGQDYPGL 264
Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QT 231
+NPLD T S MYQYF+K+VPT+Y G +++NQFSVT H + + G + Q
Sbjct: 265 INPLDDTNITAPQASMMYQYFVKIVPTIYVKTDGEVLKTNQFSVTRHEKVA-NGLIGDQG 323
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
LPGVF Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH R I+KKI
Sbjct: 324 LPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQKKI 383
Query: 292 EIGKFS 297
E+GK S
Sbjct: 384 ELGKAS 389
>gi|159470839|ref|XP_001693564.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283067|gb|EDP08818.1| predicted protein [Chlamydomonas reinhardtii]
Length = 388
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 167/302 (55%), Positives = 209/302 (69%), Gaps = 10/302 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE HLD+ +++ + E + GIG + R+ L + CGSC
Sbjct: 92 MDISGELHLDLVVELYTLWRRGAAGLTEGKGGGIGVLSVSVSRSRNATALANG---CGSC 148
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE DCCN C+EVR AYR+KGWALSN D I+QC + + + IKE+ GEGC+I +
Sbjct: 149 YGAEDKQGDCCNTCDEVRAAYRRKGWALSNVDHIEQCAHDLYTEAIKEQAGEGCHIG--V 206
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPG+S+ Q +HVHDI F + H I+KL+FGE +PG+ NPLDG
Sbjct: 207 EVNKVAGNFHFAPGRSYQQGSMHVHDIAPFGDAVIDFRHVIHKLSFGEPYPGMKNPLDGA 266
Query: 181 RWTQETP-----SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
+ Q +GM+QYF+KVVPT YTD+S T+ +NQFSVTE+FR ++ G +TLPGV
Sbjct: 267 KAGQAAAAAAAATGMFQYFLKVVPTSYTDLSNKTLSTNQFSVTENFREAQGGAGRTLPGV 326
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FFFYDLSPIKV E SFL FLT+VCAIVGGVFTVSGI+DAF+Y G R IKKK+E+GK
Sbjct: 327 FFFYDLSPIKVKIVEHGSSFLSFLTSVCAIVGGVFTVSGIVDAFVYTGTRMIKKKMELGK 386
Query: 296 FS 297
FS
Sbjct: 387 FS 388
>gi|351702542|gb|EHB05461.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Heterocephalus glaber]
Length = 378
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 166/301 (55%), Positives = 209/301 (69%), Gaps = 19/301 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID----KPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + K++ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHELGKVEVTVFDPESLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAES D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS HVH Q N++H I L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQS--HVHGWCCLQ---INMTHYIQHLSFGEDYPGIVNP 256
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPG
Sbjct: 257 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPG 315
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++G
Sbjct: 316 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 375
Query: 295 K 295
K
Sbjct: 376 K 376
>gi|410953940|ref|XP_003983626.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 3 [Felis catus]
Length = 399
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 167/317 (52%), Positives = 215/317 (67%), Gaps = 30/317 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDS-----------FNISHK 160
YGFLEVNKVAGNFHFAPGKSF QS VHV HD+ +F D+ N++H
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNRSRLRCWYCLQINMTHY 261
Query: 161 INKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H
Sbjct: 262 IRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRH 321
Query: 221 FRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
+ + G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+
Sbjct: 322 EKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDS 380
Query: 279 FIYHGQRAIKKKIEIGK 295
IYH RAI+KKI++GK
Sbjct: 381 LIYHSARAIQKKIDLGK 397
>gi|355563183|gb|EHH19745.1| Serologically defined breast cancer antigen NY-BR-84 [Macaca
mulatta]
gi|355784539|gb|EHH65390.1| Serologically defined breast cancer antigen NY-BR-84 [Macaca
fascicularis]
Length = 401
Score = 330 bits (845), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 164/315 (52%), Positives = 215/315 (68%), Gaps = 24/315 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205
Query: 121 EVNKVAGNFHFAPGKSFHQS-GVH-----------------VHDILAFQRDSFNISHKIN 162
EVNKVAGNFHFAPGKSF QS G + VHD+ +F D+ N++H I
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHGTYLTGCVCRLKMIARSLACVHDLQSFGLDNINMTHYIQ 265
Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H +
Sbjct: 266 HLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEK 325
Query: 223 SSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ I
Sbjct: 326 VA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLI 384
Query: 281 YHGQRAIKKKIEIGK 295
YH RAI+KKI++GK
Sbjct: 385 YHSARAIQKKIDLGK 399
>gi|340373749|ref|XP_003385402.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Amphimedon queenslandica]
Length = 386
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 164/306 (53%), Positives = 211/306 (68%), Gaps = 18/306 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD+SGE LDV+H ++K+RL G VI ES + A + G+ CGS
Sbjct: 90 MDVSGEHQLDVEHTMYKQRLTLDGEVINESPTKSVLARD-----ETQDGKAGAANKTCGS 144
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYGAE+ + CCN CE+VREAYRKKGWA S+P I+QC++EG+ +IKE+ EGC +YG
Sbjct: 145 CYGAETPELSCCNTCEQVREAYRKKGWAFSDPSSIEQCEKEGWTTQIKEQMNEGCRVYGL 204
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
++V+KVAGNFHFAPGKSF Q VHVHD+ F FN+SH + KL+FG+ +PG++NPLDG
Sbjct: 205 IDVSKVAGNFHFAPGKSFQQHSVHVHDLQPFGVKHFNMSHTVLKLSFGQEYPGIINPLDG 264
Query: 180 VR-WTQETPSG--MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQT 231
+ + ET G MYQYFIKVVPT+Y ++ T+ +NQF+VT+H R S E G
Sbjct: 265 HKAFDVETTHGGIMYQYFIKVVPTLYRRLNNETMGTNQFAVTKHQRPVRSASGEHG---- 320
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
LPGVFF YD+SPI V TE S HFLT+VCAIVGGVFTV+G+ID +YH R +KKK+
Sbjct: 321 LPGVFFIYDISPILVYLTEYRHSLTHFLTSVCAIVGGVFTVAGMIDKLLYHSGRVLKKKM 380
Query: 292 EIGKFS 297
E+GK S
Sbjct: 381 ELGKLS 386
>gi|440902508|gb|ELR53293.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3 [Bos
grunniens mutus]
Length = 395
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 163/313 (52%), Positives = 209/313 (66%), Gaps = 26/313 (8%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVH------------VHDILAFQRDSFNISHKINKL 164
YGFLEVNKVAGNFHFAPGKSF QS VH + + N++H I L
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHGCREEVRVTGARCSEAQGWCCLQINMTHYIRHL 261
Query: 165 AFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + +
Sbjct: 262 SFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA 321
Query: 225 EQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
G + Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH
Sbjct: 322 -NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYH 380
Query: 283 GQRAIKKKIEIGK 295
RAI+KKI++GK
Sbjct: 381 SARAIQKKIDLGK 393
>gi|330790779|ref|XP_003283473.1| hypothetical protein DICPUDRAFT_52316 [Dictyostelium purpureum]
gi|325086583|gb|EGC39970.1| hypothetical protein DICPUDRAFT_52316 [Dictyostelium purpureum]
Length = 383
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 165/299 (55%), Positives = 209/299 (69%), Gaps = 10/299 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGE DV H+IFKKRL S G I Q I +I+K + ++ E++ CGSC
Sbjct: 89 MDVSGEHQFDVAHNIFKKRLSSTGQPI-IEQPPIREEEINKKIVKN----ENDVQGCGSC 143
Query: 61 YGAESSDE--DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
YGAE CCN CEEVR AY KKGW L +P + QC REGF + I E+ GEGC +YG
Sbjct: 144 YGAEDPARGIPCCNTCEEVRNAYSKKGWGL-DPSTVSQCLREGFTKNIVEQNGEGCQVYG 202
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
F+ VNKVAGNFHFAPGKSF Q +HVHD+ F+ FN+SH INKLA G FPG+ NPLD
Sbjct: 203 FILVNKVAGNFHFAPGKSFQQHHMHVHDLQPFKDGQFNMSHTINKLAVGNEFPGIKNPLD 262
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQT-LPGVF 236
V T+ GM+QYFIK+VPT+Y ++G+ I +NQ+SVTEH+R +++G T LPG+F
Sbjct: 263 EVTKTEVAGVGMFQYFIKIVPTIYEGLNGNRIATNQYSVTEHYRLLAKKGEEPTGLPGLF 322
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
F YDLSPI + +E+ SF FLTNVCAI+GGVFTV GI D+FIY+ + +KKKI++GK
Sbjct: 323 FMYDLSPIMMKVSEKGKSFASFLTNVCAIIGGVFTVFGIFDSFIYYSTKNLKKKIDLGK 381
>gi|414586930|tpg|DAA37501.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 268
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 141/179 (78%), Positives = 164/179 (91%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG++HLDVKHD+FK+R+D+ GNVI +RQD +G K++ PLQ HGGRLEHNETYCGSC
Sbjct: 90 MDISGQEHLDVKHDVFKQRIDAHGNVIATRQDVVGGMKMEAPLQHHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+ SD+ CCN CE+VREAYRKKGW +SNPDL+DQCKREGFLQ IK+EEGEGCNIYGF+
Sbjct: 150 YGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLLDQCKREGFLQSIKDEEGEGCNIYGFI 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
EVNKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKIN+L+FGE+FPGVVNPLDG
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNVSHKINRLSFGEYFPGVVNPLDG 268
>gi|390359988|ref|XP_792057.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Strongylocentrotus purpuratus]
Length = 400
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 156/311 (50%), Positives = 213/311 (68%), Gaps = 20/311 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGG-------RLE- 51
MDISGEQ LDV H+I+K+R+D G I E ++ +G + + + ++E
Sbjct: 92 MDISGEQQLDVDHNIYKRRIDKTGTPISEPEKEELGKKEDQEKKEEEDSEQEDEKKKMEV 151
Query: 52 HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
+ C SCYGAE+ CCN+CE V+EAYR+KGWA S+P I+QCKREGF ++++ ++
Sbjct: 152 LDPNRCESCYGAETPGLKCCNDCEGVQEAYRRKGWAFSDPTSIEQCKREGFSEKMQSQKE 211
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
EGC +YG+LEVNKVAGNFHFAPGKSF Q VHVHD+ A FN++H + L+FG +P
Sbjct: 212 EGCELYGYLEVNKVAGNFHFAPGKSFQQHHVHVHDLQAIAGAKFNMTHHVKTLSFGMEYP 271
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH-------FRSS 224
G+ NPLD ++ S M+QYF+K+VPT YT + ++NQ+SVT+H F +
Sbjct: 272 GMENPLDNMKTIDVKGSSMFQYFVKIVPTTYTKLDKSITRTNQYSVTKHEKQVTTSFSTG 331
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
E G LPGVF Y+LSP+ V FTE+H SF+HFLT VCAI+GGVFTV+G+ID+ IYH
Sbjct: 332 EHG----LPGVFVLYELSPLMVKFTEKHRSFMHFLTGVCAIIGGVFTVAGLIDSLIYHSA 387
Query: 285 RAIKKKIEIGK 295
+AI+KKI++GK
Sbjct: 388 KAIQKKIDLGK 398
>gi|328868763|gb|EGG17141.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Dictyostelium fasciculatum]
Length = 335
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 162/303 (53%), Positives = 208/303 (68%), Gaps = 17/303 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES----RQDGIGAPKIDKPLQRHGGRLEHNETY 56
MD+SG+ DV H+IFKKRL G I R+D I +R E+++
Sbjct: 40 MDVSGDHQFDVAHNIFKKRLSPTGMPIADASPQREDTIN--------KRVPAGNENDKVD 91
Query: 57 CGSCYGAESSDE--DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
CGSCYGAE CC+ CEEVR AY+KKGW++ I QC REGF + I E+ GEGC
Sbjct: 92 CGSCYGAEDPSRGISCCSTCEEVRTAYQKKGWSIQEYSGIAQCVREGFTKNIVEQNGEGC 151
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
+YGF+ VNKVAGNFHFAPGKSF Q +HVHD+ AF + SFN+SH IN+L+FG FPG+
Sbjct: 152 QVYGFINVNKVAGNFHFAPGKSFQQHHMHVHDLQAF-KGSFNLSHSINRLSFGNDFPGIK 210
Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSEQGRLQTL 232
NPLDGV T+ SGM+QY+IKVVPT+Y ++G+ I +NQFSVTEH+R + + L
Sbjct: 211 NPLDGVTKTEMVGSGMFQYYIKVVPTLYEGLNGNRISTNQFSVTEHYRLLAKKDEEPSGL 270
Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
PG+FF YDLSPI + +E+ SF FLT+VCAIVGGVFTV+GI+D+ IY + +KKKI+
Sbjct: 271 PGLFFMYDLSPIMMKVSEQGKSFASFLTSVCAIVGGVFTVAGILDSMIYKTTKNLKKKID 330
Query: 293 IGK 295
+GK
Sbjct: 331 LGK 333
>gi|66801671|ref|XP_629760.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Dictyostelium discoideum AX4]
gi|74851212|sp|Q54DW2.1|ERGI3_DICDI RecName: Full=Probable endoplasmic reticulum-Golgi intermediate
compartment protein 3
gi|60463164|gb|EAL61357.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Dictyostelium discoideum AX4]
Length = 383
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 162/301 (53%), Positives = 210/301 (69%), Gaps = 13/301 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI-DKPLQRHGGRLEHNETY-CG 58
MD+SGE DV H+IFKKRL G I I AP I ++ + + ++N+ CG
Sbjct: 88 MDVSGEHQFDVAHNIFKKRLSPTGQPI------IEAPPIREEEINKKESVKDNNDVVGCG 141
Query: 59 SCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
SCYGAE + CCN CEEVR AY KKGW L +P I QC REGF + + E+ GEGC +
Sbjct: 142 SCYGAEDPSKGIGCCNTCEEVRVAYSKKGWGL-DPSGIPQCIREGFTKNLVEQNGEGCQV 200
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGF+ VNKVAGNFHFAPGKSF Q +HVHD+ F+ SFN+SH IN+L+FG FPG+ NP
Sbjct: 201 YGFILVNKVAGNFHFAPGKSFQQHHMHVHDLQPFKDGSFNVSHTINRLSFGNDFPGIKNP 260
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQT-LPG 234
LD V T+ GM+QYF+KVVPT+Y ++G+ I +NQ+SVTEH+R +++G + LPG
Sbjct: 261 LDDVTKTEMVGVGMFQYFVKVVPTIYEGLNGNRIATNQYSVTEHYRLLAKKGEEPSGLPG 320
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
+FF YDLSPI + +E SF FLTNVCAI+GGVFTV GI D+FIY+ + ++KKI++G
Sbjct: 321 LFFMYDLSPIMMKVSERGKSFASFLTNVCAIIGGVFTVFGIFDSFIYYSTKNLQKKIDLG 380
Query: 295 K 295
K
Sbjct: 381 K 381
>gi|156389237|ref|XP_001634898.1| predicted protein [Nematostella vectensis]
gi|156221986|gb|EDO42835.1| predicted protein [Nematostella vectensis]
Length = 386
Score = 315 bits (806), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 153/302 (50%), Positives = 203/302 (67%), Gaps = 16/302 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR--LEHNETYCG 58
MD+SGEQ +DV +I K+R+D G +I+ A K D + H + L+ + C
Sbjct: 92 MDVSGEQQIDVSSNILKRRVDLDGKIIDE-----NAEKGDLGDKSHEAKELLDLDPNRCE 146
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYGAE+ D+ CCN C++VREAYR+KGWALSN D + QC REG+ +++E++ EGC + G
Sbjct: 147 SCYGAETPDKKCCNTCDDVREAYRRKGWALSNVDDVKQCMREGWKDKLQEQKNEGCEVTG 206
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
+LEVNKVAGNFHFAPGKSF Q VHVHD+ F FN++H I L+FG +PG PLD
Sbjct: 207 YLEVNKVAGNFHFAPGKSFQQHHVHVHDLQPFGSTQFNLTHNIKHLSFGHDYPGKTYPLD 266
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTLP 233
MYQYF+K+VPT Y +SG + ++QFSVT+H R S E G LP
Sbjct: 267 NTFVPAMEAGSMYQYFVKIVPTTYRKLSGEILHTHQFSVTKHKRVIRQMSGEHG----LP 322
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+ SP+ V +TE SF+HFLT VCAIVGG+FTV+G++D+ IYH RA++KKI++
Sbjct: 323 GVFVLYEFSPMMVQYTESRRSFMHFLTGVCAIVGGIFTVAGLVDSMIYHSSRALQKKIDL 382
Query: 294 GK 295
GK
Sbjct: 383 GK 384
>gi|355686517|gb|AER98082.1| ERGIC and golgi 3 [Mustela putorius furo]
Length = 304
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 152/277 (54%), Positives = 194/277 (70%), Gaps = 14/277 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 36 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 88
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 89 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 148
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 149 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 208
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
LD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 209 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 267
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
VF Y+LSP+ V TE+H SF HFLT VCAI+GG+FT
Sbjct: 268 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFT 304
>gi|428183328|gb|EKX52186.1| hypothetical protein GUITHDRAFT_65491 [Guillardia theta CCMP2712]
Length = 425
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 163/298 (54%), Positives = 199/298 (66%), Gaps = 18/298 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGG---RLEHN 53
MDISGEQH+DV H+++K+RLD GNVI + + L+ H G L
Sbjct: 122 MDISGEQHIDVHHEVYKQRLDVDGNVILLLSRACLNVTNGSGDFTTLRAHAGFDAPLTGG 181
Query: 54 ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
E CGSCYGAE S ++CCN C+ VREAYR++GWA N D I QCK EGFL +++EE EG
Sbjct: 182 E--CGSCYGAEESPDECCNTCDSVREAYRRRGWAFVNSDGIVQCKTEGFLLKMQEERHEG 239
Query: 114 CNIYGFL-------EVNKVAGNFHFAPGKSF-HQSGVHVHDILAFQRDSFNISHKINKLA 165
C + G L +VNKVAGNFHF+PGKSF Q GVH D+L ++ +N+SH IN L+
Sbjct: 240 CRVVGTLQARLTREQVNKVAGNFHFSPGKSFSQQVGVHFQDLLVLRKTDYNVSHAINHLS 299
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
FG +PG VNPLDGV E S MYQYF+KVVPT Y +G + +NQFS TE+ R E
Sbjct: 300 FGRKYPGRVNPLDGVVRICEFRSAMYQYFVKVVPTQYQYRNGTILSTNQFSTTENTRQLE 359
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
G + LPGVFFFYDLSPIK T E + SFLHFLT +CAI+GGVFTV GIID+ IY G
Sbjct: 360 -GFTRGLPGVFFFYDLSPIKATLAERNNSFLHFLTGLCAIIGGVFTVMGIIDSTIYTG 416
>gi|320167013|gb|EFW43912.1| Ergic3 protein [Capsaspora owczarzaki ATCC 30864]
Length = 392
Score = 308 bits (788), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 149/301 (49%), Positives = 202/301 (67%), Gaps = 10/301 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIE--SRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
MD++GE LDV H + K RL + G V+ + + +G +P R + + + CG
Sbjct: 94 MDVAGEHQLDVLHTLVKTRLSASGEVVREPTPVEALG----QQPPSDAAERRDLDNSKCG 149
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
CYGA++ CCN+CEEV+ AYR+KGW + +PD I+QC++EGF +R++ EGC + G
Sbjct: 150 DCYGAQTEKRPCCNSCEEVQAAYREKGWGMMDPDSIEQCRQEGFSERMRSIANEGCKVQG 209
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
F+ VNKVAGNFHFAPGKS VHVHD+ F+ +F+++H I+ L+FG +PG VNPLD
Sbjct: 210 FMYVNKVAGNFHFAPGKSSQHQHVHVHDLQQFKTTTFDMTHTIHLLSFGTEYPGQVNPLD 269
Query: 179 GVRWT--QETP-SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPG 234
V + TP S M+QYFIKVVPT Y ++G T Q++QFS T H + + LPG
Sbjct: 270 AVSKVPPENTPGSAMFQYFIKVVPTEYVKLNGETEQTSQFSATSHVKMINHAAGENGLPG 329
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VFF Y+ SP+ V TE SF+HFLT VCAIVGGVFTV+G++DA IYH R+IKKK+E+G
Sbjct: 330 VFFMYEPSPMLVKITERRKSFMHFLTGVCAIVGGVFTVAGLVDATIYHSYRSIKKKMELG 389
Query: 295 K 295
K
Sbjct: 390 K 390
>gi|326434226|gb|EGD79796.1| intermediate compartment protein 3 [Salpingoeca sp. ATCC 50818]
Length = 396
Score = 306 bits (783), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 157/312 (50%), Positives = 206/312 (66%), Gaps = 24/312 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI-----------ESRQDGIGAPKIDKPLQRHGGR 49
MD+SGE LDV+HDIFK+RL G I + +GA K+ K
Sbjct: 92 MDVSGENELDVEHDIFKQRLTETGTPIYEEPEEVDDLGDESDSAVGALKMMKE------G 145
Query: 50 LEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
L+ N C SCYGAES CCN CE VREAYR+KGWAL++ I+QC+REG+ +++K +
Sbjct: 146 LDPNR--CESCYGAESEQNKCCNTCEAVREAYRRKGWALTDIQGIEQCEREGWTEKLKAQ 203
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINKLAF 166
EGC IYG LEVNKVAGNFH APGKSF Q +H HD+ +F R++ FN+SH IN L+F
Sbjct: 204 AKEGCRIYGHLEVNKVAGNFHIAPGKSFQQHSIHFHDLNSFGREALGKFNMSHTINHLSF 263
Query: 167 GEHFPGVVNPLDGVRWTQE-TPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
G +PGVVNPLDG T + + MYQY++K+VPT Y G + +NQ+SVT H R +
Sbjct: 264 GIEYPGVVNPLDGHSETADKLGATMYQYYVKIVPTRYRKARGQELNTNQYSVTMHQRHID 323
Query: 226 QGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
QT LPG+F +++SPI V +E SF HFLT V AI+GG+F+V+G+ID+F+YHG
Sbjct: 324 HKAGQTGLPGMFVMFEISPILVQLSERTHSFFHFLTGVLAIIGGIFSVAGMIDSFVYHGL 383
Query: 285 RAIKKKIEIGKF 296
R++KKK E+GK
Sbjct: 384 RSLKKKQELGKL 395
>gi|307105810|gb|EFN54058.1| hypothetical protein CHLNCDRAFT_25376, partial [Chlorella
variabilis]
Length = 312
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 155/307 (50%), Positives = 206/307 (67%), Gaps = 15/307 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET----Y 56
MDISGE L+V HD++K+RL G + D G P+ G E + T Y
Sbjct: 9 MDISGEVQLEVDHDVYKRRLSPDGTPL----DEGGCPRAGWLKPVPGNDSEADPTKAPGY 64
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
CGSCYG+ES CCN C EVR+AYR KGWAL + + ++QC EG+ + I E++GEGC++
Sbjct: 65 CGSCYGSESRAGQCCNTCAEVRDAYRTKGWALLDVEKVEQCHHEGYKEEIDEQKGEGCHV 124
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG---- 172
+G L++NKVAGNFH APG+S+ Q +H+HD+ F +F+ SH I+KLAFG +PG
Sbjct: 125 WGELQINKVAGNFHIAPGRSYQQGNMHIHDLSPFAGQAFDFSHTIHKLAFGREYPGTRGQ 184
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSEQGRLQ 230
++ T+ G+YQYF+KVVPT Y+D+ +TI +NQFSVTEHFR +S
Sbjct: 185 ALSTFCLSVGTRRERMGLYQYFLKVVPTSYSDLRNNTIYTNQFSVTEHFRETASPTAGGG 244
Query: 231 TLPGVFFFYDLSPIKVTFT-EEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
LPGVF FYDLSPIK + +SFL FLT++CAI+GGVFTVSGIIDA +YHGQ+AIKK
Sbjct: 245 QLPGVFLFYDLSPIKASLEGRARLSFLSFLTSLCAIIGGVFTVSGIIDATVYHGQQAIKK 304
Query: 290 KIEIGKF 296
K+++GK
Sbjct: 305 KLDLGKL 311
>gi|281211641|gb|EFA85803.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Polysphondylium pallidum PN500]
Length = 388
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 157/314 (50%), Positives = 211/314 (67%), Gaps = 29/314 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIE---SRQDGIG-APKIDKPLQRHGGRLEHNETY 56
MD+SGE DV H+IFK+RL G I R+D + PK++ E++
Sbjct: 87 MDVSGEHQFDVAHNIFKRRLSPTGEFIPDAPKREDNVNIKPKVN----------ENDRPE 136
Query: 57 CGSCYGAESSDE--DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
CGSC GAE+ + +CCN CEEVR AY+K GW +P QC REGF + + E+ GEGC
Sbjct: 137 CGSCMGAENPSKGINCCNTCEEVRVAYQKMGWGF-DPSDTPQCVREGFTKNVVEQNGEGC 195
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
+YGFL VNKVAGNFHFAPGKSF Q +HVHD+ +F + FN+SH I++L+FG FPG+
Sbjct: 196 QVYGFLLVNKVAGNFHFAPGKSFQQHHMHVHDLQSF-KGQFNLSHTISRLSFGNDFPGIK 254
Query: 175 NPLDGVRWTQETP---------SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SS 224
NPLDGV T+ SGM+QY++K+VPT+Y ++G+ I +NQ+SVTEH+R +
Sbjct: 255 NPLDGVSKTEANQYQYHNLVVGSGMFQYYVKIVPTIYEGLNGNLINTNQYSVTEHYRLLA 314
Query: 225 EQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
++G T LPG+FF YDLSPI + E SF F+T+VCAIVGGVFTV+GI D+FIY
Sbjct: 315 KKGEEMTGLPGLFFMYDLSPIMMKVVERSKSFASFITSVCAIVGGVFTVAGIFDSFIYQT 374
Query: 284 QRAIKKKIEIGKFS 297
+++K+KI++GK S
Sbjct: 375 TKSLKRKIDLGKAS 388
>gi|332248939|ref|XP_003273622.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 1 [Nomascus leucogenys]
Length = 380
Score = 301 bits (770), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/302 (50%), Positives = 201/302 (66%), Gaps = 19/302 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC G LQR + E C++
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCPARG-LQRTQPENERECSL---- 200
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVH-----DILAFQRDSFNISHKINKLAFGEHFPGVVN 175
+VAGNFHFAPGKSF QS VHVH D+ +F D+ N++H I L+FGE +PG+VN
Sbjct: 201 ---QVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYPGIVN 257
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LP
Sbjct: 258 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 316
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++
Sbjct: 317 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 376
Query: 294 GK 295
GK
Sbjct: 377 GK 378
>gi|383864675|ref|XP_003707803.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Megachile rotundata]
Length = 385
Score = 293 bits (750), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 144/308 (46%), Positives = 197/308 (63%), Gaps = 21/308 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID----KPLQRHGGRLEHNETY 56
MD +GEQHL ++H+I+K+RLD QG IE Q K D K L + + + T
Sbjct: 88 MDTTGEQHLQIEHNIYKRRLDLQGKPIEDPQ------KTDITDTKALSKTTAKSVESTTV 141
Query: 57 --CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
CG CYGA S CCN CE+VR+AY K WA +P I QC+ + ++++K +GC
Sbjct: 142 ETCGDCYGAASEKIKCCNTCEDVRKAYSDKNWAPPDPGSIKQCQNDKSVEKMKTAFTQGC 201
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
IYG++EVN+V G+FH APG SF + VHVHD+ + FN++HKI L+FG + PG
Sbjct: 202 QIYGYMEVNRVGGSFHIAPGNSFSVNHVHVHDVQPYMSTQFNMTHKIRHLSFGLNIPGKT 261
Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRL 229
NP+D + M+ ++IK+VPT Y G T+ +NQFSVT H R S E G
Sbjct: 262 NPIDDTTMVAMEGAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHARQVSLLSGESG-- 319
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+PG+FF Y+LSP+ V +TE+ SF HF TN+CAI+GGVFTV+G+ID+F+YH RAI+K
Sbjct: 320 --MPGIFFSYELSPLMVKYTEKAKSFGHFATNMCAIIGGVFTVAGLIDSFLYHSVRAIQK 377
Query: 290 KIEIGKFS 297
KIE+GK+S
Sbjct: 378 KIELGKYS 385
>gi|167535515|ref|XP_001749431.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163772059|gb|EDQ85716.1| predicted protein [Monosiga brevicollis MX1]
Length = 394
Score = 291 bits (745), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 150/306 (49%), Positives = 201/306 (65%), Gaps = 11/306 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQ---DGIGAPKIDKPLQRH-GGRLEHNETY 56
MDISGE ++ HD+F++RLD+ GN I + Q D +G D + G + +
Sbjct: 89 MDISGENEQNIDHDVFRQRLDASGNKIYNGQEEIDELGESHADNVADKALDGLKDLDPNR 148
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE ++ CCN C +V+EAYRKKGWA + I QC+REG+ ++ +E EGC +
Sbjct: 149 CESCYGAEDTEGQCCNTCAQVQEAYRKKGWAFRSGQGIAQCEREGYDAMMEAQEREGCQL 208
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISHKINKLAFGEHFPGV 173
YG LEVNKVAGNFH APG+SF Q +H+HD+ +F R+ FN++H IN L+FG +P
Sbjct: 209 YGHLEVNKVAGNFHIAPGRSFEQHNMHIHDMQSFGREKLAKFNLTHVINHLSFGIDYPDR 268
Query: 174 VNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS--SEQGRLQ 230
VN LDG V E + MYQYF+KVVPT Y +S I +NQ+SVT H R +QG
Sbjct: 269 VNSLDGHVEVPNEYGAIMYQYFLKVVPTRYRFLSQTEIDTNQYSVTMHQREIRPDQG-TS 327
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
LPG+FF YD+SP+K+ T+ SF HFLT +CAI+GGV+TV+G+ID F+YHG R +K K
Sbjct: 328 GLPGLFFMYDISPMKIQLTQSSRSFFHFLTGLCAIIGGVYTVAGMIDGFLYHGIRTLKAK 387
Query: 291 IEIGKF 296
+GK
Sbjct: 388 QNMGKL 393
>gi|441638772|ref|XP_004090166.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 isoform 2 [Nomascus leucogenys]
Length = 393
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 154/315 (48%), Positives = 201/315 (63%), Gaps = 32/315 (10%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC G LQR + E C++
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCPARG-LQRTQPENERECSL---- 200
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVH-----DILAFQRDS-------------FNISHKIN 162
+VAGNFHFAPGKSF QS VHVH D+ +F D+ N++H I
Sbjct: 201 ---QVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNVQLWMSSGWCCLQINMTHYIQ 257
Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G +++NQFSVT H +
Sbjct: 258 HLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEK 317
Query: 223 SSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G L Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ I
Sbjct: 318 VA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLI 376
Query: 281 YHGQRAIKKKIEIGK 295
YH RAI+KKI++GK
Sbjct: 377 YHSARAIQKKIDLGK 391
>gi|332024433|gb|EGI64631.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Acromyrmex echinatior]
Length = 386
Score = 290 bits (743), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 138/302 (45%), Positives = 193/302 (63%), Gaps = 8/302 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD +GEQHL ++H+IFK+RLD GN IE Q I K + CG
Sbjct: 88 MDTTGEQHLHIEHNIFKRRLDLNGNPIEDPQRTNITDAKAMSKTTEKAVEIGSTTELCGD 147
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYGA + CCN CE+V EAYR+K WA +P + QC+ + + ++K +GC IYG+
Sbjct: 148 CYGATTDTMKCCNTCEDVWEAYRRKKWAPPDPADVKQCQNDKSMDKLKHAFTQGCQIYGY 207
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
+EVN+V G+FH APG SF + VHVHD+ + FN++HKI L+FG + PG NP+DG
Sbjct: 208 MEVNRVGGSFHIAPGASFSVNHVHVHDVQPYTSSHFNMTHKIRHLSFGLNIPGKTNPMDG 267
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----LPGV 235
+ + M+ ++IK+VPT Y G T+ +NQFSVT H S++ L T +PG+
Sbjct: 268 MTVVDMDAAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRH---SKKVSLLTGESGMPGI 324
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FF Y+LSP+ V +TE+ SF HF TN CAI+GGVFTV+G+ID+ +YH RAI++KIE+GK
Sbjct: 325 FFNYELSPLMVKYTEKANSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQRKIELGK 384
Query: 296 FS 297
++
Sbjct: 385 YN 386
>gi|307179776|gb|EFN67966.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Camponotus floridanus]
Length = 385
Score = 287 bits (734), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 143/303 (47%), Positives = 197/303 (65%), Gaps = 13/303 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHNET-YC 57
MD +GEQHL ++H+IFK+RLD G IE R + + ++K ++ LE T C
Sbjct: 88 MDTTGEQHLHIEHNIFKRRLDLNGKPIEDPQRTNITDSKAVNKTAEK---ALEIGSTESC 144
Query: 58 GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
G CYGA + CCN CEEVREAY+ K WA +P I QCK + +++IK +GC IY
Sbjct: 145 GDCYGAATETLRCCNTCEEVREAYKLKKWAPPDPANIKQCKDDKSMEKIKHAFTQGCQIY 204
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
G++EVN+V G+FH APG SF + VHVHD+ + FN++HKI L+FG + PG NP+
Sbjct: 205 GYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTSTHFNMTHKIRHLSFGLNIPGKTNPM 264
Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----LP 233
D + M+ ++IK+VPT Y G T+ +NQFSVT H ++Q L T +P
Sbjct: 265 DDTTVIATEGAMMFYHYIKIVPTTYVRTDGSTLFTNQFSVTRH---AKQVSLFTGESGMP 321
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
G+FF Y+LSP+ V +TE+ SF HF TN CAI+GGVFTV+G+ID+ +YH RAI+KKIE+
Sbjct: 322 GIFFSYELSPLMVKYTEKAKSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQKKIEL 381
Query: 294 GKF 296
GK+
Sbjct: 382 GKY 384
>gi|340721521|ref|XP_003399168.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Bombus terrestris]
Length = 385
Score = 286 bits (731), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 140/302 (46%), Positives = 187/302 (61%), Gaps = 9/302 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD +GEQHL ++H+IFK+RLD G IE Q + E CG C
Sbjct: 88 MDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDITDTKARSKTTEKTVESTTEKACGDC 147
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA CCN CE+VREAYR K WA +I QCK + +++IK +GC IYG++
Sbjct: 148 YGAAGDIIKCCNTCEDVREAYRLKNWAPPALGMIKQCKNDKSVEKIKTAFTQGCQIYGYM 207
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVN+V G+FH APG SF + VHVHD+ + FN++HKI L+FG + PG NP+D
Sbjct: 208 EVNRVGGSFHIAPGDSFSVNHVHVHDVKPYTSTQFNMTHKIRHLSFGLNIPGKTNPMDDT 267
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTLPGV 235
+ M+ ++IK+VPT Y G T+ +NQFSVT H R S E G +PG+
Sbjct: 268 TVVAMEGAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHARQVSLFSGESG----MPGI 323
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FF Y+LSP+ V +TE+ SF HF TN CAI+GGVFTV+G+ID+ +YH RAI+KKIE+GK
Sbjct: 324 FFNYELSPLMVKYTEKAKSFGHFATNACAIIGGVFTVAGLIDSLLYHSVRAIQKKIELGK 383
Query: 296 FS 297
++
Sbjct: 384 YN 385
>gi|350404831|ref|XP_003487234.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Bombus impatiens]
Length = 385
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 140/302 (46%), Positives = 188/302 (62%), Gaps = 9/302 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD +GEQHL ++H+IFK+RLD G IE Q + E CG C
Sbjct: 88 MDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDITDTKARSKTTTKTVESTTEKACGDC 147
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA CCN CE+VREAYR K WAL +I QCK + ++++K +GC IYG++
Sbjct: 148 YGAAGDIIKCCNTCEDVREAYRLKNWALPALGMIKQCKNDKSVEKMKTAFIQGCQIYGYM 207
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVN+V G+FH APG SF + VHVHD+ + FN++HKI L+FG + PG NP+D
Sbjct: 208 EVNRVGGSFHIAPGDSFSVNHVHVHDVKPYTSTQFNMTHKIRHLSFGLNIPGKTNPMDDT 267
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTLPGV 235
+ M+ ++IK+VPT Y G T+ +NQFSVT H R S E G +PG+
Sbjct: 268 TVVAMEGAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHARQVSLFSGESG----MPGI 323
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FF Y+LSP+ V +TE+ SF HF TN CAI+GGVFTV+G+ID+ +YH RAI+KKIE+GK
Sbjct: 324 FFNYELSPLMVKYTEKAKSFGHFATNACAIIGGVFTVAGLIDSLLYHSVRAIQKKIELGK 383
Query: 296 FS 297
++
Sbjct: 384 YN 385
>gi|380016121|ref|XP_003692037.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Apis florea]
Length = 385
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 143/305 (46%), Positives = 192/305 (62%), Gaps = 15/305 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHN-ETYC 57
MD +GEQHL ++H+IFK+RLD G IE R D + K + LE E C
Sbjct: 88 MDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDITDTKALSKTTAK---TLESTTEKIC 144
Query: 58 GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
G CYGA S CCN CE+VREAYR K WA I QC+ + ++++K +GC IY
Sbjct: 145 GDCYGAASEIIKCCNTCEDVREAYRLKNWAPPVLGNIKQCQNDKSVEKMKTAFTQGCQIY 204
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
G++EVN+V G+FH APG SF + VHVHD+ + FN++HKI L+FG + PG NP+
Sbjct: 205 GYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTSTQFNMTHKIRHLSFGLNIPGKTNPM 264
Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTL 232
D + M+ ++IK+VPT Y G T+ +NQFSVT H R S E G +
Sbjct: 265 DDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHARQVSLFSGESG----M 320
Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
PG+FF Y+LSP+ V +TE+ SF HF TN CAI+GGVFTV+G+ID+ +YH RAI+KKIE
Sbjct: 321 PGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGGVFTVAGLIDSLLYHSLRAIQKKIE 380
Query: 293 IGKFS 297
+GK++
Sbjct: 381 LGKYN 385
>gi|328786822|ref|XP_393819.4| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like isoform 1 [Apis mellifera]
Length = 383
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 145/306 (47%), Positives = 194/306 (63%), Gaps = 19/306 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHN-ETYC 57
MD +GEQHL ++H+IFK+RLD G IE R D + K + LE E C
Sbjct: 88 MDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDITDTKALSKTTAK---TLESTTEKIC 144
Query: 58 GSCYGAESSDEDCCNNCEEVREAYRKKGWA-LSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
G CYGA S CCN CE+VREAYR K WA L N I QC+ + ++++K +GC I
Sbjct: 145 GDCYGAASEIIKCCNTCEDVREAYRLKNWAVLGN---IKQCQNDKSVEKMKTAFTQGCQI 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YG++EVN+V G+FH APG SF + VHVHD+ + FN++HKI L+FG + PG NP
Sbjct: 202 YGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTSTQFNMTHKIRHLSFGLNIPGKTNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQT 231
+D + M+ ++IK+VPT Y G T+ +NQFSVT H R S E G
Sbjct: 262 MDDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHARQVSLFSGESG---- 317
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
+PG+FF Y+LSP+ V +TE+ SF HF TN CAI+GGVFTV+G+ID+ +YH RAI+KKI
Sbjct: 318 MPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGGVFTVAGLIDSLLYHSLRAIQKKI 377
Query: 292 EIGKFS 297
E+GK++
Sbjct: 378 ELGKYN 383
>gi|297602842|ref|NP_001052965.2| Os04g0455900 [Oryza sativa Japonica Group]
gi|255675519|dbj|BAF14879.2| Os04g0455900 [Oryza sativa Japonica Group]
Length = 253
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 125/157 (79%), Positives = 144/157 (91%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG++HLDVKHDIFK+R+D GNVI ++QD +G K+++PLQRHGGRLEHNETYCGSC
Sbjct: 90 MDISGQEHLDVKHDIFKQRIDVHGNVIATKQDAVGGMKVEQPLQRHGGRLEHNETYCGSC 149
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE SDE CCN+CE+VREAYRKKGW +SNPDLIDQCKREGFLQ IK+EEGEGCNIYGFL
Sbjct: 150 YGAEESDEQCCNSCEDVREAYRKKGWGVSNPDLIDQCKREGFLQSIKDEEGEGCNIYGFL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
EVNKVAGNFHFAPGKSF ++ VHVHD+L FQ+DSFN+
Sbjct: 210 EVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDSFNV 246
>gi|270007946|gb|EFA04394.1| hypothetical protein TcasGA2_TC014693 [Tribolium castaneum]
Length = 385
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 149/305 (48%), Positives = 193/305 (63%), Gaps = 19/305 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR--LEHNETYCG 58
MD SGEQHL + H+I+K+RLD QG IE + K D ++R N+T CG
Sbjct: 90 MDSSGEQHLQIDHNIYKRRLDLQGQPIEEPK------KEDITIKRKNSTEVATVNKTECG 143
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWAL-SNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
SCYGA + CCN CE+VREAYR++ WA NP+ I QCK E F +++K +GC IY
Sbjct: 144 SCYGASFDPKRCCNTCEDVREAYRERRWAFPENPENITQCKEERFSEKLKTAFAQGCQIY 203
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV-NP 176
G L VN+V+G+FH APGKSF + VHVHD+ F FN +HKI L+FG NP
Sbjct: 204 GSLTVNRVSGSFHIAPGKSFSINHVHVHDVQPFSSTEFNTTHKIRHLSFGASIDSDTHNP 263
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQT 231
L E + M+QY IK+VPT Y + G I +NQFSVT+H R S E G
Sbjct: 264 LKDTVGLAEEGASMFQYHIKIVPTAYVKLDGQFISANQFSVTKHRRVISLMSGESG---- 319
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
+PG+FF Y+LSP+ V +TE+ SF HF TNVCAI+GGV+TV+G+ID +YH + I+KKI
Sbjct: 320 MPGIFFQYELSPLMVKYTEQSRSFGHFATNVCAIIGGVYTVAGLIDTMLYHSVKLIQKKI 379
Query: 292 EIGKF 296
E+GKF
Sbjct: 380 ELGKF 384
>gi|189237821|ref|XP_974331.2| PREDICTED: similar to AGAP012144-PA [Tribolium castaneum]
Length = 395
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 149/307 (48%), Positives = 193/307 (62%), Gaps = 21/307 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR----LEHNETY 56
MD SGEQHL + H+I+K+RLD QG IE + K D ++R N+T
Sbjct: 98 MDSSGEQHLQIDHNIYKRRLDLQGQPIEEPK------KEDITIKRKNSTEVSVATVNKTE 151
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWAL-SNPDLIDQCKREGFLQRIKEEEGEGCN 115
CGSCYGA + CCN CE+VREAYR++ WA NP+ I QCK E F +++K +GC
Sbjct: 152 CGSCYGASFDPKRCCNTCEDVREAYRERRWAFPENPENITQCKEERFSEKLKTAFAQGCQ 211
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV- 174
IYG L VN+V+G+FH APGKSF + VHVHD+ F FN +HKI L+FG
Sbjct: 212 IYGSLTVNRVSGSFHIAPGKSFSINHVHVHDVQPFSSTEFNTTHKIRHLSFGASIDSDTH 271
Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRL 229
NPL E + M+QY IK+VPT Y + G I +NQFSVT+H R S E G
Sbjct: 272 NPLKDTVGLAEEGASMFQYHIKIVPTAYVKLDGQFISANQFSVTKHRRVISLMSGESG-- 329
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+PG+FF Y+LSP+ V +TE+ SF HF TNVCAI+GGV+TV+G+ID +YH + I+K
Sbjct: 330 --MPGIFFQYELSPLMVKYTEQSRSFGHFATNVCAIIGGVYTVAGLIDTMLYHSVKLIQK 387
Query: 290 KIEIGKF 296
KIE+GKF
Sbjct: 388 KIELGKF 394
>gi|444729170|gb|ELW69597.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Tupaia chinensis]
Length = 393
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 148/334 (44%), Positives = 196/334 (58%), Gaps = 70/334 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-------------------- 40
MD++GEQ LDV+H++FK+RLD G + + + KI+
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSTEAERHELGKIEVKVFDPNSLDPDRCESCYGA 148
Query: 41 -----KPLQRHG----GRLE--------HNETYCGSCYGAESSDEDCCNNCEEVREAYRK 83
KP G++E + C SCYGAES D CCN CE+VREAYR+
Sbjct: 149 ESEDIKPCLEAADLELGKIEVKVFDPNSLDPDRCESCYGAESEDIKCCNTCEDVREAYRR 208
Query: 84 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 143
+GWA NPD I+QC+REGF Q+++E++ EGC +YGFLEVNK+
Sbjct: 209 RGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKI------------------ 250
Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY
Sbjct: 251 ------------NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYM 298
Query: 204 DVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT
Sbjct: 299 KVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTG 357
Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 358 VCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 391
>gi|242007856|ref|XP_002424735.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
gi|212508228|gb|EEB11997.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
Length = 376
Score = 276 bits (706), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 150/300 (50%), Positives = 195/300 (65%), Gaps = 20/300 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD SGEQHL ++H+I+K LD G I+ + KP+ E E CGSC
Sbjct: 92 MDSSGEQHLQIEHNIYKVSLDKNGIPIKEPEK----ETFVKPVN------ETKEKKCGSC 141
Query: 61 YGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
YGAES + CCN C +V++AY K+GW L+N +LI+QCK L + EGC IYG
Sbjct: 142 YGAESETLNITCCNTCADVKDAYMKRGWGLNNLELIEQCKN---LSQ-NNIFNEGCFIYG 197
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
+EVN+V G+FH APG+SF + VHVHD+ F +FN SHKI+ L+FG + PG NPLD
Sbjct: 198 TMEVNRVGGSFHIAPGQSFSINHVHVHDVQPFSSKAFNTSHKIDHLSFGYNIPGKTNPLD 257
Query: 179 GVRWTQETPSGMYQYFIKVVPTV--YTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 236
G+ + M+QY+IK+VPT+ Y D SG TI +NQFSVT H +S + + PG+F
Sbjct: 258 GIVALTHEGATMFQYYIKIVPTIYYYYDKSG-TILTNQFSVTRHQKSGSE-TIGVPPGIF 315
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
F Y+L+PI V +TE SF HF TNVCAI+GGVFTV+ +IDAF+Y +A KKKIEIGKF
Sbjct: 316 FNYELAPIMVKYTERKRSFGHFATNVCAIIGGVFTVASLIDAFLYRSVQAFKKKIEIGKF 375
>gi|307193219|gb|EFN76110.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Harpegnathos saltator]
Length = 386
Score = 275 bits (704), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 190/304 (62%), Gaps = 12/304 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
MD +G Q+L ++H+IF++RLD G IE R + + KP ++ CG
Sbjct: 88 MDTTGVQYLQIEHNIFQRRLDLNGKPIEDPQRTNITKTKAVVKPTDEET-QISSTTKVCG 146
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
CYGA + +CCN C++V+ AYR K WA+ + I QC+ + + K +GC IYG
Sbjct: 147 DCYGAATETLECCNTCDDVQMAYRLKKWAMPDLAKIKQCQNDKSADKYKHAFTQGCQIYG 206
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
++EVN+V G+FH APG S+ + VHVHD+ + + FN++HKI L+FG + PG NP+D
Sbjct: 207 YMEVNRVGGSFHIAPGDSYSVNHVHVHDVQPYNSNHFNMTHKIRHLSFGLNIPGKTNPMD 266
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-----SEQGRLQTLP 233
+ M+ Y+IK+VPT Y G T+ +NQFSVT H + S+ G +P
Sbjct: 267 DTTTVATEGAMMFYYYIKIVPTTYVRADGSTLLTNQFSVTRHSKRMPLYMSDSG----MP 322
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
G+FF Y+LSP+ V +TE+ SF HF TN CAI+GGVFTV+G+ID+ +YH RAI+KKIE+
Sbjct: 323 GIFFSYELSPLMVKYTEKAKSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQKKIEL 382
Query: 294 GKFS 297
GK++
Sbjct: 383 GKYN 386
>gi|170031960|ref|XP_001843851.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Culex quinquefasciatus]
gi|167871431|gb|EDS34814.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Culex quinquefasciatus]
Length = 391
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 144/305 (47%), Positives = 197/305 (64%), Gaps = 12/305 (3%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIES-RQDGIGAPKIDK-PLQRHGGRLEHNETYCGS 59
D +GEQHL + H+IFK+RLD +GN IE+ +++ I APK K + CGS
Sbjct: 90 DATGEQHLHIDHNIFKRRLDLKGNPIEAPKKEDIQAPKPRKDATEAPVVNSSTTANPCGS 149
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL--IDQCKREGFLQRIKEEE---GEGC 114
CYGA+ + CCN C++V +AYR+K W NP L +QCK E + ++ E EGC
Sbjct: 150 CYGAQKNSSHCCNTCQDVIDAYREKQW---NPTLEEFEQCKTEVAIGKLSLEAKAFNEGC 206
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GV 173
IYG++EVN+V G+FH APGKSF S +HVHD+ F FN++H IN L+FGE F G
Sbjct: 207 QIYGYMEVNRVGGSFHIAPGKSFSISHIHVHDVQPFSSSRFNMTHHINTLSFGEEFGFGQ 266
Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTL 232
+PLDG E + M+QY+IK+VPT + +SG + +NQFSVT H +S S +
Sbjct: 267 TSPLDGTDVIAEEGAMMFQYYIKIVPTEFVPLSGPKLHTNQFSVTTHRKSVSLMSGDSGM 326
Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
PG+F Y+LSP+ V FTE+ SF HF TN+CAI+GG+FTVSGI+D ++ A+K+KIE
Sbjct: 327 PGIFVNYELSPLMVKFTEKRSSFSHFATNLCAIIGGIFTVSGIVDTLLFTSIHALKRKIE 386
Query: 293 IGKFS 297
+GK S
Sbjct: 387 LGKAS 391
>gi|156552683|ref|XP_001599365.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Nasonia vitripennis]
Length = 328
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 134/303 (44%), Positives = 192/303 (63%), Gaps = 16/303 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES-RQDGIGAPK--IDKPLQRHGGRLEHNETYC 57
MD +GE HL+++H+IFK+RLD G IE ++ GI PK +KP + + C
Sbjct: 36 MDTTGETHLEIQHNIFKRRLDLDGKPIEDPKKTGIADPKKTTEKPAENATAK-------C 88
Query: 58 GSCYGAESSDE--DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
G CYGA S + CCN CEEV+EAYRK+ WA+ + QCK + + +E GC
Sbjct: 89 GDCYGAASEELGIKCCNTCEEVKEAYRKRKWAVHDTSRFAQCKNDKSREMTFKE---GCQ 145
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
IYGF+EVN+V G+FH APG S +HVHD+ + FN++H+I L+FG + PG N
Sbjct: 146 IYGFMEVNRVGGSFHIAPGDSITIDHLHVHDVQPYSSSQFNLTHRIRHLSFGTNIPGKTN 205
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPG 234
P+D + M+ ++IK+VPT + + G + +NQFS+T+H RS +Q ++ +PG
Sbjct: 206 PIDNTTVIASEGATMFHHYIKIVPTTFMRLDGSILHTNQFSLTKHSRSIKQYSGESGMPG 265
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
+FF Y+LSP+ V +T+ S H +TN CAI+GG FTV+ IIDAF+YH RAI+KK+E+G
Sbjct: 266 LFFSYELSPLMVKYTQTVKSLGHLMTNTCAIIGGTFTVASIIDAFLYHSVRAIQKKMELG 325
Query: 295 KFS 297
K S
Sbjct: 326 KLS 328
>gi|157118753|ref|XP_001653244.1| ptx1 protein [Aedes aegypti]
gi|108875623|gb|EAT39848.1| AAEL008391-PA [Aedes aegypti]
Length = 384
Score = 273 bits (697), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 145/304 (47%), Positives = 199/304 (65%), Gaps = 17/304 (5%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIE-SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
D +GEQHL ++H I+K+R+D QGN IE ++++ I APK L++ E N C SC
Sbjct: 90 DATGEQHLHIEHTIYKRRMDLQGNPIEEAKKEDISAPK--PRLEKK----EENVKKCRSC 143
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID--QCKREGFLQRIKEEE---GEGCN 115
YGAE + CC C++V +AYR+K W NP+L D QC+ E L + E EGC
Sbjct: 144 YGAEKNSTHCCETCQDVIDAYREKQW---NPNLDDFEQCQNEVLLGKKSLESKAFSEGCQ 200
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVV 174
IYG ++VN+V G+FH APGKSF S +HVHD+ F FN SH+IN L+FGE F G
Sbjct: 201 IYGSMQVNRVGGSFHIAPGKSFSISHIHVHDVQPFSSSRFNTSHRINTLSFGEEFGYGQT 260
Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLP 233
PLD T + M+QY+IK+VPT + ++G T+ +NQFSVT+H +S S +P
Sbjct: 261 RPLDFTEKTAHEGAIMFQYYIKIVPTEFVPLNGPTLHTNQFSVTKHQKSVSVMSGESGMP 320
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
G+F Y+LSP+ V FTE+ SF HF TN+CAI+GG+FTV+GIID+ ++ A+K+KIE+
Sbjct: 321 GIFVNYELSPLMVRFTEKRNSFSHFATNLCAIIGGIFTVAGIIDSLLFTSIHALKRKIEL 380
Query: 294 GKFS 297
GKFS
Sbjct: 381 GKFS 384
>gi|412994036|emb|CCO14547.1| predicted protein [Bathycoccus prasinos]
Length = 436
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 152/334 (45%), Positives = 193/334 (57%), Gaps = 40/334 (11%)
Query: 1 MDISGEQHLDV-KHDIFKKRLDSQG-------NVIESRQDGIGAPKIDKPLQRHGGRLEH 52
MD+SGE HLDV H++ K R D G N +++ + D L
Sbjct: 100 MDVSGETHLDVVDHEMRKIRYDRYGVKLADALNDEHGKEEVVNEKAFDSNETETASSLRK 159
Query: 53 NET------------------YCGSCYGAESS------DEDCCNNCEEVREAYRKKGWAL 88
N+T YCGSCYGA+ S ++ CC CEEVREAY + GWA
Sbjct: 160 NKTKKTAKELIPRYMEDGKTKYCGSCYGADVSGANRGREQRCCQTCEEVREAYIEVGWAF 219
Query: 89 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 148
+ ++QCKREGF + + EGC GFL+VNKV GNFH APGKSF Q HVHD+
Sbjct: 220 TGASSMEQCKREGFSEVLGNVHEEGCEFKGFLDVNKVQGNFHIAPGKSFQQGEQHVHDLS 279
Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETP--SGMYQYFIKVVPTVYTDVS 206
F FN SH++ L+FGE +PG V+PLDG + T + P +G+YQYF ++VPT YT ++
Sbjct: 280 PFPDGKFNFSHEVRHLSFGEGYPGKVDPLDGTKRTLKLPAETGVYQYFFRIVPTTYTYLN 339
Query: 207 --GHTIQSNQFSVTEHFR----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
I +NQ+SV +HF+ +S QG LPGVFFFYDLSPIKV E S FL
Sbjct: 340 PFKKDISTNQYSVVDHFKPVDAASIQGGSSDLPGVFFFYDLSPIKVDIAEYRTSVWKFLA 399
Query: 261 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
VCA VGGVF VSGI+D +Y G AIKKKI++G
Sbjct: 400 EVCASVGGVFAVSGIVDKVVYKGSLAIKKKIQLG 433
>gi|198425065|ref|XP_002127888.1| PREDICTED: similar to ERGIC and golgi 3 [Ciona intestinalis]
Length = 385
Score = 268 bits (684), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 134/300 (44%), Positives = 192/300 (64%), Gaps = 16/300 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID----KPLQRHGGRLEHNETY 56
+D+SG++ +DV+H + K+ L+S G+ + A K+D KP+ Y
Sbjct: 95 IDVSGQRDIDVQHTLVKQPLNSDGSWVAE-----AAEKVDLVGTKPVL--NATEPPPADY 147
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
CGSC+GAE+ D CCN C +++EAYR+KGWA I C E KE G GC +
Sbjct: 148 CGSCFGAETKDMTCCNTCSDIKEAYRRKGWAFPRDGSITPCIGE---DDDKEPVGSGCYL 204
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-DSFNISHKINKLAFGEHFPGVVN 175
+G LEVN+VAGNFH +PGKS+ +HVHD+ + N+SH N L+FG +PG V+
Sbjct: 205 HGHLEVNRVAGNFHISPGKSYEVGHMHVHDMARMGKYKESNVSHVFNHLSFGSTYPGQVH 264
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
PLD + S +QY++K+VPT Y +SG T +NQFSVT H + ++ R ++LPG+
Sbjct: 265 PLDNLEVIASESSVAFQYYVKIVPTTYEKLSGDTFHTNQFSVTRHQKRNKDSR-ESLPGM 323
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
F Y+LSP+ V + E SF+HFLT+VCAI+GG+FTV+G+ D+FIYHG +A++KKIE+GK
Sbjct: 324 FVSYELSPMMVRYVERRRSFVHFLTSVCAIIGGIFTVAGLFDSFIYHGSKALQKKIELGK 383
>gi|145340712|ref|XP_001415464.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144575687|gb|ABO93756.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 379
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 138/304 (45%), Positives = 194/304 (63%), Gaps = 15/304 (4%)
Query: 1 MDISGEQHLDV-KHDIFKKRLDSQGNVIE--SRQDGIGAPKIDKPLQRH--GGRLEHNET 55
MD++GE LDV + ++ R+D++G I S + + A +R GGR +
Sbjct: 82 MDVTGETRLDVSRSEVRTTRVDARGRAIAMTSERTAVNAKTEAGEREREATGGR-----S 136
Query: 56 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
CG CYGA + CC++C+ VREAYR KGWAL + + QC +E + ++ E EGC+
Sbjct: 137 ACGDCYGAAEAGT-CCDDCDSVREAYRVKGWALPDLRRVTQCTKEYDVVAMRNEHKEGCH 195
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ-RDSFNISHKINKLAFGEHFPGVV 174
G EVNKVAGNFH APGKS++ G HVHD+ F +SFN SH I+KL+FGE FPGVV
Sbjct: 196 FSGHFEVNKVAGNFHIAPGKSYNNLGQHVHDLSPFAGVESFNFSHIIHKLSFGEEFPGVV 255
Query: 175 NPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVS--GHTIQSNQFSVTEHFRSSEQGRLQT 231
NPLDGV R + +G+YQY + VVP Y + ++SN +SVT+HFR + +
Sbjct: 256 NPLDGVTRTMDDANAGVYQYRLSVVPARYKYLGFRARVVESNDYSVTDHFRGFDVTKNPG 315
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
LPG+FFFYDLSP++V + E + F +L+NV AI+GGV V I+D +Y GQRA+++K+
Sbjct: 316 LPGLFFFYDLSPLRVEYEERRIGFFQYLSNVAAIIGGVSAVVNIVDGLVYRGQRALREKV 375
Query: 292 EIGK 295
++GK
Sbjct: 376 DLGK 379
>gi|358334909|dbj|GAA53334.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Clonorchis sinensis]
Length = 323
Score = 266 bits (680), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 130/298 (43%), Positives = 182/298 (61%), Gaps = 13/298 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD +GEQ +DV I+K R+DS G+ I + + G P G + + YCGSC
Sbjct: 28 MDSTGEQKIDVSQQIYKTRIDSTGSPISATRRDDGNPS-------KGQVVTKDPDYCGSC 80
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES CCN C+E++ AY+++ W + N + +QC+ E + + EGC I G L
Sbjct: 81 YGAESETRKCCNTCKEIQLAYQERHWVVKNLSVFEQCREEQWDDTLANLGSEGCRIQGSL 140
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+VNKVAG+FH PG S+ VHVH++ F N+SHKI+KLAFG +PG NPLDG
Sbjct: 141 QVNKVAGSFHITPGNSYASDQVHVHNLQGFDGQKLNMSHKIDKLAFGNMYPGQTNPLDGT 200
Query: 181 RWTQETPSGMYQYFIKVVPTVY-----TDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPG 234
P+ M Y++K+VPT+Y T S T+ +NQ+SVT H + S + +PG
Sbjct: 201 TMNVVEPAQMVTYYMKLVPTMYVSYNTTTRSLSTVHTNQYSVTWHSKGSPLTSDSSGIPG 260
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
+FF Y+LSP+ V + EH SFLHFLTN CAI+GGVFTV+ ++DAFIY ++K++
Sbjct: 261 LFFNYELSPLLVKISYEHKSFLHFLTNTCAIIGGVFTVASLLDAFIYQSTCVVRKRLS 318
>gi|150036309|emb|CAO03349.1| ERGIC and golgi 3 [Homo sapiens]
Length = 325
Score = 263 bits (673), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 128/240 (53%), Positives = 167/240 (69%), Gaps = 6/240 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 90 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 146
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 147 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 206
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 266
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 267 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 325
>gi|328770814|gb|EGF80855.1| hypothetical protein BATDEDRAFT_19389 [Batrachochytrium
dendrobatidis JAM81]
Length = 409
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 137/301 (45%), Positives = 195/301 (64%), Gaps = 9/301 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGE ++ H + K R+D GN++E +Q +G +++ + + YCGSC
Sbjct: 110 MDVSGEHQNNLPHSMHKVRIDQLGNLLE-KQKKLGNTN-SSGVKKEIRDMALDPKYCGSC 167
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YG + + CCN CE+V+EAY + GW+ ++PD I+QC REG+ +R++ + E CNIYG +
Sbjct: 168 YGGVAPESKCCNTCEQVQEAYERSGWSFTDPDSIEQCVREGWSKRMETQINEACNIYGHI 227
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ--RDSFNISHKINKLAFGEHFPGVVNPLD 178
EVNKV GN HFAPG SF Q+ +HVHD+ + SFN H I++L+FGE VNPLD
Sbjct: 228 EVNKVQGNIHFAPGHSFQQNALHVHDLHDYNAPNGSFNFKHTIHELSFGES-SSFVNPLD 286
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ--GRLQT-LPGV 235
V T T YQY+IKVV T + ++G + +NQFSVTEH + G L +PG
Sbjct: 287 TVTKTPPTKYFSYQYYIKVVGTDISYLNGSQLTTNQFSVTEHEQDVTPLFGALPIGMPGK 346
Query: 236 FFF-YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
FF +++SP+ V F E F HFLT++CAI+GGVFTV+G+IDA ++ QR+I+ K+EIG
Sbjct: 347 LFFNFEISPMLVKFKEFRKPFTHFLTDLCAIIGGVFTVAGMIDALLFATQRSIQAKVEIG 406
Query: 295 K 295
K
Sbjct: 407 K 407
>gi|158300475|ref|XP_320382.3| AGAP012144-PA [Anopheles gambiae str. PEST]
gi|157013177|gb|EAA00591.3| AGAP012144-PA [Anopheles gambiae str. PEST]
Length = 386
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 193/308 (62%), Gaps = 23/308 (7%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET------ 55
D +GEQHL ++H+I+K+RLD QGN IE PK + +Q R+ E
Sbjct: 90 DSTGEQHLHIEHNIYKRRLDLQGNQIEE-------PK-KEDIQASTKRISSTEAPATTTV 141
Query: 56 --YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID--QCKREGFLQRIKEEEG 111
CGSCYGA + CCN C+EV +AYR++ W NP++ D QCK +
Sbjct: 142 KPACGSCYGAAKNASQCCNTCQEVIDAYRERKW---NPNVEDFEQCKNGNGGSVEGKAFS 198
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
EGC+IYG +EVN+V G FH APGKSF + +HVHD+ + FN +H+IN L+FGE F
Sbjct: 199 EGCHIYGTMEVNRVEGRFHIAPGKSFSINHIHVHDVQPYSSSRFNTTHRINTLSFGEQFG 258
Query: 172 -GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
G PLDG+ + M+QY+IK+VPT++ ++G T+ +NQFSVT+H +S +
Sbjct: 259 FGTTRPLDGLMVEATEGAMMFQYYIKIVPTMFVPLNGPTLYTNQFSVTKHQKSVTAMSGE 318
Query: 231 T-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
T +PG+F Y+LSP+ V FTE+ S HF TNVCAI+GG+FTV+GIID+ ++ IK+
Sbjct: 319 TGMPGIFVNYELSPLMVKFTEKRNSLGHFATNVCAIIGGIFTVAGIIDSLLFTSIHVIKR 378
Query: 290 KIEIGKFS 297
KIE+GK S
Sbjct: 379 KIELGKAS 386
>gi|357612408|gb|EHJ67977.1| hypothetical protein KGM_08440 [Danaus plexippus]
Length = 385
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 133/298 (44%), Positives = 181/298 (60%), Gaps = 8/298 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD SGEQHL + H++ K+RLD G I E ++ I K E CGS
Sbjct: 91 MDSSGEQHLQMDHNVHKRRLDLDGVPIKEPIKEDISLSSTVKQ-----NSSEIAIVTCGS 145
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYGA +D CCN CE+V+EAYR + WAL + ++QCK + L+R EGC IYG+
Sbjct: 146 CYGAAFNDSQCCNTCEDVKEAYRLRRWALPDLATVEQCKDDDSLERTNLALKEGCQIYGY 205
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV-VNPLD 178
+EVN+V G+FH APGKSF + VHVHD+ F FN +H I L+FG PLD
Sbjct: 206 MEVNRVGGSFHIAPGKSFTINHVHVHDVQPFSSSVFNTTHIIRHLSFGSDIESANTAPLD 265
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFF 237
G+ + + M+QY++K+VPT+Y + G + +NQFSVT H +S +++ +PG FF
Sbjct: 266 GITGLAKEGAVMFQYYLKIVPTMYVKLDGTILHTNQFSVTRHQKSVSNINVESGMPGAFF 325
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y+LSP+ V +T + S HF TNVCAIVGGVFTV+GI D +YH A + K+ +GK
Sbjct: 326 SYELSPLMVKYTAKGRSIGHFATNVCAIVGGVFTVAGIFDTLLYHSLNAFQNKVVLGK 383
>gi|296481082|tpg|DAA23197.1| TPA: endoplasmic reticulum-Golgi intermediate compartment protein 3
[Bos taurus]
Length = 306
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 124/224 (55%), Positives = 157/224 (70%), Gaps = 11/224 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
LD T S M+QYF+KVVPTVY V G +++NQFSVT H
Sbjct: 262 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRH 305
>gi|291000812|ref|XP_002682973.1| predicted protein [Naegleria gruberi]
gi|284096601|gb|EFC50229.1| predicted protein [Naegleria gruberi]
Length = 416
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/326 (42%), Positives = 196/326 (60%), Gaps = 40/326 (12%)
Query: 1 MDISGEQHLDVK-HDIFKKRLDSQGN-VIESRQDGIGAPKIDKP----LQRHGGRLEHN- 53
MD+SGE H+ + H ++K RL G +IE + + + DKP L+ G ++H+
Sbjct: 85 MDVSGEHHVHLDYHTVYKMRLTLDGKPIIEQQAEQVSD---DKPTLDILKPPPGAVKHDL 141
Query: 54 -------------------ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 94
YCGSCYG+ CCN C++VRE+YR+ GWA S + I
Sbjct: 142 VNNAELDKIRAERAKKVKDPKYCGSCYGSNRDANQCCNTCDDVRESYRRVGWAFSPNEDI 201
Query: 95 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 154
+QC E +++K + EGCN++G+ VNKVAGNFHFAPGKSF ++ H+HD ++ D
Sbjct: 202 EQCYEEILERKMKYSKQEGCNLHGYFLVNKVAGNFHFAPGKSFVRAQQHMHDYTNYEVDH 261
Query: 155 FNISHKINKLAFGEHFPGVVNPLDG----VRWTQET------PSGMYQYFIKVVPTVYTD 204
FN SH IN L FGE PG++NPLDG + + ET S ++QYF+KVVPT+Y
Sbjct: 262 FNTSHIINYLGFGEKIPGLINPLDGTSKIIGYNAETGQRVEGESALFQYFVKVVPTIYEK 321
Query: 205 V-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
S ++I +NQ+SVT+H R + +PGVFF YDLSPI V TE SF+ FLT++C
Sbjct: 322 YGSSNSIITNQYSVTQHSRPKNRLHPNVVPGVFFIYDLSPIMVHITENKKSFVQFLTSLC 381
Query: 264 AIVGGVFTVSGIIDAFIYHGQRAIKK 289
AI+GGVFTVS ++D IY ++ + +
Sbjct: 382 AIIGGVFTVSALLDRVIYGVEKKMNR 407
>gi|345319994|ref|XP_001507420.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Ornithorhynchus anatinus]
Length = 203
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 121/204 (59%), Positives = 150/204 (73%), Gaps = 3/204 (1%)
Query: 70 CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 129
CCN CE+VREAYR++GWA NPD I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNF
Sbjct: 1 CCNTCEDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNF 60
Query: 130 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG 189
HFAPGKSF QS VH + L N++H I L+FGE +PG+VNPLDG + S
Sbjct: 61 HFAPGKSFQQSHVHGKERLRIHPRPINMTHYIEHLSFGEDYPGIVNPLDGTDVSAPQASM 120
Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVT 247
M+QYF+KVVPTVY G +++NQFSVT H + + G + Q LPGVF Y+LSP+ V
Sbjct: 121 MFQYFVKVVPTVYVKADGEVVRTNQFSVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVK 179
Query: 248 FTEEHVSFLHFLTNVCAIVGGVFT 271
TE+H SF HFLT VCAI+GGVFT
Sbjct: 180 LTEKHRSFTHFLTGVCAIIGGVFT 203
>gi|321463520|gb|EFX74535.1| hypothetical protein DAPPUDRAFT_226626 [Daphnia pulex]
Length = 381
Score = 257 bits (657), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 129/297 (43%), Positives = 189/297 (63%), Gaps = 10/297 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
+D SGEQ V+H+IFK+RL+ G +++ + +I+K + E + + C S
Sbjct: 91 VDSSGEQQFGVEHNIFKQRLNLLGEPLQAAE----LEEINKTHNKTETSTEESASKPCNS 146
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYGA+ E CC C EVREAYR+K WA P+ +QC+ E L R EGC +YG+
Sbjct: 147 CYGAK---EGCCETCAEVREAYRQKNWAF-RPEEFEQCRNEKNLTRDYSAFKEGCKLYGY 202
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
LEVN+V+G+FH APGKS+ + VHVHD+ + + FN++H IN L+FG G NPLDG
Sbjct: 203 LEVNRVSGSFHIAPGKSYAINHVHVHDVQPYSSEDFNVTHHINSLSFGTSLIGKENPLDG 262
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQTLPGVFFF 238
T + + M+QY+IKVVPT Y + G +NQ+SVT H + S G +PGVFF
Sbjct: 263 FLTTADKGAMMFQYYIKVVPTWYVKLDGEEFHTNQYSVTRHQKVVSSYGGESGVPGVFFT 322
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
Y++SP+++++ E S HF T+VC I+GGVFTV+GIID+ +Y + +++K+++GK
Sbjct: 323 YEMSPLQISYKESKRSIGHFATDVCTIIGGVFTVAGIIDSLLYRSSKLLQQKLQLGK 379
>gi|226486462|emb|CAX74360.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Schistosoma japonicum]
Length = 379
Score = 250 bits (639), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 125/293 (42%), Positives = 175/293 (59%), Gaps = 10/293 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD +G Q L+V H+++K + GN + + D L + YCGSC
Sbjct: 90 MDTTGAQQLNVMHEVYKTSVSISGNPLSNSVRH--TVNDDSALTT-----TRDPNYCGSC 142
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+S CCN CEEV+ AY + W N +QC+ E + + EGC I+G L
Sbjct: 143 YGADSPTRKCCNTCEEVQMAYHEMQWVFGNASEFEQCRNENWDGMKRNIGNEGCRIHGSL 202
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
VN+V G FH APG S+ ++ HVH I + FN+SH I +L FG+ +PG +N LDG
Sbjct: 203 TVNRVGGGFHIAPGHSYTENHAHVHSIRSLGHVQFNVSHSITELRFGDAYPGQINSLDGT 262
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGH--TIQSNQFSVTEHFRSSE-QGRLQTLPGVFF 237
+ T + PS M+ Y++K+VPT+YT VS + T+ +NQ+S T H R S G Q LPGVFF
Sbjct: 263 KMTVDKPSQMFNYYLKLVPTMYTSVSNNESTLITNQYSATWHSRGSPLSGDGQGLPGVFF 322
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
Y+++P+ V TEE SF+HFLTN CAI+GGVFTV+ ++DAFIY ++ +
Sbjct: 323 NYEIAPLLVKITEERKSFVHFLTNTCAIIGGVFTVASLLDAFIYQSSCVLRNR 375
>gi|56753075|gb|AAW24747.1| SJCHGC09363 protein [Schistosoma japonicum]
gi|226486460|emb|CAX74359.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Schistosoma japonicum]
gi|226486464|emb|CAX74361.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Schistosoma japonicum]
Length = 379
Score = 250 bits (639), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 125/293 (42%), Positives = 175/293 (59%), Gaps = 10/293 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD +G Q L+V H+++K + GN + + D L + YCGSC
Sbjct: 90 MDTTGAQQLNVMHEVYKTSVSISGNPLSNSVRH--TVNDDSALTT-----TRDPNYCGSC 142
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+S CCN CEEV+ AY + W N +QC+ E + + EGC I+G L
Sbjct: 143 YGADSPTRKCCNTCEEVQMAYHEMQWVFGNASEFEQCRNENWDGMKRNIGNEGCRIHGSL 202
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
VN+V G FH APG S+ ++ HVH I + FN+SH I +L FG+ +PG +N LDG
Sbjct: 203 TVNRVGGGFHIAPGHSYTENHAHVHSIRSLGHVQFNVSHSITELRFGDAYPGQINSLDGT 262
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGH--TIQSNQFSVTEHFRSSE-QGRLQTLPGVFF 237
+ T + PS M+ Y++K+VPT+YT VS + T+ +NQ+S T H R S G Q LPGVFF
Sbjct: 263 KMTVDKPSQMFNYYLKLVPTMYTSVSNNESTLITNQYSATWHSRGSPLSGDGQGLPGVFF 322
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
Y+++P+ V TEE SF+HFLTN CAI+GGVFTV+ ++DAFIY ++ +
Sbjct: 323 NYEIAPLLVKITEERKSFVHFLTNTCAIIGGVFTVASLLDAFIYQSSCVLRNR 375
>gi|358054679|dbj|GAA99605.1| hypothetical protein E5Q_06306 [Mixia osmundae IAM 14324]
Length = 424
Score = 249 bits (637), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 185/318 (58%), Gaps = 35/318 (11%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE D+ HDI K RLD G ++++ +D + L+R G ++ YCGSC
Sbjct: 93 MDISGEHQNDIHHDILKNRLDKSGALVQATRD----STLKGELERAVG-VKREPGYCGSC 147
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YG D CCN C+EVRE+Y ++GW+ NPD IDQC REGF ++IKE+ EGCN+ G +
Sbjct: 148 YGGAPGDSGCCNTCDEVRESYVRRGWSFVNPDGIDQCVREGFSEKIKEQSEEGCNVAGQV 207
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFG----------- 167
+VNKV GNFH +PGKSF + HVHD++ + + H IN+ +F
Sbjct: 208 KVNKVIGNFHLSPGKSFQSNMHHVHDLVPYLAAGQQHDFGHIINRFSFAAEGDDGFNRET 267
Query: 168 ---EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
+ + +PL GVR E + M+QYF+KVV T + + G T+ S+Q+SVT++ R
Sbjct: 268 ARLKQSLNIEDPLTGVRAHTEQSNYMFQYFVKVVSTKFKTLDGRTLSSHQYSVTQYERDL 327
Query: 225 EQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
+G +PG+FF Y++SP+ V EE SF HF+T+ CAIVGG+
Sbjct: 328 SKGNKPGKDEDGHQTSHGYAGVPGLFFNYEISPMLVVHREERQSFAHFITSTCAIVGGIL 387
Query: 271 TVSGIIDAFIYHGQRAIK 288
TV+G+ID +Y Q ++
Sbjct: 388 TVAGLIDTLVYSSQTRLQ 405
>gi|389612123|dbj|BAM19583.1| ptx1 protein [Papilio xuthus]
Length = 285
Score = 249 bits (636), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 129/292 (44%), Positives = 179/292 (61%), Gaps = 16/292 (5%)
Query: 11 VKHDIFKKRLDSQGNVIES-RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDED 69
+ H+I K+RLD GN IE +++ I I ++++ L CGSCYGA +D
Sbjct: 1 MDHNIHKRRLDLDGNPIEEPKKEEIA---ISSTVKQNTSELA--TVTCGSCYGAAFNDSQ 55
Query: 70 CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 129
CCN CE+V+EAYR + WAL + I QCK + L++ EGC IYG++EVN+V G+F
Sbjct: 56 CCNTCEDVKEAYRIRRWALPDLATIVQCKDDESLEKANLALKEGCQIYGYMEVNRVGGSF 115
Query: 130 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV-VNPLDGVRWTQETPS 188
H APGKSF + VHVHD+ + +FN +H I L+FG PLDGV+ + +
Sbjct: 116 HIAPGKSFTINHVHVHDVQPYSSSAFNTTHXIQHLSFGSDIKSANTAPLDGVKGIAQEGA 175
Query: 189 GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-----SEQGRLQTLPGVFFFYDLSP 243
M+QY+IK+ PT+Y + + +NQFSVT H +S SE G +PG FF Y+LSP
Sbjct: 176 VMFQYYIKIGPTMYVKLDKTVLHTNQFSVTRHQKSVSNINSESG----MPGAFFSYELSP 231
Query: 244 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
+ V +TE+ S HF TN+CAI+GGVFTV+GI+D +YH A KI +GK
Sbjct: 232 LMVKYTEKERSIGHFATNICAIIGGVFTVAGILDTLLYHSLNAFHNKIVLGK 283
>gi|349804919|gb|AEQ17932.1| putative ergic and golgi 3 [Hymenochirus curtipes]
Length = 228
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 120/231 (51%), Positives = 162/231 (70%), Gaps = 5/231 (2%)
Query: 6 EQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAES 65
EQ LDV+H++FK RLD + S + K ++P+ L+ + C SCYGAE+
Sbjct: 1 EQQLDVEHNLFKLRLDKDRQPVSSEAERHDLGKAEEPVIFDPKSLDPDR--CESCYGAET 58
Query: 66 SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKV 125
D CCN+C++VREAYR++GWA PD I+QCKREGF Q+++E++ EGC +YGFLEVNKV
Sbjct: 59 DDFRCCNSCDDVREAYRRRGWAFKTPDSIEQCKREGFSQKMQEQKNEGCRVYGFLEVNKV 118
Query: 126 AGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE 185
AGNFHFAPGKSF QS VHVHD+ +F D+ N++H+I L+FG +PG+VNPLDG +
Sbjct: 119 AGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHEIKHLSFGMDYPGLVNPLDGTSVSAV 178
Query: 186 TPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
S M+QYF+K+VPTVY V G +++NQFSVT H + + G + Q LPG
Sbjct: 179 QSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKVT-NGLIGDQGLPG 228
>gi|195378906|ref|XP_002048222.1| GJ11466 [Drosophila virilis]
gi|194155380|gb|EDW70564.1| GJ11466 [Drosophila virilis]
Length = 372
Score = 246 bits (629), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 139/302 (46%), Positives = 187/302 (61%), Gaps = 23/302 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE-TYCGS 59
MD SG+ HL V HD+FK RLD +G P + P++ N+ + CGS
Sbjct: 89 MDSSGDTHLRVDHDVFKHRLDLEGQ-----------PLKETPIKEIVAVSPPNKNSTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CE+V +AYR + W + D I+QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRVRKWNM-QVDKIEQCK--GKYKRTDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFTNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
DG+R QE+ S M+ Y++K+VPT+Y S G I +NQFSVT H R R + +PG+
Sbjct: 252 DGLRVEVQESKSEMFNYYLKIVPTLYERHSDGQPIYTNQFSVTRH-RKDLTDRERGMPGI 310
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FF Y+LSP+ V + E HVSF HF TN C+IVGGVFTV+GI+ + + A+++K+E+GK
Sbjct: 311 FFSYELSPLMVKYAERHVSFGHFATNCCSIVGGVFTVAGILAVLLNNSWEALQRKLEVGK 370
Query: 296 FS 297
S
Sbjct: 371 LS 372
>gi|331241265|ref|XP_003333281.1| hypothetical protein PGTG_14201 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309312271|gb|EFP88862.1| hypothetical protein PGTG_14201 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 421
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 136/323 (42%), Positives = 187/323 (57%), Gaps = 46/323 (14%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLE-----HNET 55
MDISGE DV HD+ K RL G P Q+ G LE +
Sbjct: 93 MDISGEHQNDVAHDLAKTRLGLDG-----------VPLSTNTTQKLQGELETIIASRAKD 141
Query: 56 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
YCGSCYG E CCN+CEEVRE+Y ++GW+ +NPD I+QC +E + +RIKE+ EGCN
Sbjct: 142 YCGSCYGGEPGPSGCCNSCEEVRESYVRRGWSFNNPDGIEQCVQEHWSERIKEQSKEGCN 201
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGE----- 168
I G L+VNKV GNFH +PG+SF VHVHD++ + +DS + H I+ AF +
Sbjct: 202 INGVLKVNKVIGNFHLSPGRSFQTHQVHVHDLVPYLQDSNLHDFGHVIHNFAFMDANQPT 261
Query: 169 ---------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
G+VNPLDGV+ E + M+QYF+KVV T + + G +++Q+SVT+
Sbjct: 262 ETAHTLRLKKTLGIVNPLDGVKAHTEASNYMFQYFLKVVGTQFQLLDGQVAKTHQYSVTQ 321
Query: 220 HFR---------SSEQGRLQT-----LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ R + E G L + +PGVFF Y++SP++V E SF HF T+ CAI
Sbjct: 322 YERDLDNSDKSDADELGHLTSHGHSGVPGVFFNYEISPMQVVHQEYRQSFAHFATSTCAI 381
Query: 266 VGGVFTVSGIIDAFIYHGQRAIK 288
VGGV TV+G++D+F+Y Q +K
Sbjct: 382 VGGVLTVAGLLDSFVYGAQNRMK 404
>gi|195126511|ref|XP_002007714.1| GI12235 [Drosophila mojavensis]
gi|193919323|gb|EDW18190.1| GI12235 [Drosophila mojavensis]
Length = 372
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/302 (46%), Positives = 187/302 (61%), Gaps = 23/302 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE-TYCGS 59
MD SG+ HL V HD+FK RLD GN P + P++ N+ + CGS
Sbjct: 89 MDSSGDTHLRVDHDVFKHRLDLDGN-----------PLKETPIKEIVAVSPPNKNSTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CE+V +AYR + W + D I+QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNSTHCCNTCEDVLDAYRIRKWNM-QVDKIEQCK--GKYKRTDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFTNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
DG+R +E+ S M+ Y++K+VPT+Y S G I +NQFSVT H R R + +PG+
Sbjct: 252 DGLRVDVEESKSEMFNYYLKIVPTLYERHSDGKPIYTNQFSVTRH-RKDLTDRERGMPGI 310
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FF Y+LSP+ V + E HVSF HF TN C+I+GGVFTV+GI+ + + AI++K+E+GK
Sbjct: 311 FFSYELSPLMVKYAERHVSFGHFATNCCSIIGGVFTVAGILAVVLNNSLEAIQRKLEVGK 370
Query: 296 FS 297
S
Sbjct: 371 LS 372
>gi|195021391|ref|XP_001985385.1| GH17030 [Drosophila grimshawi]
gi|193898867|gb|EDV97733.1| GH17030 [Drosophila grimshawi]
Length = 372
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 139/302 (46%), Positives = 186/302 (61%), Gaps = 23/302 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE-TYCGS 59
MD SG+ HL V HD+FK RLD QG P + P++ N+ + CGS
Sbjct: 89 MDSSGDTHLRVDHDVFKHRLDLQGE-----------PLKETPIKEIVAVSPPNKNSTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CE+V +AYR + W + D I+QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNSTHCCNTCEDVLDAYRIRKWNM-QVDKIEQCK--GKYKRTDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFTNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
DG+R +E+ S M+ Y++K+VPT+Y S G I +NQFSVT H R R + +PG+
Sbjct: 252 DGIRVDVEESKSEMFNYYLKIVPTLYERHSDGEPIYTNQFSVTRH-RKDLTDRERGMPGI 310
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FF Y+LSP+ V + E H SF HF TN C+IVGGVFTV+GI+ + + AI++K+E+GK
Sbjct: 311 FFSYELSPLMVKYAERHNSFGHFATNCCSIVGGVFTVAGILAVLLNNSWEAIQRKLEVGK 370
Query: 296 FS 297
S
Sbjct: 371 LS 372
>gi|58264656|ref|XP_569484.1| ER to Golgi transport-related protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|134109945|ref|XP_776358.1| hypothetical protein CNBC5750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50259032|gb|EAL21711.1| hypothetical protein CNBC5750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57225716|gb|AAW42177.1| ER to Golgi transport-related protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 422
Score = 244 bits (622), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 126/326 (38%), Positives = 188/326 (57%), Gaps = 40/326 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE + +H + K R++ GNVI Q G +++ L + YCGSC
Sbjct: 92 MDISGEHQTEFEHQVTKTRMNKDGNVISKVQGGQLKGDVER------ANLNQDPNYCGSC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA + CCN+CEEVR+AY +KGW+ S+P+ I+QC EG++ ++KE+ EGC I G +
Sbjct: 146 YGALPPESGCCNSCEEVRQAYGRKGWSFSDPEGIEQCVEEGWMDKMKEQNEEGCRIDGHI 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGEHFP------- 171
VNKV GN HF+PG+SF + + + +++ + RD + H ++K FG
Sbjct: 206 RVNKVIGNLHFSPGRSFQNNMMQMLELVPYLRDKNHHDFGHIVHKFRFGADMTKAEELTV 265
Query: 172 -----------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
G+ +PL G++ E + M+QYF+KVV T + +SG I S+Q+SVT++
Sbjct: 266 LPKEQRWRDKLGLRDPLQGIKAHTEVSNYMFQYFLKVVSTNFISLSGEEISSHQYSVTQY 325
Query: 221 FRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
R G + +PGVFF Y++SP+KV TEE SF HFLT+ CAIV
Sbjct: 326 ERDLRTGNAPGKDAHGHMTSHGMMGVPGVFFNYEISPMKVIHTEERQSFAHFLTSTCAIV 385
Query: 267 GGVFTVSGIIDAFIYHGQRAIKKKIE 292
GGV TV+ ++D+ I++ + +KKK E
Sbjct: 386 GGVLTVASLVDSLIFNSSKRLKKKSE 411
>gi|443732120|gb|ELU16969.1| hypothetical protein CAPTEDRAFT_192533 [Capitella teleta]
Length = 304
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 124/255 (48%), Positives = 164/255 (64%), Gaps = 6/255 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGEQ +DV HDIFK+RLD G +++ G L + SC
Sbjct: 52 MDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKEGQSSESCALNHALSSFLFSRF---SC 108
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES CCN C EVREAYR+KGWA + I+QC REG++ +++E + EGC IYGFL
Sbjct: 109 YGAESEAHKCCNTCNEVREAYRQKGWAFVDAQNIEQCMREGYVSQLEEGKNEGCRIYGFL 168
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKVAGNFH APG+SF Q H+HD+ A Q FN+SH+I L+FG+ +PG VNPLD
Sbjct: 169 EVNKVAGNFHVAPGRSFSQHHAHIHDMQALQGMKFNMSHRIQHLSFGDDYPGQVNPLDAS 228
Query: 181 -RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFF 237
+ T++ M+ Y++KVVPT Y +G + SNQ+SVT+H + G L Q LPGVF
Sbjct: 229 EQVTEQADFVMFSYYVKVVPTSYLRANGEFVSSNQYSVTKHHKKVGGGILGEQGLPGVFV 288
Query: 238 FYDLSPIKVTFTEEH 252
Y+LSP+ V +TE++
Sbjct: 289 TYELSPMMVKYTEKN 303
>gi|194751543|ref|XP_001958085.1| GF10736 [Drosophila ananassae]
gi|190625367|gb|EDV40891.1| GF10736 [Drosophila ananassae]
Length = 372
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/302 (45%), Positives = 184/302 (60%), Gaps = 23/302 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
MD SG+ HL V HDIFK RLD +G P + P++ N+ CGS
Sbjct: 89 MDSSGDTHLRVDHDIFKHRLDLKGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CEEV +AYR + W + D I+QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNSTHCCNTCEEVLDAYRLRKWNV-QVDKIEQCK--GKYKRTDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYT-DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
DG+ +E S M+ Y++K+VPT+Y D G I +NQFSVT H R R + +PG+
Sbjct: 252 DGMHVEVEEKKSEMFNYYLKIVPTLYMRDSDGKPIYTNQFSVTRH-RKDLSDRERGMPGI 310
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV+GI+ + + AI++K+E+GK
Sbjct: 311 FFSYELSPLMVKYAEKHSSFGHFATNCCSIIGGVFTVAGILAVLLNNSLEAIQRKLEVGK 370
Query: 296 FS 297
S
Sbjct: 371 LS 372
>gi|321253192|ref|XP_003192660.1| ER to Golgi transport-related protein [Cryptococcus gattii WM276]
gi|317459129|gb|ADV20873.1| ER to Golgi transport-related protein, putative [Cryptococcus
gattii WM276]
Length = 435
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 126/327 (38%), Positives = 190/327 (58%), Gaps = 40/327 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE + +H + K R+D G +I Q G ++ L+R L + YCGSC
Sbjct: 92 MDISGEHQTEFEHQVTKTRIDKNGKIISKVQGG----QLKGDLER--ANLNQDPNYCGSC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA + CCN+CEEVR+AY +KGW+ S+P+ I+QC EG++ ++KE+ EGC I G +
Sbjct: 146 YGAPPPESGCCNSCEEVRQAYGRKGWSFSDPEGIEQCVEEGWMDKMKEQNEEGCRIGGHI 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGEHFP------- 171
VNKV GN HF+PG+SF + + + +++ + RD + H ++K FG
Sbjct: 206 RVNKVIGNLHFSPGRSFQNNMMQMLELVPYLRDKNHHDFGHIVHKFRFGGDMTKAEELTV 265
Query: 172 -----------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
G+ +PL G++ E + M+QYF+KVV T + ++G I S+Q+SVT++
Sbjct: 266 LPKEQRWRDKLGLKDPLQGIKVHTEVSNYMFQYFLKVVSTNFISLNGEEIPSHQYSVTQY 325
Query: 221 FRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
R G + +PGVFF Y++SP+KV TEE SF HFLT+ CAIV
Sbjct: 326 ERDLRTGNAPGKDAHGHMTSHGMMGVPGVFFNYEISPMKVIHTEERQSFAHFLTSTCAIV 385
Query: 267 GGVFTVSGIIDAFIYHGQRAIKKKIEI 293
GGV TV+ ++D+FI++ + +KK E+
Sbjct: 386 GGVLTVASLLDSFIFNSSKRLKKTSEV 412
>gi|194374867|dbj|BAG62548.1| unnamed protein product [Homo sapiens]
Length = 321
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 119/224 (53%), Positives = 153/224 (68%), Gaps = 21/224 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHG-GRLE-------- 51
MD++GEQ LDV+H++FK+RLD G + S GA +RH G++E
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSS-----GA-------ERHELGKVEVTVFDPDS 136
Query: 52 HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
+ C SCYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++
Sbjct: 137 LDPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKN 196
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +P
Sbjct: 197 EGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYP 256
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
G+VNPLD T S M+QYF+KVVPTVY V G Q +
Sbjct: 257 GIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVSQGAPY 300
>gi|195327731|ref|XP_002030571.1| GM24497 [Drosophila sechellia]
gi|195590409|ref|XP_002084938.1| GD12569 [Drosophila simulans]
gi|194119514|gb|EDW41557.1| GM24497 [Drosophila sechellia]
gi|194196947|gb|EDX10523.1| GD12569 [Drosophila simulans]
Length = 373
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/303 (44%), Positives = 185/303 (61%), Gaps = 24/303 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
MD SG+ HL V HD+FK RLD G P + P++ N+ CGS
Sbjct: 89 MDSSGDTHLRVDHDVFKHRLDLNGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CE+V +AYR + W ++ D I+QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRLRKWTVA-VDKIEQCK--GKYKRSDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
DG+R ET S M+ Y++K+VPT+Y + G I +NQFSVT +R R + +PG
Sbjct: 252 DGLRVDVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYTNQFSVTR-YRKDLSDRERGMPG 310
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
+FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV+GI+ + + AI++K+E+G
Sbjct: 311 IFFSYELSPLMVKYAEKHSSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEAIQRKLEVG 370
Query: 295 KFS 297
K S
Sbjct: 371 KLS 373
>gi|21357439|ref|NP_648758.1| CG7011 [Drosophila melanogaster]
gi|7294304|gb|AAF49653.1| CG7011 [Drosophila melanogaster]
gi|16768234|gb|AAL28336.1| GH25868p [Drosophila melanogaster]
gi|220946650|gb|ACL85868.1| CG7011-PA [synthetic construct]
Length = 373
Score = 241 bits (616), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/303 (44%), Positives = 184/303 (60%), Gaps = 24/303 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
MD SG+ HL V HD+FK RLD G P + P++ N+ CGS
Sbjct: 89 MDSSGDTHLRVDHDVFKHRLDLNGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CE+V +AYR + W ++ D I+QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRLRKWTVA-VDKIEQCK--GKYKRSDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
DG+R ET S M+ Y++K+VPT+Y + G I +NQFSVT +R R + +PG
Sbjct: 252 DGLRVDVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYTNQFSVTR-YRKDLSDRERGMPG 310
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
+FF Y+LSP+ V + E H SF HF TN C+I+GGVFTV+GI+ + + AI++K+E+G
Sbjct: 311 IFFSYELSPLMVKYAERHSSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEAIQRKLEVG 370
Query: 295 KFS 297
K S
Sbjct: 371 KLS 373
>gi|195441336|ref|XP_002068468.1| GK20487 [Drosophila willistoni]
gi|194164553|gb|EDW79454.1| GK20487 [Drosophila willistoni]
Length = 372
Score = 241 bits (614), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 136/302 (45%), Positives = 186/302 (61%), Gaps = 23/302 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE-TYCGS 59
MD SG+ HL V HD+FK RLD +G P + P++ N+ + CGS
Sbjct: 89 MDSSGDTHLRVDHDVFKHRLDLKGE-----------PLKETPIKEIVAVSPANKNSTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CE+V +AY K W++ D ++QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYHLKKWSV-QVDKLEQCK--GKYKRTDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
DG+R +E+ S M+ Y+IK+VPT+Y S G I +NQFSVT +R R + +PG+
Sbjct: 252 DGLRVNVEESKSEMFNYYIKIVPTLYERNSDGQPIYTNQFSVTR-YRKDLTDRERGMPGI 310
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FF Y+LSP+ V + E H SF HF TN C+I+GGVFTV+GI+ + + AI++K+E+GK
Sbjct: 311 FFSYELSPLMVKYAERHNSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEAIQRKLEVGK 370
Query: 296 FS 297
S
Sbjct: 371 LS 372
>gi|195495133|ref|XP_002095138.1| GE19855 [Drosophila yakuba]
gi|194181239|gb|EDW94850.1| GE19855 [Drosophila yakuba]
Length = 373
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 135/303 (44%), Positives = 185/303 (61%), Gaps = 24/303 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
MD SG+ HL V HD+FK RLD G P + P++ N+ CGS
Sbjct: 89 MDSSGDTHLRVDHDVFKHRLDLNGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CE+V +AYR + W ++ D I+QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRLRKWNVA-VDKIEQCK--GKYKRSDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
DG+R ET S M+ Y++K+VPT+Y + G I +NQFSVT +R R + +PG
Sbjct: 252 DGLRVDVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYTNQFSVTR-YRKDLSDRERGMPG 310
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
+FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV+GI+ + + A+++K+E+G
Sbjct: 311 IFFSYELSPLMVKYAEKHSSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEALQRKLEVG 370
Query: 295 KFS 297
K S
Sbjct: 371 KLS 373
>gi|125978263|ref|XP_001353164.1| GA20029 [Drosophila pseudoobscura pseudoobscura]
gi|54641917|gb|EAL30666.1| GA20029 [Drosophila pseudoobscura pseudoobscura]
Length = 372
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/302 (45%), Positives = 184/302 (60%), Gaps = 23/302 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
MD SG+ HL V HDIFK RLD +G P + P++ N+ CGS
Sbjct: 89 MDSSGDTHLRVDHDIFKHRLDLKGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CE+V +AYR W + D I+QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRLHKWNV-QVDKIEQCK--GKYKRTDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
DG+R ET S M+ Y++K+VPT+Y S G I +NQFSVT +R R + +PG+
Sbjct: 252 DGLRVDVAETKSEMFNYYLKIVPTLYMRQSDGQPIYTNQFSVTR-YRKDLTDRERGMPGI 310
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV+GI+ + + AI++K+++GK
Sbjct: 311 FFSYELSPLMVKYAEKHNSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEAIQRKLDVGK 370
Query: 296 FS 297
S
Sbjct: 371 LS 372
>gi|194872681|ref|XP_001973062.1| GG13555 [Drosophila erecta]
gi|190654845|gb|EDV52088.1| GG13555 [Drosophila erecta]
Length = 373
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/303 (44%), Positives = 184/303 (60%), Gaps = 24/303 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
MD SG+ HL V HD+FK RLD G P + P++ N+ CGS
Sbjct: 89 MDSSGDTHLRVDHDVFKHRLDLNGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CEEV +AYR + W ++ D I+QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEEVLDAYRLRKWNVA-VDKIEQCK--GKYKRSDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
DG+R ET S M+ Y++K+VPT+Y + G I +NQFSVT +R R + +PG
Sbjct: 252 DGLRVEVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYTNQFSVTR-YRKDLSDRERGMPG 310
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
+FF Y+LSP+ V + E+ SF HF TN C+I+GGVFTV+GI+ + + A+++K+E+G
Sbjct: 311 IFFSYELSPLMVKYAEKRSSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEALQRKLEVG 370
Query: 295 KFS 297
K S
Sbjct: 371 KLS 373
>gi|301106576|ref|XP_002902371.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
gi|262098991|gb|EEY57043.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
Length = 393
Score = 236 bits (602), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 121/295 (41%), Positives = 173/295 (58%), Gaps = 18/295 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GE +++ + K RLD+ G I + D + K D P YCGSC
Sbjct: 112 MDVAGELQVNMHQTVVKTRLDANGRSISTTADELA--KTDLP-----------AGYCGSC 158
Query: 61 YGAE-SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
YG + ++CCN CEEV+EA+ +L + +QC RE ++GEGC G
Sbjct: 159 YGTRHPAGKECCNTCEEVKEAFIHSDLSLEEAEQKEQCVRESIDTEKLAQDGEGCRFTGK 218
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
+ VN+VAGNFH A G++FH+ G VH Q +FN SH I+ L+FGE PG +PLDG
Sbjct: 219 MFVNRVAGNFHVALGRTFHRQGRLVHQFRPGQEHTFNSSHIIHSLSFGEPIPGATSPLDG 278
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFF 238
V E G++QY+IK+VPT+Y+D+ I S QFSVT+ + +G++ +LPG FF
Sbjct: 279 VSKIAEQSGGVFQYYIKIVPTIYSDIDESAIHSYQFSVTQQSNYLNPRGQMTSLPGTFFV 338
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY---HGQRAIKKK 290
+DLSP V + V F HFLT +CAIVGGV +++G +D+F+Y H +R + K
Sbjct: 339 FDLSPFMVKVENDRVPFTHFLTKICAIVGGVISIAGFVDSFMYNSLHVRRRVSSK 393
>gi|3860008|gb|AAC72954.1| unknown [Homo sapiens]
Length = 198
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 111/185 (60%), Positives = 138/185 (74%), Gaps = 3/185 (1%)
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGF
Sbjct: 8 CYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGF 67
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
LEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 68 LEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDH 127
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFF 237
T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L Q LPGVF
Sbjct: 128 TNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFA 186
Query: 238 FYDLS 242
LS
Sbjct: 187 HLPLS 191
>gi|328858670|gb|EGG07782.1| hypothetical protein MELLADRAFT_105603 [Melampsora larici-populina
98AG31]
Length = 422
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/327 (40%), Positives = 190/327 (58%), Gaps = 38/327 (11%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIE-SRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MDISGE DV HD+ K RL+ G ++ S G+ R G YCGS
Sbjct: 93 MDISGEHQNDVNHDMTKTRLNPDGTLVSASVSKGLKGELDTIAATRAPG-------YCGS 145
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYG + CCN CEEVRE+Y ++GW+ SNPD I+QC +E + +IKE+E EGCN+ G
Sbjct: 146 CYGGTPPESGCCNTCEEVRESYVRRGWSFSNPDGIEQCVQEHWSDKIKEQEKEGCNMNGQ 205
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAF-GEHFP----- 171
++VNKV GNFH +PG+SF + +HVHD++ + + +S + H I+K AF EH
Sbjct: 206 VKVNKVIGNFHMSPGRSFQTNAMHVHDLVPYLQTGNSHDFGHIIHKFAFLAEHQSPDDDE 265
Query: 172 --------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR- 222
G+VNPLDG++ E + M+QYF+KVV T + + ++++Q+SVT++ R
Sbjct: 266 TRRIKTSLGIVNPLDGIKAHTEESNYMFQYFLKVVGTEFHLLDQRVVKTHQYSVTQYERD 325
Query: 223 --SSEQGRLQTL-----------PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
S +G L PG+FF Y++SP++V E SF HF T+ CAI+GGV
Sbjct: 326 LTKSSRGGTDELGHQTSHGYAGVPGLFFNYEISPMQVIHKEYRQSFAHFATSTCAIIGGV 385
Query: 270 FTVSGIIDAFIYHGQRAIKKKIEIGKF 296
TV+G+ID+ +Y + IK + G F
Sbjct: 386 LTVAGLIDSAVYGARNRIKLQSSDGGF 412
>gi|348680250|gb|EGZ20066.1| CopII vesicle protein [Phytophthora sojae]
Length = 409
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 122/301 (40%), Positives = 179/301 (59%), Gaps = 10/301 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GE +++ + K RLD+ GN I G I + E YCGSC
Sbjct: 113 MDVAGELQVNMHQTVVKTRLDADGNTI-----GRPISMITDEGAEEQAKTALPEGYCGSC 167
Query: 61 YGAE-SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
+GA+ + ++CCN CE+V+EA+ ++L + + +QC RE ++GEGC G
Sbjct: 168 HGAQHPAGKECCNTCEDVKEAFIYSDFSLEDAEQKEQCVREIMEAEKLAQDGEGCRFTGK 227
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
+ VN+VAGNFH A G++FH+ G VH Q ++N SH I+ L+FGE PGV PLDG
Sbjct: 228 MFVNRVAGNFHVALGRTFHRQGRLVHQFRPGQEHTYNSSHIIHSLSFGEPMPGVAGPLDG 287
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFF 238
V E G++QY+IK+VPT+Y+D+ +TI S QFSVT+ + +G++ +LPG FF
Sbjct: 288 VSKIAEQSGGVFQYYIKIVPTIYSDIDENTIHSYQFSVTQQGNYLNPRGQMTSLPGTFFV 347
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY---HGQRAIKKKIEIGK 295
+DLSP V + + F HFLT VCAIVGGV +++G +D+F+Y H +R + K
Sbjct: 348 FDLSPFMVKVENDRMPFTHFLTKVCAIVGGVISIAGFVDSFMYNSLHVRRRVSTNSGATK 407
Query: 296 F 296
F
Sbjct: 408 F 408
>gi|409079094|gb|EKM79456.1| hypothetical protein AGABI1DRAFT_120853 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1000
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 123/323 (38%), Positives = 182/323 (56%), Gaps = 42/323 (13%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK--PLQRHGGRLEHNETYCG 58
MDISGE D+ H+I K RL++ G ++ + ++DK +Q+ G YCG
Sbjct: 673 MDISGEVQRDISHNILKTRLENNGTIVPASYSAQLQNELDKMNEVQQSG--------YCG 724
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYG CCN C+EVR+AY +GW+ S+PD I+QCKREG+ +++K++ EGCN+ G
Sbjct: 725 SCYGGVEPASGCCNTCDEVRQAYVNRGWSFSSPDAIEQCKREGWSEKMKDQADEGCNVSG 784
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLAF---------- 166
L VNKV GN H +PG+SF + ++++++ + RD + SH+I+ AF
Sbjct: 785 RLRVNKVIGNIHLSPGRSFQTNSRNLYELVPYLRDENKHDFSHEIHHFAFEGDDEYVYWK 844
Query: 167 ---GEHFPGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
G +NPLDG ++ M+QYF+KVV T + + G + ++Q+SVT
Sbjct: 845 ASAGREMKNRLGLNINPLDGAKYRTSKVQYMFQYFLKVVSTQFRTLDGKIVNTHQYSVTH 904
Query: 220 HFRSSE-------------QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
R E Q Q LPG FF Y++SPI V + SF HFLT+ CAIV
Sbjct: 905 FERDLEEGGGGQSPGGINIQHGAQGLPGAFFNYEISPILVVHADSRQSFAHFLTSTCAIV 964
Query: 267 GGVFTVSGIIDAFIYHGQRAIKK 289
GGV TV+ ++D+ ++ RA+KK
Sbjct: 965 GGVLTVASLVDSLLFATTRALKK 987
>gi|426196003|gb|EKV45932.1| hypothetical protein AGABI2DRAFT_207344 [Agaricus bisporus var.
bisporus H97]
Length = 1000
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 123/323 (38%), Positives = 182/323 (56%), Gaps = 42/323 (13%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK--PLQRHGGRLEHNETYCG 58
MDISGE D+ H+I K RL++ G ++ + ++DK +Q+ G YCG
Sbjct: 673 MDISGEVQRDISHNILKTRLENNGTIVPASYSAQLQNELDKMNEVQQSG--------YCG 724
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYG CCN C+EVR+AY +GW+ S+PD I+QCKREG+ +++K++ EGCN+ G
Sbjct: 725 SCYGGVEPASGCCNTCDEVRQAYVNRGWSFSSPDAIEQCKREGWSEKMKDQADEGCNVSG 784
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLAF---------- 166
L VNKV GN H +PG+SF + ++++++ + RD + SH+I+ AF
Sbjct: 785 RLRVNKVIGNIHLSPGRSFQTNSRNLYELVPYLRDENKHDFSHEIHHFAFEGDDEYVYWK 844
Query: 167 ---GEHFPGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
G +NPLDG ++ M+QYF+KVV T + + G + ++Q+SVT
Sbjct: 845 ASAGREMKNRLGLNINPLDGAKYRTSKVQYMFQYFLKVVSTQFRTLDGKIVNTHQYSVTH 904
Query: 220 HFRSSE-------------QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
R E Q Q LPG FF Y++SPI V + SF HFLT+ CAIV
Sbjct: 905 FERDLEEGGGGQSPGGINIQHGAQGLPGAFFNYEISPILVVHADSRQSFAHFLTSTCAIV 964
Query: 267 GGVFTVSGIIDAFIYHGQRAIKK 289
GGV TV+ ++D+ ++ RA+KK
Sbjct: 965 GGVLTVASLVDSLLFATTRALKK 987
>gi|302688477|ref|XP_003033918.1| hypothetical protein SCHCODRAFT_75438 [Schizophyllum commune H4-8]
gi|300107613|gb|EFI99015.1| hypothetical protein SCHCODRAFT_75438 [Schizophyllum commune H4-8]
Length = 415
Score = 233 bits (595), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 125/321 (38%), Positives = 177/321 (55%), Gaps = 39/321 (12%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
DISG+ DV H++ K RLD G I +IDK ++ G YCGSCY
Sbjct: 92 DISGDVQRDVSHNMLKTRLDKDGKAIRGAHTAELRNEIDKQNEQRGA------DYCGSCY 145
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
G CCN CEEVR AY +GW+ +NPD I+QCK EG+ +++E+ EGCNI G L
Sbjct: 146 GGLPPASGCCNTCEEVRTAYVNRGWSFNNPDSIEQCKNEGWADKLREQANEGCNIAGRLR 205
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF------------ 166
+NKVAGN H +PG+SF G +V++++ + RD N SH I+ L+F
Sbjct: 206 INKVAGNIHLSPGRSFQTGGRNVYELVPYLRDDGNRHDFSHTIHSLSFEGDDAYDNRKRE 265
Query: 167 -----GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 221
+ NPLDG M+QYF+KVV T + ++G T+ S+ +SVT
Sbjct: 266 TSKEMRQRMGLSSNPLDGTVRVTNKAQYMFQYFVKVVSTKFRPLNGRTVNSHSYSVTHFE 325
Query: 222 RS-SEQGRLQT------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
R ++ G+ QT LPG F +D+SPI++ TE SF HF+T+ CAIVGG
Sbjct: 326 RDLTDGGQAQTGQNVQVQHGVTGLPGAFINFDVSPIQLVHTEWRQSFAHFVTSTCAIVGG 385
Query: 269 VFTVSGIIDAFIYHGQRAIKK 289
V TV+ ++D+ ++ +A+KK
Sbjct: 386 VLTVASLLDSVLFATSKALKK 406
>gi|405123077|gb|AFR97842.1| COPII-coated vesicle component Erv46 [Cryptococcus neoformans var.
grubii H99]
Length = 422
Score = 233 bits (595), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 122/315 (38%), Positives = 183/315 (58%), Gaps = 40/315 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE + +H + K R++ GNVI Q ++ ++R L + YCGSC
Sbjct: 92 MDISGEHQTEFEHQVTKTRMNKDGNVISKVQ----GSQLKGDVER--ANLNQDPNYCGSC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA + CCN+CEEVR+AY +KGW+ S+P+ I+QC EG++ ++KE+ EGC I G +
Sbjct: 146 YGAPPPESGCCNSCEEVRQAYGRKGWSFSDPEGIEQCVEEGWMDKMKEQNEEGCRIDGHI 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGEHFP------- 171
VNKV GN HF+PG+SF + + + +++ + RD + H ++K FG
Sbjct: 206 RVNKVIGNLHFSPGRSFQNNMMQMLELVPYLRDKNHHDFGHIVHKFRFGGDMTKAEELTV 265
Query: 172 -----------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
G+ +PL G++ E + M+QYF+KVV T + ++G I S+Q+SVT++
Sbjct: 266 LPKEQRWRDKLGLRDPLQGMKAHTEVSNYMFQYFLKVVSTNFISLNGEEIPSHQYSVTQY 325
Query: 221 FRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
R G + +PGVFF Y++SP+KV TEE SF HFLT+ CAIV
Sbjct: 326 ERDLRTGNAPGKDAHGHMTSHGMMGVPGVFFNYEISPMKVIHTEERQSFAHFLTSTCAIV 385
Query: 267 GGVFTVSGIIDAFIY 281
GGV TV+ ++D+FI+
Sbjct: 386 GGVLTVASLVDSFIF 400
>gi|225708964|gb|ACO10328.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Caligus rogercresseyi]
Length = 385
Score = 233 bits (595), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 126/303 (41%), Positives = 180/303 (59%), Gaps = 17/303 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKID-KPLQRHGGRLEHNETYC 57
MD+SGE H+D+ H+I+K+RL +G+ +E R+ +G K P ++ E + C
Sbjct: 90 MDVSGESHVDIVHNIYKRRLSLEGSPMEEPRRETEVGQKKTTHAPSPKN----ETSTPPC 145
Query: 58 GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
GSCYGAE+ CCN+C EV+EAYR+KGW + +QC+ + + I+ EGC IY
Sbjct: 146 GSCYGAETPGSPCCNSCGEVKEAYRRKGWTIVAAKF-EQCEMD--TEGIERVYKEGCQIY 202
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF---PGVV 174
G L VN+V G+FH PGKSF + +H+HD+ F FN SH+I L+FG PG
Sbjct: 203 GSLLVNRVGGSFHIVPGKSFTLNHLHIHDLQPFSSGEFNTSHRIRHLSFGSKTALDPGG- 261
Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT--L 232
N LD V MYQY++K+VPT Y+ G T NQ+SVT L + +
Sbjct: 262 NALDAVSALSPKGGLMYQYYLKIVPTTYSRSDGGTFTGNQYSVT-RLEKDVSSSLDSGGM 320
Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
PGVFF Y+L+P+ V ++E+ SF HF T +CAI+GGVFT++ D FIY + +++K
Sbjct: 321 PGVFFNYELAPLMVKYSEKEKSFGHFATGLCAIIGGVFTLASAFDKFIYSSSKILEEKFG 380
Query: 293 IGK 295
+GK
Sbjct: 381 LGK 383
>gi|71021625|ref|XP_761043.1| hypothetical protein UM04896.1 [Ustilago maydis 521]
gi|46100607|gb|EAK85840.1| hypothetical protein UM04896.1 [Ustilago maydis 521]
Length = 435
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 122/329 (37%), Positives = 187/329 (56%), Gaps = 51/329 (15%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE--TYCG 58
MDISGE D++HD+ + R++ G +IE + K L+ R+ + + YCG
Sbjct: 92 MDISGEHVNDIQHDVERTRINHDGKIIEQGK---------KSLKGDAARIANTKGKDYCG 142
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
CYG + CCN C+EVREAY +KGW+ ++PD +DQC EG+ ++IKE+ EGC I G
Sbjct: 143 DCYGGQPPASKCCNTCDEVREAYVRKGWSFADPDHVDQCVAEGWSEKIKEQNKEGCRISG 202
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFGEHFP--- 171
L VNKV G+FH +PGK+F ++ +H+HD++ + + + H I++ +FG
Sbjct: 203 KLHVNKVVGSFHLSPGKAFQRNSMHIHDLVPYLSGTGSEHHDFGHIIHEFSFGSEQEYHG 262
Query: 172 -------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
GV +PL+GVR + M+QYF+KVV T + +SG T+++ Q+SVT
Sbjct: 263 LTSAKERAVKAKLGVKDPLEGVRAQTQQSQFMFQYFVKVVSTEFRPLSGETLKTQQYSVT 322
Query: 219 EHFRS-------------SEQGR-------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
+ R S +G +PGVFF Y++SP+K +E S HF
Sbjct: 323 TYERDLSPGANAAALAGLSNEGSGAHISHGFAGVPGVFFNYEISPLKTIHSEYRQSLSHF 382
Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
LT+ CAIVGG+ TV+GI+D+ +Y+ +R +
Sbjct: 383 LTSTCAIVGGILTVAGILDSLVYNSRRRL 411
>gi|390603136|gb|EIN12528.1| endoplasmic reticulum-derived transport vesicle ERV46 [Punctularia
strigosozonata HHB-11173 SS5]
Length = 419
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 121/322 (37%), Positives = 180/322 (55%), Gaps = 39/322 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGEQ D+ H+I K RLDS G +I Q +++ R + + YCGSC
Sbjct: 92 MDISGEQQRDISHNILKTRLDSTGKLIPGSQRS----ELESEFDRQNKPMP--DGYCGSC 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE S+ CCN+C+ VR+AY +GW+ NPD I+QC +E + +++K++ EGCNI G +
Sbjct: 146 YGAEPSEGACCNSCDAVRQAYVNRGWSFGNPDSIEQCVKENWSEKLKDQASEGCNIAGRV 205
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF---GEHFPGV- 173
VNKV GN H +PG+SF G +++++ + R+ N SH I++ AF E+ P
Sbjct: 206 RVNKVIGNIHLSPGRSFQSQGRSMYELVPYLREDGNRHDFSHTIHEFAFEGDDEYLPDKY 265
Query: 174 -------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
PLDG M+QYF+KVV T + + G T+ S+Q+S T
Sbjct: 266 KVSKEMRAKMGLEAGPLDGAVGRTIKAQYMFQYFLKVVSTQFRTLDGQTVNSHQYSATHF 325
Query: 221 FRSSEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
R ++G +PG FF +++SPI + +E SF HFLT+ CAIVG
Sbjct: 326 ERDLDKGSEDNTAEGVHISHTTYGVPGAFFNFEISPILIVHSETRQSFAHFLTSTCAIVG 385
Query: 268 GVFTVSGIIDAFIYHGQRAIKK 289
GV T++ I+D+ ++ +A+KK
Sbjct: 386 GVLTIASIVDSVLFATTKALKK 407
>gi|392591676|gb|EIW81003.1| ER-derived vesicles protein ERV46 [Coniophora puteana RWD-64-598
SS2]
Length = 419
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 124/324 (38%), Positives = 181/324 (55%), Gaps = 41/324 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE DV H++ K+RLD G I + G +IDK + G YCGSC
Sbjct: 91 MDISGETQRDVSHNVVKQRLDKTGKGIAGSRSGDLRNEIDKLAELRG------PDYCGSC 144
Query: 61 YGA-ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
YG S+D CCN+CEEVR+AY KGW+ NP+ I+QC +EG+ ++K++ EGCNI G
Sbjct: 145 YGGYTSTDNGCCNSCEEVRQAYVNKGWSFGNPEGIEQCTQEGWTDKVKDQADEGCNISGR 204
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINKLAF---GEHFPGV 173
+ VNKV GN + +PG+SF + +D + + ++ + +H I++L F E+ P
Sbjct: 205 IRVNKVVGNINISPGRSFQTGSRNFYDFVPYLKEDGGQHDFTHYIDELTFLADDEYNPNK 264
Query: 174 V--------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
+ NPLDG + + MYQYF+KVV T + ++G TI ++Q+S T
Sbjct: 265 MKHGKELKQRMGLDSNPLDGFKASTTKKMFMYQYFLKVVSTQFRTLNGRTINTHQYSATH 324
Query: 220 HFRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
R +G PG +F +++SPI+V E SF HFLT+ CAI
Sbjct: 325 FERDLSRGMGGGENNQGVYVQHGAGGAPGAYFNFEISPIQVVHAETRQSFAHFLTSTCAI 384
Query: 266 VGGVFTVSGIIDAFIYHGQRAIKK 289
VGGV TV+ ++D+F++ RA+KK
Sbjct: 385 VGGVLTVAALLDSFLFATSRALKK 408
>gi|389744843|gb|EIM86025.1| ER-derived vesicles protein ERV46 [Stereum hirsutum FP-91666 SS1]
Length = 419
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 121/322 (37%), Positives = 177/322 (54%), Gaps = 40/322 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNETYCGS 59
MDISGE D+ H+I K RL+S G + + + ++DK QR G YCGS
Sbjct: 91 MDISGETQRDISHNIVKTRLNSDGTQVPNSANMQLRNELDKLNAQRQDG-------YCGS 143
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYG + CCN C++VREAY ++GW+ NPD I+QC +E + +++ E+ EGCNI G
Sbjct: 144 CYGGTPPEGGCCNTCDQVREAYVQRGWSFGNPDSIEQCVQEHWSEKLHEQSSEGCNISGR 203
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAFG--------- 167
+ VNKV GN H +PGKSF S +++++ + +D N SH ++ L FG
Sbjct: 204 VRVNKVIGNIHLSPGKSFQNSASSIYELVPYLKDDKNRHDFSHIVHSLTFGADDEYDSRK 263
Query: 168 --------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
+ NPLDG PS M+QYF+K V T + + G + ++Q+ VT
Sbjct: 264 TKIANEMKQRMGLDSNPLDGYHARTSQPSTMFQYFLKAVSTQFRTIDGKVVNTHQYQVTH 323
Query: 220 HFRSSEQGRLQT------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
+ R + + +T +PG FF Y++SPIKV E SF HFLT+ CAIVG
Sbjct: 324 YNRDAGNPQDKTNQGVNVMHGITGVPGAFFNYEISPIKVIHEETRQSFAHFLTSTCAIVG 383
Query: 268 GVFTVSGIIDAFIYHGQRAIKK 289
GV TV+ I+D+ ++ + +KK
Sbjct: 384 GVLTVTSILDSVLFAANQRLKK 405
>gi|443894052|dbj|GAC71402.1| hypothetical protein PANT_3d00017 [Pseudozyma antarctica T-34]
Length = 461
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 123/331 (37%), Positives = 184/331 (55%), Gaps = 51/331 (15%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE--TYCG 58
MDISGE D++HDI + R+ G I + K L+ R+ + YCG
Sbjct: 119 MDISGEHVNDIQHDIERTRVTHDGKPITQGK---------KNLKGDAARIAATKGKDYCG 169
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
CYG + CCN C+EVREAY +KGW+ ++PD +DQC EG+ +IKE+ EGC I G
Sbjct: 170 DCYGGQPPASGCCNTCDEVREAYVRKGWSFADPDHVDQCVAEGWSDKIKEQNKEGCRISG 229
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFG------- 167
L VNKV G+FH +PGK+F ++ VH+HD++ + + + H I+ +FG
Sbjct: 230 KLHVNKVVGSFHLSPGKAFQRNSVHIHDLVPYLSGTGAEHHDFGHIIHDFSFGSEQQYHG 289
Query: 168 ---------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
+ GV +PL+GVR + M+QYF+KVV T + +SG T+++ Q+SVT
Sbjct: 290 LTTAKEREVKQKLGVKDPLEGVRAQTQQSQFMFQYFLKVVSTEFRPLSGDTLKTQQYSVT 349
Query: 219 EHFRS-------------SEQGR-------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
+ R S +G +PGVFF Y++SP+K +E S HF
Sbjct: 350 TYERDLSPGANAAAMAGMSNEGSGAHISHGFAGVPGVFFNYEISPLKTIHSEHRQSLSHF 409
Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
LT+ CAIVGG+ TV+GI+D+ +Y+ +R +++
Sbjct: 410 LTSTCAIVGGILTVAGIVDSLVYNSRRRLRR 440
>gi|324511490|gb|ADY44781.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Ascaris suum]
Length = 382
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 127/301 (42%), Positives = 184/301 (61%), Gaps = 12/301 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLE-HNETYCGS 59
MD+SG+ DV+ D++K+RLD QGN I G A ++ + + E CGS
Sbjct: 90 MDVSGDNQDDVQDDVYKQRLDQQGNNIT----GQAAVRLGVNVNTSTPASQLTTEPKCGS 145
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYGA + CCN CE+V+EAY +GW + + + ++QCK + +++ I + +GEGC +YG
Sbjct: 146 CYGAS---DRCCNTCEDVKEAYSARGWQMLDIESVEQCKSDAWVRTINDFKGEGCRVYGK 202
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
++V KVAGNFH APG H HD+ + F+ +H IN L+FG FPG PLDG
Sbjct: 203 VQVAKVAGNFHIAPGDPLRSLRSHFHDLHSIAPAKFDTAHIINHLSFGTPFPGKNYPLDG 262
Query: 180 VRW-TQETPSG-MYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 236
+ T + SG M+QY++KVVPT+Y + S + I S+QFSVT H + G LPG F
Sbjct: 263 KSFGTNKDSSGIMFQYYMKVVPTMYEFLDSSNNIFSHQFSVTTHQKDIGMGA-SGLPGFF 321
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
Y+ SP+ V + E FL ++CAI+GGVFTV+ +ID+ IYH RAI+ K+E+ K+
Sbjct: 322 VQYEFSPLMVKYEERRQPLSTFLVSLCAIIGGVFTVASLIDSLIYHSSRAIQHKVEMNKY 381
Query: 297 S 297
+
Sbjct: 382 N 382
>gi|343425773|emb|CBQ69306.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 435
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 119/329 (36%), Positives = 181/329 (55%), Gaps = 51/329 (15%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE--TYCG 58
MDISGE D++HDI + R+ G V+E + K L+ R+ + + YCG
Sbjct: 92 MDISGEHVNDIQHDIERTRISHDGKVVEQGK---------KHLKGDAARIANTKGKDYCG 142
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
CYG + CCN C+EVREAY ++GW+ ++PD +DQC EG+ +IK++ EGC I G
Sbjct: 143 DCYGGQPPASGCCNTCDEVREAYVRRGWSFADPDHVDQCVAEGWSDKIKQQNKEGCRISG 202
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFGEHFP--- 171
L VNKV G+FH +PGK+F ++ +H+HD++ + + + H I++ +FG
Sbjct: 203 KLHVNKVVGSFHLSPGKAFQRNSMHIHDLVPYLSGTGAEHHDFGHIIHEFSFGSEQEYHG 262
Query: 172 -------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
GV +PL GVR + M+QYF+KVV T + ++G T+++ Q+SVT
Sbjct: 263 LTTAKERAVKAKLGVKDPLAGVRAQTQQSQFMFQYFVKVVATEFRPLAGETLKTQQYSVT 322
Query: 219 EHFRSSEQGR--------------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
+ R G +PGVFF Y++SP+K E S HF
Sbjct: 323 TYERDLSPGASAAALAGMSNEGSGAHISHGFAGVPGVFFNYEISPLKTIHAEYRQSLAHF 382
Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
LT+ CAIVGG+ TV+GI+D+ +Y+ +R +
Sbjct: 383 LTSTCAIVGGILTVAGILDSLVYNSRRRL 411
>gi|312075860|ref|XP_003140604.1| hypothetical protein LOAG_05019 [Loa loa]
Length = 365
Score = 226 bits (576), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 119/300 (39%), Positives = 174/300 (58%), Gaps = 27/300 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SG+ D++ D++K ++ N+ S + A ++ CGSC
Sbjct: 90 MDLSGDNQDDIRDDVYKIKV----NINTSTASSVPASQV----------------LCGSC 129
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+ E CCN CEEV+EAY +KGW L N + ++QCK + +++++ E + EGC +YG +
Sbjct: 130 YGAK---EGCCNTCEEVKEAYMRKGWELINIETVEQCKSDLWVKKMSEHKNEGCRVYGKV 186
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V KVAGNFH APG H HD+ + F+ SH +N +FG FPG V PLDG
Sbjct: 187 QVAKVAGNFHIAPGDPLRAHRSHFHDLHSLSPSKFDTSHTVNHFSFGNSFPGKVYPLDGK 246
Query: 181 RW--TQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
+ + + MYQY +K+VPT Y + S I S+ FSVT + + QG LPG F
Sbjct: 247 FFGSARNSDGIMYQYHLKLVPTSYVFLDSTRNIFSHLFSVTTYQKDISQGA-SGLPGFFV 305
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
Y+ SP+ V + E S FL ++CAI+GG+FTV+ +IDAFIY R I +KI + K++
Sbjct: 306 QYEFSPLMVKYEERQQSLSTFLVSICAIIGGIFTVASLIDAFIYRSGRIISQKIALNKYT 365
>gi|393907059|gb|EFO23462.2| hypothetical protein LOAG_05019 [Loa loa]
Length = 378
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 119/300 (39%), Positives = 174/300 (58%), Gaps = 14/300 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SG+ D++ D++K L ++G G + + ++ CGSC
Sbjct: 90 MDLSGDNQDDIRDDVYKISL-------LDGKEGNGVRQEVNINTSTASSVPASQVLCGSC 142
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+ E CCN CEEV+EAY +KGW L N + ++QCK + +++++ E + EGC +YG +
Sbjct: 143 YGAK---EGCCNTCEEVKEAYMRKGWELINIETVEQCKSDLWVKKMSEHKNEGCRVYGKV 199
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V KVAGNFH APG H HD+ + F+ SH +N +FG FPG V PLDG
Sbjct: 200 QVAKVAGNFHIAPGDPLRAHRSHFHDLHSLSPSKFDTSHTVNHFSFGNSFPGKVYPLDGK 259
Query: 181 RW--TQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
+ + + MYQY +K+VPT Y + S I S+ FSVT + + QG LPG F
Sbjct: 260 FFGSARNSDGIMYQYHLKLVPTSYVFLDSTRNIFSHLFSVTTYQKDISQGA-SGLPGFFV 318
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
Y+ SP+ V + E S FL ++CAI+GG+FTV+ +IDAFIY R I +KI + K++
Sbjct: 319 QYEFSPLMVKYEERQQSLSTFLVSICAIIGGIFTVASLIDAFIYRSGRIISQKIALNKYT 378
>gi|341884797|gb|EGT40732.1| CBN-ERV-46 protein [Caenorhabditis brenneri]
Length = 379
Score = 225 bits (574), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 125/299 (41%), Positives = 174/299 (58%), Gaps = 11/299 (3%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQG-NVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCG 58
MD+S E ++ DI++ RLD+ G NV E+ Q KI+ + E E CG
Sbjct: 90 MDVSSEAQDNINDDIYRLRLDADGKNVSETAQ------KIEINQNKTVDATELIQEVKCG 143
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYGA ++D CCN CE+V+ AY KGW + N + ++QCK + +++ E + EGC +YG
Sbjct: 144 SCYGA-AADGICCNTCEDVKNAYAIKGWQV-NIEEVEQCKNDKWVKEFNEHKNEGCRVYG 201
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
++V KVAGNFH APG HVHD+ F+ SH +N ++FG+ FPG PLD
Sbjct: 202 TVKVAKVAGNFHLAPGDPHQSMRSHVHDLHNLDPVKFDASHTVNHISFGKSFPGKNYPLD 261
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
G T+ MYQY++KVVPT Y + G QS+QFSVT H + R LPG F
Sbjct: 262 GKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQSHQFSVTTH-KKDLGFRQSGLPGFFLQ 320
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
Y+ SP+ V + E S FL ++CAIVGGVF ++ ++D IYH R +K +I GK +
Sbjct: 321 YEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAMAQLVDITIYHSSRYMKNRIAGGKLT 379
>gi|388856238|emb|CCF50047.1| uncharacterized protein [Ustilago hordei]
Length = 435
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 123/329 (37%), Positives = 185/329 (56%), Gaps = 51/329 (15%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE--TYCG 58
MDISGE D++HDI + R+ QDG + + K L+ R+ + + YCG
Sbjct: 92 MDISGEHVNDIQHDIERTRIS---------QDGKVSIQGTKSLKGDAARIANTKGKDYCG 142
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
CYG + CCN C+EVREAY +KGW+ S+PD ++QC EG+ ++IKE+ EGC I G
Sbjct: 143 DCYGGQPPASGCCNTCDEVREAYVRKGWSFSDPDHVEQCVAEGWSEKIKEQNKEGCRISG 202
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFGEHFP--- 171
L VNKV G+FH +PG++F ++ +H+HD++ + S + H I++ +FG
Sbjct: 203 KLHVNKVVGSFHLSPGRAFQRNSMHIHDLVPYLSGSGAEHHDFGHIIHEFSFGSEQEYHG 262
Query: 172 -------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
GV +PL+GVR + M+QYF+KVV T + ++G T+++ Q+SVT
Sbjct: 263 LTTAKERAVKDKLGVKDPLEGVRARTKESQYMFQYFLKVVSTEFRPLAGETLKTQQYSVT 322
Query: 219 EHFRS-------------SEQGR-------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
+ R S +G +PGVFF Y++SP+K +E S HF
Sbjct: 323 TYERDLSPGANAAALAGLSNEGSGARISHGFAGVPGVFFNYEISPLKTIHSEYRQSLSHF 382
Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
LT+ CAIVGG+ TV+GI+D+ IY+ R +
Sbjct: 383 LTSTCAIVGGILTVAGILDSLIYNSGRRL 411
>gi|393212588|gb|EJC98088.1| endoplasmic reticulum-derived transport vesicle ERV46 [Fomitiporia
mediterranea MF3/22]
Length = 421
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 121/322 (37%), Positives = 181/322 (56%), Gaps = 39/322 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE D+ H+I K RLD+ G V+ + K+D + + YCGSC
Sbjct: 91 MDISGEAQRDISHNIVKARLDANGAVVPNSHSAELRNKLDVMND------QTQDNYCGSC 144
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YG + + CCN CEEVR+AY KGW+ SNPD I+QC RE + +++ E+ EGCNI G L
Sbjct: 145 YGGVAPEGGCCNTCEEVRQAYVNKGWSFSNPDSIEQCVREHWSEKLHEQSTEGCNISGRL 204
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF----------G 167
VNKV GN H +PG+SF + +++H+++ + ++ N H +++L+F
Sbjct: 205 RVNKVIGNIHLSPGRSFQTNYMNIHELVPYLKEDKNRHDFGHIVHELSFEGDDEYNFRKK 264
Query: 168 EHFPGV-------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
E G+ NPLDG + M+QYF+KVV T + + G T++++Q+S T
Sbjct: 265 ERSKGIKKKLGIEANPLDGAVGKAASLQYMFQYFVKVVSTKFELMDGQTVKTHQYSATHF 324
Query: 221 FRSSEQGRL-QT------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
R G + QT +PGVF Y++SP+ V +E SF HFLT+ CAI+G
Sbjct: 325 ERDLTTGAIGQTKEGVHIAHTNVGMPGVFINYEISPLLVVHSETRQSFAHFLTSTCAIIG 384
Query: 268 GVFTVSGIIDAFIYHGQRAIKK 289
GV T++ I+D+ ++ R +KK
Sbjct: 385 GVLTIATIVDSVVFATGRRLKK 406
>gi|17568835|ref|NP_510575.1| Protein ERV-46 [Caenorhabditis elegans]
gi|3878494|emb|CAB01889.1| Protein ERV-46 [Caenorhabditis elegans]
Length = 380
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 125/302 (41%), Positives = 177/302 (58%), Gaps = 16/302 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQG-NVIESRQDGIGAPKIDKPLQRHGGRLEHN----ET 55
MD+S E ++ DI++ RLD +G N+ ES Q KI+ + ++ +E E
Sbjct: 90 MDVSSEAQENINDDIYRLRLDPEGRNISESAQ------KIE--INQNKTSVETTDVIQEV 141
Query: 56 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
CGSCYGA ++D CCN C++V+ AY KGW + N + ++QCK + +++ E + EGC
Sbjct: 142 KCGSCYGA-AADGICCNTCDDVKSAYAVKGWQV-NIEEVEQCKNDKWVKEFNEHKNEGCR 199
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
+YG ++V KVAGNFH APG HVHD+ F+ SH +N ++FG+ FPG
Sbjct: 200 VYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVKFDASHTVNHVSFGKSFPGKNY 259
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
PLDG T MYQY++KVVPT Y + G QS+QFSVT H + R LPG
Sbjct: 260 PLDGKVNTDNRGGIMYQYYVKVVPTRYDYLDGRVDQSHQFSVTTH-KKDLGFRQSGLPGF 318
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
F Y+ SP+ V + E SF FL ++CAIVGGVF ++ ++D IYH R +K +I GK
Sbjct: 319 FLQYEFSPLMVQYEEFRQSFASFLVSLCAIVGGVFAMAQLVDITIYHSSRYMKSRIAGGK 378
Query: 296 FS 297
+
Sbjct: 379 LT 380
>gi|268581953|ref|XP_002645960.1| C. briggsae CBR-ERV-46 protein [Caenorhabditis briggsae]
Length = 380
Score = 224 bits (572), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/300 (42%), Positives = 173/300 (57%), Gaps = 12/300 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQG-NVIESRQDGIGAPKIDKPLQRHGGRLEH--NETYC 57
MD+S E ++ DI++ RLD+ G NV ES Q KI+ + G E C
Sbjct: 90 MDVSSEAQENINDDIYRLRLDADGRNVSESAQ------KIEINQNKTIGEPTELVQEVKC 143
Query: 58 GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
GSCYGA +D CCN CE+V+ AY KGW + N + ++QCK + +++ E + EGC +Y
Sbjct: 144 GSCYGA-VADGICCNTCEDVKNAYAVKGWQV-NIEEVEQCKNDKWVKEFNEHKNEGCRVY 201
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
G ++V KVAGNFH APG HVHD+ F+ SH +N ++FG+ FPG PL
Sbjct: 202 GTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVKFDASHTVNHISFGKSFPGKNYPL 261
Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
DG T+ MYQY++KVVPT Y + G QS+QFSVT H + R LPG F
Sbjct: 262 DGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQSHQFSVTTH-KKDLGFRQAGLPGFFL 320
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
Y+ SP+ V + E S FL ++CAIVGGVF ++ ++D IYH R +K +I GK +
Sbjct: 321 QYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAMAQLVDITIYHTSRYMKSRIAGGKLT 380
>gi|170089933|ref|XP_001876189.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
gi|164649449|gb|EDR13691.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
Length = 421
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 119/322 (36%), Positives = 171/322 (53%), Gaps = 39/322 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE D+ H++ K RLD+ G + + +DK E YCGSC
Sbjct: 91 MDISGELQRDISHNVMKVRLDTHGKEVPNSHSAELRNDLDKMND------AKRENYCGSC 144
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+G + CCN CE+VR AY +GW+ SNP+ I+QCK EG+ ++KE+ EGCNI G +
Sbjct: 145 FGGLEPEGGCCNTCEDVRLAYVNRGWSFSNPEAIEQCKNEGWADKLKEQADEGCNISGRI 204
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF----------- 166
VNKV GN H +PG+SF + ++++++ + RD N SH I+ LAF
Sbjct: 205 RVNKVIGNIHLSPGRSFQTNARNLYELVPYLRDDGNRHDFSHTIHHLAFEGDDEYDYWKA 264
Query: 167 ------GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
+ NPLDG M+QYF+KVV T + + G + ++Q+S T+
Sbjct: 265 AAGSAMRQRMGLTENPLDGAIARTAKAQYMFQYFLKVVSTQFRTLDGRKVNTHQYSTTQF 324
Query: 221 FRSSEQGR-------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
R +G + LPG FF +++SPI V E SF HFLT+ CAI+G
Sbjct: 325 ERDLTEGAAGETAGGIHVQHGVSGLPGAFFNFEISPILVVHAETRQSFAHFLTSTCAIIG 384
Query: 268 GVFTVSGIIDAFIYHGQRAIKK 289
GV TV+ IID+ ++ R +KK
Sbjct: 385 GVLTVASIIDSILFATNRRLKK 406
>gi|336369994|gb|EGN98335.1| hypothetical protein SERLA73DRAFT_109778 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382751|gb|EGO23901.1| hypothetical protein SERLADRAFT_450196 [Serpula lacrymans var.
lacrymans S7.9]
Length = 988
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 123/324 (37%), Positives = 182/324 (56%), Gaps = 42/324 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNETYCGS 59
MDISGEQ DV H+I K R+ +G + ++G +IDK QR G YCGS
Sbjct: 662 MDISGEQQRDVSHNIHKTRITPEGGPVPGARNGELRNEIDKLNDQRSNG-------YCGS 714
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYG + CCN+CE+VR+AY +GW+ +NPD I+QC EG+ +++K++ EGCNI G
Sbjct: 715 CYGGVEPEGGCCNSCEDVRQAYVNRGWSFNNPDNIEQCVAEGWSEKLKDQAEEGCNISGR 774
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF---------- 166
L VNKV GN + +PG+SF S + ++++ + R+ N SH I++ +F
Sbjct: 775 LRVNKVIGNINVSPGRSFQSSSRNFYELVPYLREDNNRHDFSHVIHEFSFMTDDEYNLHK 834
Query: 167 ------GEHFPGVV-NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
+ G+ NPLDG+ M+QYF+KVV T + + G TI ++Q+S T
Sbjct: 835 AKLGKDMKQRMGIAENPLDGLNAKTNKAQYMFQYFLKVVSTQFRTIDGKTINTHQYSATH 894
Query: 220 HFRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
R +G + +PG FF +++SPI V +E SF HFLT+ CAI
Sbjct: 895 FERDLSKGSQGGDNGEGVVTQHGVSGVPGAFFNFEISPILVVHSEGRQSFAHFLTSTCAI 954
Query: 266 VGGVFTVSGIIDAFIYHGQRAIKK 289
VGGV TV+ ++D+F++ R +KK
Sbjct: 955 VGGVLTVAALLDSFLFATGRRLKK 978
>gi|308483051|ref|XP_003103728.1| CRE-ERV-46 protein [Caenorhabditis remanei]
gi|308259746|gb|EFP03699.1| CRE-ERV-46 protein [Caenorhabditis remanei]
Length = 380
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 127/300 (42%), Positives = 172/300 (57%), Gaps = 12/300 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQG-NVIESRQD-GIGAPK-IDKPLQRHGGRLEHNETYC 57
MD+S E ++ DI++ RLD+ G N+ ES Q I K I P + E C
Sbjct: 90 MDVSSEAQDNINDDIYRLRLDADGRNISESAQKIEINQNKTIADPTELT------QEVKC 143
Query: 58 GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
GSCYGA ++D CCN CE+V+ AY KGW + N + ++QCK + +++ E + EGC +Y
Sbjct: 144 GSCYGA-AADGICCNTCEDVKSAYAIKGWQV-NIEEVEQCKNDKWVKEFTEHKNEGCRVY 201
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
G ++V KVAGNFH APG HVHD+ F+ SH +N L FG+ FPG PL
Sbjct: 202 GTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVKFDASHTVNHLTFGKSFPGKHYPL 261
Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
DG T+ MYQY++KVVPT Y + G QS+QFSVT H + R LPG F
Sbjct: 262 DGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQSHQFSVTTH-KKDLGFRQSGLPGFFV 320
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
Y+ SP+ V + E S FL ++CAIVGGVF ++ +ID IY R +K +I GK +
Sbjct: 321 QYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAMAQLIDITIYQTHRYMKNRIAGGKLT 380
>gi|384483831|gb|EIE76011.1| hypothetical protein RO3G_00715 [Rhizopus delemar RA 99-880]
Length = 408
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 118/297 (39%), Positives = 179/297 (60%), Gaps = 23/297 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD SGE + HD++K+RLD G VI + + + K + H + + YCGSC
Sbjct: 112 MDESGEHISNYDHDVYKERLDPNGEVITAEKSNDLSNSQAKNAREHSMNVP--DDYCGSC 169
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+ S+E CCN CEE++ AY + GW + +PD +QC REG+ ++I+ + EGC ++G L
Sbjct: 170 YGAKGSNE-CCNTCEEIQNAYSELGWNV-DPDNFEQCIREGWKEKIESQSREGCRMHGTL 227
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLAFGEH--------- 169
VNK+ GNFHF+ GK+F QSG H+HD+ F + + N H I L FG H
Sbjct: 228 LVNKIRGNFHFSAGKAFKQSGSHIHDMSTFLHNDKNQNFMHTIQHLQFGNHDYNSEKQKR 287
Query: 170 --FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT--EHFRSSE 225
+++PL+ ++ + MYQYF+K+VPT + ++G I++ Q+SV+ +H S
Sbjct: 288 TKSRELIHPLENIKSGNSETAIMYQYFLKIVPTEFNFLNGKRIRTFQYSVSKQDHIVSYL 347
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
G LPGVFF D SP+++ ++E S +LT++CAI+GG+FTV+ +ID I H
Sbjct: 348 GG----LPGVFFMLDHSPMRIIYSETKTSLASYLTSLCAIIGGIFTVASVIDGSIQH 400
>gi|402218655|gb|EJT98731.1| ER to Golgi transport-related protein [Dacryopinax sp. DJM-731 SS1]
Length = 455
Score = 219 bits (559), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 131/350 (37%), Positives = 186/350 (53%), Gaps = 68/350 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG-APKIDKPLQ-RHGGRLEHNETYCG 58
MDISGE+ DV H++ + RL QG I G + +I+K ++ R GG CG
Sbjct: 92 MDISGERQHDVTHNMQRVRLSPQGIPIPDVLPESGLSNEIEKVIEAREGGE-------CG 144
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYG + CCN CE+VREAY ++GW+ S+P+ I QC EG+ +++K + EGCNI G
Sbjct: 145 SCYGGDPPASGCCNTCEDVREAYMRRGWSFSSPEDIKQCVNEGWTEKVKSQSEEGCNISG 204
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAF---GEHFPGV 173
+ VNKV GNFHF+PGKSF + +HVHD++ + +D+ + H+I+ F GE V
Sbjct: 205 RVRVNKVIGNFHFSPGKSFQTNAMHVHDLVPYLKDANRHDFGHEIHYFGFESDGEQQAEV 264
Query: 174 --------------VNPLDGVRW---------TQETPSG-----------------MYQY 193
NPLDG+R T+ P M+QY
Sbjct: 265 GRLSKSIKTKLGIDKNPLDGLRAHVRSLSRRETRRVPGMSSNRRSYRPEQTEKSNYMFQY 324
Query: 194 FIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR--------------LQTLPGVFFFY 239
F+KVV T Y + G + S+Q+SVT + R QG + +PG FF +
Sbjct: 325 FLKVVSTKYEMLRGTVVNSHQYSVTSYERDLSQGDKAQRDEHGTMTSHGVSGIPGAFFNF 384
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
++SP+ V E SF HFLT+ CAIVGGV TV+ I D+ ++ +R +KK
Sbjct: 385 EISPMVVVHQETRQSFAHFLTSTCAIVGGVLTVAAIFDSMLFSAERKLKK 434
>gi|345569114|gb|EGX51983.1| hypothetical protein AOL_s00043g717 [Arthrobotrys oligospora ATCC
24927]
Length = 397
Score = 219 bits (559), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 129/316 (40%), Positives = 174/316 (55%), Gaps = 38/316 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
MD+SG+ V H I K RLD G +IES K L+ H +H + +YCG
Sbjct: 89 MDVSGDLQPSVSHGIGKHRLDKSGGIIES-----------KFLELHPEHPKHLDPSYCGE 137
Query: 60 CYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
CYGA + D CC C++VREAY KGWA + + QC+ EG+ + +KE+ GEGC
Sbjct: 138 CYGAVAPDTSKKAGCCQTCDDVREAYAAKGWAFGDGTGVHQCEEEGYKEMLKEQAGEGCR 197
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGV 173
I G L VNKV GNFH APGKSF + +HVHD+ + + + +H IN L+FG P
Sbjct: 198 IDGHLWVNKVVGNFHIAPGKSFSNAQMHVHDLANYLQGDVHHDFTHTINALSFGPPLPTD 257
Query: 174 V--------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSS 224
+ NPLD + Y YF+K+V T Y + G+TI ++Q+SVT H RS
Sbjct: 258 LLHENHHQQNPLDATSKKTSDRNYNYLYFLKIVSTSYEHLDHGYTIHTHQYSVTSHERSL 317
Query: 225 EQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 273
E G+ +PG+FF YD+SP+KV E SF FLT++CAI+GG TV+
Sbjct: 318 EGGKDDVHPGTVHARGGIPGIFFSYDISPMKVVNREIRTKSFSGFLTSICAIIGGTLTVA 377
Query: 274 GIIDAFIYHGQRAIKK 289
+D +Y G R I K
Sbjct: 378 AALDRGLYEGARRIGK 393
>gi|299743758|ref|XP_002910702.1| ER-derived vesicles protein ERV46 [Coprinopsis cinerea
okayama7#130]
gi|298405804|gb|EFI27208.1| ER-derived vesicles protein ERV46 [Coprinopsis cinerea
okayama7#130]
Length = 416
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/321 (36%), Positives = 173/321 (53%), Gaps = 42/321 (13%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN--ETYCG 58
MDISGE D+ H++ K RLD G + + ++K L H E YCG
Sbjct: 91 MDISGEVQRDISHNVLKVRLDRSGKEVPGSHTADLSADVEK--------LSHTKKEGYCG 142
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYG + CCN CE+VR AY +GW+ +NPD I+QC+ EG+ +++++ EGCNI G
Sbjct: 143 SCYGGLEPESGCCNTCEDVRMAYVNRGWSFTNPDAIEQCRNEGWADKLRDQADEGCNISG 202
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF--------- 166
+ VNKV GN H +PG+SF + ++++++ + RD N SH I+ F
Sbjct: 203 RIRVNKVIGNIHMSPGRSFQSNSRNIYELVPYLRDDQNRHDFSHIIHHFGFEGDDEYDYW 262
Query: 167 ----GEHFPGVV----NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
G+ + NPLDG+ M+QYF+KVV T + + G T+ ++Q+S T
Sbjct: 263 KAEAGQKMRRRMGLTENPLDGIEARTWKSQYMFQYFLKVVSTRFRTLDGQTVNTHQYSTT 322
Query: 219 EHFRSSEQGRLQT------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
R +G Q LPG FF Y++SPI+V E SF HFLT+ CA++
Sbjct: 323 SFERDLGEGMNQDDGGIRVQHGVSGLPGAFFNYEISPIQVVHAESRQSFAHFLTSTCAVI 382
Query: 267 GGVFTVSGIIDAFIYHGQRAI 287
GGV TV+ ++D+ ++ +AI
Sbjct: 383 GGVLTVAALVDSALFVTAKAI 403
>gi|313231322|emb|CBY08437.1| unnamed protein product [Oikopleura dioica]
Length = 386
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 180/305 (59%), Gaps = 19/305 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRL-DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET---- 55
MD++G++ D +H +FK R+ D Q + + + I A K+ H + E ET
Sbjct: 89 MDLTGDR-ADAEHQLFKVRMKDGQEVALSEKVEEINAEKL------HDEKQEEEETGLAV 141
Query: 56 --YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS-NPDLIDQCKREGF--LQRIKEEE 110
C SCYGAE+ ++ CCN+CEEV++AYR KGWA + QC E F + +++ E
Sbjct: 142 KDECQSCYGAETEEQPCCNSCEEVQQAYRNKGWAFDHSAQQFSQCVNEHFDLNEELQKTE 201
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
GE C ++G LEVN+V+G+ +PGK+ G VHDI + SF+ SH I+ L+FGE F
Sbjct: 202 GESCRVHGHLEVNRVSGSLQISPGKTLVLDGSVVHDIRGMKHMSFDTSHTIHHLSFGEVF 261
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
PG NPLD E+ + + Y KV+PT + + G +NQFSVT H ++ Q +
Sbjct: 262 PGQENPLDNTEHEAESMNMAWHYNFKVIPTEFRKLDGSRTATNQFSVTRHEKALSQMSSR 321
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
LPG+ F ++++PI V E S +HF T+VCAI+GGV+T+S I+D+FI H + K
Sbjct: 322 -LPGINFHFEIAPIAVIKMETRRSAVHFATSVCAIIGGVWTISSILDSFI-HKTNKLLIK 379
Query: 291 IEIGK 295
E+GK
Sbjct: 380 TELGK 384
>gi|449684240|ref|XP_002157414.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like, partial [Hydra magnipapillata]
Length = 311
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 109/224 (48%), Positives = 147/224 (65%), Gaps = 19/224 (8%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
MD+SGEQ D++H+IFKKR D +GN I++ +++ +G K ++ ++ L+ ++ C
Sbjct: 89 MDVSGEQQTDLEHNIFKKRYDEKGNPIDTVEKKEELGD-KSEEAVKVLNSTLD-DKPKCE 146
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYGAE++D CCN CE+VR AYRKKGW +PD I+QCKRE + +++ EGC IYG
Sbjct: 147 SCYGAETTDHPCCNTCEDVRVAYRKKGWGFHDPDSIEQCKREHWKDTFQQQSNEGCQIYG 206
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHV---------------HDILAFQRDSFNISHKINK 163
++EV+KVAGNFH APGKSF Q +HV HD+ F FN+SH I
Sbjct: 207 YIEVSKVAGNFHIAPGKSFQQQHIHVQTIRFGKDGTISLNMHDLQPFGAKQFNVSHNIWS 266
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
L+FGE PGV NPLDG + E S MYQYF+K+VPTVY +SG
Sbjct: 267 LSFGEPIPGVENPLDGTNVSAEAGSLMYQYFVKIVPTVYKKLSG 310
>gi|393233667|gb|EJD41236.1| endoplasmic reticulum-derived transport vesicle ERV46 [Auricularia
delicata TFB-10046 SS5]
Length = 419
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 117/323 (36%), Positives = 175/323 (54%), Gaps = 41/323 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRL--EHNETYCG 58
MDISGE+ DV H+I K R+D+ +RQ I LQ ++ YCG
Sbjct: 91 MDISGERQADVTHNILKTRIDA------NRQR-IADQTTTYDLQNEAEKVVAARGANYCG 143
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYG + CC CE VR+AY +GWA S+PD I+QCK+EG+ ++I+ + EGCN+ G
Sbjct: 144 SCYGGLEPEGGCCQTCEAVRQAYINRGWAFSDPDAIEQCKQEGWKEKIQAQMNEGCNVEG 203
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-------------------DSFNISH 159
+ VNKV G+ F+ G+SF + + +HD++ + R D FNI
Sbjct: 204 RVRVNKVVGSIQFSFGRSFQMNQMSLHDLVPYLRDENVHDWRHRVQHFYFSSDDEFNIYK 263
Query: 160 KINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
+ + NPLDG E+ M+QYF+KVV T + + G I ++Q+S T
Sbjct: 264 AGISSSMKQRLGIAANPLDGNYGHTESTEYMFQYFLKVVSTQFRTIGGEVINTHQYSATH 323
Query: 220 HFRSSEQGR-------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
R +G +Q LPGVFF +++SP+++ +E SF HF+T+ CAIV
Sbjct: 324 FDRDLAEGVRGKTEDGVVVTHGVQGLPGVFFNFEISPMRIIHSETRQSFAHFITSTCAIV 383
Query: 267 GGVFTVSGIIDAFIYHGQRAIKK 289
GGV T++ I+D+ ++ Q+A+KK
Sbjct: 384 GGVLTIASIVDSLLFTTQQALKK 406
>gi|409042254|gb|EKM51738.1| hypothetical protein PHACADRAFT_150385 [Phanerochaete carnosa
HHB-10118-sp]
Length = 422
Score = 217 bits (552), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 122/329 (37%), Positives = 179/329 (54%), Gaps = 53/329 (16%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGN------VIESRQDGIGAPKIDK-PLQRHGGRLEHN 53
MDISGE D+ H++ K RL+ QGN ++E R D IDK QR G
Sbjct: 91 MDISGETQTDIVHNVIKTRLNEQGNPVPANKIVELRND------IDKLNEQRQDG----- 139
Query: 54 ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
YCGSCYG CCN CE+VR+AY +GW+ + PD I+QC +EG+ +++++ EG
Sbjct: 140 --YCGSCYGGVEPAGGCCNTCEDVRQAYVNRGWSFTAPDSIEQCAQEGWADKLRDQANEG 197
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAFG--- 167
CN G L VNKV GN H +PG+SF +++DI+ + ++ N SH ++ AF
Sbjct: 198 CNAAGKLRVNKVVGNIHLSPGRSFRSGSHNIYDIVPYLKEDGNRHDFSHTVHAFAFAGDD 257
Query: 168 -------------EHFPGVVN-PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
+ G+ + PLDG + M+QYF+KVV T + + G +I+++
Sbjct: 258 EFNFQKADHGNSLKRRLGIADGPLDGTTQKTSKQAYMFQYFLKVVSTQFITLDGKSIKTH 317
Query: 214 QFSVTEHFR--------SSEQGR-----LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
Q S T R +S+QG + +PG FF Y++SPI V E SF HFLT
Sbjct: 318 QHSATHFERDLSKGIAENSQQGMHVMHGMTGIPGAFFNYEISPILVVHRETRQSFAHFLT 377
Query: 261 NVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+ CA+VGGV TV+ +ID+ ++ + +KK
Sbjct: 378 STCAVVGGVLTVASLIDSMLFATSKKLKK 406
>gi|395324643|gb|EJF57079.1| endoplasmic reticulum-derived transport vesicle ERV46 [Dichomitus
squalens LYAD-421 SS1]
Length = 423
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 121/323 (37%), Positives = 177/323 (54%), Gaps = 41/323 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNETYCGS 59
MDISGE D+ H+I K RLD +G + +DK QR G YCGS
Sbjct: 91 MDISGETQSDITHNILKTRLDEKGKPVSHSLIAELQNDLDKLNEQRQSG-------YCGS 143
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYG + CCN CEEVR+AY +GW+ + PD I+QC +EG+ ++KE+ EGCNI G
Sbjct: 144 CYGGIEPEGGCCNTCEEVRQAYVNRGWSFNRPDSIEQCVKEGWSDKLKEQAHEGCNIAGR 203
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF---GEHFP-- 171
+ VNKV GN H +PG+SF S ++++++ + R N +H+I+ AF E+ P
Sbjct: 204 VRVNKVVGNIHLSPGRSFRTSAHNLYELVPYLRTDGNRHDFTHQIHHFAFEGDDEYDPRN 263
Query: 172 -----------GV-VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
G+ NPLDG + M+QYF+KVV T + + G + ++Q+S T
Sbjct: 264 AKLGKELKNRLGIDANPLDGTQGRTIKQQYMFQYFLKVVSTQFQTIDGKKVGTHQYSATH 323
Query: 220 HFRSSEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
R ++G + +PG FF Y++SP+ + E SF HFLT+ CAIV
Sbjct: 324 FERDLDKGPSEDSPAGLHVAHGNGGIPGAFFNYEISPLLIRHVETRQSFAHFLTSTCAIV 383
Query: 267 GGVFTVSGIIDAFIYHGQRAIKK 289
GGV TV+ +ID+ ++ ++A KK
Sbjct: 384 GGVLTVASLIDSLLFATRKAFKK 406
>gi|401888400|gb|EJT52358.1| ER to golgi family transport-related protein [Trichosporon asahii
var. asahii CBS 2479]
gi|406696432|gb|EKC99721.1| ER to transport-related protein [Trichosporon asahii var. asahii
CBS 8904]
Length = 378
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 124/337 (36%), Positives = 177/337 (52%), Gaps = 49/337 (14%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE+ D+ HD+ K RL + G +E + G + ++ Q + YCGSC
Sbjct: 47 MDISGERQNDITHDMAKHRLSASGEELEVTRSGQLKGEAERAAQ------NRDPNYCGSC 100
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA++ + CCN+C++VR+AY + GW NP I+QC E + + + ++ EGC I G +
Sbjct: 101 YGAQAPESGCCNSCDDVRKAYSESGWQFPNPSTIEQCVEENWAENMAQQNTEGCRIVGQV 160
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINKLAFGEHFP------ 171
+VNKV GN F G F + D+L + RD + H INK F P
Sbjct: 161 KVNKVVGNLQFTHGNVFTRGHT---DLLPYLRDGNVHHDFGHIINKFRFTGEMPGQLYHR 217
Query: 172 --------------GVVNPLDGVRWTQETPSG--MYQYFIKVVPTVYTDVSGHTIQSNQF 215
G+ +PL GVR E MYQYF+KVV T + ++G I +NQ+
Sbjct: 218 SQIQKKEDETRKELGIHDPLQGVRSHAENDGSNIMYQYFVKVVSTAFVYLNGQNINTNQY 277
Query: 216 SVTEHFRSSEQGRLQT--------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
S TE+ R + G L T +PGVF Y++SP+KV TE SF HF+T+
Sbjct: 278 SATEYERDLKHGNLPTKDQHGHVTTHYTNAIPGVFINYEISPMKVVHTETRQSFAHFVTS 337
Query: 262 VCAIVGGVFTVSGIIDAFIYHG-QRAIKKKIEIGKFS 297
CAIVGGV TV+ +IDA I++ +R + +K G S
Sbjct: 338 TCAIVGGVLTVASLIDAAIFNSRKRLMGEKESYGALS 374
>gi|403417426|emb|CCM04126.1| predicted protein [Fibroporia radiculosa]
Length = 419
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 113/319 (35%), Positives = 177/319 (55%), Gaps = 36/319 (11%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK-IDKPLQRHGGRLEHNETYCGS 59
MDISGE D+ H+I K RL+ +G ++S +DK ++ G + YCGS
Sbjct: 92 MDISGESQADITHNILKTRLNEKGIPLQSLAKSAELRNDLDKINEQRG------DNYCGS 145
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYG ++ CCN C++VR+AY +GW+ + PD I+QC EG+ +++KE+ EGCNI G
Sbjct: 146 CYGGQAPPGGCCNTCDQVRQAYIDRGWSFTRPDSIEQCTNEGWSEKLKEQASEGCNIAGK 205
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAFG--------- 167
+ VNKV GN +PG+SF + +++D++ + ++ N SH I++ AF
Sbjct: 206 VRVNKVIGNIQLSPGRSFRTAAQNMYDLVPYLKEDKNRHDFSHTIHQFAFESDQEKERHR 265
Query: 168 ----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
+ G+ +PLD M+QYF+KVV T + + +++Q+S T R
Sbjct: 266 ARDFQKRVGIESPLDNTERKTSKQQYMFQYFLKVVSTHFAMLDNKVYKTHQYSATHFERD 325
Query: 224 SEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
+G+ + +PGVF YD+SP+ + +E SF HFLT+ CAIVGGV
Sbjct: 326 LTKGQQEDNKEGVHIAHTATGIPGVFINYDISPMLILHSETRQSFAHFLTSTCAIVGGVL 385
Query: 271 TVSGIIDAFIYHGQRAIKK 289
TV+ +ID+ ++ RA+KK
Sbjct: 386 TVASLIDSVLFATTRALKK 404
>gi|388581981|gb|EIM22287.1| endoplasmic reticulum-derived transport vesicle ERV46 [Wallemia
sebi CBS 633.66]
Length = 407
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 117/319 (36%), Positives = 181/319 (56%), Gaps = 38/319 (11%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGEQ D++H I + RL +G I DG+ + L E CGSC
Sbjct: 88 MDVSGEQVRDLRHAIVRTRLSEKGETI----DGMKTAGMSGYLNEVAKPRE-----CGSC 138
Query: 61 YG-AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
YG ++E CC C++VRE+Y K+GW+ NPD + QC E + +R+KE+ EGCN+ G
Sbjct: 139 YGGVPPNEEKCCYTCDDVRESYVKQGWSFVNPDGVKQCLDEHWAERVKEQSSEGCNVAGL 198
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAFG--------- 167
++VNKV GNFH +PG+SF + H+HD++ + +++ N H ++ +F
Sbjct: 199 VDVNKVVGNFHISPGRSFQSNAHHIHDLVPYLKNANNHHDFGHILHHFSFKSSNEPADTD 258
Query: 168 --EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-- 223
+ + +PL + E + M+QYF+KVV T + ++G + S+Q+S T + R+
Sbjct: 259 NLKEMLNINDPLSNTKAHTEVSNYMFQYFLKVVSTDFDFLNGEKLNSHQYSATAYERNLD 318
Query: 224 -----SEQGRLQTL-------PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
++ G QT+ PGVFF YD+SP++V +TE SF FLT+ CAIVGGV T
Sbjct: 319 EKGIYAQDGHGQTILHGVEGFPGVFFNYDISPLRVIYTESRRSFASFLTSTCAIVGGVLT 378
Query: 272 VSGIIDAFIYHGQRAIKKK 290
V+ IIDA ++ ++ + K
Sbjct: 379 VASIIDAGVFGARQKLTGK 397
>gi|407418919|gb|EKF38246.1| hypothetical protein MOQ_001547 [Trypanosoma cruzi marinkellei]
Length = 406
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 173/312 (55%), Gaps = 30/312 (9%)
Query: 11 VKHDIFKKRLDSQG--NVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE 68
V+ D K R+ + + E+R KI K L +G E+ C SCYGAE
Sbjct: 100 VERDTVKSRVAASTLEKISEARPLVDEKKKITKALDPNGAEKEN----CPSCYGAEPEPG 155
Query: 69 DCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAG 127
CC+ C++VR AY + W + D+ ++QC E + EGCN++ +V +V G
Sbjct: 156 ACCHTCDDVRRAYSLRRWVFNEDDISVEQCAGERLRKAAILISQEGCNLFVKYKVARVTG 215
Query: 128 NFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG-------V 180
N HF PG+ F+ G H+HD N+SH ++ L FGE FPG VNP+DG V
Sbjct: 216 NIHFVPGRMFNLMGQHLHDFRGKTVRQLNLSHIVHTLCFGERFPGQVNPMDGLVNSRGAV 275
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSSEQGRLQT----- 231
T+E +G + YF+KVVPT Y S G ++SNQ+SVT HF +S L T
Sbjct: 276 DATEEV-NGRFSYFVKVVPTQYQAASILGVGSVVESNQYSVTHHFTASPSAELSTTTPES 334
Query: 232 ----LPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 285
+PGVF YDLSPIKV E+H S LH + +CA+ GGVFTV+G++D+ I+HG R
Sbjct: 335 TPVIVPGVFITYDLSPIKVFVMEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVR 394
Query: 286 AIKKKIEIGKFS 297
+++K++ GK S
Sbjct: 395 RVQRKMQQGKQS 406
>gi|164655211|ref|XP_001728736.1| hypothetical protein MGL_4071 [Malassezia globosa CBS 7966]
gi|159102620|gb|EDP41522.1| hypothetical protein MGL_4071 [Malassezia globosa CBS 7966]
Length = 427
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 181/323 (56%), Gaps = 45/323 (13%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRL--EHNETYCG 58
+D+ GE +DV HD+ ++RLD G + + ++ + L+ R+ E YCG
Sbjct: 92 VDVVGETQMDVHHDVERRRLDETGKPV--------SEEVIRELESEAKRVIAERGPDYCG 143
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
CYGA+ + CCN+C+ VREAY W+ ++PD I+QC +E + + ++E+ EGCNI G
Sbjct: 144 DCYGADPPEGGCCNSCDAVREAYMLHNWSFTSPDDIEQCAQEHWSEHVREQNHEGCNIAG 203
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR----DSFNISHKINKLAFG------- 167
+ VNKV GN HF PG++FH++ +H HD++ + D + HKI++ +FG
Sbjct: 204 EVRVNKVVGNLHFIPGRTFHRNDIHTHDLVPYLHGTGDDVHHFGHKIHRFSFGMEDEFAI 263
Query: 168 ------------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
++ G+ N L+G + + M+QYF+KVVP ++GH + + Q+
Sbjct: 264 ERTSRGRRQGPLKNRMGIKNALEGRSAKTLSSNYMFQYFLKVVPVEVHKLNGHEMSTYQY 323
Query: 216 SVTEHFRSSEQ------------GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
S T + R+ E ++ +PGV+F Y++SP++V TE H S H ++N+
Sbjct: 324 SATSYERNLEDFDRGGQMSGHIVRMIEGIPGVYFNYEISPLRVIQTEWHHSIWHLVSNLF 383
Query: 264 AIVGGVFTVSGIIDAFIYHGQRA 286
A++GG+ TV+G+ID IY +R
Sbjct: 384 ALIGGIVTVAGLIDGAIYRSRRT 406
>gi|358391585|gb|EHK40989.1| ER-derived vesicle Erv46-like protein [Trichoderma atroviride IMI
206040]
Length = 422
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/339 (38%), Positives = 175/339 (51%), Gaps = 61/339 (17%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
MD+SGEQ V H I K RL G VIES L + + EH N Y
Sbjct: 89 MDVSGEQQHGVAHGITKLRLQPPSRGGGVIESNS-----------LAQLHEKAEHLNPDY 137
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA + CCN C+EVREAY + WA + ++QC+RE + +R+ ++ E
Sbjct: 138 CGGCYGATAPANAEKPGCCNTCDEVREAYAQASWAFGRGEGVEQCEREHYSERLDQQREE 197
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI-----LAFQRDSFNISHKINKLAFG 167
GC I G L+VNKV GNFH APG+SF +HVHD+ L + + +H I+ L FG
Sbjct: 198 GCRIEGLLQVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDLPNGMKAHDFTHVIHSLRFG 257
Query: 168 EHFPGVV---------------NPLDGVRWTQETPSGMYQYFIKVVPTVY---------T 203
P V NPLDG+ P+ Y YF+K+VPT Y
Sbjct: 258 PQLPPEVIARMGRRTAWTNHHLNPLDGIHQETSDPNFNYMYFVKIVPTSYLPLGWEQKSA 317
Query: 204 DVSGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEE 251
S +++++Q+SVT H RS G RL + +PGVFF YD+SP+KV EE
Sbjct: 318 SASDGSVETHQYSVTSHKRSLMGGDDAKEGHAERLHSKGGIPGVFFSYDISPMKVINREE 377
Query: 252 HV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+FL FL+ +CAIVGG TV+ ID ++ G +KK
Sbjct: 378 RAKTFLGFLSGLCAIVGGTLTVAAAIDRGLFEGATRLKK 416
>gi|387219467|gb|AFJ69442.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Nannochloropsis gaditana CCMP526]
Length = 432
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 114/306 (37%), Positives = 169/306 (55%), Gaps = 31/306 (10%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-----NET 55
MD++G+ + V+H++ K+RL SQG + IG P ++ P + +
Sbjct: 117 MDVAGDNQMQVEHNMLKQRLSSQG-------ERIGFPFLEDPTDFDSKKADALLGAAPWD 169
Query: 56 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE---EEGE 112
YCGSC+ A + CCN+C+++ +AY +G + GF ++GE
Sbjct: 170 YCGSCFQARTHTGACCNSCQDLEQAYLTQGLPMGKIKTTAPQCLPGFQAPAPSGPMQKGE 229
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GCN+ GF+ VNKVAGNFH A G S + G H+H + + FN+SH I ++FG+ +PG
Sbjct: 230 GCNLKGFMSVNKVAGNFHIAFGDSVVKDGRHIHQFIPSEAPFFNVSHTIQHVSFGDEYPG 289
Query: 173 VVNPLDG-VRWTQETP-SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS------- 223
VNPLDG V++ T +G++QYFIKV+PT Y +G I++N+ SVTE F+
Sbjct: 290 RVNPLDGKVKYVSSTVGTGLFQYFIKVIPTHYKGRAGEAIRTNRISVTERFKPLHKEGEA 349
Query: 224 -------SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ + LPGVFF YDLSP V + V F HFL +CAI GGVF++S ++
Sbjct: 350 RLTGDSHAHNDQTSVLPGVFFIYDLSPFNVEVSTVSVPFSHFLVKLCAIAGGVFSISRLL 409
Query: 277 DAFIYH 282
D Y+
Sbjct: 410 DNVFYY 415
>gi|322693278|gb|EFY85144.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Metarhizium acridum CQMa 102]
Length = 356
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 128/344 (37%), Positives = 174/344 (50%), Gaps = 64/344 (18%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGEQ V H + RL R + G ID K ++ H EH + +YCG
Sbjct: 16 MDVSGEQQHGVSHGVKNVRL---------RPESQGGGVIDIKSMKVHDDPAEHLDPSYCG 66
Query: 59 SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
CYGA + CCN C+EVREAY +GWA + ++QC RE + +R+ E+ EGC
Sbjct: 67 ECYGATAPPNARKAGCCNTCDEVREAYASQGWAFGRGENVEQCTREHYAERLDEQREEGC 126
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR----DSFNISHKINKLAFGEHF 170
+ G LEVNKV GNFH APG+SF +HVHD+ + + +H I++L FG
Sbjct: 127 RVEGHLEVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETPNGKQHDFTHTIHQLRFGPQL 186
Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVY---------TDV 205
P V NPLDG R P+ Y YF+K+VPT Y +
Sbjct: 187 PAAVSDRLGKGSMPWTNHHINPLDGTRQETGDPAFNYMYFVKIVPTSYLPLGWEKRFKNA 246
Query: 206 SGHT-------IQSNQFSVTEHFRSSEQGRLQT------------LPGVFFFYDLSPIKV 246
+G T ++++Q+SVT H RS E G +PGVFF YD+SP+KV
Sbjct: 247 AGSTYGNADGSLETHQYSVTSHKRSLEGGNDAAEGHAERQHSQGGIPGVFFSYDISPMKV 306
Query: 247 TFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
EE +F FL +CAIVGG TV+ +D ++ G +KK
Sbjct: 307 INREEPAKTFTGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 350
>gi|51214107|emb|CAH17876.1| hypothetical protein (22C8.0001), conserved [Pneumocystis carinii]
Length = 388
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 123/310 (39%), Positives = 167/310 (53%), Gaps = 28/310 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGE DV H++ K RLDS G I S + +P + YCGSC
Sbjct: 92 MDVSGELETDVSHNVVKNRLDSNGIFINST--SLNTLNFQQPAKTRP------PDYCGSC 143
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+ E CCN C++V +AY W + + +QCK + E EGCN G +
Sbjct: 144 YGAK---EGCCNTCQQVIDAYASNNWPVPDTKAFEQCKEK---YNNLNEFDEGCNFVGRI 197
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGEHFPG--VVNP 176
EVNKV GNFHFAPG S H+HDI + DS + SH INKL+FG G + NP
Sbjct: 198 EVNKVVGNFHFAPGHSSQIMRNHIHDIYDYMTDSSPHDFSHTINKLSFGPEVEGRSLQNP 257
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS----------SEQ 226
LD V+ + P+ Y YFIK V + +S ++ +N++SVT H RS +
Sbjct: 258 LDNVKKETDNPTLRYSYFIKCVAYRFEYLSKPSLDTNKYSVTVHERSISGDSDPNYPTHI 317
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
+PGVFF YD+SPIK+ E +F FLT+ I+ GV T++GI+D +Y +R
Sbjct: 318 SPKDGIPGVFFSYDISPIKIIERETRGNFSTFLTSTVIIISGVLTIAGIVDRILYETERQ 377
Query: 287 IKKKIEIGKF 296
I+KK+ GKF
Sbjct: 378 IEKKLREGKF 387
>gi|322708973|gb|EFZ00550.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Metarhizium anisopliae ARSEF 23]
Length = 429
Score = 208 bits (529), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/344 (36%), Positives = 174/344 (50%), Gaps = 64/344 (18%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGEQ V H + RL R + G ID K ++ H +H + +YCG
Sbjct: 89 MDVSGEQQHGVSHGVKNVRL---------RPESQGGGVIDIKSMKVHDDPADHLDPSYCG 139
Query: 59 SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
CYGA + CCN C+EVREAY +GWA + ++QC RE + +R+ E+ EGC
Sbjct: 140 ECYGATAPPNARKAGCCNTCDEVREAYASQGWAFGRGENVEQCTREHYAERLDEQREEGC 199
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR----DSFNISHKINKLAFGEHF 170
+ G LEVNKV GNFH APG+SF +HVHD+ + + +H I++L FG
Sbjct: 200 RVEGHLEVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETPNGKQHDFTHTIHQLRFGPQL 259
Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVY---------TDV 205
P V NPLDG R P+ Y YF+K+VPT Y +
Sbjct: 260 PAAVSDRLGKGSMPWTNHHLNPLDGTRQEIGDPAFNYMYFVKIVPTSYLPLGWEKRFKNA 319
Query: 206 SGHT-------IQSNQFSVTEHFRSSEQGRLQT------------LPGVFFFYDLSPIKV 246
+G T ++++Q+SVT H RS E G +PGVFF YD+SP+KV
Sbjct: 320 AGSTYGNADGSLETHQYSVTSHKRSLEGGNDAAEGHAERQHSQGGIPGVFFSYDISPMKV 379
Query: 247 TFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
EE +F FL +CAIVGG TV+ +D ++ G +KK
Sbjct: 380 INREEPAKTFTGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 423
>gi|224000966|ref|XP_002290155.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973577|gb|EED91907.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 396
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 115/304 (37%), Positives = 172/304 (56%), Gaps = 50/304 (16%)
Query: 8 HLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRH-----GGRL---EHNETYCGS 59
H+D KH I+K RL+ G KP+ R GG L +H+E CGS
Sbjct: 101 HIDKKHRIWKHRLNKDG----------------KPIGRKSRFELGGTLTSSDHDEEECGS 144
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
CYGA E CCN C++V+ AYR K W +++ I QC L R+K+E+GEGCNI+G+
Sbjct: 145 CYGAGGEGE-CCNTCDDVKRAYRTKQWHITDMTKITQCAH---LVRVKDEDGEGCNIHGY 200
Query: 120 LEVNKVAGNFHFAPGKSFHQSG------------VHVHDILAFQRDS---FNISHKINKL 164
+ ++ GN HFAP + + + G +++ I+ D+ FN++H +NKL
Sbjct: 201 VALSTGGGNLHFAPDRQWEKEGDKQNGLMIMGGFINLDSIVEMFNDAYEQFNVTHTVNKL 260
Query: 165 AFGEHFP-------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
+FG + P + + LDG T GM+Q+++++VPTVY ++G TI++ Q+SV
Sbjct: 261 SFGPYMPKHVKNSLNLTSQLDGATRTVTDGYGMFQFYLQIVPTVYRFLNGTTIETFQYSV 320
Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
TEH R + G + +PGVFFFY++S + V F E + HF T VCA VGG FTV G++D
Sbjct: 321 TEHVRHVDPGSNRGMPGVFFFYEVSALHVEFEEYRRGWTHFFTGVCAAVGGAFTVMGMLD 380
Query: 278 AFIY 281
++
Sbjct: 381 RLVF 384
>gi|407852879|gb|EKG06122.1| hypothetical protein TCSYLVIO_002790, partial [Trypanosoma cruzi]
Length = 472
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 120/309 (38%), Positives = 169/309 (54%), Gaps = 28/309 (9%)
Query: 11 VKHDIFKKRLDSQG--NVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE 68
V+ D K R+ + + E+R KI K L G E+ C SCYGAE
Sbjct: 166 VERDTVKSRVAASTLEKISEARPLVDEKKKITKALDPSGAEKEN----CPSCYGAEPEPG 221
Query: 69 DCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAG 127
CC+ CE+VR AY + W + D+ ++QC E + EGCN++ +V +V G
Sbjct: 222 ACCHTCEDVRRAYSLRRWVFNEDDISVEQCAEERLRKAATLSSQEGCNLFVNYKVARVTG 281
Query: 128 NFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQ--- 184
N HF PG+ F+ G H+HD N+SH ++ L FGE FPG VNP+DG+ ++
Sbjct: 282 NIHFVPGRMFNLMGQHLHDFRGKTVRQLNLSHIVHTLGFGERFPGQVNPMDGLVNSRGAV 341
Query: 185 ---ETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSSEQGRLQ------- 230
E +G + YF+KVVPT Y S G ++SNQ+SVT HF S L
Sbjct: 342 DATEEVNGRFSYFVKVVPTQYQSASVLGVGSVVESNQYSVTRHFTPSPSAELSAAAAESS 401
Query: 231 --TLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
+PGVF YDLSPIKV E+H S LH + +CA+ GGVFTV+G++D+ I+HG R
Sbjct: 402 PVVVPGVFITYDLSPIKVFVIEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRR 461
Query: 287 IKKKIEIGK 295
+++K++ GK
Sbjct: 462 VQRKMQQGK 470
>gi|340053482|emb|CCC47775.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 404
Score = 207 bits (526), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 119/316 (37%), Positives = 172/316 (54%), Gaps = 35/316 (11%)
Query: 3 ISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYG 62
I + + V D + +++G V+E RQ A GG C SCYG
Sbjct: 101 IRSTRKMRVHADTLQPISEARGLVVEKRQSSTNADS--------GG-----AEGCPSCYG 147
Query: 63 AESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLE 121
AE + DCCN C++VR A++ KGW+ + D+ I QC E EGCNIY
Sbjct: 148 AEKNPGDCCNTCDDVRNAFKDKGWSFNEDDIGIAQCAEERLRHAESSSSREGCNIYAKFS 207
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
++V GN HF PG F G H+H + N+SH I++L FGE FPG NPLDG+
Sbjct: 208 ASRVKGNIHFVPGSMFDYYGQHMHVLKGEIIRKMNLSHIIHQLDFGERFPGQKNPLDGMV 267
Query: 182 WTQ------ETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSS--EQGRL 229
++ E+ +G + YF++VVPT Y VS G +++NQ+SVT +F S GR
Sbjct: 268 NSRGVVDKSESTNGRFSYFVQVVPTQYQHVSIFGTGRLLETNQYSVTHYFTESWNATGRD 327
Query: 230 QT-------LPGVFFFYDLSPIK--VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
++ +PG+F YD+SPIK V T + S +H + +CA+ GGVF V+ +ID+F+
Sbjct: 328 KSANDAPSVVPGIFILYDISPIKTSVKATHPYPSVVHLVLQLCAVGGGVFNVASLIDSFL 387
Query: 281 YHGQRAIKKKIEIGKF 296
+HG R ++KKI GK+
Sbjct: 388 FHGTRQVQKKIRQGKY 403
>gi|193627365|ref|XP_001948436.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Acyrthosiphon pisum]
Length = 404
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 119/308 (38%), Positives = 165/308 (53%), Gaps = 23/308 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHNET--- 55
+D SGE HL V H+I+K+RL+ +G I + D +G+ K P L+ NET
Sbjct: 95 VDNSGETHLQVDHNIYKRRLNLEGQPISDPEKSDDVGSKKTLNP----PSMLKSNETDDA 150
Query: 56 -----YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
CGSCYGAESS CCN C++V+ AY+ K W P I+QCK + + ++
Sbjct: 151 NNTEDICGSCYGAESSTIPCCNTCDDVKRAYKMKNWDF-RPSSIEQCKNQSSQNEMYDKA 209
Query: 111 -GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
EGC +YG L VN+V+G+FH APG SF + +HVHD+ F SFN +H I L+FG+
Sbjct: 210 FKEGCQLYGTLLVNRVSGSFHIAPGMSFSFNHMHVHDVHPFSSSSFNTTHTIRHLSFGQK 269
Query: 170 FPGVV-----NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
+ NPLD + M+QY+IK+VPT+Y +NQFSVT+H +
Sbjct: 270 LESINTSHGGNPLDSTESIAGEGATMFQYYIKIVPTLYQRRDLSIFSTNQFSVTKHKVQA 329
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
PG+FF Y+ SPI + TE+ H T + GVF IID F+Y
Sbjct: 330 FDKGPSGAPGIFFSYEFSPIMIKLTEKPRLLGHLFTQFLCNISGVFICFWIIDIFMYKVS 389
Query: 285 RA--IKKK 290
+ I+KK
Sbjct: 390 KVYNIRKK 397
>gi|50548631|ref|XP_501785.1| YALI0C13112p [Yarrowia lipolytica]
gi|49647652|emb|CAG82095.1| YALI0C13112p [Yarrowia lipolytica CLIB122]
Length = 401
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 119/314 (37%), Positives = 172/314 (54%), Gaps = 26/314 (8%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D SGE V HD+ K LD +GN++ S +G K + + + YCGSC
Sbjct: 87 IDSSGEVQQSVDHDMTKVTLDERGNILSSEALTLGENPDSKAVAKR--TFLDDPNYCGSC 144
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES + CCN CE+VR AY KGWA ++ ++QC+ GF +++K + +GCNI G
Sbjct: 145 YGAESEPDQCCNTCEQVRAAYATKGWAFTDGSGVEQCEVIGFKEQLKAQYNQGCNIAGKF 204
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ--RDSFNISHKINKLAFGEHF-------- 170
V KVAGNFHFAPG S H+ H+HD+ F+ F SH I+ L+FGE
Sbjct: 205 TVQKVAGNFHFAPGVSSHRDEQHLHDLSHFKDPEAPFTFSHIIHDLSFGEQVDVSGLDWD 264
Query: 171 PGV---VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
GV +PL+ + + YF KVV T + + G I++NQ++ T H R + G
Sbjct: 265 KGVAMETSPLENTPHHTDNKWFRFNYFTKVVSTRFEFLDGKKIETNQYAATAHERPLQGG 324
Query: 228 RLQT----------LPGVFFFYDLSPIKVTFTEEHVS-FLHFLTNVCAIVGGVFTVSGII 276
R + LPGVFF YD+SP+++ +E+ S F F+ V A +GGV TV+ ++
Sbjct: 325 RDEDHQNTRHMRGGLPGVFFSYDISPMRIVNKQEYRSHFGAFVMQVVATIGGVLTVAAVL 384
Query: 277 DAFIYHGQRAIKKK 290
D IY + +K+K
Sbjct: 385 DRGIYEVDQVLKRK 398
>gi|71407913|ref|XP_806393.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870127|gb|EAN84542.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 406
Score = 206 bits (524), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 123/312 (39%), Positives = 170/312 (54%), Gaps = 30/312 (9%)
Query: 11 VKHDIFKKRLDSQG--NVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE 68
V+ D K R+ + + E+R KI K L G E+ C SCYGAE
Sbjct: 100 VERDTVKSRVAASTLEKISEARPLVDEKKKITKALDPSGAEKEN----CPSCYGAEPEPG 155
Query: 69 DCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAG 127
CC+ CE+VR AY + W + D+ ++QC E + EGCN++ +V +V G
Sbjct: 156 ACCHTCEDVRRAYSLRRWVFNEDDVSVEQCAEERLRKAAILSSQEGCNLFVNYKVARVTG 215
Query: 128 NFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG-------V 180
N HF PG+ F+ G H+HD N+SH ++ L FGE FPG VNP+DG V
Sbjct: 216 NIHFVPGRMFNLMGQHLHDFRGKTVRQLNLSHIVHTLGFGERFPGQVNPMDGLVNLRGAV 275
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSSEQGRLQ------ 230
T+E +G + YF+KVVPT Y S G ++SNQ+SVT HF S L
Sbjct: 276 DATEEV-NGRFSYFVKVVPTQYQSASILGVGSVVESNQYSVTHHFTPSPSAELSAAAAES 334
Query: 231 ---TLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 285
+PGVF YDLSPIKV E+H S LH + +CA+ GGVFTV+G++D+ I+HG R
Sbjct: 335 SPVMVPGVFITYDLSPIKVFVFEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVR 394
Query: 286 AIKKKIEIGKFS 297
+++K++ GK S
Sbjct: 395 RVQRKMQQGKQS 406
>gi|296417040|ref|XP_002838173.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634087|emb|CAZ82364.1| unnamed protein product [Tuber melanosporum]
Length = 399
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 164/316 (51%), Gaps = 36/316 (11%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
MD+SGEQ + H I RL ES+ P L H H + YCG
Sbjct: 89 MDVSGEQQSSITHGIHLTRLTP---FPESK------PVSTTSLNVHEDTASHLDPAYCGK 139
Query: 60 CYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
CYGA ++D CC CE+VREAY GWA + ++QC+RE + +R+ E EGCNI
Sbjct: 140 CYGAPGPEKDKGCCQTCEDVREAYASIGWAFGKGEGVEQCEREHYAERLDEMREEGCNIA 199
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEHFPGVV- 174
G L VNKV GNFH APGKSF + +HVHD+ + +H I+ L+FG P V
Sbjct: 200 GHLSVNKVIGNFHIAPGKSFSSAQMHVHDLNQYFASTKEHTFTHTIHHLSFGPDLPANVK 259
Query: 175 ---NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-------TIQSNQFSVTEHFRSS 224
NPLD R + S + YFIKVV T Y + I+++Q+SVT H RS
Sbjct: 260 VQRNPLDDSRQVTQERSFNFMYFIKVVSTSYLPLGTSENSYIPGAIETHQYSVTSHKRSL 319
Query: 225 EQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 273
G + +PGVFF YD+SP+KV E SF FLT VCA++GG TV+
Sbjct: 320 MGGADKEHASTIHARGGIPGVFFSYDISPMKVINREVRAKSFAGFLTGVCAVIGGTLTVA 379
Query: 274 GIIDAFIYHGQRAIKK 289
ID +Y G +KK
Sbjct: 380 AAIDRGLYEGGMRVKK 395
>gi|340520521|gb|EGR50757.1| predicted protein [Trichoderma reesei QM6a]
Length = 430
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/345 (37%), Positives = 176/345 (51%), Gaps = 65/345 (18%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGEQ V H I K RL +G +I+ K L + + EH + YCG
Sbjct: 89 MDVSGEQQHGVAHGITKIRLQPAA---------LGGGEIESKSLSQLHEKAEHLDPNYCG 139
Query: 59 SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
CYGA + CCN C+EVREAY WA + ++QC+RE + +R+ ++ EGC
Sbjct: 140 GCYGAIAPSTAQKPGCCNTCDEVREAYALASWAFGRGEGVEQCEREHYAERLDQQREEGC 199
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGEHF 170
I G L+VNKV GNFH APG+SF +HVHD+ + + S + +H I+ L FG
Sbjct: 200 RIEGLLQVNKVIGNFHLAPGRSFSNGNMHVHDLKNYWDLPEGKSHDFTHIIHSLRFGPQL 259
Query: 171 PGVV---------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV---------- 205
P V NPLD R + P+ Y YF+K+VPT Y +
Sbjct: 260 PDTVIERLGGKNTWSNHHLNPLDNTRQDTKDPNFNYMYFVKIVPTSYLPLGWEKRKPSTT 319
Query: 206 --------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIK 245
S +I+++Q+SVT H RS G RL +PGVFF YD+SP+K
Sbjct: 320 NGGVTTFYSDGSIETHQYSVTSHKRSLMGGDDAKEGHPERLHARNGIPGVFFSYDISPMK 379
Query: 246 VTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
V EE +FL FL+ +CAIVGG TV+ +D ++ G +KK
Sbjct: 380 VINREERAKTFLGFLSGLCAIVGGTLTVAAAVDRGLFEGATRLKK 424
>gi|358378080|gb|EHK15763.1| hypothetical protein TRIVIDRAFT_86970 [Trichoderma virens Gv29-8]
Length = 420
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 123/333 (36%), Positives = 170/333 (51%), Gaps = 51/333 (15%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGEQ V H I K RL G G + + Q H YCG C
Sbjct: 89 MDVSGEQQHGVAHGISKIRLRPAAQ-------GGGEIESNTLTQLHEKAEHLAPDYCGGC 141
Query: 61 YGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
YGA + CCN C+EVREAY + WA + ++QC+RE + +R+ ++ EGC I
Sbjct: 142 YGATAPANAEKPGCCNTCDEVREAYAQMSWAFGRGEGVEQCEREHYAERLDQQREEGCRI 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGEHFPG 172
G L+VNKV GNFH APG+SF +HVHD+ + + + +H I+ L FG P
Sbjct: 202 EGLLQVNKVVGNFHLAPGRSFSNGNMHVHDLKTYWDFPEGKPHDFTHIIHSLRFGPQLPD 261
Query: 173 VV---------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--------T 209
V NPLD + P+ Y YF+K+VPT Y + +
Sbjct: 262 TVIERMGGKNTWTNHHLNPLDATHQETKDPNFNYMYFVKIVPTSYLPLGWEKRTPGYDGS 321
Query: 210 IQSNQFSVTEHFRS------SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFL 256
I+++Q+SVT H RS S++G + L PGVFF YD+SP+KV EE +FL
Sbjct: 322 IETHQYSVTSHKRSLMGGDDSQEGHPERLHARNGIPGVFFSYDISPMKVINREERAKTFL 381
Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
FL+ +CAIVGG TV+ +D ++ G +KK
Sbjct: 382 GFLSGLCAIVGGTLTVAAAVDRGLFEGASRLKK 414
>gi|134054958|emb|CAK36967.1| unnamed protein product [Aspergillus niger]
Length = 406
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 127/328 (38%), Positives = 173/328 (52%), Gaps = 53/328 (16%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ V H I K RL S G VI+ + A ++ K L + YC
Sbjct: 89 MDVSGEQQTGVVHGINKVRLTSAAEGGRVID-----VKALELAKHL---------DPDYC 134
Query: 58 GSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA + S CCN C+EVREAY ++ WA + ++QC+ EG+ +RI + EG
Sbjct: 135 GECYGATAPAGASKPGCCNTCDEVREAYAQQQWAFGKGENVEQCELEGYAERIDAQRREG 194
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAFG 167
C + G L VNKV GNFH APG+SF +HVHD+ F + ++H+I++L FG
Sbjct: 195 CRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVHDLANFFDADLPDAEKHTMTHEIHQLRFG 254
Query: 168 EHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT-IQSNQ 214
P + NPLDG + P Y YF+KVV T Y + I+++Q
Sbjct: 255 PQLPDELSDRWQWTDHHHTNPLDGTKQETNEPGYNYMYFVKVVSTSYLPLGWDPLIETHQ 314
Query: 215 FSVTEHFRS------SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTN 261
+SVT H RS S++G + L PGVF YD+SP+KV E +F FLT
Sbjct: 315 YSVTSHKRSLMGGDASDEGHKERLHAANGIPGVFVNYDISPMKVINREARPKTFTGFLTG 374
Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
VCAI+GG TV+ +D +Y G +KK
Sbjct: 375 VCAIIGGTLTVAAALDRGLYEGVSRMKK 402
>gi|348667280|gb|EGZ07106.1| hypothetical protein PHYSODRAFT_319656 [Phytophthora sojae]
Length = 398
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 156/290 (53%), Gaps = 8/290 (2%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
D++G D++H+I K LD G + E D IG + + HG E ++ CGSC
Sbjct: 95 DMAGNVQHDIEHNIRKIPLDHTGQALAEGMHDVIGG-ALTNNTELHG---ETDKPACGSC 150
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
Y A E CC+ CE V+ AY +K W + + I QC+ + ++ E EGC I G L
Sbjct: 151 YSAGEPGE-CCDTCESVKAAYARKSWMMPSLHTIAQCQEVEIEKVLRGEVNEGCRIQGSL 209
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
V+KVAG +FAP K F + D++ F+ SH I L+FGE +P + NPLD
Sbjct: 210 VVSKVAGKLYFAPSKFFRSGYLSSKDLVDATFKVFDTSHTIRSLSFGEAYPDMKNPLDNR 269
Query: 181 R--WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
+ E G +QYF+KVVPT YT +S I +NQFS TEHFR + LP V F
Sbjct: 270 KKELPDEKTRGSFQYFLKVVPTEYTFLSASRIITNQFSATEHFRQLTPVSDKGLPMVTFS 329
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIK 288
Y SPI + V FL FLT+VCAIVGGVFT + D +Y GQ K
Sbjct: 330 YTFSPIMFRIEQYRVGFLQFLTSVCAIVGGVFTRTATADESVYRGQVGAK 379
>gi|346979363|gb|EGY22815.1| ER-derived vesicles protein ERV46 [Verticillium dahliae VdLs.17]
Length = 435
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 130/350 (37%), Positives = 174/350 (49%), Gaps = 70/350 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEHNET---Y 56
MD+SGEQ + I K RL SQ +DG G ID K L H Y
Sbjct: 89 MDVSGEQQHGIVSGISKVRLRSQ-------KDGGGV--IDTKALSLHAADEAATHLAPDY 139
Query: 57 CGSCYGAESS----DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA++ + CCN CEEVREAY + WA + ++QC RE + +R+ E+ E
Sbjct: 140 CGDCYGAKAPANAVKQGCCNTCEEVREAYAQASWAFGKGENVEQCTREHYAERLDEQRAE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLAFGEHF 170
GC I G L VNKV GNFH APG+SF +HVHD+ + + + +H+I+ L FG
Sbjct: 200 GCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDGDITHDFTHQIHALRFGPQL 259
Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
P + NPLDG PS + YF+K+VPT Y +
Sbjct: 260 PESITKNLGNKATPWTNHHLNPLDGTSQITTDPSFNFMYFVKIVPTSYLPLGWDSKRSPQ 319
Query: 206 -------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYD 240
S +I+++Q+SVT H RS G RL T +PGVFF YD
Sbjct: 320 DHDGGLLGSFGQGSDGSIETHQYSVTSHKRSLSGGDDSAEGHAERLHTRGGIPGVFFSYD 379
Query: 241 LSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+SP+KV EE SF FLT +CA++GG TV+ +D ++ G +KK
Sbjct: 380 ISPMKVINREERSKSFTGFLTGLCAVIGGTLTVAAAVDRGMFEGSLRLKK 429
>gi|219111025|ref|XP_002177264.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411799|gb|EEC51727.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 404
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 116/307 (37%), Positives = 179/307 (58%), Gaps = 29/307 (9%)
Query: 8 HLDVKHDIFKKRLDSQGN-----VIESRQDGIGAPKI-DKPLQRHGGRLEHNE------- 54
HLD H ++K R+ N + E + +G+ + +K L+ L++ +
Sbjct: 99 HLDTDHHVWKHRITLLPNGHRQLLGERSKLELGSTLLTEKDLEVKAEELQNAKDNSESRT 158
Query: 55 --TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
T CG CYGA E CC +CE+V+ AY+++GW+L + + QC+RE I E EGE
Sbjct: 159 EMTPCGDCYGAGEEGE-CCKSCEDVKRAYKRRGWSLRDTSGVSQCRRE---SGIAEAEGE 214
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQS---GVHVHDILAFQRDSFNISHKINKLAFGEH 169
GCN++G + ++ GN H APG+ + G+++ D L +N+SH+I+KL FG+
Sbjct: 215 GCNVHGVVALSSGGGNLHIAPGRDTEANFPGGMNIFDALLQSFHQWNVSHQIHKLRFGKD 274
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
+P V LDG T GMYQY+ +VVPT YT ++G TIQ++Q+SVTEH R G
Sbjct: 275 YPAGVYQLDGETRTITDGYGMYQYYFQVVPTRYTFLNGTTIQTHQYSVTEHLRHVSPGSN 334
Query: 230 Q------TLPGVFFFYDLSPIKVTFTEEHVS-FLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
+ +PG+FFFY++SP+ V E + ++ FLT+VCAIVGGV T++G+ID I+
Sbjct: 335 RGYSLNSRMPGIFFFYEVSPLHVDIMEVYQKGWIAFLTSVCAIVGGVVTIAGLIDHVIFS 394
Query: 283 GQRAIKK 289
Q + ++
Sbjct: 395 RQHSSRE 401
>gi|346324387|gb|EGX93984.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Cordyceps militaris CM01]
Length = 423
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 123/338 (36%), Positives = 174/338 (51%), Gaps = 58/338 (17%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGEQ V H + K RL R +G G ID L H EH + +YCG
Sbjct: 89 MDVSGEQQHGVAHGVHKVRL---------RPEGEGGGVIDVSSLNLHNDAAEHLDPSYCG 139
Query: 59 SCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
C GA + + CCN CEE+REAY + WA + +QC+RE + +R++E+ EGC
Sbjct: 140 DCGGAPAPTTVTKAGCCNTCEEIREAYAQVSWAFGDGKAFEQCEREHYAERLEEQRHEGC 199
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFGEHF 170
I G L+VNKV GNFH APG+SF +HVHD+ + + + +H I+ L FG
Sbjct: 200 RIDGLLQVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETTDDKKHDFTHHIHHLRFGPQL 259
Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH------ 208
P V NPLD + P+ + YF+K+VPT + +
Sbjct: 260 PETVVQKLGKGATPWTNHHGNPLDSTKQLTNDPNFNFMYFVKIVPTSFLPLGWEKMARTM 319
Query: 209 ----TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEH 252
+++++Q+SVT H RS G RL + +PGVFF YD+SP+KV EE
Sbjct: 320 NVDASVETHQYSVTSHKRSLTGGDDSAEGHAERLHSRGGIPGVFFSYDISPMKVINREEK 379
Query: 253 -VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
SFL F+ +CA+VGG TV+ +D ++ G +KK
Sbjct: 380 GKSFLGFVAGLCAVVGGTLTVAAAVDRGLFEGTTRLKK 417
>gi|400602673|gb|EJP70275.1| endoplasmic reticulum-golgi intermediate compartment protein 3
[Beauveria bassiana ARSEF 2860]
Length = 423
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 122/338 (36%), Positives = 174/338 (51%), Gaps = 58/338 (17%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGEQ V H + K RL R + G ID L H EH + +YCG
Sbjct: 89 MDVSGEQQHGVAHGVHKVRL---------RPEAEGGGVIDVSSLDLHNDAAEHLDPSYCG 139
Query: 59 SCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
C GA + CCN CEE+REAY + WA + +QC+RE + +R++E+ EGC
Sbjct: 140 DCGGAPAPSNVKKAGCCNTCEEIREAYAQVSWAFGDGKAFEQCEREHYAERLEEQRHEGC 199
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFGEHF 170
I G L+VNKV GNFH APG+SF +HVHD+ + + + +H I+ L FG
Sbjct: 200 RIDGLLQVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETTDDKKHDFTHYIHHLRFGPQL 259
Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
P V NPLD + + P+ + YF+K+VPT + +
Sbjct: 260 PEAVVKKMGKGATPWTNHHANPLDNTKQLTDDPNYNFMYFVKIVPTSFLPLGWEKMSRAM 319
Query: 206 -SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEH 252
+ +++++Q+SVT H RS G RL + +PGVFF YD+SP+KV EE
Sbjct: 320 NTDGSVETHQYSVTSHKRSLTGGDDAAEGHAERLHSRGGIPGVFFSYDISPMKVINREEQ 379
Query: 253 -VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
SFL F+ +CA+VGG TV+ +D ++ G +KK
Sbjct: 380 GKSFLGFIAGLCAVVGGTLTVAAAVDRGLFEGTTRLKK 417
>gi|157873507|ref|XP_001685262.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68128333|emb|CAJ08503.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 467
Score = 199 bits (507), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 107/314 (34%), Positives = 168/314 (53%), Gaps = 29/314 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ-GNVIESRQDGIGAPKI-DKPLQRHGGRLEHNETYCG 58
+DI G DV+ + K+R+D+ G VI + + + K+ K + G E+ C
Sbjct: 151 VDIFGVFANDVEGNTVKQRIDAATGQVISAARAMVDEKKVMTKAIDADGAEKEN----CP 206
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIY 117
SCYGAE + DCC+ CE+VR+AY ++GW L ++ ++QC + EGCN+Y
Sbjct: 207 SCYGAERNPGDCCHTCEDVRQAYARRGWKLDIDEISVEQCAEDRINMAAAASGKEGCNLY 266
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
++ G+ F PG+ + G +HD++ ++SH ++ L FG+ FPG NPL
Sbjct: 267 ATFAASRATGSLQFIPGRIYETLGRRMHDLMGSTTRKLDLSHTVHTLEFGDPFPGQQNPL 326
Query: 178 DGVRW-------TQETPSGMYQYFIKVVPTVYTDVSGHT-----IQSNQFSVTEHFRSSE 225
DG ++ +G + YF+K+VPT Y S T ++SNQ+S T HF SE
Sbjct: 327 DGTAQGSALSGDAKDAMNGRFSYFVKLVPTTYQRYSLITGLQDVVESNQYSATHHFTPSE 386
Query: 226 QGRL--------QTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
+ + +PGVF YDLSP+++ E H S HF+ +CA+ GGV TV+G+
Sbjct: 387 AAKAASQAPKKQEIVPGVFMTYDLSPVRILVQERHPYPSLAHFVLQLCAVCGGVLTVAGL 446
Query: 276 IDAFIYHGQRAIKK 289
+D+ +H R I+K
Sbjct: 447 VDSLCFHSARKIRK 460
>gi|392566201|gb|EIW59377.1| endoplasmic reticulum-derived transport vesicle ERV46 [Trametes
versicolor FP-101664 SS1]
Length = 423
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 113/327 (34%), Positives = 170/327 (51%), Gaps = 49/327 (14%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQG-----NVIESRQDGIGAPKIDKPLQRHGGRLEHNET 55
MDISGE D+ H+I K R+D +G VI Q+ + + G E
Sbjct: 91 MDISGETQSDITHNILKTRMDERGFPVPTTVITELQNDLDKINSQREGGYCGSCYGGVEP 150
Query: 56 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
G CCN CE+VR+AY +GW+ + PD I+QC +EG+ +++KE+ EGCN
Sbjct: 151 EGG-----------CCNTCEDVRQAYVNRGWSFNRPDSIEQCVQEGWSEKLKEQATEGCN 199
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF------ 166
I G + VNKV GN H +PG+SF S +++++ + + N +H I+ LAF
Sbjct: 200 IAGRVRVNKVVGNIHLSPGRSFRTSSHSLYELVPYLKTDGNRHDFTHTIHHLAFEGDDEW 259
Query: 167 -----------GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
+ NPLDG M+QYF+KVV T + +SG TI ++Q+
Sbjct: 260 DLAKAKLGKELKQRLGIAANPLDGTTGRTIKQQYMFQYFLKVVATQFRTLSGKTINTHQY 319
Query: 216 SVTEHFRSSEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
S T R ++G + +PG FF Y++SP+++ E SF HFLT+
Sbjct: 320 SATHFERDLDKGSQENTPTGVHVAHGNGGIPGAFFNYEISPLRIVHAETRQSFAHFLTST 379
Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKK 289
CAIVGGV TV+ +ID+ ++ ++A+KK
Sbjct: 380 CAIVGGVLTVASLIDSALFATRKALKK 406
>gi|389602486|ref|XP_001567299.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|322505471|emb|CAM42729.2| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 541
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 168/314 (53%), Gaps = 29/314 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ-GNVIESRQDGIGAPK-IDKPLQRHGGRLEHNETYCG 58
+D+ G DV+ + K+R+D+ G VI + + + K I K + G E+ C
Sbjct: 225 VDVFGVFANDVEDNTVKQRIDAATGQVISAARAVVDEKKVITKAIDADGVEKEN----CP 280
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIY 117
SCYGAE S DCC+ CE+VR+AY +KGW L+ D+ ++QC + EGCN+Y
Sbjct: 281 SCYGAERSPGDCCHTCEDVRQAYAQKGWRLNVDDISVEQCAEDRIKMATAAFGKEGCNLY 340
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
++ G+ F PG+ + G +HD++ ++SH ++ L FGE FPG NPL
Sbjct: 341 ATFAASRATGSLQFIPGRMYQMLGRRMHDLMGSAARKLDLSHTVHTLEFGERFPGQQNPL 400
Query: 178 DGVRW-------TQETPSGMYQYFIKVVPTVYTDVS-----GHTIQSNQFSVTEHFRSSE 225
DG ++ +G + YF+KV+PT Y S T++SNQ++ T HF S
Sbjct: 401 DGTAQGSALSGDAKDAMNGRFSYFVKVIPTTYQRYSLITGLQDTVESNQYTATHHFTPSA 460
Query: 226 QGRL--------QTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
+ + +PGVF YDLSP+++ E H S +HF+ +CA+ GGV TV G+
Sbjct: 461 ATKAASQTPTMQEIVPGVFMTYDLSPVRILAQERHPYPSVIHFVLQLCAVCGGVLTVVGL 520
Query: 276 IDAFIYHGQRAIKK 289
+D+ +H R ++K
Sbjct: 521 VDSMCFHSVRKVRK 534
>gi|255941116|ref|XP_002561327.1| Pc16g10170 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211585950|emb|CAP93687.1| Pc16g10170 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 412
Score = 199 bits (505), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 124/329 (37%), Positives = 171/329 (51%), Gaps = 49/329 (14%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETY 56
MD+SGEQ + V H + K RL G VI+ + D + + K L Y
Sbjct: 89 MDVSGEQQVGVAHGVNKVRLSPHNEGGKVIDVQALDLHSSSEAAKHLA---------PDY 139
Query: 57 CGSCYGAESS----DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG C GA CC CEEVREAY +K WA + I+QCKREG+ +++ E+ E
Sbjct: 140 CGECGGATPPANVIKPGCCTTCEEVREAYAEKQWAFGDGSNIEQCKREGYAEKLAEQRRE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
GC I G L+VNKV GNFH APG+SF +HVHD+ A+ + +SH +++L F
Sbjct: 200 GCRIEGVLKVNKVVGNFHIAPGRSFTTGNMHVHDLDAYVVPNAGPAEQHTMSHLVHELRF 259
Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT-IQSN 213
G P + NPLD + + P+ + YF+KVV T Y + I+++
Sbjct: 260 GPQLPTELAGRWGWTDHHHTNPLDDTKQETDEPAYNFMYFVKVVSTSYLPLGWDPHIEAH 319
Query: 214 QFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLT 260
Q+SVT H R G R+ +PGVFF YD+SP+KV E +F +FLT
Sbjct: 320 QYSVTSHKRPLSGGNDAAEGHKERVHAGGGIPGVFFNYDISPMKVINREARPKTFTNFLT 379
Query: 261 NVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
VCAI+GG TV+ +D +Y G +KK
Sbjct: 380 GVCAIIGGTLTVAAALDRGLYEGAMRVKK 408
>gi|146095510|ref|XP_001467598.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398020411|ref|XP_003863369.1| hypothetical protein, conserved [Leishmania donovani]
gi|134071963|emb|CAM70660.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322501601|emb|CBZ36681.1| hypothetical protein, conserved [Leishmania donovani]
Length = 467
Score = 199 bits (505), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/314 (34%), Positives = 168/314 (53%), Gaps = 29/314 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ-GNVIESRQDGIGAPKI-DKPLQRHGGRLEHNETYCG 58
+DI G DV+ + K+R+D+ G VI + + + K+ K + G E+ C
Sbjct: 151 VDIFGVFANDVEGNTVKQRIDAATGQVISAARAMVDEKKVMTKAIDADGAEKEN----CP 206
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIY 117
SCYGAE + DCC+ CE+VR+AY ++GW L ++ ++QC + EGCN+Y
Sbjct: 207 SCYGAERNPGDCCHTCEDVRQAYARRGWKLDIDEISVEQCAEDRIKMAAAASGKEGCNLY 266
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
++ G+ F PG+ + G +HD++ ++SH ++ L FG+ FPG NPL
Sbjct: 267 ATFAASRATGSLQFIPGRIYETLGRRMHDLMGSTTRKLDLSHTVHTLEFGDPFPGQQNPL 326
Query: 178 DGVRW-------TQETPSGMYQYFIKVVPTVYTDVSGHT-----IQSNQFSVTEHFRSSE 225
DG ++ +G + YF+K+VPT Y S T ++SNQ+S T HF SE
Sbjct: 327 DGTAQGSALSGDAKDAMNGRFSYFVKLVPTTYQRYSLITGLQDAVESNQYSATHHFTPSE 386
Query: 226 QGRL--------QTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
+ + +PGVF YDLSP+++ E H S +HF+ +CA+ GGV TV G+
Sbjct: 387 AAKAVSQTPKKQEIVPGVFMTYDLSPVRILVQERHPYPSLVHFVLQLCAVCGGVLTVVGL 446
Query: 276 IDAFIYHGQRAIKK 289
+D+ +H R I+K
Sbjct: 447 VDSMCFHSVRKIRK 460
>gi|320592791|gb|EFX05200.1| copii-coated vesicle membrane protein [Grosmannia clavigera kw1407]
Length = 440
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 126/355 (35%), Positives = 172/355 (48%), Gaps = 75/355 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGEQ V+H + RL+ Q G +I+ K L H H + YCG
Sbjct: 89 MDVSGEQQHGVQHGVRMVRLEPQSR---------GGSEIEVKTLDLHADAASHLDPEYCG 139
Query: 59 SCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
CYGA CCN C+EVREAY WA + ++QC+RE + +RI E+ EGC
Sbjct: 140 PCYGATPPQHAIKTGCCNTCDEVREAYASSSWAFGKGENVEQCQREHYAERIDEQRHEGC 199
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGEHF 170
I G L VNKV GNFH APG+SF +HVHD+ + + + +H ++ L FG
Sbjct: 200 RIEGGLRVNKVVGNFHIAPGRSFSNGNMHVHDLKNYWDMPTPNLHSFTHTVHSLRFGPQL 259
Query: 171 PGV-------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVY--------- 202
P +NPLDGV P+ Y YFIK+VPT Y
Sbjct: 260 PESLQKTLAGGGAKGQPWTNHHINPLDGVMQQTSDPNFNYMYFIKIVPTSYLALGWEKTF 319
Query: 203 ---------TDVSGH------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGV 235
DV + +++++Q+SVT H RS + G RL +PGV
Sbjct: 320 RGFVDDHDSADVGSYGLLADGSVETHQYSVTSHKRSLQGGDDAAEGHQERLHARGGIPGV 379
Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
FF YD+SP+KV EE +F FL +CAI+GG TV+ +D ++ G +KK
Sbjct: 380 FFSYDISPMKVVNREERAKTFAGFLAGLCAIIGGTLTVAAAVDRTVFEGTIRLKK 434
>gi|380489161|emb|CCF36889.1| hypothetical protein CH063_08353 [Colletotrichum higginsianum]
Length = 437
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 125/352 (35%), Positives = 175/352 (49%), Gaps = 72/352 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
MD+SGEQ V H + K RL SQ G VI+ + +D L EH + Y
Sbjct: 89 MDVSGEQQHGVIHGVNKVRLRSQKEGGGVIDMK-------ALD--LHSREATAEHLDPNY 139
Query: 57 CGSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG+CYGA++ CCN CEEVREAY + WA + ++QC RE + +R++E+ E
Sbjct: 140 CGACYGAQAPANAQKAGCCNTCEEVREAYAQASWAFGKGENVEQCTREHYAERLEEQRQE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGE 168
GC + G L VNKV GNFH APG+SF +HVHD+ + + +H I+ L FG
Sbjct: 200 GCRLEGNLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDTPDDAQHDFTHTIHSLRFGP 259
Query: 169 HFPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH---- 208
P V NPLD P+ + YF+K+VPT Y ++
Sbjct: 260 QLPDQVTKKMGKRAYAWTNHHGNPLDNTHQETTDPNYNFMYFVKIVPTSYLALNWQKSSS 319
Query: 209 ------------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
+++++Q+SVT H RS G RL + +PGVFF
Sbjct: 320 YQDEENSGLGLLGQGNDGSVETHQYSVTSHKRSLAGGDDAAEGHKERLHSRGGIPGVFFS 379
Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV EE +F FLT +CAI+GG TV+ +D ++ G +KK
Sbjct: 380 YDISPMKVINREERAKTFTGFLTGLCAIIGGTLTVAAAVDRGVFEGGLRLKK 431
>gi|119496763|ref|XP_001265155.1| COPII-coated vesicle membrane protein Erv46, putative [Neosartorya
fischeri NRRL 181]
gi|119413317|gb|EAW23258.1| COPII-coated vesicle membrane protein Erv46, putative [Neosartorya
fischeri NRRL 181]
Length = 438
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 132/355 (37%), Positives = 173/355 (48%), Gaps = 75/355 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETY 56
MD+SGEQ + V H + K RL S G V++ + D +I K L + Y
Sbjct: 89 MDVSGEQQVGVAHGVNKVRLSSPAEGGRVLDVQALDLHSKEEIAKHL---------DPNY 139
Query: 57 CGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG C GA+ S E CCN C+EVREAY K WA I+QC+REG+ RI + E
Sbjct: 140 CGDCGGADPLPGSMKEGCCNTCDEVREAYAAKNWAFGKGSNIEQCEREGYAARIDAQRRE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
GC + G L VNKV GNFH APG+SF VH HD+ + + ++H I++L F
Sbjct: 200 GCRLEGILRVNKVVGNFHIAPGRSFTSGQVHAHDLQNYLDLELPDNEKHTMTHHIHQLRF 259
Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
G P V NPLD P+ + YF+KVV T Y +
Sbjct: 260 GPQLPDEVSDRWQWTDHHHTNPLDSTSQETNDPAYNFVYFVKVVSTSYLPLGWDPLFSSA 319
Query: 206 ------------------SGHTIQSNQFSVTEHFRS------SEQGRLQTL------PGV 235
SG +I+++Q+SVT H RS S++G + L PGV
Sbjct: 320 AHNAHDQTPLGSHGIAYGSGGSIETHQYSVTSHKRSLRGGDASDEGHKERLHAANGIPGV 379
Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
FF YD+SP+KV E SF FLT VCAI+GG TV+ ID +Y G +KK
Sbjct: 380 FFNYDISPMKVINREARPKSFSGFLTGVCAIIGGTLTVAAAIDRGLYEGALRVKK 434
>gi|401426616|ref|XP_003877792.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494038|emb|CBZ29334.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 406
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 167/314 (53%), Gaps = 29/314 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ-GNVIESRQDGIGAPKI-DKPLQRHGGRLEHNETYCG 58
+DI G DV+ + K+R+D+ G VI + + + K+ K + G E+ C
Sbjct: 90 VDIFGVFANDVEGNTVKQRIDTATGQVISAARAIVDEKKVVTKAIDADGAEKEN----CP 145
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIY 117
SCYGAE DCC+ CE+VR+AY ++GW L ++ ++QC + EGCN+Y
Sbjct: 146 SCYGAERHPGDCCHTCEDVRQAYVRRGWKLDIDEISVEQCAEDRIKMATAAFGKEGCNLY 205
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
++ G+ F PG+ + G +HD++ ++SH ++ L FG+ FPG NPL
Sbjct: 206 ATFAASRATGSLQFIPGRIYETLGRRMHDLMGSATRKLDLSHTVHTLEFGDPFPGQQNPL 265
Query: 178 DGVRW-------TQETPSGMYQYFIKVVPTVYTDVS-----GHTIQSNQFSVTEHFRSSE 225
DG ++ +G + YF+K+VPT Y S T++SNQ+S T HF SE
Sbjct: 266 DGTAQGSALSGDAKDAMNGRFSYFVKLVPTTYQRYSLITGLQDTVESNQYSATHHFTPSE 325
Query: 226 QGRLQT--------LPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
+ ++ +PGVF YDLSP+++ E H S HF+ VCA+ GGV TV G+
Sbjct: 326 AAKAESQAPKKQEIVPGVFMTYDLSPVRILVQERHPYPSLAHFVLQVCAVCGGVLTVVGL 385
Query: 276 IDAFIYHGQRAIKK 289
+D+ +H R I+K
Sbjct: 386 VDSLCFHSVRKIRK 399
>gi|194224360|ref|XP_001916465.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Equus caballus]
Length = 342
Score = 197 bits (501), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 120/303 (39%), Positives = 163/303 (53%), Gaps = 59/303 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FK+RLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNC--EEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
C SCYGAE+ D C + + + KG R +EE
Sbjct: 142 CESCYGAETEDIKPPYFCLQDHLHSSLAGKGLPWG---------------RDQEE----- 181
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
+ H V +HD+ +F D+ N++H I L+FGE +PG+V
Sbjct: 182 ---------------------ALH--AVEIHDLQSFGLDNINMTHYIRHLSFGEDYPGIV 218
Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTL 232
NPLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G + Q L
Sbjct: 219 NPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGL 277
Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
PGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI+
Sbjct: 278 PGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKID 337
Query: 293 IGK 295
+GK
Sbjct: 338 LGK 340
>gi|70990824|ref|XP_750261.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus fumigatus
Af293]
gi|66847893|gb|EAL88223.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
fumigatus Af293]
gi|159130735|gb|EDP55848.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
fumigatus A1163]
Length = 438
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 132/355 (37%), Positives = 173/355 (48%), Gaps = 75/355 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETY 56
MD+SGEQ + V H + K RL S G V++ + D +I K L + Y
Sbjct: 89 MDVSGEQQVGVAHGVNKVRLSSPAEGGRVLDVQALDLHSKEEIAKHL---------DPNY 139
Query: 57 CGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG C GA+ S E CCN C+EVREAY K WA I+QC+REG+ RI + E
Sbjct: 140 CGDCGGADPLPGSIKEGCCNTCDEVREAYAAKNWAFGKGTNIEQCEREGYAARIDAQRRE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
GC + G L VNKV GNFH APG+SF VH HD+ + + ++H I++L F
Sbjct: 200 GCRLEGILRVNKVVGNFHIAPGRSFTSGQVHAHDLQNYLDSELPDNEKHTMTHHIHQLRF 259
Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
G P V NPLD P+ + YF+KVV T Y +
Sbjct: 260 GPQLPDEVSDRWQWTDHHHTNPLDSTSQETNDPAYNFVYFVKVVSTSYLPLGWDPLFSSA 319
Query: 206 ------------------SGHTIQSNQFSVTEHFRS------SEQGRLQTL------PGV 235
SG +I+++Q+SVT H RS S++G + L PGV
Sbjct: 320 AHNAHDQTPLGSHGIAYGSGGSIETHQYSVTSHKRSLRGGDASDEGHKERLHAANGIPGV 379
Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
FF YD+SP+KV E SF FLT VCAI+GG TV+ ID +Y G +KK
Sbjct: 380 FFNYDISPMKVINREARPKSFSGFLTGVCAIIGGTLTVAAAIDRGLYEGALRVKK 434
>gi|429853391|gb|ELA28466.1| copii-coated vesicle membrane protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 437
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 125/351 (35%), Positives = 174/351 (49%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGG--RLEH-NETYC 57
MD+SGEQ V H + K RL Q ++G G + K L H EH + YC
Sbjct: 89 MDVSGEQQHGVMHGVNKVRLRPQ-------KEGGGVIDV-KALSLHSSDEAAEHLDPNYC 140
Query: 58 GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA + CCN CEEVREAY + WA + ++QC RE + ++++E+ EG
Sbjct: 141 GPCYGAPAPPNAQKAGCCNTCEEVREAYAQASWAFGKGENVEQCTREHYAEKLEEQRREG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD----SFNISHKINKLAFGEH 169
C I G L VNKV GNFH APG+SF +HVHD+ + + +H I+ L FG
Sbjct: 201 CRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETPDDAQHDFTHVIHTLRFGPQ 260
Query: 170 FPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS------- 206
P + NPLD P+ + YF+K+VPT Y ++
Sbjct: 261 LPDTITKKMTKRAYAWTNHHGNPLDSTHQETNDPNYNFMYFVKIVPTSYLALNWQKSASI 320
Query: 207 -----------GH----TIQSNQFSVTEHFRS---------SEQGRLQT---LPGVFFFY 239
GH +++++Q+SVT H RS Q RL + +PGVFF Y
Sbjct: 321 QDEESSGLGLLGHLSDGSVETHQYSVTSHKRSLAGGDDSAEGHQERLHSRGGIPGVFFSY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
D+SP+KV EE +F FLT +CAI+GG TV+ +D ++ G +KK
Sbjct: 381 DISPMKVINREERAKTFTGFLTGLCAIIGGTLTVAAAVDRGVFEGGLRLKK 431
>gi|389632999|ref|XP_003714152.1| hypothetical protein MGG_01245 [Magnaporthe oryzae 70-15]
gi|351646485|gb|EHA54345.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Magnaporthe oryzae 70-15]
Length = 439
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 127/353 (35%), Positives = 172/353 (48%), Gaps = 72/353 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ V+H + K RL Q G VI+++ + A L+ N YC
Sbjct: 89 MDVSGEQQHGVQHGVIKVRLRPQSEGGGVIDAKTLALHAE------DEAATHLDPN--YC 140
Query: 58 GSCYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA + CCN C+EVREAY + WA + ++QC RE + +R+ E+ EG
Sbjct: 141 GGCYGAPAPANAKKAGCCNTCDEVREAYAQASWAFGRGENVEQCTREHYAERLDEQRHEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----NISHKINKLAFGEH 169
C I G L VNKV GNFH APG+SF +HVHD+ + + SH I+ L FG
Sbjct: 201 CQIAGSLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDTPVEGGHSFSHTIHSLRFGPQ 260
Query: 170 FPGV------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--- 208
P +NPLDGV T P+ Y YF+K+VPT Y +
Sbjct: 261 LPPSALEKLGNKDKNMPWTNHHINPLDGVIQTTVDPNFNYMYFVKIVPTSYLPLGWEKRT 320
Query: 209 -------------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFF 237
+++++Q+SVT H RS G R+ + +PGVFF
Sbjct: 321 HLATMHDHGVGTYGYSGDGSVETHQYSVTSHKRSLAGGDDGEDGHKERMHSRGGIPGVFF 380
Query: 238 FYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV E +F FLT +CAI+GG TV+ ID + G IKK
Sbjct: 381 SYDISPMKVINREVRTKTFAGFLTGLCAILGGTLTVAAAIDRMTFEGVTRIKK 433
>gi|310800359|gb|EFQ35252.1| hypothetical protein GLRG_10396 [Glomerella graminicola M1.001]
Length = 437
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 124/351 (35%), Positives = 173/351 (49%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHG--GRLEH-NETYC 57
MD+SGEQ V H + K RL R++G G I K L H EH + YC
Sbjct: 89 MDVSGEQQHGVMHGVNKVRL-------RPRKEGGGVIDI-KALDLHSRDDSAEHLDPNYC 140
Query: 58 GSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA++ CCN C+EVREAY + WA + ++QC RE + +R++E+ EG
Sbjct: 141 GPCYGAQAPPNAQKPGCCNTCDEVREAYAQASWAFGKGEGVEQCTREHYAERLEEQRQEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGEH 169
C I G L VN+V GNFH APG+SF +HVHD+ + + +H I+ L FG
Sbjct: 201 CRIEGNLRVNRVVGNFHLAPGRSFSNGNMHVHDLKNYWDTPADAQHDFTHTIHSLRFGPQ 260
Query: 170 FPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----- 208
P V NPLD P+ + YF+K+VPT Y ++
Sbjct: 261 LPDQVTKKMGKRAYAWTNHHGNPLDNTHQDTNDPNYNFMYFVKIVPTSYLALNWQKSTAY 320
Query: 209 -----------------TIQSNQFSVTEHFRS---------SEQGRLQT---LPGVFFFY 239
+++++Q+SVT H RS Q RL + +PGVFF Y
Sbjct: 321 QDDDSSSLGLLGQGNDGSVETHQYSVTSHKRSLAGGDDAAEGHQERLHSRGGIPGVFFSY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
D+SP+KV EE +F FLT +CAI+GG TV+ +D ++ G +KK
Sbjct: 381 DISPMKVINREERAKTFTGFLTGLCAIIGGTLTVAAAVDRGVFEGGMRLKK 431
>gi|72393511|ref|XP_847556.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62175086|gb|AAX69235.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70803586|gb|AAZ13490.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
gi|261330829|emb|CBH13814.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 405
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 117/319 (36%), Positives = 169/319 (52%), Gaps = 29/319 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D GE +V D K R+DS + G +D Q G NE C +C
Sbjct: 90 IDAFGEYVENVVTDTAKVRVDSS----TLKPLGKARQLVDLKKQPTNGNETGNEN-CPTC 144
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGF 119
YGAE + +CC+ C++VR A+ ++ W D+ I QC E EGCN++
Sbjct: 145 YGAEKNPGECCHTCDDVRRAFAERQWEFHEDDVSIAQCAHERLKVAADSASAEGCNLHAS 204
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD- 178
V +V GN HF PG+ F+ G H+H N+SH ++ L FGE FPG NP+D
Sbjct: 205 FSVPRVTGNIHFVPGRMFNFFGQHLHSFKGETIRKLNLSHIVHALEFGERFPGQNNPMDG 264
Query: 179 -----GVRWTQETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSS----E 225
GV+ E G + YF+KVVPT+Y VS G+ ++SNQ+SVT HF S +
Sbjct: 265 MVNARGVKDPSEPLIGRFTYFVKVVPTLYQVVSMANTGNLVESNQYSVTHHFTPSWAAPK 324
Query: 226 QGRLQ-------TLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGII 276
+G +PGVF YD+SPI+V+ T H S +H + +CA+ GGV+TV+G+I
Sbjct: 325 EGETDNPNSDPLVVPGVFISYDISPIRVSVTRTHPYPSIVHLVLQLCAVGGGVYTVTGLI 384
Query: 277 DAFIYHGQRAIKKKIEIGK 295
D+ +HG + +++KI GK
Sbjct: 385 DSLFFHGIKRVQEKINRGK 403
>gi|212540034|ref|XP_002150172.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
marneffei ATCC 18224]
gi|210067471|gb|EEA21563.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
marneffei ATCC 18224]
Length = 440
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 127/355 (35%), Positives = 170/355 (47%), Gaps = 75/355 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGR---LEHNETY 56
MD+SGEQ + V H + K RL S + G ID L+ H + + Y
Sbjct: 89 MDVSGEQQMGVVHGLNKVRLSSVAD---------GGRVIDVSKLELHSQNEVAIHLDPEY 139
Query: 57 CGSCYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG C GA + CCN CEEVREAY K WA + I+QC+REG+ RI + E
Sbjct: 140 CGECGGASPPENAKKPGCCNTCEEVREAYALKSWAFGKGENIEQCQREGYADRIDAQRRE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
GC I G + VNKV GNFH APG+SF +HVHD+ + + +SH I++L F
Sbjct: 200 GCRIEGDIRVNKVIGNFHIAPGRSFSSGNMHVHDLDTYLDRELADYEKHTMSHIIHQLRF 259
Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
G V NPLD + P+ Y Y+IKVV T Y +
Sbjct: 260 GPQLSDEVSQRWQWTDHHHTNPLDSTQQLTNEPAYNYNYYIKVVSTSYLPLGWDSARSDQ 319
Query: 206 ------------------SGHTIQSNQFSVTEHFRS---------SEQGRLQT---LPGV 235
+ +I+++Q+SVT H RS Q R+ +PGV
Sbjct: 320 LHGDDQFTPLGLHGAAHGTAGSIETHQYSVTSHKRSLHGGNDAAEGHQERIHAEGGIPGV 379
Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
FF YD+SP+KV E +F FLT VCA++GG TV+ +D F+Y G R I+K
Sbjct: 380 FFNYDISPMKVVNREARAKTFTGFLTGVCAVIGGTLTVAAAVDRFLYEGSRRIRK 434
>gi|449549110|gb|EMD40076.1| hypothetical protein CERSUDRAFT_132878 [Ceriporiopsis subvermispora
B]
Length = 1001
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 114/323 (35%), Positives = 170/323 (52%), Gaps = 41/323 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNETYCGS 59
MDISGE D+ H+I K RL +G + + IDK QR GG
Sbjct: 669 MDISGETQTDISHNIIKTRLTEKGLPVPNAASSELRNDIDKLNEQRQGGYCGSCYGGVEP 728
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
G CCN+CE+VR+AY +GW+ + P+ I+QC EG+ +++K++ EGCNI G
Sbjct: 729 AGG-------CCNSCEDVRQAYVNRGWSFNRPEGIEQCVDEGWSEKLKDQANEGCNIAGR 781
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF-GEHFPGVV- 174
+ VNKV GN H +PG+SF +++D++ + +D N SH I++ AF G+ ++
Sbjct: 782 VRVNKVVGNIHLSPGRSFRSGSQNLYDLVPYLKDDGNRHDFSHTIHEFAFEGDDEYDILK 841
Query: 175 ---------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
NPLDG M+QYF+KVV T + + G ++ +NQ+S T
Sbjct: 842 AKSGKEMRRRMGIEGNPLDGAIGRTSKQQYMFQYFLKVVSTQFRTLDGMSVNTNQYSATH 901
Query: 220 HFRSSEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
R G+ + +PG FF Y++SPI ++ E SF HFLT+ CAIV
Sbjct: 902 FERDLTAGQQEKDQAGLHVAHTSVGIPGAFFNYEISPILISHAESRQSFAHFLTSTCAIV 961
Query: 267 GGVFTVSGIIDAFIYHGQRAIKK 289
GGV TV+ +ID+ ++ R +KK
Sbjct: 962 GGVLTVASLIDSVLFVAGRTLKK 984
>gi|317025332|ref|XP_001388859.2| COPII-coated vesicle membrane protein Erv46 [Aspergillus niger CBS
513.88]
gi|350638031|gb|EHA26387.1| hypothetical protein ASPNIDRAFT_196625 [Aspergillus niger ATCC
1015]
Length = 438
Score = 194 bits (493), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 128/355 (36%), Positives = 174/355 (49%), Gaps = 75/355 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG--RLEH-NETY 56
MD+SGEQ V H I K RL S G ID K L+ H +H + Y
Sbjct: 89 MDVSGEQQTGVVHGINKVRLTSAAE---------GGRVIDVKALELHSKDESAKHLDPDY 139
Query: 57 CGSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA + S CCN C+EVREAY ++ WA + ++QC+ EG+ +RI + E
Sbjct: 140 CGECYGATAPAGASKPGCCNTCDEVREAYAQQQWAFGKGENVEQCELEGYAERIDAQRRE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
GC + G L VNKV GNFH APG+SF +HVHD+ F + ++H+I++L F
Sbjct: 200 GCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVHDLANFFDADLPDAEKHTMTHEIHQLRF 259
Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
G P + NPLDG + P Y YF+KVV T Y +
Sbjct: 260 GPQLPDELSDRWQWTDHHHTNPLDGTKQETNEPGYNYMYFVKVVSTSYLPLGWDPLFSSS 319
Query: 206 ------------------SGHTIQSNQFSVTEHFRS------SEQGRLQTL------PGV 235
+ +I+++Q+SVT H RS S++G + L PGV
Sbjct: 320 IHSAYDQAPLGSHGIAYGAEGSIETHQYSVTSHKRSLMGGDASDEGHKERLHAANGIPGV 379
Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
F YD+SP+KV E +F FLT VCAI+GG TV+ +D +Y G +KK
Sbjct: 380 FVNYDISPMKVINREARPKTFTGFLTGVCAIIGGTLTVAAALDRGLYEGVSRMKK 434
>gi|302923326|ref|XP_003053651.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256734592|gb|EEU47938.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 437
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 126/352 (35%), Positives = 174/352 (49%), Gaps = 72/352 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGEQ V H + K RL Q G ID K L H H + +YCG
Sbjct: 89 MDVSGEQQHGVMHGVNKVRLQPQSK---------GGADIDSKSLSLHDDAAAHLDPSYCG 139
Query: 59 SCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
CYGA+ + CC C+EVREAY + WA + ++QC+RE + +++ + EGC
Sbjct: 140 GCYGAQPPANARKAGCCQTCDEVREAYAQASWAFGRGEGVEQCEREHYAEKLDAQREEGC 199
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL----AFQRDSFNISHKINKLAFGEHF 170
I G L VNKV GNFHFAPG+SF +HVHD+ A + + + +H I+ L FG
Sbjct: 200 RIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLKNYWDAPKGKAHDFTHIIHSLRFGPQL 259
Query: 171 PGVV---------------NPLDGVRWTQETPSGMYQYFIKVVPTVY------------- 202
P V NPLDG R + P+ + YF+K+VPT Y
Sbjct: 260 PDEVARKVGKGTPWTNHHQNPLDGTRQDIKDPNFNFMYFVKIVPTSYLPLGWDSKGLKIA 319
Query: 203 ------TDVSGH------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
T + + +++++Q+SVT H RS G R T +PGVFF
Sbjct: 320 GLLQDDTSLGAYGYAEDGSVETHQYSVTSHKRSLAGGNDAAEGHAERQHTSGGIPGVFFS 379
Query: 239 YDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV EE +F FL +CAIVGG TV+ +D ++ G +KK
Sbjct: 380 YDISPMKVVNREEKGKTFSGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 431
>gi|367052857|ref|XP_003656807.1| hypothetical protein THITE_2121964 [Thielavia terrestris NRRL 8126]
gi|347004072|gb|AEO70471.1| hypothetical protein THITE_2121964 [Thielavia terrestris NRRL 8126]
Length = 436
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 128/352 (36%), Positives = 172/352 (48%), Gaps = 73/352 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG---RLEHNETY 56
MD+SGEQ V+H + K RL R G ID K L H + + +Y
Sbjct: 89 MDVSGEQQHGVQHGVTKTRL---------RPLSEGGGDIDSKALALHAADEAAIHLDPSY 139
Query: 57 CGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA+ + CCN C+EV+EAY ++ WA D I+QC+RE + +R+ E+ E
Sbjct: 140 CGPCYGAKPPTTAKKPGCCNTCDEVKEAYAQQAWAFGRGDGIEQCEREHYGERLDEQRRE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEHF 170
GC I G L VNKV GNFH APG+SF VHVHD+ + +H I+ L FG
Sbjct: 200 GCRIEGGLRVNKVVGNFHIAPGRSFSNGNVHVHDLKNYWDTPTKHTFTHIIHHLRFGPQL 259
Query: 171 PGV----------------VNPLDGVRWTQETPSGMYQYFIKVVPTVY------------ 202
P +NPLDG + + Y YFIK+VPT Y
Sbjct: 260 PDSLHKKLGTKHLPWTNHHLNPLDGTSQETDDVNFNYMYFIKIVPTSYLPLGWEKTWAGF 319
Query: 203 ------------TDVSGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
T G +++++Q+SVT H RS G RL +PGVFF
Sbjct: 320 REEHQAELGSFGTSADG-SVETHQYSVTSHKRSLAGGDDAAEGHRERLHAKGGIPGVFFS 378
Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV EE +FL F+ +CAIVGG TV+ +D ++ G +KK
Sbjct: 379 YDISPMKVINREERSKTFLGFIAGLCAIVGGTLTVAAAVDRALFEGTVRLKK 430
>gi|325191973|emb|CCA26442.1| endoplasmic reticulumGolgi intermediate compartment protein
putative [Albugo laibachii Nc14]
Length = 401
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 163/284 (57%), Gaps = 17/284 (5%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GE +D+ I RLD++GN I + +D + L N YCGSC
Sbjct: 115 MDVTGELQMDLHRSIGMTRLDAKGNPINT---------LDSAKEE---VLPAN--YCGSC 160
Query: 61 Y-GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
Y + CCN C+EV+EA+ L + D +QC RE ++ + + GEGC + G+
Sbjct: 161 YETVHPLGKTCCNTCDEVKEAFVANDLRLFDADQKEQCVREMTEEQRQAQAGEGCRLKGY 220
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
+ VN+VAGNFH G++FH+ G +H L Q FN S ++ L+FG + V N LDG
Sbjct: 221 MMVNRVAGNFHVGLGRTFHRKGKLIHQFLPGQESVFNASFLLHSLSFGTPYANVKNGLDG 280
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQTLPGVFFF 238
++ + G+ +YF+K+VPT+Y+D+S ++ S Q+S T+ + + G++ LPG +F
Sbjct: 281 TQYITKKKGGVMKYFLKIVPTIYSDISS-SVHSYQYSHTKQEKYMNAMGQISGLPGAYFM 339
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
++ SP V E + F HF+ + AI+GG+ +++G +D+ I+H
Sbjct: 340 FEFSPFMVKIDSEQIPFTHFVIRIFAILGGMISIAGFVDSVIFH 383
>gi|340923948|gb|EGS18851.1| hypothetical protein CTHT_0054620 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 436
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 173/351 (49%), Gaps = 71/351 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRL---DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ V+H + K RL + G I+ ++ + + ++ L+ N YC
Sbjct: 89 MDVSGEQQHGVQHGVTKTRLRPWEEGGGDIDKKELALHS------IEESATHLDPN--YC 140
Query: 58 GSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
GSCYGA + CC C+EVREAY + WA + I+QC+RE + +R+ ++ EG
Sbjct: 141 GSCYGANPPPNAVKPGCCQTCDEVREAYAQAAWAFGRGENIEQCQREHYAERLDQQRREG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C I G L VNKV GNFH APGKSF +HVHD+ + +H I+ L FG P
Sbjct: 201 CRIEGGLRVNKVVGNFHIAPGKSFSNGNMHVHDLKNYWESPVRHTFTHIIHHLRFGPQLP 260
Query: 172 GV----------------VNPLDGVRWTQETPSGMYQYFIKVVPTVY------------- 202
VNPLD + + Y YFIK+VPT Y
Sbjct: 261 ESLHQKLGNKALPWSNHHVNPLDNTHQETDEVNFSYMYFIKIVPTSYLPLGWEKTWDQFR 320
Query: 203 -----------TDVSGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFY 239
T G +++++Q+SVT H RS G RL + +PGVFF Y
Sbjct: 321 EQHHAELGSFGTSADG-SVETHQYSVTSHRRSLSGGDDAAEGHSERLHSKGGIPGVFFSY 379
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
D+SP+KV EE SFL FL +CAIVGG TV+ ID ++ G +KK
Sbjct: 380 DISPMKVINREERAKSFLGFLAGLCAIVGGTLTVAAAIDRALFEGTVRLKK 430
>gi|443734706|gb|ELU18587.1| hypothetical protein CAPTEDRAFT_139951 [Capitella teleta]
Length = 285
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 97/191 (50%), Positives = 123/191 (64%), Gaps = 15/191 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGIGAPKID----KPLQRHGGRLEHNE 54
MD+SGEQ +DV HDIFK+RLD G + E ++ +G D PL+ +
Sbjct: 92 MDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKEDLGDKSKDFAVKNPLK---------D 142
Query: 55 TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
C SCYGAES CCN C EVREAYR+KGWA + I+QC REG++ +++E + EGC
Sbjct: 143 DRCESCYGAESEAHKCCNTCNEVREAYRQKGWAFVDAQNIEQCMREGYVSQLEEGKNEGC 202
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
IYGFLEVNKVAGNFH APG+SF Q H+HD+ A Q FN+SH+I L+FG+ +PG V
Sbjct: 203 RIYGFLEVNKVAGNFHVAPGRSFSQHHAHIHDMQALQGMKFNMSHRIQHLSFGDDYPGQV 262
Query: 175 NPLDGVRWTQE 185
NPLD E
Sbjct: 263 NPLDASEQVTE 273
>gi|169770949|ref|XP_001819944.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus oryzae
RIB40]
gi|238486566|ref|XP_002374521.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
flavus NRRL3357]
gi|83767803|dbj|BAE57942.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|220699400|gb|EED55739.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
flavus NRRL3357]
gi|391874294|gb|EIT83200.1| COPII vesicle protein [Aspergillus oryzae 3.042]
Length = 436
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 127/352 (36%), Positives = 175/352 (49%), Gaps = 71/352 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ V H I K RL S G+VI+ + + + Q L+ N YC
Sbjct: 89 MDVSGEQQTGVVHGINKVRLSSPAEGGHVIDVKALELHSE------QEAAKHLDPN--YC 140
Query: 58 GSCYGAESS--DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
G C G ++ CCN CEEVREAY ++ WA + I+QC+REG+ QR+ + EGC
Sbjct: 141 GDCGGVPQPGGEKRCCNTCEEVREAYAQQQWAFGKGENIEQCEREGYAQRLDAQRREGCR 200
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAFGEH 169
+ G L VNKV GNFH APG+SF VHVHD+ + + ++H I++L FG
Sbjct: 201 LEGVLRVNKVVGNFHIAPGRSFTSGNVHVHDLENYFEGDLPDAEKHTMTHIIHQLRFGPQ 260
Query: 170 FPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV------------ 205
P + NPLD + P+ + YF+KVV T Y +
Sbjct: 261 LPDELSDRWQWTDHHHTNPLDSTQQETSDPAYNFMYFVKVVSTSYLPLGWDPLFSSAVHS 320
Query: 206 ---------------SGHTIQSNQFSVTEHFRS------SEQGRLQTL------PGVFFF 238
S +I+++Q+SVT H RS S++G + L PGVFF
Sbjct: 321 AYEDSPLGSHGIAYGSQSSIETHQYSVTSHKRSLRGGDASDEGHKERLHAANGIPGVFFN 380
Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV E +F FLT VCAI+GG TV+ +D +Y G +KK
Sbjct: 381 YDISPMKVINKEARPKTFTGFLTGVCAIIGGTLTVAAALDRGLYEGALRVKK 432
>gi|443734710|gb|ELU18591.1| hypothetical protein CAPTEDRAFT_139954 [Capitella teleta]
Length = 285
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 97/191 (50%), Positives = 123/191 (64%), Gaps = 15/191 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGIGAPKID----KPLQRHGGRLEHNE 54
MD+SGEQ +DV HDIFK+RLD G + E ++ +G D PL+ +
Sbjct: 92 MDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKEDLGDKSKDFAVKNPLK---------D 142
Query: 55 TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
C SCYGAES CCN C EVREAYR+KGWA + I+QC REG++ +++E + EGC
Sbjct: 143 DRCESCYGAESEAHKCCNTCNEVREAYRQKGWAFVDAQNIEQCMREGYVSQLEEGKNEGC 202
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
IYGFLEVNKVAGNFH APG+SF Q H+HD+ A Q FN+SH+I L+FG+ +PG V
Sbjct: 203 RIYGFLEVNKVAGNFHVAPGRSFSQHHAHIHDMQALQGMKFNMSHRIQHLSFGDDYPGQV 262
Query: 175 NPLDGVRWTQE 185
NPLD E
Sbjct: 263 NPLDASEQVTE 273
>gi|402083890|gb|EJT78908.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 444
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 124/358 (34%), Positives = 172/358 (48%), Gaps = 77/358 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ V+H + K RL Q G VI+ + + A D+ H + YC
Sbjct: 89 MDVSGEQQHGVQHGVVKVRLQPQSEGGGVIDVKALSLHA---DEDSATH-----LDPKYC 140
Query: 58 GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA ++ CC+ C+EVREAY + WA + ++QC RE + +R+ E+ EG
Sbjct: 141 GPCYGAPAPSNAAKAGCCSTCDEVREAYAQASWAFGRGENVEQCLREHYAERLDEQRQEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGEH 169
C I G L VNKV GNFH APG+SF +HVHD+ + + SH ++ L+FG
Sbjct: 201 CQIAGSLRVNKVIGNFHLAPGRSFSNGNMHVHDLKNYWDTPVDGGHSFSHVVHSLSFGPQ 260
Query: 170 FPGVV-------------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-- 208
P V NPLDG P+ + YF+K+VPT Y +
Sbjct: 261 LPLEVQKRLDRGRSLPWADHSHQLNPLDGTSQETADPNFSFMYFLKIVPTSYLPLGWEGR 320
Query: 209 ------------------------TIQSNQFSVTEHFRS---------SEQGRLQT---L 232
++++Q+SVT H RS Q RL + +
Sbjct: 321 RAKIATGNHDKDSWVGTYGYSPDGAVETHQYSVTSHKRSLAGGDDAAEGHQERLHSKGGI 380
Query: 233 PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
PGVFF YD+SP+KV EE +F FLT +CAI+GG TV+ +D Y G +KK
Sbjct: 381 PGVFFSYDISPMKVINREERPKTFAGFLTGLCAILGGTLTVAAAVDRTFYEGATRLKK 438
>gi|358372047|dbj|GAA88652.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus kawachii
IFO 4308]
Length = 438
Score = 193 bits (491), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 127/355 (35%), Positives = 174/355 (49%), Gaps = 75/355 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG--RLEH-NETY 56
MD+SGEQ V H I K RL S G ID K L+ H +H + Y
Sbjct: 89 MDVSGEQQTGVVHGINKVRLTSAAE---------GGRVIDVKALELHSKDESAKHLDPDY 139
Query: 57 CGSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA + S CCN C+EVREAY ++ WA + ++QC+ EG+ +RI + E
Sbjct: 140 CGECYGATAPAGASKPGCCNTCDEVREAYAQQQWAFGKGENVEQCELEGYAERIDAQRRE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
GC + G L VNKV GNFH APG+SF +HVHD+ F + + ++H+I++L F
Sbjct: 200 GCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVHDLATFFDAELPESERHTMTHEIHQLRF 259
Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
G P + NPLD + P Y YF+KVV T Y +
Sbjct: 260 GPQLPDELSDRWQWTDHHHTNPLDNTKQETNEPGYNYMYFVKVVSTSYLPLGWDPLFSSS 319
Query: 206 ------------------SGHTIQSNQFSVTEHFRS------SEQGRLQTL------PGV 235
+ +I+++Q+SVT H RS S++G + L PGV
Sbjct: 320 IHSAYDQAPLGSHGIAYGAEGSIETHQYSVTSHKRSLMGGDASDEGHKERLHAANGIPGV 379
Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
F YD+SP+KV E +F FLT VCAI+GG TV+ +D +Y G +KK
Sbjct: 380 FVNYDISPMKVINREARPKTFTGFLTGVCAIIGGTLTVAAALDRGLYEGVSRMKK 434
>gi|440299607|gb|ELP92159.1| endoplasmic reticulum-golgi intermediate compartment protein,
putative [Entamoeba invadens IP1]
Length = 361
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 104/287 (36%), Positives = 168/287 (58%), Gaps = 29/287 (10%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY---C 57
+D +GE +D++ ++ KKRL+ + + ++ Y C
Sbjct: 86 LDTTGEVSIDIESNVNKKRLNPHS------------------MTESSNKATAHKVYGIEC 127
Query: 58 GSCYGAESSDED-CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
+C ES D++ CC C+E++E+Y+K G + P+ + QC+ + + +GEGC++
Sbjct: 128 PAC--EESVDKNKCCFTCDELKESYKKAGKEVP-PNAV-QCQLKNIQKMALALDGEGCHM 183
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YG + VN+V+GNFH APG S Q H H A S N++H N L+FG++FPG++ P
Sbjct: 184 YGSVFVNRVSGNFHIAPGMSEQQGEGHRHS--AEWIGSLNLTHTWNSLSFGDNFPGMIKP 241
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-QTLPGV 235
+D ++ T + MYQYF++VVP Y + +++N +SVTEH+RS + Q +PGV
Sbjct: 242 MDSIQKVDVTNNSMYQYFVQVVPMTYFGLDKKVVKTNGYSVTEHYRSGNLKTMEQGVPGV 301
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
F Y++S ++V +TEE SF H LT +C IVGG+FT+ ++DAFI+H
Sbjct: 302 FVLYEISSMEVLYTEETGSFGHLLTGICGIVGGIFTIFSLLDAFIFH 348
>gi|189203047|ref|XP_001937859.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187984958|gb|EDU50446.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 437
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 125/356 (35%), Positives = 175/356 (49%), Gaps = 78/356 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGN---VIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
MD+SGE + V H I K RL + + VIE+ K L H H Y
Sbjct: 89 MDVSGELQMGVTHGINKVRLSPEADGSKVIET-----------KALDLHADEASHLAPDY 137
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA + +CCN C+EVR+AY W+ + ++QC+RE + + + ++ E
Sbjct: 138 CGQCYGAPPPTNAKKPNCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDQQRQE 197
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHF 170
GC + G ++VNKV GNFHFAPGKSF +HVHD+ + +D + +H+I++L FG
Sbjct: 198 GCRLEGSIKVNKVVGNFHFAPGKSFSNGNLHVHDLENYFKDDYAHTFTHRIHQLRFGPQL 257
Query: 171 PGV---------------------VNPLDGVRWTQETPSGMYQYFIKVV----------- 198
V VNPLD + + Y YFIKVV
Sbjct: 258 SDVVVRDMQKKHLDSGHNGWSNHHVNPLDNTVQHTDEKAYNYMYFIKVVSTAYLPLGWEQ 317
Query: 199 ----PTVYTDVSGHT--------IQSNQFSVTEHFRSSEQG---------RLQT---LPG 234
P+ Y+D+ G T I+++Q+SVT H RS + G R+ +PG
Sbjct: 318 EFPHPSKYSDILGTTIDESYKGSIETHQYSVTSHKRSLQGGTDEKDGHKERIHARGGIPG 377
Query: 235 VFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
VFF YD+SP+KV E SF FL +CA++GG TV+ ID +Y G IKK
Sbjct: 378 VFFSYDISPMKVVNREVREKSFSGFLVGLCAVIGGTLTVAAAIDRALYEGVNRIKK 433
>gi|325189930|emb|CCA24410.1| hypothetical protein BRAFLDRAFT_63528 [Albugo laibachii Nc14]
Length = 699
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 104/282 (36%), Positives = 157/282 (55%), Gaps = 8/282 (2%)
Query: 4 SGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY---CGSC 60
SGE H D++H + K+ +D G ++ + G+ I K + +T CGSC
Sbjct: 408 SGEIHHDIQHSVHKQAIDLNGKILSA---GMKLDSIGKAWTNQSDTVAEEKTVKVECGSC 464
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA +S E CCN CE+V++AY + W + + I+QC++ + + EGC IYG +
Sbjct: 465 YGAGASGE-CCNTCEDVQQAYASRRWNIPSLHTIEQCQKSEIEKLLHSTVEEGCRIYGSI 523
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG- 179
V KV G FAP K+ + +IL F+ SHKIN L FGE +P + +PL+G
Sbjct: 524 AVTKVHGKVLFAPAKALLSGYISTEEILDKTIKIFDTSHKINYLDFGERYPEMKSPLNGH 583
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
+ G YQYF++VVPT Y ++G I +NQ+SVT+H++ Q LP + F Y
Sbjct: 584 NTILPKGTRGTYQYFLQVVPTAYYYLNGGIIDTNQYSVTQHYQELTPLGEQQLPMITFQY 643
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
SPI + +L FLT++CAI+GGVFT+ G +D+ ++
Sbjct: 644 KFSPIMFQIEQRRRGYLQFLTSLCAILGGVFTMVGAVDSILF 685
>gi|336465550|gb|EGO53790.1| hypothetical protein NEUTE1DRAFT_151014 [Neurospora tetrasperma
FGSC 2508]
gi|350295150|gb|EGZ76127.1| DUF1692-domain-containing protein [Neurospora tetrasperma FGSC
2509]
Length = 444
Score = 193 bits (490), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 126/359 (35%), Positives = 176/359 (49%), Gaps = 79/359 (22%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG---RLEHNETY 56
MD+SGEQ V+H + K RL Q G +ID K L H + +Y
Sbjct: 89 MDVSGEQQHGVQHGVKKIRLRPQSE---------GGGEIDAKILSLHAADESATHLDPSY 139
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA + CC+ CEEVREAY + WA + ++QC+RE + +R+ E+ E
Sbjct: 140 CGPCYGAPAPYNAKKPGCCSTCEEVREAYAQASWAFGDGATMEQCQREHYTERLAEQRHE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----NISHKINKLAFGE 168
GC I G L VNKV GNFH APG+SF +HVHD+ + + SH I+ L FG
Sbjct: 200 GCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVHDLAQWWSTPVPGGHSFSHIIHSLRFGP 259
Query: 169 HFPGV------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----- 205
P +NPLD + + P+ + YF+K+VPT Y +
Sbjct: 260 QLPDDLVRKLGGNGKNTLWTNHHLNPLDNTKQETDDPNYNFMYFVKIVPTSYLPLGWEKQ 319
Query: 206 ----------------------SGHTIQSNQFSVTEHFRS------SEQG---RLQT--- 231
S +++++Q+SVT H RS S++G RL +
Sbjct: 320 AAQNKATWEQDHSVGLGAYGYGSDGSMETHQYSVTSHKRSLTGGDDSKEGHGERLHSRGG 379
Query: 232 LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+PGVFF YD+SP+KV EE SFL FL +CA+VGG TV+ +D ++ G +KK
Sbjct: 380 IPGVFFSYDISPMKVVNREERAKSFLGFLAGLCAVVGGTLTVAAAVDRGLFEGTVRLKK 438
>gi|213409826|ref|XP_002175683.1| COPII-coated vesicle component Erv46 [Schizosaccharomyces japonicus
yFS275]
gi|212003730|gb|EEB09390.1| COPII-coated vesicle component Erv46 [Schizosaccharomyces japonicus
yFS275]
Length = 394
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 171/311 (54%), Gaps = 28/311 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG+ DV+H + K RLD GN+I IG+ + + + G E CG C
Sbjct: 89 MDISGDFQQDVQHSVTKTRLDKYGNIIAVIDSDIGSATDESAMDKDG------EVTCGDC 142
Query: 61 YGAESS----DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
YGA + CCNNC+ VR+AY +K WA+ + D QC+ E + ++GEGCNI
Sbjct: 143 YGAGDAAPPETPGCCNNCKAVRDAYARKQWAIGDYDAFQQCRDENYKAEHASQKGEGCNI 202
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEHF-PGV 173
G L VN+VAGNFHFAPG+SF H+HD+ + ++++ +++H I++L+FG P
Sbjct: 203 AGHLFVNRVAGNFHFAPGRSFQTQQGHLHDLRGYEEEQEAHDMTHMIHQLSFGPPIKPSA 262
Query: 174 --VNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
+PLDG + Y YFIK V V D + TI +N+FSVT+H RS GR
Sbjct: 263 EHTDPLDGHFKNTDDALHNYAYFIKCVAHKFVPLDPADPTINTNEFSVTQHERSVTGGRE 322
Query: 230 QT----------LPGVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
+PGVFF D+SP+ V + +F F++NV + +GG T++ ++D
Sbjct: 323 NDNPSHLNRRGGIPGVFFNIDISPMLVIQRQIRGNTFGGFISNVLSFLGGFITLTTLVDR 382
Query: 279 FIYHGQRAIKK 289
+Y + +KK
Sbjct: 383 GLYAAELKMKK 393
>gi|85115136|ref|XP_964815.1| hypothetical protein NCU08607 [Neurospora crassa OR74A]
gi|28926610|gb|EAA35579.1| hypothetical protein NCU08607 [Neurospora crassa OR74A]
Length = 444
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 126/359 (35%), Positives = 175/359 (48%), Gaps = 79/359 (22%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG---RLEHNETY 56
MD+SGEQ V+H + K RL Q G +ID K L H + +Y
Sbjct: 89 MDVSGEQQHGVQHGVKKIRLRPQSE---------GGGEIDAKVLSLHAADESATHLDPSY 139
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA + CC+ CEEVREAY + WA + ++QC+RE + +R+ E+ E
Sbjct: 140 CGPCYGAPAPYNAKKPGCCSTCEEVREAYAQASWAFGDGATMEQCQREHYTERLAEQRHE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----NISHKINKLAFGE 168
GC I G L VNKV GNFH APG+SF +HVHD+ + + SH I+ L FG
Sbjct: 200 GCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVHDLAQWWSTPVPGGHSFSHIIHSLRFGP 259
Query: 169 HFPGV------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----- 205
P +NPLD + P+ + YF+K+VPT Y +
Sbjct: 260 QLPDDLVRKLGGNGKNTLWTNHHLNPLDNTKQETNDPNYNFMYFVKIVPTSYLPLGWEKQ 319
Query: 206 ----------------------SGHTIQSNQFSVTEHFRS------SEQG---RLQT--- 231
S +++++Q+SVT H RS S++G RL +
Sbjct: 320 AAQNKAAWEQDHSVGLGAYGYGSDGSMETHQYSVTSHKRSLTGGDDSKEGHGERLHSRGG 379
Query: 232 LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+PGVFF YD+SP+KV EE SFL FL +CA+VGG TV+ +D ++ G +KK
Sbjct: 380 IPGVFFSYDISPMKVVNREERAKSFLGFLAGLCAVVGGTLTVAAAVDRGLFEGTVRLKK 438
>gi|323449476|gb|EGB05364.1| hypothetical protein AURANDRAFT_30967 [Aureococcus anophagefferens]
Length = 368
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 113/295 (38%), Positives = 164/295 (55%), Gaps = 20/295 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++G+ H ++ + K+RLD +G+ I R A + + HG E C SC
Sbjct: 81 MDVAGDYHPYMEQHMTKQRLDGRGSPIPHRAIPERANEYE-----HGP--EDTGAGCQSC 133
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSN-----PDLIDQCKREGFLQRIKEEEGEGCN 115
+GAE++++ CCN C+E+ AY KGW+ P +D R+ ++ IK+ GEGCN
Sbjct: 134 FGAETAEQPCCNTCDELLRAYGNKGWSAQEIKKEAPQCVDD-TRDDSIRAIKK--GEGCN 190
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
+ G+LEVNKVAGN H A G+S Q+G VH + FN+SH I+ LAFGE + G+
Sbjct: 191 LAGWLEVNKVAGNVHVAMGESAIQNGRFVHQFDPTRAPEFNVSHVIHDLAFGETYDGMAL 250
Query: 176 PLDGVRWTQE--TPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRS--SEQGRLQ 230
PL G + T +G++QYFIK+VPT+Y +++ ++S T+ FR ++
Sbjct: 251 PLSGTSRIVDAATGTGLFQYFIKLVPTIYRAAPDAAPVRTVRYSYTQRFRPLHNQPPPTA 310
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 285
LPG+F YD S V T S HFL VCAIVGGV TV +D + +R
Sbjct: 311 MLPGIFLVYDFSAFMVEVTRHRSSLAHFLVRVCAIVGGVSTVVAFVDWAVVRAKR 365
>gi|258565913|ref|XP_002583701.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237907402|gb|EEP81803.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 435
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 124/351 (35%), Positives = 171/351 (48%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ + H I K RL G+V++++ + K D+ + + YC
Sbjct: 89 MDVSGEQQSGLIHGIKKVRLGPASEGGHVLDAQT--LDLHKKDEVA------VHLDPEYC 140
Query: 58 GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
GSCY + + CCN C+EVREAY +GWA + + QC+REG+ RI + EG
Sbjct: 141 GSCYDGVPPPNAQKQGCCNTCDEVREAYASRGWAFGRGEGVAQCEREGYGARIDAQRHEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C + G L VNKV GNFH APG+SF +H HD+ + ++H I++L FG P
Sbjct: 201 CRLEGILRVNKVIGNFHIAPGRSFTNGYMHAHDLKIYHETPVKHTMAHIIHQLRFGPQLP 260
Query: 172 GVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------- 208
+ NPLD T E P + YF+KVV T Y +
Sbjct: 261 DELSQKWKWTDHHHTNPLDSTSQTTEDPKYNFMYFVKVVSTSYLPLGWDASLSSEVHSRL 320
Query: 209 -----------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFY 239
+I+++Q+SVT H RS E G R+ T +PGVFF Y
Sbjct: 321 ASDAPLGKQGIQLGRHGSIETHQYSVTSHKRSVEGGDDSAEGHKERIHTAGGIPGVFFNY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
D+SP+KV E SF FLT VCA++GG TV+ ID +Y G +KK
Sbjct: 381 DISPMKVINREARTKSFSGFLTGVCAVIGGTLTVAAAIDRMLYEGAVRVKK 431
>gi|363752862|ref|XP_003646647.1| hypothetical protein Ecym_5030 [Eremothecium cymbalariae
DBVPG#7215]
gi|356890283|gb|AET39830.1| hypothetical protein Ecym_5030 [Eremothecium cymbalariae
DBVPG#7215]
Length = 399
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 113/314 (35%), Positives = 168/314 (53%), Gaps = 35/314 (11%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD SGE LD++ F K RLD G I + + +G+ K L + YCGS
Sbjct: 88 MDTSGEVQLDLQDAGFTKTRLDHSGTPIRTEKLEVGSNK--------AVHLPDDPNYCGS 139
Query: 60 CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
CYG++S D + CC CEEVREAY +KGWA + I+QC REG++++I +
Sbjct: 140 CYGSKSQDNNDALPKEQKVCCQTCEEVREAYSEKGWAFFDGQKIEQCIREGYVEKINSQL 199
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG-VHVHDILAFQRDS-FNISHKINKLAFGE 168
EGC + G ++N++ GN HFAPG++ + H HD+ + S N +H I+KL+FG
Sbjct: 200 HEGCRVKGSAKLNRIQGNIHFAPGRTTNSGKRTHTHDVSLYDTHSHLNFNHIIHKLSFGS 259
Query: 169 HFPGVV-NPLDG---VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
G + NPLDG + + + YF K+VPT Y + G +++ QFSVT H R
Sbjct: 260 DADGALSNPLDGHKNIIQGDDAHFSTFSYFTKIVPTRYEYLDGRKLETTQFSVTTHSRPL 319
Query: 225 EQGRLQTLP----------GVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVS 273
+ G+ P GV F+++SP+KV +E+H +++ F+ N +G V V
Sbjct: 320 KGGKDDDHPNTIHHRGGIAGVTIFFEMSPLKVINSEKHAITWSGFVLNCITSIGSVLAVG 379
Query: 274 GIIDAFIYHGQRAI 287
+ID Y QR+I
Sbjct: 380 TVIDKITYRAQRSI 393
>gi|440636941|gb|ELR06860.1| hypothetical protein GMDG_08151 [Geomyces destructans 20631-21]
Length = 441
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 127/355 (35%), Positives = 175/355 (49%), Gaps = 74/355 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGG--RLEH-NETYC 57
MD+SGE VKH + K RL+S D G K L H + H + +YC
Sbjct: 89 MDVSGEMQTGVKHGVSKVRLNSP--------DAGGGAIDVKALDLHSTEEKAAHLDPSYC 140
Query: 58 GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA + CCN C+EVR+AY WA + ++QC+RE + +R+ E+ EG
Sbjct: 141 GQCYGATPPPNAQKAGCCNTCDEVRDAYASASWAFGRGENVEQCEREHYSERLDEQRKEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ-----RDSFNISHKINKLAFGE 168
C I G + VNKV GNFH APG+S+ +HVHD+ + + +H I+ + FG
Sbjct: 201 CRIEGGVRVNKVIGNFHIAPGRSYSNGNMHVHDLANYWDTPSLERGHSFAHTIHHVRFGP 260
Query: 169 HFP-GV---------------VNPLDGVRWTQETPSGMYQYFIKVVPTVY---------- 202
P G+ +NPLDG + P+ Y YF+KVV T Y
Sbjct: 261 QLPEGLSKKFGGKNQPWTNHHLNPLDGTQQHTRDPAFNYMYFVKVVSTSYLPLGWNSKSA 320
Query: 203 --TDVS---------GH----TIQSNQFSVTEHFRSSEQG---------RLQT---LPGV 235
T +S GH +++++Q+SVT H RS G RL + +PGV
Sbjct: 321 AKTQISEENIGLGAYGHAVDGSVETHQYSVTSHKRSLSGGDDGAEGHKERLHSRTGIPGV 380
Query: 236 FFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
FF YD+SP+KV EE L F+T +CAIVGG TV+ +D +Y G IKK
Sbjct: 381 FFSYDISPMKVINREERTKTLSGFITGLCAIVGGTLTVAAAVDRGLYEGVSRIKK 435
>gi|378732932|gb|EHY59391.1| hypothetical protein HMPREF1120_07381 [Exophiala dermatitidis
NIH/UT8656]
Length = 437
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 130/356 (36%), Positives = 170/356 (47%), Gaps = 78/356 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRH-----GGRLEHNE 54
MD+SGEQ V H + K RL S G+ ID + LQ H L+ +
Sbjct: 89 MDVSGEQQSGVVHGVNKVRLTSVAE---------GSRVIDTQALQLHQQAEVSSHLDPD- 138
Query: 55 TYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
YCGSCY A + CCN C+EVREAY WA + ++QC+REG+ R+ E+
Sbjct: 139 -YCGSCYSAPAPPNAKKPGCCNTCDEVREAYAANSWAFGRGEGVEQCEREGYGARLDEQR 197
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAF 166
EGC I G + VNKV GNFH APG+SF +HVHD+ F +H+I+ L F
Sbjct: 198 HEGCRIEGVIRVNKVVGNFHIAPGRSFSNGNMHVHDLNNFFDTPIEGGHTFTHEIHSLRF 257
Query: 167 GEHFPGV------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
G NPLDG+R + P + YFIKVV T Y +
Sbjct: 258 GPQLSDQEAKWTGADHHLNANPLDGLRQETDEPGYNFMYFIKVVSTSYLPLGWDEDKSIQ 317
Query: 206 -------------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPG 234
S +I+++Q+SVT H RS G RL +PG
Sbjct: 318 QHSSLSDLIPLGMHGKGAGSQGSIETHQYSVTSHKRSLAGGNDAAEGHKERLHAHGGIPG 377
Query: 235 VFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
VFF YD+SP+KV E SF +FLT VCA++GG TV+ ID +Y G +KK
Sbjct: 378 VFFSYDISPMKVINREVRPKSFANFLTGVCAVIGGTLTVAAAIDRGLYEGATRLKK 433
>gi|171696240|ref|XP_001913044.1| hypothetical protein [Podospora anserina S mat+]
gi|170948362|emb|CAP60526.1| unnamed protein product [Podospora anserina S mat+]
Length = 437
Score = 191 bits (484), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 124/351 (35%), Positives = 173/351 (49%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRL---DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ V+H + K RL G VIE++ + A L+ N YC
Sbjct: 89 MDVSGEQQHGVQHGVVKTRLRPLSEGGGVIEAKALALHA------RDEEAAHLDPN--YC 140
Query: 58 GSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA + +CC C+EV+EAY + WA + I+QC+RE + +++ E+ EG
Sbjct: 141 GPCYGAAPPVHAQKPNCCQTCDEVKEAYAAQAWAFGRGEGIEQCEREHYAEKLDEQRNEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C I G + VNKV GNFH APGKSF +HVHD+ + +H+I+ L FG P
Sbjct: 201 CRIEGNVRVNKVIGNFHIAPGKSFSNGNMHVHDLKNYWDTPVKHTFTHEIHHLRFGPQLP 260
Query: 172 -GV----------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
G+ VNPLD + + + YFIK+VPT Y +
Sbjct: 261 DGLAKKLGKNKALPWTNHHVNPLDNTHQETDDVNYNFMYFIKIVPTSYLPLGWEKTWQGF 320
Query: 206 --------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFY 239
+ +++++Q+SVT H RS G RL +PGVFF Y
Sbjct: 321 KDQHHKELGSFGQSADGSLETHQYSVTSHRRSLSGGDDGSEGHKERLHAKGGIPGVFFSY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
D+SP+KV EE SFL FL +CAIVGG TV+ +D ++ G +KK
Sbjct: 381 DISPMKVINREERPKSFLGFLAGLCAIVGGTLTVAAAVDRALFEGGMKLKK 431
>gi|406606433|emb|CCH42207.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Wickerhamomyces ciferrii]
Length = 405
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 112/322 (34%), Positives = 173/322 (53%), Gaps = 40/322 (12%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHG-GRLEHNETYCG 58
MD+SG+ LDV + F K RL G I + IG HG + YCG
Sbjct: 89 MDVSGDLQLDVTNYGFTKIRLTETGEEIGEEEMKIG--------DDHGHADADIPADYCG 140
Query: 59 SCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
CYGA++ D++ CCN+C+ VR+AY GWA + ++QC+REG++++I +
Sbjct: 141 PCYGAKNQDKNENKPQEEKVCCNDCDSVRKAYASVGWAFFDGKNVEQCEREGYVKKINDR 200
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFG- 167
GEGC + G ++N++ GN HFAPG S+ HVHD+ + ++ FN H IN +FG
Sbjct: 201 LGEGCRVKGTAKLNRINGNIHFAPGASYSAPNRHVHDLSLYGKNKDFNFRHVINHFSFGP 260
Query: 168 --------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
E +PLDG Q + +Y YF+KVVPT Y ++G +++NQFS T
Sbjct: 261 DVNSKYTAETLELSSHPLDGTNAIQGSRDHLYSYFLKVVPTRYEYLNGTKVETNQFSSTY 320
Query: 220 HFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGG 268
H R GR + +PG+FF +++SP+K+ E + S+ FL NV + +GG
Sbjct: 321 HDRPLTGGRDEDHPNTFHARGGIPGLFFHFEMSPLKIINKETYGTSWSGFLLNVISAIGG 380
Query: 269 VFTVSGIIDAFIYHGQRAIKKK 290
+ TV ++D ++ + I++K
Sbjct: 381 ILTVGAVVDRTVFVADKVIRRK 402
>gi|396471326|ref|XP_003838845.1| similar to endoplasmic reticulum-golgi intermediate compartment
protein 3 [Leptosphaeria maculans JN3]
gi|312215414|emb|CBX95366.1| similar to endoplasmic reticulum-golgi intermediate compartment
protein 3 [Leptosphaeria maculans JN3]
Length = 439
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 127/356 (35%), Positives = 174/356 (48%), Gaps = 76/356 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGE + + H I K RL + + G+ ID KPL H H + +YCG
Sbjct: 89 MDVSGELQMGITHGINKVRLSPEVD---------GSKVIDAKPLDLHQDEASHLDPSYCG 139
Query: 59 SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
+CYGA + CCN C+EVR+AY W+ + ++QC+RE + + + E+ EGC
Sbjct: 140 NCYGAPPPTNAIKHGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDEQRQEGC 199
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHF-- 170
+ G ++VNKV GNFH APGKSF +HVHD+ + RD + +HKI+ L FG
Sbjct: 200 RLEGSIKVNKVVGNFHIAPGKSFSNGNLHVHDLENYFRDEYAHTFTHKIHHLRFGPQLSQ 259
Query: 171 --------------PGV-----VNPLDGVRWTQETPSGMYQYFIKVVPTVYT-------- 203
PG VNPLD + + Y YFIKVV T Y
Sbjct: 260 AVVQDMAKKHMATGPGGWTNHHVNPLDHTEQRTDEKAFNYMYFIKVVSTAYLPLGWEKSA 319
Query: 204 ---------DVSGHTIQS--------NQFSVTEHFRSSEQG---------RLQT---LPG 234
D+ G TI S +Q+SVT H RS + G R+ +PG
Sbjct: 320 DGSSSGGYDDLLGTTIHSVNKGSIETHQYSVTSHKRSLQGGSDEKEGHKERIHARGGIPG 379
Query: 235 VFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
VFF YD+SP+KV E +F FL +CA++GG TV+ +D +Y G IKK
Sbjct: 380 VFFSYDISPMKVINREMREKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNKIKK 435
>gi|242803029|ref|XP_002484091.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
stipitatus ATCC 10500]
gi|218717436|gb|EED16857.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
stipitatus ATCC 10500]
Length = 440
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 122/353 (34%), Positives = 174/353 (49%), Gaps = 71/353 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLD--SQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
MD+SGEQ + V H + K RL ++G + I K++ Q + N YCG
Sbjct: 89 MDVSGEQQMGVVHGLNKVRLSPVAEGGKV------IDVAKLELHAQNEVA-VHLNPEYCG 141
Query: 59 SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
C GA ++ CCN CEEVREAY K WA + I+QC+REG+ ++I + EGC
Sbjct: 142 QCGGAPPPPNTNKPGCCNTCEEVREAYALKSWAFGKGENIEQCQREGYAEKINAQRREGC 201
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ------RDSFNISHKINKLAFGE 168
I G + VNKV GNFH APG+SF +HVHD+ + + +SH I++L FG
Sbjct: 202 RIEGDIRVNKVIGNFHIAPGRSFSTGNMHVHDLDTYMDRELSDNEKHTMSHIIHQLRFGP 261
Query: 169 HFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----------- 205
+ NPLD + + P+ Y Y+IKVV T Y +
Sbjct: 262 QLSDELSRRWQWTDHHHTNPLDDTQQFTDEPAYNYNYYIKVVSTSYLPLGWDSSQSDQLH 321
Query: 206 ----------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFF 237
+ +++++Q+SVT H RS G R+ +PGVFF
Sbjct: 322 GDDQSTPLGLHGAVHGAAGSLETHQYSVTSHKRSLHGGNDAAEGHKERVHAEGGIPGVFF 381
Query: 238 FYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV E +F FLT VCA++GG TV+ +D F+Y G R ++K
Sbjct: 382 NYDISPMKVVNREVRPKTFTGFLTGVCAVIGGTLTVAAAVDRFLYEGSRRMRK 434
>gi|121702771|ref|XP_001269650.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
clavatus NRRL 1]
gi|119397793|gb|EAW08224.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
clavatus NRRL 1]
Length = 438
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 127/355 (35%), Positives = 169/355 (47%), Gaps = 75/355 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETY 56
MD+SGEQ + V H + K RL S G+V++ R D ++ K L + Y
Sbjct: 89 MDVSGEQQVGVAHGVNKVRLSSPAEGGHVLDIRSLDLHSKDEVAKHL---------DPNY 139
Query: 57 CGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG C GA+ + CCN C+EVREAY K WA I+QC+REG+ RI + E
Sbjct: 140 CGDCGGADPLPGAIKPGCCNTCDEVREAYAAKNWAFGKGANIEQCEREGYTARIDAQRRE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
GC + G L VNKV GNFH APG+SF +HVHD A+ + H+I++L F
Sbjct: 200 GCRLEGVLRVNKVVGNFHIAPGRSFTNGNIHVHDTQAYFDLDLPDDAKHTMEHEIHQLRF 259
Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
G P + NPLD P+ + YF+KVV T Y +
Sbjct: 260 GPQLPDELSARWQWTDHHHTNPLDNTHQETNDPAYNFVYFVKVVSTSYLPLGWDPLFSSA 319
Query: 206 ------------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGV 235
+ +I+++Q+SVT H RS G RL +PGV
Sbjct: 320 LHSTYEKAPLGAHGIGYGASGSIETHQYSVTSHKRSLRGGDAEDEGHKERLHAANGIPGV 379
Query: 236 FFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
FF YD+SP+KV E L FLT VCAI+GG TV+ ID +Y G +KK
Sbjct: 380 FFNYDISPMKVINREARPKTLSSFLTGVCAIIGGTLTVAAAIDRGLYEGALRVKK 434
>gi|440473660|gb|ELQ42442.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Magnaporthe oryzae Y34]
gi|440486294|gb|ELQ66175.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Magnaporthe oryzae P131]
Length = 444
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 127/358 (35%), Positives = 172/358 (48%), Gaps = 77/358 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ V+H + K RL Q G VI+++ + A L+ N YC
Sbjct: 89 MDVSGEQQHGVQHGVIKVRLRPQSEGGGVIDAKTLALHAE------DEAATHLDPN--YC 140
Query: 58 GSCYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA + CCN C+EVREAY + WA + ++QC RE + +R+ E+ EG
Sbjct: 141 GGCYGAPAPANAKKAGCCNTCDEVREAYAQASWAFGRGENVEQCTREHYAERLDEQRHEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----NISHKINKLAFGEH 169
C I G L VNKV GNFH APG+SF +HVHD+ + + SH I+ L FG
Sbjct: 201 CQIAGSLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDTPVEGGHSFSHTIHSLRFGPQ 260
Query: 170 FPGV------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--- 208
P +NPLDGV T P+ Y YF+K+VPT Y +
Sbjct: 261 LPPSALEKLGNKDKNMPWTNHHINPLDGVIQTTVDPNFNYMYFVKIVPTSYLPLGWEKRT 320
Query: 209 -------------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFF 237
+++++Q+SVT H RS G R+ + +PGVFF
Sbjct: 321 HLATMHDHGVGTYGYSGDGSVETHQYSVTSHKRSLAGGDDGEDGHKERMHSRGGIPGVFF 380
Query: 238 FY-----DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Y D+SP+KV E +F FLT +CAI+GG TV+ ID + G IKK
Sbjct: 381 SYPFCPQDISPMKVINREVRTKTFAGFLTGLCAILGGTLTVAAAIDRMTFEGVTRIKK 438
>gi|256078219|ref|XP_002575394.1| serologically defined breast cancer antigen ny-br-84-related
[Schistosoma mansoni]
gi|353230384|emb|CCD76555.1| serologically defined breast cancer antigen ny-br-84-related
[Schistosoma mansoni]
Length = 338
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 102/255 (40%), Positives = 145/255 (56%), Gaps = 12/255 (4%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD +G Q L+V H+++K + G + D + D + YCGSC
Sbjct: 90 MDTTGAQQLNVMHEVYKTSVSVDGTPVS---DSVRHAVNDAS----ALTTTRDPNYCGSC 142
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG-EGCNIYGF 119
YGAES CCN CEEV+ AY + W N +QC++E + IK++ G EGC I+G
Sbjct: 143 YGAESPSRKCCNTCEEVQMAYNEMRWIFVNISAFEQCRKENW-NEIKQKIGNEGCRIHGN 201
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
L VN+V G FH APG S+ ++ H H + FN+SH I +L FGE +PG VNPLDG
Sbjct: 202 LTVNRVGGAFHIAPGHSYTENHAHFHSFQSLGPVQFNVSHSIGELRFGESYPGQVNPLDG 261
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGH--TIQSNQFSVTEHFRSSE-QGRLQTLPGVF 236
+ +T S M Y++K+VPT+Y + + T+ +NQ+S T H + + G Q LPGVF
Sbjct: 262 TKLAVQTHSQMVIYYLKLVPTMYISLRRNESTVITNQYSATWHSKGTPLTGDGQGLPGVF 321
Query: 237 FFYDLSPIKVTFTEE 251
F Y+++P+ V TEE
Sbjct: 322 FNYEIAPLLVKITEE 336
>gi|336265645|ref|XP_003347593.1| hypothetical protein SMAC_04901 [Sordaria macrospora k-hell]
gi|380096460|emb|CCC06508.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 428
Score = 190 bits (482), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 121/347 (34%), Positives = 170/347 (48%), Gaps = 71/347 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG---RLEHNETY 56
MD+SGEQ V+H + K RL Q G +ID K L H + +Y
Sbjct: 89 MDVSGEQQHGVQHGVKKIRLRPQSE---------GGGEIDAKVLALHAADESATHLDPSY 139
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA + CC+ CEE+REAY + WA + ++QC+RE + +R+ E+ E
Sbjct: 140 CGPCYGAPAPYNAKKAGCCSTCEEIREAYAQASWAFGDGSTMEQCQREHYTERLAEQRHE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKI--------N 162
GC I G L VNKV GNFH APG+SF +HVHD+ + ++ K+ N
Sbjct: 200 GCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVHDLAQWWNSPLPDDLVRKLGGGKDGKRN 259
Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----------------- 205
L H NPLD R + P+ + YF+K+VPT Y +
Sbjct: 260 TLWTNHHL----NPLDNTRQETDDPNYNFMYFVKIVPTSYLPLGWEKQAAQNKASWDQDH 315
Query: 206 ----------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSP 243
S +++++Q+SVT H RS G RL + +PGVFF YD+SP
Sbjct: 316 SVGLGVFGQGSDGSMETHQYSVTSHKRSLAGGDDAKEGHGERLHSRGGIPGVFFSYDISP 375
Query: 244 IKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+KV EE SF+ FL +CA+VGG TV+ +D ++ G +KK
Sbjct: 376 MKVVNREERAKSFIGFLAGLCAVVGGTLTVAAAVDRGLFEGTVRLKK 422
>gi|425772976|gb|EKV11354.1| COPII-coated vesicle membrane protein Erv46, putative [Penicillium
digitatum PHI26]
gi|425782132|gb|EKV20058.1| COPII-coated vesicle membrane protein Erv46, putative [Penicillium
digitatum Pd1]
Length = 438
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 121/354 (34%), Positives = 173/354 (48%), Gaps = 73/354 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ + V H + K RL + G VI+ + + +P +H + YC
Sbjct: 89 MDVSGEQQVGVAHGVNKVRLSPRNEGGKVIDVQALDLHSPS---EAAKH-----LDPEYC 140
Query: 58 GSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G C GA CC CEEVR+AY +K WA + I+QC REG+ +R+ E+ EG
Sbjct: 141 GECGGATPPPNVIKPGCCTTCEEVRQAYAEKQWAFGDGSNIEQCTREGYAERLAEQRREG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAFG 167
C I G L+VNKV GNFH APG+SF +HVHD+ + + +SH +++L FG
Sbjct: 201 CRIEGVLKVNKVIGNFHIAPGRSFTTGNMHVHDLDTYIDPNAGPAEQHTMSHLVHELRFG 260
Query: 168 EHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV---------- 205
P + NPLD + + P+ + YF+KVV T Y +
Sbjct: 261 PQLPAELAGRWGWTDHHHTNPLDDTKQETDEPAYNFLYFVKVVSTSYLPLGWDPQFSTAI 320
Query: 206 -----------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVF 236
+ +I+++Q+SVT H R G R+ +PGVF
Sbjct: 321 HNAYDKAPLGYHGLAYGTQGSIEAHQYSVTSHKRPLSGGNDAAEGHKERVHAGGGIPGVF 380
Query: 237 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
F YD+SP+KV E +F +FLT VCAI+GG TV+ +D +Y G +KK
Sbjct: 381 FNYDISPMKVVNREARPKTFTNFLTGVCAIIGGTLTVAAALDRGVYEGAMRVKK 434
>gi|342874382|gb|EGU76396.1| hypothetical protein FOXB_13074 [Fusarium oxysporum Fo5176]
Length = 439
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 125/354 (35%), Positives = 172/354 (48%), Gaps = 74/354 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGEQ V H + K RL G ID K L H +H + +YCG
Sbjct: 89 MDVSGEQQHGVMHGVNKVRLQPANQ---------GGAVIDIKSLALHDESADHLDPSYCG 139
Query: 59 SCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
CYGA+ + CC C+EVREAY + WA + ++QC+RE + +++ + EGC
Sbjct: 140 GCYGAQPPANARKAGCCQTCDEVREAYAQSSWAFGRGEGVEQCEREHYGEKLDAQREEGC 199
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGEHF 170
I G L VNKV GNFHFAPG+SF +HVHD+ + + S + +H I+ L FG
Sbjct: 200 RIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLKNYWDVPKGKSHDFTHYIHSLRFGPQL 259
Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT----------- 203
P + NPLD R P+ + YF+K+VPT Y
Sbjct: 260 PDNIAKKVGTKSSLWTNHHQNPLDNTRQEIHDPNFNFMYFVKIVPTSYLPLGWDSKGIKI 319
Query: 204 ------DVSG---------HTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVF 236
D +G +++++Q+SVT H RS G R T +PGVF
Sbjct: 320 AGLLQDDNAGLGAYGYSEDGSVETHQYSVTSHKRSLAGGNDAAEGHAERQHTSGGIPGVF 379
Query: 237 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
F YD+SP+KV EE +F FL +CAIVGG TV+ +D ++ G IKK
Sbjct: 380 FSYDISPMKVVNREEKAKTFSGFLAGLCAIVGGTLTVAAAVDRGLFEGAARIKK 433
>gi|119189667|ref|XP_001245440.1| hypothetical protein CIMG_04881 [Coccidioides immitis RS]
gi|392868334|gb|EAS34105.2| COPII-coated vesicle membrane protein Erv46 [Coccidioides immitis
RS]
Length = 435
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 121/351 (34%), Positives = 168/351 (47%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ V H + K RL + G+ ++ + +DK R L + YC
Sbjct: 89 MDVSGEQQSGVIHGVNKVRLSAASEGGHALD-----VETLDLDK---RDQAPLHLDPAYC 140
Query: 58 GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
GSCY + CCN C+EVREAY + WA + ++QC++EG+ +I + EG
Sbjct: 141 GSCYDGIPPPNAKKPGCCNTCDEVREAYALRNWAFGRGEGVEQCEQEGYGSKIDSQRNEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C + G L VNKV GNFH APG+SF +H HD+ + +SH I++L FG P
Sbjct: 201 CRLEGILRVNKVVGNFHVAPGRSFTNGYMHAHDLKTYYETPVKHTMSHIIHQLRFGPQLP 260
Query: 172 GVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------- 208
+ NPLD T E P + YF+KVV T Y +
Sbjct: 261 DELSQKWKWTDHHHTNPLDSTSQTTEDPKFNFMYFVKVVSTSYLPLGWDASLSSEVHSRL 320
Query: 209 -----------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFY 239
+I+++Q+SVT H RS E G R+ T +PGVFF Y
Sbjct: 321 SSDAPLGKQGIQLGQYGSIETHQYSVTSHKRSIEGGDDSAEGHKERVHTAGGIPGVFFNY 380
Query: 240 DLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
D+SP+KV E L FLT VCA++GG TV+ +D +Y G +KK
Sbjct: 381 DISPMKVINREARTKSLSGFLTGVCAVIGGTLTVAAAVDRALYEGSVRVKK 431
>gi|67524561|ref|XP_660342.1| hypothetical protein AN2738.2 [Aspergillus nidulans FGSC A4]
gi|40743850|gb|EAA63036.1| hypothetical protein AN2738.2 [Aspergillus nidulans FGSC A4]
gi|259486349|tpe|CBF84116.1| TPA: COPII-coated vesicle membrane protein Erv46, putative
(AFU_orthologue; AFUA_1G05120) [Aspergillus nidulans
FGSC A4]
Length = 437
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 124/354 (35%), Positives = 167/354 (47%), Gaps = 74/354 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGEQ + V H + K RL G +D + LQ H +H + YCG
Sbjct: 89 MDVSGEQQVGVAHGVNKVRLAPAAE---------GGRVLDVQALQLHAEEAKHLDPDYCG 139
Query: 59 SCYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
C GA CC+ C+EVREAY +K W I+QC+RE + +RI + EGC
Sbjct: 140 ECGGAPPPPNAIKPGCCSTCDEVREAYAQKQWGFGKGTNIEQCEREHYSERIDAQRREGC 199
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN------ISHKINKLAFGE 168
+ G + VNKV GNFH APG+SF + VH+HDI ++ + +SH I+ L FG
Sbjct: 200 RLEGVIRVNKVVGNFHIAPGRSFSSNNVHIHDIANYEERGLSPAEQHTMSHIIHSLRFGP 259
Query: 169 HFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----------- 205
P + NPLD P+ + YFIKVV T Y +
Sbjct: 260 QLPDELSDRWQWTDHHHTNPLDSTSQEAPEPAYSFMYFIKVVSTSYLPLGWDPLYSASLH 319
Query: 206 -----------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVF 236
S +I+++Q+SVT H RS G R+ +PGVF
Sbjct: 320 AAADTNTPLGAQGLSAGSQGSIETHQYSVTSHKRSLRGGDASDEAHKERIHAAGGIPGVF 379
Query: 237 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
F YD+SP+KV E +F FLT VCAIVGG TV+ ID +Y G ++K
Sbjct: 380 FNYDISPMKVINREARPKTFTGFLTGVCAIVGGTLTVAAAIDRTLYEGVSRVRK 433
>gi|452842116|gb|EME44052.1| hypothetical protein DOTSEDRAFT_71753 [Dothistroma septosporum
NZE10]
Length = 436
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 123/353 (34%), Positives = 169/353 (47%), Gaps = 73/353 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGE V H + K RL + G IE + +G + + L + YC
Sbjct: 89 MDVSGEVQTGVMHGVNKVRLRPEAEGGGEIEKKALDLGVEEAAQHL---------DPDYC 139
Query: 58 GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA ++ CCN C EVREAY W+ + ++QC+RE + + + + EG
Sbjct: 140 GECYGAPAPSNAAKPGCCNTCAEVREAYAGVSWSFGRGENVEQCEREHYSEHLDAQRKEG 199
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI----SHKINKLAFGEH 169
C I G + VNKV GNFHFAPGKSF +HVHD+ F I +HKI+ L FG
Sbjct: 200 CRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVHDLENFFNSPEGIQHTFTHKIHSLRFGPQ 259
Query: 170 FPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS------- 206
P V NPLDG E S + YF+KVV T Y ++
Sbjct: 260 LPDDVVNKVGKRGIAWSEHHLNPLDGTSQVTEEKSYNFMYFVKVVSTAYLPLAWKPSGSL 319
Query: 207 -----------------GHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFF 237
G +I+++Q+SVT H RS + G RL +PGVFF
Sbjct: 320 LDLPHELVELGGYGKGEGGSIETHQYSVTSHKRSLQGGDANEEGHKERLHARGGIPGVFF 379
Query: 238 FYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV E +F FLT V A++GG TV+ +D +Y G + ++K
Sbjct: 380 SYDISPMKVVNREARTKTFTGFLTGVAAVIGGTLTVAAAVDRLMYEGGQRVRK 432
>gi|408400673|gb|EKJ79750.1| hypothetical protein FPSE_00030 [Fusarium pseudograminearum CS3096]
Length = 439
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 125/356 (35%), Positives = 174/356 (48%), Gaps = 78/356 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRL--DSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETY 56
MD+SGEQ V H + K RL +SQG + ID K L H H + +Y
Sbjct: 89 MDVSGEQQHGVMHGVNKVRLQPESQGGAV-----------IDTKSLSLHDDAAHHLDPSY 137
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA + CC C+EVREAY + WA + ++QC+RE + +++ + E
Sbjct: 138 CGGCYGATPPANAQKAGCCQTCDEVREAYAQASWAFGRGEGVEQCEREHYGEKLDAQRSE 197
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGE 168
GC I G L VNKV GNFHFAPG+SF +HVHD+ + + S + +H ++ L FG
Sbjct: 198 GCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLKNYWDVPKGFSHDFTHIVHSLRFGP 257
Query: 169 HFPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT--------- 203
P + NPLD R P+ + YF+K+VPT Y
Sbjct: 258 QLPDHIARKVGHKNTLWTNHHQNPLDDTRQETHDPNYNFMYFVKIVPTSYLPLGWDKKGI 317
Query: 204 --------DVSG---------HTIQSNQFSVTEHFRSSEQG---------RLQT---LPG 234
D +G +++++Q+SVT H RS G R T +PG
Sbjct: 318 KIAGLLQEDNAGLGAYGYGEDGSVETHQYSVTSHRRSLAGGNDAAEGHAERQHTSGGIPG 377
Query: 235 VFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
VFF YD+SP+KV EE +F FL +CAIVGG TV+ +D ++ G +KK
Sbjct: 378 VFFSYDISPMKVVNREEKAKTFSGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 433
>gi|330919615|ref|XP_003298687.1| hypothetical protein PTT_09471 [Pyrenophora teres f. teres 0-1]
gi|311327999|gb|EFQ93219.1| hypothetical protein PTT_09471 [Pyrenophora teres f. teres 0-1]
Length = 437
Score = 188 bits (478), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 123/353 (34%), Positives = 172/353 (48%), Gaps = 72/353 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
MD+SGE + V H I K RL + DG A +I K + H H YCG
Sbjct: 89 MDVSGELQMGVTHGINKVRLSPEA-------DGSKAIEI-KAVDLHTDEASHLAPDYCGQ 140
Query: 60 CYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
CYGA + CCN C+EVR+AY W+ + ++QC+RE + + + ++ EGC
Sbjct: 141 CYGAPAPSNAKKPTCCNTCDEVRDAYASVSWSFGRGEGVEQCEREHYAEHLDQQRQEGCR 200
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGV 173
+ G ++VNKV GNFHFAPGKSF +HVHD+ + +D + +H I++L FG V
Sbjct: 201 LEGNIKVNKVVGNFHFAPGKSFSNGNLHVHDLENYFKDEYTHTFTHHIHQLRFGPQLSDV 260
Query: 174 V---------------------NPLDGVRWTQETPSGMYQYFIKVVPTVY---------- 202
V NPLD + + Y YFIKVV TVY
Sbjct: 261 VVQNMQKKHQESGIGGWSNHHINPLDETMQHTDEKAYNYMYFIKVVTTVYLPLGWEKVFP 320
Query: 203 -----TDVSGHT--------IQSNQFSVTEHFRSSEQGRLQT------------LPGVFF 237
+D+ G T I+++Q+SVT H RS + G + +PGVFF
Sbjct: 321 HPSKFSDILGATIDESYKGSIETHQYSVTSHKRSLQGGNDEKDGHKERIHARGGIPGVFF 380
Query: 238 FYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP++V E +F FL +CA++GG TV+ ID +Y G IKK
Sbjct: 381 SYDISPMEVINREVREKTFSGFLVGLCAVIGGTLTVAAAIDRALYEGVNRIKK 433
>gi|46105482|ref|XP_380545.1| hypothetical protein FG00369.1 [Gibberella zeae PH-1]
Length = 444
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 125/356 (35%), Positives = 174/356 (48%), Gaps = 78/356 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRL--DSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETY 56
MD+SGEQ V H + K RL +SQG + ID K L H H + +Y
Sbjct: 89 MDVSGEQQHGVMHGVNKVRLQPESQGGAV-----------IDTKSLSLHDDAAHHLDPSY 137
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA + CC C+EVREAY + WA + ++QC+RE + +++ + E
Sbjct: 138 CGGCYGATPPANAQKAGCCQTCDEVREAYAQASWAFGRGEGVEQCEREHYGEKLDAQRSE 197
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGE 168
GC I G L VNKV GNFHFAPG+SF +HVHD+ + + S + +H ++ L FG
Sbjct: 198 GCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLKNYWDVPKGFSHDFTHIVHSLRFGP 257
Query: 169 HFPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT--------- 203
P + NPLD R P+ + YF+K+VPT Y
Sbjct: 258 QLPDHIARKVGHKNTLWTNHHQNPLDDTRQETHDPNYNFMYFVKIVPTSYLPLGWDKKGI 317
Query: 204 --------DVSG---------HTIQSNQFSVTEHFRSSEQG---------RLQT---LPG 234
D +G +++++Q+SVT H RS G R T +PG
Sbjct: 318 KIAGLLQEDNAGLGAYGYGEDGSVETHQYSVTSHRRSLAGGNDAAEGHAERQHTSGGIPG 377
Query: 235 VFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
VFF YD+SP+KV EE +F FL +CAIVGG TV+ +D ++ G +KK
Sbjct: 378 VFFSYDISPMKVVNREEKAKTFSGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 433
>gi|261327856|emb|CBH10834.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 405
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 116/320 (36%), Positives = 163/320 (50%), Gaps = 31/320 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR-LEHNETYCGS 59
+D GE +V D + R++ V G P +D Q G EH + C S
Sbjct: 90 IDAFGEHVENVLTDTARVRVNPDTLV----PLGEARPLMDMKKQPADGNGAEHGK--CPS 143
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYG 118
CYGAES+ DCC+ C++VR A+ ++ W D I QC E EGCN++
Sbjct: 144 CYGAESNPGDCCHTCDDVRRAFAERQWEFHEDDASIVQCVHERLKMAAASASTEGCNLHA 203
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
V +V GN HF PG+ F+ G H+H N+SH ++ L FGE FPG NP+D
Sbjct: 204 SFSVPRVTGNIHFIPGRMFNFFGQHLHSFKGETIQKLNLSHIVHSLEFGERFPGQSNPMD 263
Query: 179 GVRWTQ------ETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSS---- 224
G+ + E G + YF+KVVPTVY S G ++SNQ+SVT HF S
Sbjct: 264 GMANVRGATDPSEPLIGRFSYFVKVVPTVYRIESLVGGGRVVESNQYSVTHHFTPSWETP 323
Query: 225 -------EQGRLQTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
+ +PGVF YDLSPI+V+ H S +H + +CA+ GGV+TV+G+
Sbjct: 324 KGGENNNAKHDPSVVPGVFISYDLSPIRVSVKRTHPYPSIVHLVLQLCAVGGGVYTVTGL 383
Query: 276 IDAFIYHGQRAIKKKIEIGK 295
ID+ +H R ++ K+ GK
Sbjct: 384 IDSLFFHSIRRMQIKMNRGK 403
>gi|72388468|ref|XP_844658.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|62360135|gb|AAX80555.1| hypothetical protein, conserved [Trypanosoma brucei]
gi|70801191|gb|AAZ11099.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 405
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 116/320 (36%), Positives = 163/320 (50%), Gaps = 31/320 (9%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR-LEHNETYCGS 59
+D GE +V D + R++ V G P +D Q G EH + C S
Sbjct: 90 IDAFGEHVENVLTDTARVRVNPDTLV----PLGEARPLMDMKKQPADGNGAEHGK--CPS 143
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYG 118
CYGAES+ DCC+ C++VR A+ ++ W D I QC E EGCN++
Sbjct: 144 CYGAESNPGDCCHTCDDVRRAFAERQWEFHEDDASIVQCVHERLKMAAASASTEGCNLHA 203
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
V +V GN HF PG+ F+ G H+H N+SH ++ L FGE FPG NP+D
Sbjct: 204 SFSVPRVTGNIHFIPGRMFNFFGQHLHSFKGETIQKLNLSHIVHSLEFGERFPGQSNPMD 263
Query: 179 GVRWTQ------ETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSS---- 224
G+ + E G + YF+KVVPTVY S G ++SNQ+SVT HF S
Sbjct: 264 GMANVRGATDPSEPLIGRFSYFVKVVPTVYRIESLVGGGRVVESNQYSVTHHFTPSWETP 323
Query: 225 -------EQGRLQTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
+ +PGVF YDLSPI+V+ H S +H + +CA+ GGV+TV+G+
Sbjct: 324 KGGENNNAKHDPSVVPGVFISYDLSPIRVSVKRTHPYPSIVHLVLQLCAVGGGVYTVTGL 383
Query: 276 IDAFIYHGQRAIKKKIEIGK 295
ID+ +H R ++ K+ GK
Sbjct: 384 IDSLFFHSIRRMQIKMNRGK 403
>gi|353237029|emb|CCA69011.1| related to ERV46-component of copii vesicles [Piriformospora indica
DSM 11827]
Length = 428
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 109/319 (34%), Positives = 171/319 (53%), Gaps = 35/319 (10%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
DISG+ ++ H + K RLD + + DGI + L + ++ YCGSCY
Sbjct: 92 DISGDVVREITHHVVKTRLDPAAH--QPIPDGIYRTDLKSDLSKQ--LTATSKGYCGSCY 147
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
G + + CCN C++VR AY +GWA NPD IDQC E + ++I + EGCNI G +
Sbjct: 148 GGQPPEGGCCNTCDDVRRAYTDRGWAFGNPDQIDQCVSENWTEKIMAMQREGCNIEGRVR 207
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN-ISHKINKLAFGEH---------FP 171
VNKV GN F+PG+SF + V+ ++ + +DS + H I+ L ++ P
Sbjct: 208 VNKVTGNMQFSPGRSFVVNRPEVYALVPYLKDSNHFFGHHIHSLEIYDYEEDTWTRRNLP 267
Query: 172 GVVN--------PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR- 222
+ PL+ V E+ M+QYF+KVV + Y + G ++Q+S + R
Sbjct: 268 EQIKERLGITKPPLEDVYAHTESADYMFQYFLKVVKSSYKGLDGKAYSTHQYSTSSFERD 327
Query: 223 -------SSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
+E G Q +PGVFF +++SP++V E+ S+ HF+T++ AI+GGV
Sbjct: 328 LATMSHGKNEDGIEIVHERQGVPGVFFNFEISPMEVIHIEQRQSWAHFITSMAAIIGGVL 387
Query: 271 TVSGIIDAFIYHGQRAIKK 289
TV+ ++DA +++ Q IKK
Sbjct: 388 TVATLVDALLFNTQGLIKK 406
>gi|67479077|ref|XP_654920.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56472012|gb|EAL49533.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449701866|gb|EMD42605.1| endoplasmic reticulumgolgi intermediate compartment protein,
putative [Entamoeba histolytica KU27]
Length = 354
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 108/283 (38%), Positives = 160/283 (56%), Gaps = 20/283 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D +GE +D+ +I K+RL N++ +D I K K + +G T C C
Sbjct: 86 LDTTGEVIIDISKNIKKERL----NLV--NEDEISKKKFAKTV--YG-------TECPPC 130
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
ES + CC CEE+ E+Y+K + P QC+ + GEGC I G +
Sbjct: 131 -NNESDKDKCCFTCEELTESYQKLNKEV--PKGSPQCEIRNIHKMTTFYNGEGCRISGTV 187
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
VN+ +GNFH APG S + H+H + + N++H N L+FG+ FPG++NP+DG+
Sbjct: 188 FVNRASGNFHIAPGSSQQLTQEHIHSV-DWISGGINLTHTWNFLSFGDSFPGMINPMDGI 246
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFY 239
T + MYQYF++VVP YT + I +N +SVTEH+R S + Q +PGVF Y
Sbjct: 247 VKVDRTNNSMYQYFVQVVPMTYTSLDNKVIHTNGYSVTEHYRPGSLKSPEQGIPGVFVIY 306
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
D+S I+V + EE SF H LT++C I+GGVF + ++D FI+H
Sbjct: 307 DISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFSLLDYFIFH 349
>gi|303322923|ref|XP_003071453.1| hypothetical protein CPC735_069900 [Coccidioides posadasii C735
delta SOWgp]
gi|240111155|gb|EER29308.1| hypothetical protein CPC735_069900 [Coccidioides posadasii C735
delta SOWgp]
gi|320033474|gb|EFW15422.1| COPII-coated vesicle membrane protein Erv46 [Coccidioides posadasii
str. Silveira]
Length = 435
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 121/351 (34%), Positives = 168/351 (47%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ V H + K RL + G+ ++ + +DK Q L + YC
Sbjct: 89 MDVSGEQQSGVIHGVNKVRLSAASEGGHALD-----VETVDLDKKDQ---APLHLDPGYC 140
Query: 58 GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
GSCY + CCN C+EVREAY + WA + ++QC++EG+ +I + EG
Sbjct: 141 GSCYDGIPPPNAKKPGCCNTCDEVREAYALRNWAFGRGEGVEQCEQEGYGSKIDSQRNEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C + G L VNKV GNFH APG+SF +H HD+ + +SH I++L FG P
Sbjct: 201 CRLEGILRVNKVVGNFHVAPGRSFTNGYMHAHDLKTYYETPVKHTMSHIIHQLRFGPQLP 260
Query: 172 GVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------- 208
+ NPLD T E P + YF+KVV T Y +
Sbjct: 261 DELSQKWKWTDHHHTNPLDSTSQTTEDPKFNFMYFVKVVSTSYLPLGWDASLSSEVHSRL 320
Query: 209 -----------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFY 239
+I+++Q+SVT H RS E G R+ T +PGVFF Y
Sbjct: 321 SSDAPLGKQGIQLGQYGSIETHQYSVTSHKRSIEGGDDSAEGHKERVHTAGGIPGVFFNY 380
Query: 240 DLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
D+SP+KV E L FLT VCA++GG TV+ +D +Y G +KK
Sbjct: 381 DISPMKVINREARTKSLSGFLTGVCAVIGGTLTVAAAVDRALYEGSVRVKK 431
>gi|300123299|emb|CBK24572.2| unnamed protein product [Blastocystis hominis]
Length = 376
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 103/276 (37%), Positives = 148/276 (53%), Gaps = 20/276 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISG+Q + V I + LD + + K P CGSC
Sbjct: 111 MDISGQQQMGVTSRIVQLDLDENHKPVNMALSSVLYEKNIDPA-------------CGSC 157
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGW-ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
+GA S+ CCN C++V AY ++GW QC++ + +GC ++G
Sbjct: 158 FGASLSNV-CCNTCDDVLSAYERRGWDTWFVSKYSPQCRKNNDEVKKPRVNSQGCMMWGV 216
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
LEVNKVAGNFH A G + ++ H+H FN++H I KL+FGEH PG+ NPLDG
Sbjct: 217 LEVNKVAGNFHIAVGHAANRDSHHIHSFNPLMISKFNVTHHIEKLSFGEHIPGIQNPLDG 276
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ---GRLQTLPGVF 236
E+ + Y++KV+PTVY++ + T+ SN+ SV E R E G++ +LPG+F
Sbjct: 277 HDMVAESLTSQ-NYYLKVMPTVYSNRTS-TVVSNELSVNEVSRRVEMTPFGQITSLPGIF 334
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
F YD++P TE ++F HFL VCA++GGV V
Sbjct: 335 FIYDITPFMHVVTESRIAFAHFLVRVCAVIGGVAAV 370
>gi|453082617|gb|EMF10664.1| DUF1692-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 432
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 125/350 (35%), Positives = 169/350 (48%), Gaps = 71/350 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
MD+SGE V H + K RLD+ G I G A ++ Q + H + YCG
Sbjct: 89 MDVSGEVQSGVMHGVNKVRLDANGKEI-----GKEALTVNSEEQ-----VPHLDPDYCGD 138
Query: 60 CYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
CYGA ++ CCNNC EVREAY W+ + ++QC RE + + + E+ EGC
Sbjct: 139 CYGAPAPETATKAGCCNNCAEVREAYAGVSWSFGRGEGVEQCTREHYAEHLDEQRKEGCR 198
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINKLAFGEHFPG 172
I G + VNKV GNFHFAPGKSF +HVHD+ + + + +HKI+ L FG P
Sbjct: 199 IEGGIRVNKVVGNFHFAPGKSFSNGNMHVHDLENYFQSGEVQHSFTHKIHHLRFGPELPD 258
Query: 173 VV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGH---- 208
V NPLD + + + YF+KVV T Y D SG
Sbjct: 259 DVVKAVGKKGMAWSNHHLNPLDDTEQVTDEVAYNFMYFVKVVSTAYLPLGWDGSGSLLDI 318
Query: 209 ----------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYD 240
+I+++Q+SVT H RS G RL +PGVFF YD
Sbjct: 319 PHELIALGGYGKGEQGSIETHQYSVTSHKRSLTGGDAKAEGHEERLHAKGGIPGVFFSYD 378
Query: 241 LSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+SP+KV E SF FL VCA++GG TV+ +D +Y G ++K
Sbjct: 379 ISPMKVINREARAKSFSGFLVGVCAVIGGTLTVAAAVDRLLYEGGSKLRK 428
>gi|261188384|ref|XP_002620607.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis SLH14081]
gi|239593207|gb|EEQ75788.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis SLH14081]
gi|239609349|gb|EEQ86336.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis ER-3]
gi|327354450|gb|EGE83307.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
dermatitidis ATCC 18188]
Length = 435
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 124/354 (35%), Positives = 169/354 (47%), Gaps = 76/354 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRL---DSQGNVIESRQDGIGAPKIDKPLQRHG---GRLEHNE 54
MDISGE +V H + K RL + G V++ I A LQ H + +
Sbjct: 89 MDISGEYQTEVVHGVNKLRLSPAEEGGQVLD-----ITA------LQLHSKTDNAKDLDP 137
Query: 55 TYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
YCGSCYGA + CCN C+EVREAY K W+ + ++QC++EG+ + +
Sbjct: 138 NYCGSCYGAPAPPNAQKPGCCNTCDEVREAYAAKRWSFGRGENVEQCEKEGYSANLDAQR 197
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGE 168
EGC + G + VNKV GNFH APG+SF +H HD+ + N+ HKI+ L FG
Sbjct: 198 KEGCRVEGVIRVNKVIGNFHIAPGRSFTNGNMHAHDLNNYYNTPIPHNVGHKIHYLRFGP 257
Query: 169 HFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----------- 205
P V NPLD P + YF+KVV T Y +
Sbjct: 258 QLPDEVSRRWKWTDHHHTNPLDNTEQHTTNPRLNFAYFVKVVATSYLPLGWDDDWSSTVH 317
Query: 206 -----------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVF 236
SG +I+++Q+SVT H RS + G RL + +PGVF
Sbjct: 318 SKVSNNVPLGKQGVSLGSGGSIETHQYSVTSHKRSVDGGNDAEEGHKERLHSQGGIPGVF 377
Query: 237 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV E +F FLT VCA++GG TV+ ID +Y G +KK
Sbjct: 378 VNYDISPMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAIDRALYEGSVRVKK 431
>gi|407044387|gb|EKE42566.1| hypothetical protein ENU1_017250 [Entamoeba nuttalli P19]
Length = 354
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 108/283 (38%), Positives = 160/283 (56%), Gaps = 20/283 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D +GE +D+ +I K+RL N++ +D I K K + +G T C C
Sbjct: 86 LDTTGEVIIDISKNIKKERL----NLV--NEDEISKKKFAKTV--YG-------TECPPC 130
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
E + CC CEE+ E+Y+K + P QC+ + + GEGC I G +
Sbjct: 131 -NNEIDKDKCCFTCEELTESYQKLNKEV--PKGSPQCEIKNIHKMTTFYNGEGCRISGTV 187
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
VN+ +GNFH APG S + H+H + + N++H N L+FG+ FPG++NPLDG+
Sbjct: 188 FVNRASGNFHIAPGSSQQLTQEHIHSV-DWISGGINLTHTWNFLSFGDSFPGMINPLDGI 246
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFY 239
T + MYQYF++VVP YT + I +N +SVTEH+R S + Q +PGVF Y
Sbjct: 247 VKVDRTNNSMYQYFVQVVPMTYTSLDNKVINTNGYSVTEHYRPGSLKSPEQGIPGVFVIY 306
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
D+S I+V + EE SF H LT++C I+GGVF + ++D FI+H
Sbjct: 307 DISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFSLLDYFIFH 349
>gi|167376738|ref|XP_001734125.1| endoplasmic reticulum-golgi intermediate compartment protein
[Entamoeba dispar SAW760]
gi|165904489|gb|EDR29705.1| endoplasmic reticulum-golgi intermediate compartment protein,
putative [Entamoeba dispar SAW760]
Length = 361
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 98/292 (33%), Positives = 164/292 (56%), Gaps = 18/292 (6%)
Query: 4 SGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA 63
SGE +D++ ++ K R+ G+++ + K +Q H+ C SCYGA
Sbjct: 86 SGESMIDIEQNVTKIRIHHDGSLVTESEM--------KAIQSKLSTETHDPKECRSCYGA 137
Query: 64 ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVN 123
E+ ++ CC C++V+EAY+KKGW L + +++ QC+ +Q + + EGC + G +N
Sbjct: 138 ETPEKKCCFTCDDVKEAYKKKGWRL-DLNIVSQCQNHEKIQMARLTKDEGCRVIGDFLLN 196
Query: 124 KVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWT 183
K+ GNFH APG S G H H++ + ++SHK N+L+FGEH +
Sbjct: 197 KIGGNFHIAPGSSEQSWGRHSHNLEWTGKTQIDLSHKWNELSFGEHSKKFTTEKKDTQM- 255
Query: 184 QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSP 243
+ M+QY++ ++P ++G T +S+ E+ RS G + PGVF +YD+SP
Sbjct: 256 ----NSMFQYYLTIIPIKNNFING-TSTFYDYSIQENIRS---GEGEGSPGVFVYYDVSP 307
Query: 244 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
+ + TE + FLHFL +C+IVGG+FT + DA ++ +++KK+E+GK
Sbjct: 308 MVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHSLEKKVELGK 359
>gi|298708525|emb|CBJ49158.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 467
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 118/351 (33%), Positives = 176/351 (50%), Gaps = 66/351 (18%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++G+ +D+ H ++K+RLD G+ I + D P Q E YCGSC
Sbjct: 127 MDVAGDNQIDIDHGMWKQRLDPDGSAIGEAFMEVPGEVDDDPAQ------SLPEDYCGSC 180
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSN-PDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
+GA+ + CCN C +V +AY KGW++ + +QC R+ ++ GEGCN+ GF
Sbjct: 181 FGAK---KGCCNMCRDVVDAYTAKGWSVQDIRRTAEQCIRDNHIE-TPIVNGEGCNLSGF 236
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV-VNPLD 178
+ VNKV+GNFH A G+ + G HVH Q FN SH IN L+F E +PG+ NPLD
Sbjct: 237 MSVNKVSGNFHVATGEGVMREGRHVHLYTLEQAVGFNTSHSINLLSFWEPYPGMKPNPLD 296
Query: 179 GVRWT--QETPSGMYQYFIKVVPTVY-----TDVSGHTIQ---------------SNQFS 216
++ +G +QY+IK+VPT++ ++ SG + ++QF+
Sbjct: 297 RTSRIIDEDVGTGAFQYYIKLVPTMHSLSPQSEASGSPLPKGKGEEAERQQQSSLTSQFT 356
Query: 217 VTEHFRS--------------------SEQGRLQ-----------TLPGVFFFYDLSPIK 245
T FRS +E+G Q LPGVFF YD+SP
Sbjct: 357 YTYKFRSLKGLTEYHTDHEEGEEQAKEAEKGLTQDGGVNSIVNSALLPGVFFVYDVSPFM 416
Query: 246 V-TFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
V E F H L +CA+ GG F +SGI+D+ ++H +++ +GK
Sbjct: 417 VEVVPAEQPPFSHLLIRLCAVAGGAFAISGIVDSAVFHLSNRLRRHGVLGK 467
>gi|407034208|gb|EKE37117.1| hypothetical protein ENU1_208770 [Entamoeba nuttalli P19]
Length = 361
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 99/292 (33%), Positives = 164/292 (56%), Gaps = 18/292 (6%)
Query: 4 SGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA 63
SGE +D++ ++ K R+ G+++ + K +Q H+ C SCYGA
Sbjct: 86 SGESMIDIEQNVTKIRIHHDGSLVTENEM--------KAIQSKLSTETHDPKECRSCYGA 137
Query: 64 ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVN 123
E+ ++ CC C++V+EAY+KKGW L + +++ QC+ +Q K + EGC + G +N
Sbjct: 138 ETPEKKCCFTCDDVKEAYKKKGWRL-DLNIVSQCQNHEKIQMAKLTKDEGCRLIGDFLLN 196
Query: 124 KVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWT 183
K+ GNFH APG S G H H++ + ++SHK N+L+FGE+ +
Sbjct: 197 KIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQIDLSHKWNELSFGENSKKFTTEKKDTQM- 255
Query: 184 QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSP 243
+ M+QY++ ++P ++G T +S+ E+ RS G+ + PGVF +YD+SP
Sbjct: 256 ----NSMFQYYLTIIPIKNNFING-TSTFYDYSIQENTRS---GKGEGQPGVFVYYDVSP 307
Query: 244 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
+ + TE + FLHFL +C+IVGG+FT + DA ++ +KKK+E+GK
Sbjct: 308 MVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHTLKKKVELGK 359
>gi|115388503|ref|XP_001211757.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114195841|gb|EAU37541.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 438
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 125/354 (35%), Positives = 177/354 (50%), Gaps = 73/354 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGEQ + V H I K RL S G+V++ + + + ++ + +H L+ N YC
Sbjct: 89 MDVSGEQQVGVAHGINKVRLASPAEGGHVLDVQALELHS---EQEVAKH---LDPN--YC 140
Query: 58 GSCYGAESSD---EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
G C G + CCN CEEVREAY + WA + I+QC+REG+ RI + EGC
Sbjct: 141 GECGGIPQQPGEPKRCCNTCEEVREAYAEHQWAFGKGENIEQCEREGYAARIDAQRREGC 200
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAFGE 168
+ G L VNKV GNFH APG+SF +HVHD+ + + ++H I++L FG
Sbjct: 201 RLEGVLRVNKVVGNFHIAPGRSFSSGNIHVHDLENYFELDQPASEKHTMTHHIHQLRFGP 260
Query: 169 HFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS---------- 206
P + NPLD + + Y YF+KVV T Y +
Sbjct: 261 QLPDELSDRWQWTDHHHTNPLDDTVQETDLAAFNYMYFVKVVSTAYLPLGWDPRVSSYIH 320
Query: 207 ----------------GH--TIQSNQFSVTEHFR------SSEQGRLQTL------PGVF 236
GH +I+++Q+SVT H R ++++G + L PGVF
Sbjct: 321 SASSHNVPLGRHGIGYGHDGSIETHQYSVTSHKRPLMGGNAADEGHKERLHAAAGIPGVF 380
Query: 237 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
F YD+SP+KV E +F FLT VCAI+GG TV+ ID +Y G +KK
Sbjct: 381 FNYDISPMKVINREARPKTFTGFLTGVCAIIGGTLTVAAAIDRGLYEGAIRVKK 434
>gi|298706631|emb|CBJ29569.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 453
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/332 (35%), Positives = 166/332 (50%), Gaps = 56/332 (16%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVI-------------------ESRQDGIGAPKIDKP 42
D G D++HD+ + RLDS G + E +Q A D+
Sbjct: 112 DALGIPQEDLRHDVTRTRLDSIGRALDDGEKHEMGNTLKAVIAKEEEKQAEADASPGDED 171
Query: 43 L---QRHG----GRLEH----------NETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 85
L R G G +E E C +CYGA + E CC CE+VR+AYR+KG
Sbjct: 172 LDSKSRAGDGGDGDVEQRALEDTATTGQEDEC-NCYGAGAEGE-CCRTCEDVRKAYRRKG 229
Query: 86 WALSNPDLIDQCKREGFLQRIKEE------EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQ 139
W L NP I C E E EGC + G LEV++ GNFHFAPG H+
Sbjct: 230 WRL-NPAEIPACAGEALSANSANTMESPPVENEGCRLAGHLEVSRTEGNFHFAPGHRLHR 288
Query: 140 SGVHVH--DILAFQRDSFNISHKINKLAFGEHFP-GVVNP--------LDGVRWTQETPS 188
+ D + +SFN +H IN L FG+ P G +P L+G + T +
Sbjct: 289 HANELSFVDRIQVALESFNTTHTINTLTFGDQPPPGHASPKHAVASTVLEGHQKTVQDTH 348
Query: 189 GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTF 248
M+QYF+++VPTVY +G T+ SNQ+S TEH + G + LPGV+F+Y++SP++
Sbjct: 349 AMHQYFLQLVPTVYRLDNGETVHSNQYSATEHLKHVHDGTSRGLPGVYFYYEVSPVQALV 408
Query: 249 TEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
E+ FL FLT C +VGGV+T+ G+++ I
Sbjct: 409 EEKRKGFLAFLTGACGVVGGVYTILGLVNTGI 440
>gi|342183032|emb|CCC92512.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 401
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 117/319 (36%), Positives = 166/319 (52%), Gaps = 33/319 (10%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D GE D+ D K R+DS D + +PL + + C SC
Sbjct: 90 IDAFGEYVEDMGRDTVKMRVDS---------DTLAPLGEARPLVNMNKKATSDTHDCPSC 140
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGF 119
YGAE + DCC+ C++VR A+ ++ W D+ I QC +E EGCN++
Sbjct: 141 YGAEKNPGDCCHTCDDVRRAFAERQWEFHEDDVSIMQCAKERLQMAASTASREGCNLHSS 200
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
V +V GN HF PG+ F+ G H+H N+SH I+ L FGE FPG NPLDG
Sbjct: 201 FSVPRVTGNIHFVPGRMFNFFGQHLHSFKGETIQRLNLSHIIHTLEFGERFPGQKNPLDG 260
Query: 180 VRWTQ--ETPS----GMYQYFIKVVPTVY----TDVSGHTIQSNQFSVTEHFRSS----- 224
+ T+ E PS G + YF+KVVPT+Y SG ++SNQ+SVT HF +S
Sbjct: 261 MVNTRGVENPSEDLIGRFAYFVKVVPTLYQVRTLMSSGRVVESNQYSVTHHFTASWDAAD 320
Query: 225 ------EQGRLQTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGII 276
+ +PGVF YD+SPI+V+ H S +H + +CA+ GGV+TV G+I
Sbjct: 321 QNNQTNRDANPRVVPGVFVSYDISPIRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVVGLI 380
Query: 277 DAFIYHGQRAIKKKIEIGK 295
D+ +H R +++KI GK
Sbjct: 381 DSMFFHSIRRVQEKINRGK 399
>gi|342183042|emb|CCC92522.1| unnamed protein product [Trypanosoma congolense IL3000]
gi|343474271|emb|CCD14057.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 401
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 117/319 (36%), Positives = 166/319 (52%), Gaps = 33/319 (10%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D GE D+ D K R+DS D + +PL + + C SC
Sbjct: 90 IDAFGEYVEDMGRDTVKMRVDS---------DTLAPLGEARPLVNMNKKATSDTHDCPSC 140
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGF 119
YGAE + DCC+ C++VR A+ ++ W D+ I QC +E EGCN++
Sbjct: 141 YGAEKNPGDCCHTCDDVRRAFAERQWEFHEDDVSIMQCAKERLQMAASTASREGCNLHSS 200
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
V +V GN HF PG+ F+ G H+H N+SH I+ L FGE FPG NPLDG
Sbjct: 201 FSVPRVTGNIHFVPGRMFNFFGQHLHSFKGETIQRLNLSHIIHTLEFGERFPGQKNPLDG 260
Query: 180 VRWTQ--ETPS----GMYQYFIKVVPTVY----TDVSGHTIQSNQFSVTEHFRSS----- 224
+ T+ E PS G + YF+KVVPT+Y SG ++SNQ+SVT HF +S
Sbjct: 261 MVNTRGVENPSEDLIGRFAYFVKVVPTLYQVKTLMSSGRVVESNQYSVTHHFTASWDAAD 320
Query: 225 ------EQGRLQTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGII 276
+ +PGVF YD+SPI+V+ H S +H + +CA+ GGV+TV G+I
Sbjct: 321 QNNQTNRDANPRVVPGVFVSYDISPIRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVVGLI 380
Query: 277 DAFIYHGQRAIKKKIEIGK 295
D+ +H R +++KI GK
Sbjct: 381 DSMFFHSIRRVQEKINRGK 399
>gi|367019108|ref|XP_003658839.1| hypothetical protein MYCTH_2295135 [Myceliophthora thermophila ATCC
42464]
gi|347006106|gb|AEO53594.1| hypothetical protein MYCTH_2295135 [Myceliophthora thermophila ATCC
42464]
Length = 436
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 121/357 (33%), Positives = 164/357 (45%), Gaps = 83/357 (23%)
Query: 1 MDISGEQHLDVKHDIFKKRL----------DSQGNVIESRQDGIGAPKIDKPLQRHGGRL 50
MD+SGEQ V+H I K RL DS+ V+ SR + +
Sbjct: 89 MDVSGEQQHGVQHGITKTRLRPLSEGGGDIDSKEIVLHSRDEAA---------------V 133
Query: 51 EHNETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRI 106
+ YCG CYGA + CCN C+EVR+AY + WA + I QC+RE + +++
Sbjct: 134 HLDPNYCGECYGAPPPNNAKKPGCCNTCDEVRDAYAQASWAFGRGEGIVQCEREHYSEKL 193
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKL 164
+ EGC I G L VNKV GNFH APG+SF +HVHD+ + +H I+ L
Sbjct: 194 DAQRNEGCRIEGGLRVNKVVGNFHIAPGRSFSNGNMHVHDLKNYWDSPTKHTFTHTIHHL 253
Query: 165 AFGEHFPGV----------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
FG P VNPLD + + Y YF+K+VPT Y +
Sbjct: 254 RFGPQLPESLTQKLGTKNLPWTNHHVNPLDDTHQQTDDVNYNYMYFLKIVPTSYLPLGWE 313
Query: 209 -----------------------TIQSNQFSVTEHFRSSEQGRLQT------------LP 233
+++++Q+SVT H RS G +P
Sbjct: 314 KTWAGFRERHSAELGSFGTSPDGSVETHQYSVTSHKRSLAGGNDAAEGHQERQHARGGIP 373
Query: 234 GVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
GVFF YD+SP+KV EE SFL FL +CAIVGG TV+ ID ++ G +KK
Sbjct: 374 GVFFSYDISPMKVINREERAKSFLGFLAGLCAIVGGTLTVAAAIDRALFEGTVRLKK 430
>gi|449299159|gb|EMC95173.1| hypothetical protein BAUCODRAFT_529716 [Baudoinia compniacensis
UAMH 10762]
Length = 435
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 124/351 (35%), Positives = 171/351 (48%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
MD+SGE V H + K RL G R+ G A ++ K ++ ++H + YCG
Sbjct: 89 MDVSGEVQTGVMHGVNKVRLGEDG-----REVGREALELGKEVEE---SMKHMDPEYCGE 140
Query: 60 CYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
CYGA + CCN C EVREAY W+ + ++QC+RE + + + E+ EGC
Sbjct: 141 CYGAPAPGNAIRAGCCNTCAEVREAYASVSWSFGRGENVEQCEREHYSEHLDEQRREGCR 200
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI----SHKINKLAFGEHFP 171
I G + VNKV GNFHFAPGKSF +HVHD+ + I SH I+ L FG P
Sbjct: 201 IEGGIRVNKVVGNFHFAPGKSFSNGNMHVHDLENYFAGGEGIDHTFSHTIHHLRFGPQLP 260
Query: 172 GVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVY------------- 202
V NPLD + + Y YF+KVV T Y
Sbjct: 261 EDVVRRIGRRGMAWSNHHLNPLDETEQKTDEKAYNYMYFVKVVSTAYLPLGWERTGSILD 320
Query: 203 -----TDVSGH------TIQSNQFSVTEHFRS------SEQGRLQTL------PGVFFFY 239
++ G+ +++++Q+SVT H RS E+G + L PGVFF Y
Sbjct: 321 IPHELVELGGYGKGEAGSVETHQYSVTSHKRSLAGGDGGEEGHKERLHARGGIPGVFFSY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
D+SP+KV E SF FL VCA++GG TV+ ID +Y G + +KK
Sbjct: 381 DISPMKVINREARSKSFSGFLVGVCAVIGGTLTVAAAIDRALYEGGQRVKK 431
>gi|156844136|ref|XP_001645132.1| hypothetical protein Kpol_538p34 [Vanderwaltozyma polyspora DSM
70294]
gi|156115789|gb|EDO17274.1| hypothetical protein Kpol_538p34 [Vanderwaltozyma polyspora DSM
70294]
Length = 405
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 112/324 (34%), Positives = 166/324 (51%), Gaps = 44/324 (13%)
Query: 1 MDISGEQHLDV-KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD SG+ LDV ++ K RLD G V+E+ D + + G + YCG
Sbjct: 89 MDDSGDLQLDVLEYGFTKTRLDPDGKVLETD---------DFDMYKQDGAPSTDPNYCGP 139
Query: 60 CYGA---------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
CYG+ E+S+ CC CE+VR+AY K GWA + I+QC++EG++++I
Sbjct: 140 CYGSIDQSKNDEVEASERVCCQTCEDVRKAYVKAGWAFYDGKGIEQCEQEGYVKKINSHL 199
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEH 169
EGC + G +N++ GN HFAPGKSF H HD ++R+ N +H I+ +FG+
Sbjct: 200 NEGCRVAGSASLNRIQGNIHFAPGKSFQTVRGHFHDQSLYERNPQLNFNHIIHHFSFGKE 259
Query: 170 FP---------GVVNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVT 218
P +VNPLDG E + ++Q Y+ K+VPT + ++ + + QFS T
Sbjct: 260 IPTKLASRHSKNIVNPLDGRSVAPERDTHLHQFSYYTKIVPTRFEYLNKAVVDTAQFSAT 319
Query: 219 EHFRSSEQGR----------LQTLPGVFFFYDLSPIKVTFTEEHV--SFLHFLTNVCAIV 266
H R G +PGVFFF+D SPIKV +E++ S+ F N +
Sbjct: 320 YHDRPLRGGADDDHPNTFHFRSGIPGVFFFFDASPIKV-INKEYISGSWSSFFLNCITSI 378
Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
GGV V ++D +Y QR+ K
Sbjct: 379 GGVLAVGSMLDRLMYKAQRSFLGK 402
>gi|347842451|emb|CCD57023.1| similar to endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Botryotinia fuckeliana]
Length = 439
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 127/354 (35%), Positives = 165/354 (46%), Gaps = 74/354 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGR---LEHNETY 56
MD+SGEQ + V H + K RL Q G ID K L H + Y
Sbjct: 89 MDVSGEQQVGVMHGVKKVRLGPQEE---------GGKVIDIKALDLHNAEDSATHLDPNY 139
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG+CYGA + CCN C+EVREAY WA + ++QC+RE + +R+ + E
Sbjct: 140 CGACYGATPPPNAQKPGCCNTCDEVREAYASVSWAFGRGENVEQCEREHYGERLDSQRKE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGE 168
GC I G L VNKV GNFH APG+SF +HVHD+ F SH I+ L FG
Sbjct: 200 GCRIEGGLRVNKVIGNFHIAPGRSFTNGNMHVHDLNNFFDTPVPGGHVFSHHIHSLRFGP 259
Query: 169 HFPGVV-----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS----- 206
P V NPLD + + YF+KVV T Y +
Sbjct: 260 ELPEEVFKKLGSDSIIPWTNHHLNPLDNTEQITHEAAYNFMYFVKVVSTSYLPLGWETNY 319
Query: 207 --------------GH----TIQSNQFSVTEHFRS------SEQGRLQTL------PGVF 236
GH +I+++Q+SVT H RS S +G + L PGVF
Sbjct: 320 NSRPHDASVDIGTYGHSEDGSIETHQYSVTSHRRSLNGGDDSAEGHKEKLHARGGIPGVF 379
Query: 237 FFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
F YD+SP+KV EE L FLT +CAIVGG TV+ +D +Y G ++K
Sbjct: 380 FSYDISPMKVINKEERTKTLAGFLTGLCAIVGGTLTVAAAVDRGVYEGATRLRK 433
>gi|225562998|gb|EEH11277.1| COPII coated vesicle component Erv46 [Ajellomyces capsulatus
G186AR]
gi|240279818|gb|EER43323.1| COPII-coated vesicle component Erv46 [Ajellomyces capsulatus H143]
gi|325092948|gb|EGC46258.1| COPII-coated vesicle component Erv46 [Ajellomyces capsulatus H88]
Length = 435
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 120/348 (34%), Positives = 166/348 (47%), Gaps = 64/348 (18%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE V H + K RL S +E + + Q + G + + YCG C
Sbjct: 89 MDISGEYQTGVIHGVNKVRLSS----VEEGGRVLDITALQLHSQTNKG-TDVDPDYCGQC 143
Query: 61 YGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
YGA + CCN CEEVR+AY KGWA + ++QC++EG+ + + EGC +
Sbjct: 144 YGATPPSNAKKPGCCNTCEEVRDAYAAKGWAFGRGENVEQCEKEGYSANLDAQRKEGCRV 203
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGVV 174
G + VNKV GNFH APG+SF +H HD+ + N+ H+I+ L FG P +
Sbjct: 204 EGVIRVNKVVGNFHIAPGRSFTNGNLHAHDLDNYYHTPVQHNMGHRIHYLRFGPQLPEQL 263
Query: 175 ------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------SGH------ 208
NPLD P + YF+KVV T Y + S H
Sbjct: 264 SSRWKWTDNHHTNPLDNTEQHTTNPRFNFMYFVKVVSTSYLPLGWDPDASSSAHSQYSKN 323
Query: 209 --------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLS 242
+I+++Q+SVT H RS + G RL + +PGVF YD+S
Sbjct: 324 APLGKQGLSFGSYGSIETHQYSVTSHKRSVDGGDDSAEGHKERLHSQGGIPGVFVNYDIS 383
Query: 243 PIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
P+KV E +F FLT VCA++GG TV+ ID +Y G +KK
Sbjct: 384 PMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAIDRVLYEGAVRVKK 431
>gi|225680824|gb|EEH19108.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides brasiliensis Pb03]
Length = 413
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 120/352 (34%), Positives = 168/352 (47%), Gaps = 72/352 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
MD+SGE V H I K RL + G+VI++ L +H + Y
Sbjct: 67 MDVSGEMQSGVIHGISKVRLAPESEGGHVIDTTA---------LVLHTQTDAAKHLDPDY 117
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA ++ CC+ CEEVREAY + WA + ++QC+REG+ + + + E
Sbjct: 118 CGPCYGAPPPSHATKPGCCSTCEEVREAYASQSWAFGRGENVEQCEREGYSKNLDAQRNE 177
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHF 170
GC I G L VNKV GNFH APG+SF +H HD+ + ++SHKI++L FG
Sbjct: 178 GCRIEGVLRVNKVVGNFHIAPGRSFSSGNIHAHDLDTYYHTPVPHHMSHKIHQLRFGPQL 237
Query: 171 PGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV------------- 205
+ NPLD P + YF+KVV T Y +
Sbjct: 238 SDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNFMYFVKVVSTSYLPLGWSPEFSSSVHET 297
Query: 206 ---------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
S +I+++Q+SVT H RS + G RL + +PGVF
Sbjct: 298 TLGNTPLGKQGVHFGSSGSIETHQYSVTSHKRSIDGGDDAAEGHKERLHSHGGIPGVFVN 357
Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV E +F FLT VCA++GG TV+ +D +Y G +KK
Sbjct: 358 YDISPMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAVDRALYEGAARVKK 409
>gi|295672798|ref|XP_002796945.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226282317|gb|EEH37883.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 435
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 120/352 (34%), Positives = 168/352 (47%), Gaps = 72/352 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
MD+SGE V H I K RL + G+VI++ L +H + Y
Sbjct: 89 MDVSGEMQSGVIHGISKVRLAPESEGGHVIDTTA---------LVLHTQTDAAKHLDPDY 139
Query: 57 CGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA ++ CC+ CEEVREAY + WA + ++QC+REG+ + + + E
Sbjct: 140 CGPCYGAPPPPHATKPGCCSTCEEVREAYASQSWAFGRGENVEQCEREGYSKNLDAQRNE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN--ISHKINKLAFGEHF 170
GC I G L VNKV GNFH APG+SF +H HD+ + ++HKI++L FG
Sbjct: 200 GCRIEGVLRVNKVIGNFHIAPGRSFSNGNLHAHDLDTYYHTPVPHYMAHKIHQLRFGPQL 259
Query: 171 PGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV------------- 205
P + NPLD P + YF+KVV T Y +
Sbjct: 260 PDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNFMYFVKVVSTSYLPLGWSPEFSSSVHET 319
Query: 206 ---------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
S +I+++Q+SVT H RS + G RL + +PGVF
Sbjct: 320 TLRDTPLGKQGVHFGSSGSIETHQYSVTSHKRSIDGGDDAAEGHKERLHSQGGIPGVFVN 379
Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV E +F FLT VCA++GG TV+ +D +Y G +KK
Sbjct: 380 YDISPMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAVDRALYEGAVRVKK 431
>gi|154280410|ref|XP_001541018.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150412961|gb|EDN08348.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 435
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 120/348 (34%), Positives = 166/348 (47%), Gaps = 64/348 (18%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MDISGE V H + K RL S +E + + Q + G + + YCG C
Sbjct: 89 MDISGEYQTGVIHGVNKVRLSS----VEEGGRVLDITALQLHSQTNKG-TDVDPDYCGQC 143
Query: 61 YGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
YGA + CCN CEEVR+AY KGWA + ++QC++EG+ + + EGC +
Sbjct: 144 YGATPPSNAKKPGCCNTCEEVRDAYAAKGWAFGRGENVEQCEKEGYSANLDAQRKEGCRV 203
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGVV 174
G + VNKV GNFH APG+SF +H HD+ + N+ H+++ L FG P +
Sbjct: 204 EGVIRVNKVVGNFHIAPGRSFTNGNLHAHDLDNYYHTPVQHNMGHRVHYLRFGPQLPEEL 263
Query: 175 ------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------SGH------ 208
NPLD P + YF+KVV T Y + S H
Sbjct: 264 SSRWKWTDNHHTNPLDNTEQHTTNPRFNFIYFVKVVSTSYLPLGWDPDASSSAHSKYSKN 323
Query: 209 --------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLS 242
+I+++Q+SVT H RS + G RL + +PGVF YD+S
Sbjct: 324 APLGKQGLSFGSYGSIETHQYSVTSHKRSVDGGDDSAEGHKERLHSQGGIPGVFVNYDIS 383
Query: 243 PIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
P+KV E SF FLT VCA++GG TV+ ID +Y G +KK
Sbjct: 384 PMKVINREARTKSFSGFLTGVCAVIGGTLTVAAAIDRVLYEGAVRVKK 431
>gi|452980033|gb|EME79795.1| hypothetical protein MYCFIDRAFT_64499 [Pseudocercospora fijiensis
CIRAD86]
Length = 436
Score = 183 bits (465), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 120/353 (33%), Positives = 171/353 (48%), Gaps = 73/353 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGN---VIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGE V H I K RL S + VIE ++ + A + L YC
Sbjct: 89 MDVSGEVQTGVLHGINKVRLSSVADGSKVIEKQKLDLDAAENSVHLA---------PDYC 139
Query: 58 GSCYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA + D CCN C EVR+AY W+ + ++QC+RE + +++ + EG
Sbjct: 140 GECYGAPAPDNAKKAGCCNTCAEVRDAYASVSWSFGRGENVEQCEREHYSEQLDAQRKEG 199
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISHKINKLAFGEHF 170
C I G L VNKV GNFHFAPGKSF +HVHD+ + + +H I++L FG
Sbjct: 200 CRIEGALRVNKVVGNFHFAPGKSFSNGNLHVHDLDNYFNSGEVEHSFTHHIHRLRFGPPL 259
Query: 171 P----------GV------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-------- 206
P G+ +NPLD + + + YF+KVV T Y +
Sbjct: 260 PHDFDKRVGKKGMAWSNHHLNPLDDTHQETDDSAFNFMYFVKVVSTAYLPLGWEKTNSFS 319
Query: 207 -------------GH----TIQSNQFSVTEHFRSSEQGRLQT------------LPGVFF 237
GH +I+++Q+SVT H RS + G + +PGVFF
Sbjct: 320 RSLPHELIDLGDYGHGEQGSIETHQYSVTSHKRSLQGGDAKDEGHKERVHARGGIPGVFF 379
Query: 238 FYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV E SF FL VCA++GG TV+ +D +Y G++ ++K
Sbjct: 380 SYDISPMKVINRETRAKSFSGFLVGVCAVIGGTLTVAAAVDRMLYEGEQRVRK 432
>gi|254569250|ref|XP_002491735.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv41p [Komagataella pastoris GS115]
gi|238031532|emb|CAY69455.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv41p [Komagataella pastoris GS115]
gi|328351763|emb|CCA38162.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Komagataella pastoris CBS 7435]
Length = 401
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 106/320 (33%), Positives = 165/320 (51%), Gaps = 33/320 (10%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRL--EHNETYCG 58
MDI+G+ +D+ F+K G E+ + + K + +L +N YCG
Sbjct: 88 MDITGDLQIDLLMSGFQKTRVVDGLAKETTELRVNEYK------QENNKLTNSNNPYYCG 141
Query: 59 SCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
SCYGA + ++ CCN CE V++AY K GWA + I+QC+ EG++Q +
Sbjct: 142 SCYGALNQKDNENKPFDEKLCCNTCESVKKAYAKAGWAFYDGRNIEQCENEGYVQLVTSM 201
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG 167
EGC + G ++N+V+GN HFAPG S H+HD+ F++ D FN H +N L+FG
Sbjct: 202 VDEGCQVSGTAQINRVSGNLHFAPGSSLTSGSRHIHDLSLFEKYPDKFNFDHTVNHLSFG 261
Query: 168 EHFPG---VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
+ +PLDG + +Y YF+KVV T Y +SG +NQFS T H R
Sbjct: 262 KTIDNQEMSTHPLDGYEAATGNKNHLYSYFLKVVATRYESMSGLKWDTNQFSATYHDRPL 321
Query: 225 EQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 273
E GR +PG FF +++SP+K+ E++ + F V A V GV T+
Sbjct: 322 EGGRDSDHPNTLHASGGIPGAFFHFEISPLKIINREQYSKTRSAFALGVSASVAGVLTLG 381
Query: 274 GIIDAFIYHGQRAIKKKIEI 293
++D I+ + +++K ++
Sbjct: 382 SVLDKTIWTADQILRQKKDL 401
>gi|320583549|gb|EFW97762.1| COPII-coated vesicle membrane protein Erv46, putative [Ogataea
parapolymorpha DL-1]
Length = 400
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 110/318 (34%), Positives = 160/318 (50%), Gaps = 37/318 (11%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD SG+ LD+ F K RLD QGN I G ++++ + TYCGS
Sbjct: 89 MDQSGDMQLDLLSSGFSKIRLDRQGNEI-----GQENMRVNQEF----ALTSSDPTYCGS 139
Query: 60 CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
CYGA + CCN+CE V++AY + W + I+QC++EG++ RI
Sbjct: 140 CYGAADQSRNDELPQDQKVCCNSCESVKQAYARNAWKFYDGKDIEQCEKEGYVDRINARL 199
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGE 168
EGC + G E+ ++ GN HFAPG S + + HVHD+ + S FN H IN +FG
Sbjct: 200 DEGCRVRGTAEIARIGGNLHFAPGSSMNFNEKHVHDLSLYDMHSNKFNFDHTINHFSFGL 259
Query: 169 HFPGVVN-----PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
V + PLD +Y YF+KVV T Y + G +++NQFS T+H R
Sbjct: 260 DDHSVADYKTTHPLDATTHRDGRKYHVYSYFLKVVNTRYEFLDGRKVETNQFSATQHDRP 319
Query: 224 SEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTV 272
GR + LPGVFF +++SP+K+ E++ ++ F CA + GV TV
Sbjct: 320 FRGGRDEDHPNTIHAQGGLPGVFFHFEISPLKIINREQYNKTWSAFALGACAAISGVLTV 379
Query: 273 SGIIDAFIYHGQRAIKKK 290
++D I+ R +K K
Sbjct: 380 FTLLDRTIWAANRMLKDK 397
>gi|170586880|ref|XP_001898207.1| Serologically defined breast cancer antigen NY-BR-84 homolog,
putative [Brugia malayi]
gi|158594602|gb|EDP33186.1| Serologically defined breast cancer antigen NY-BR-84 homolog,
putative [Brugia malayi]
Length = 341
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 106/266 (39%), Positives = 149/266 (56%), Gaps = 19/266 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRL--DSQGNVIESRQD-GIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SG+ D+K D++K L +GN I RQ I + + ++ C
Sbjct: 90 MDLSGDNQDDIKDDVYKISLLNGKEGNGI--RQGVNINTTTV--------SSVPASQILC 139
Query: 58 GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
GSCYGA+ + CCN CEEV+EAY KKGW L N + ++QCK + +++++ E + EGC +Y
Sbjct: 140 GSCYGAK---DGCCNTCEEVKEAYIKKGWELVNIETVEQCKSDLWVKKMNEHKNEGCRVY 196
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
G ++V KVAGNFH APG H HD+ + F+ SH +N L+FG FPG V PL
Sbjct: 197 GKVQVAKVAGNFHIAPGDPLKAHRSHFHDLHSLSPSKFDTSHTVNHLSFGNSFPGKVYPL 256
Query: 178 DGVRWTQETPSG-MYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
DG + SG MYQY +K+VPT Y + S I S+ FSVT + + QG LPG
Sbjct: 257 DGKFFGSAKDSGIMYQYHLKLVPTSYVFLDSTRNIFSHLFSVTTYQKDISQGA-SGLPGF 315
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTN 261
F Y+ SP+ V + E + + N
Sbjct: 316 FIQYEFSPLMVKYEERRQYVVTIILN 341
>gi|448081831|ref|XP_004194985.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
gi|359376407|emb|CCE86989.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
Length = 405
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 110/322 (34%), Positives = 178/322 (55%), Gaps = 35/322 (10%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGN--VIESR---QDGIGAPKIDKPLQRHGGRLEHNE 54
+D+SG+ DV F+K RL N V+++ ++ + I + + GG
Sbjct: 90 LDVSGDTQADVLKSGFEKYRLIPSSNEEVLDNAPVLRNDLSLEDIARNPNKEGG------ 143
Query: 55 TYCGSCYGA--ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE--EE 110
+CGSCYGA + +E CCN+CE VR AY ++ WA + I+QC+ EG++ R+ + E+
Sbjct: 144 GFCGSCYGALPQGDNEYCCNDCETVRLAYAERMWAFYDGANIEQCENEGYVTRLNQRIEQ 203
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG- 167
EGC I G ++N+V+GN HFAPG + G H+HD+ +++ D FN H IN L+FG
Sbjct: 204 KEGCRIKGTAQINRVSGNMHFAPGYAKTSPGRHIHDLSLYEKHFDKFNFDHVINHLSFGL 263
Query: 168 ---EHFPG--VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
+ P +PLDG R S + Y++KVV T + +SG +++NQFS H R
Sbjct: 264 DPVKEDPNHQSTHPLDGYRLILNDKSRVISYYLKVVATRFEFLSGLAMETNQFSAIPHHR 323
Query: 223 SSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFT 271
G+ + +PGVFF +D+SP+K+ E++ ++ F+ V + + GV T
Sbjct: 324 PYRGGKDEDHRHTMHAKGGIPGVFFHFDISPMKIINKEQYAKTWSGFVLGVVSSIAGVLT 383
Query: 272 VSGIIDAFIYHGQRAIKKKIEI 293
V ++D ++ ++AIK K +I
Sbjct: 384 VGAVLDRSVWAAEKAIKSKKDI 405
>gi|402590490|gb|EJW84420.1| hypothetical protein WUBG_04668 [Wuchereria bancrofti]
Length = 341
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 104/265 (39%), Positives = 146/265 (55%), Gaps = 17/265 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRL--DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
MD+SG+ D+K D++K L +GN I + P ++ CG
Sbjct: 90 MDLSGDNQDDIKDDVYKISLLNGKEGNGIRQGVNINTTTVSSAP---------ASQILCG 140
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYGA+ + CCN CEEV+EAY KKGW L N + ++QCK + +++++ E + EGC +YG
Sbjct: 141 SCYGAK---DGCCNTCEEVKEAYIKKGWELVNIETVEQCKSDLWVKKMNEHKNEGCRVYG 197
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
++V KVAGNFH APG H HD+ + F+ SH +N L+FG FPG V PLD
Sbjct: 198 KVQVAKVAGNFHIAPGDPLKAHRSHFHDLHSLSPSKFDTSHTVNHLSFGNSFPGKVYPLD 257
Query: 179 GVRWTQETPSG-MYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 236
G + SG MYQY +K+VPT Y + S I S+ FSVT + + QG LPG F
Sbjct: 258 GKFFGSAKDSGIMYQYHLKLVPTSYVFLDSTRNIFSHLFSVTTYQKDISQGA-SGLPGFF 316
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTN 261
Y+ SP+ V + E + + N
Sbjct: 317 IQYEFSPLMVKYEERRQYVVTIILN 341
>gi|398398231|ref|XP_003852573.1| hypothetical protein MYCGRDRAFT_72288 [Zymoseptoria tritici IPO323]
gi|339472454|gb|EGP87549.1| hypothetical protein MYCGRDRAFT_72288 [Zymoseptoria tritici IPO323]
Length = 435
Score = 181 bits (459), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 121/359 (33%), Positives = 165/359 (45%), Gaps = 85/359 (23%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET----- 55
MD+SG+ V H I K RL + G IDK GRL+ NE
Sbjct: 89 MDVSGDVQTGVLHGIVKTRLKPESE---------GGGDIDK------GRLQVNEVEEAAK 133
Query: 56 -----YCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRI 106
YCG CYGA + CCN C EVREAY W+ + ++QC RE + + +
Sbjct: 134 HLARDYCGDCYGAPPPANAIKSGCCNTCAEVREAYASVSWSFGRGENVEQCTREHYSEHL 193
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKL 164
E+ EGC + G + VNKV GNFHFAPGKSF +HVHD+ + SH I+ L
Sbjct: 194 DEQRKEGCRVDGVIRVNKVVGNFHFAPGKSFSNGNMHVHDLENYLTGGGDHTPSHIIHHL 253
Query: 165 AFGEHFPGV-----------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS- 206
FG P ++PLDG R + Y YF+KVVPT Y +
Sbjct: 254 RFGPLLPESYKHRVRDTERHWSNNHHLSPLDGFRQETNEKAYNYMYFVKVVPTAYLPLGY 313
Query: 207 -----------------------GHTIQSNQFSVTEHFR------SSEQGRLQTL----- 232
G +I+++Q+SVT H R ++++G + L
Sbjct: 314 ENLPSVGDYPHEHAHVGEYGISHGSSIETHQYSVTSHKRHLGGGDANDEGHKERLHARGG 373
Query: 233 -PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
PGVFF YD+SP+KV E SF FL +C ++GG TV+ +D + G + +KK
Sbjct: 374 IPGVFFSYDISPMKVIDREVRAKSFSSFLVGICGVLGGTLTVAAAVDRIWFEGTQRVKK 432
>gi|61555552|gb|AAX46728.1| serologically defined breast cancer antigen 84 isoform b [Bos
taurus]
Length = 283
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 89/164 (54%), Positives = 112/164 (68%), Gaps = 11/164 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 160
YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F D+ K
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNVRTRWK 245
>gi|407929248|gb|EKG22082.1| protein of unknown function DUF1692 [Macrophomina phaseolina MS6]
Length = 442
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 120/358 (33%), Positives = 169/358 (47%), Gaps = 77/358 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
MD+SGE V H + K RL + SR + A L H H + YCG
Sbjct: 89 MDVSGEIQTGVMHGVNKVRLTPENE--GSRPIEVNA------LNLHADEASHMDPDYCGE 140
Query: 60 CYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
CYGA + CCN C++VR+AY W+ + D ++QC+RE + +++ + EGC
Sbjct: 141 CYGAPAPTTAKKPGCCNTCDDVRDAYAAISWSFTRGDGVEQCEREHYGEKLDAQRREGCR 200
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGV 173
+ G + VNKV GNFHFAPGKSF +HVHD+ + +D + +H+++ L FG P
Sbjct: 201 VEGGIRVNKVIGNFHFAPGKSFSNGNMHVHDLENYFKDGAPHSFTHQVHSLRFGPQLPDD 260
Query: 174 V--------------------NPLDGVRWTQETPSGMYQYFIKVVPTVY----------T 203
V NPLD + + + YF+KVV T Y +
Sbjct: 261 VIAKLEASGMSASSLWTNHHINPLDNTEQRTDEKAFNFMYFVKVVSTAYLPLGWENKGSS 320
Query: 204 DVSG-------------------HTIQSNQFSVTEHFRSSEQG---------RLQT---L 232
+SG +I+++Q+SVT H RS G RL +
Sbjct: 321 SLSGLLPDADRAPLGSYGLASGEGSIETHQYSVTSHKRSLAGGNDEKDGHKERLHARGGI 380
Query: 233 PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
PGVFF YD+SP+KV E SF FL VCA++GG TV+ ID +Y G +KK
Sbjct: 381 PGVFFSYDISPMKVINRESRAKSFSGFLVGVCAVIGGTLTVAAAIDRALYEGSTKLKK 438
>gi|148674215|gb|EDL06162.1| ERGIC and golgi 3, isoform CRA_b [Mus musculus]
Length = 269
Score = 180 bits (456), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 88/164 (53%), Positives = 115/164 (70%), Gaps = 3/164 (1%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FKKRLD G + S + K++ + L+ N C SC
Sbjct: 100 MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTV-FDPNSLDPNR--CESC 156
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 157 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 216
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
EVNKVAGNFHFAPGKSF QS VHVH + SF + + + L
Sbjct: 217 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNPSDCL 260
>gi|451849936|gb|EMD63239.1| hypothetical protein COCSADRAFT_38106 [Cochliobolus sativus ND90Pr]
Length = 437
Score = 180 bits (456), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 118/353 (33%), Positives = 169/353 (47%), Gaps = 72/353 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
MD+SGE + V H I K RL + ++G +I K L H H YCG
Sbjct: 89 MDVSGELQMGVTHGINKVRLSPE-------REGSKTIEI-KALDLHADEASHLAPDYCGE 140
Query: 60 CYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
C+GA + CCN C+EVR+AY W+ + ++QC+RE + + + E+ EGC
Sbjct: 141 CFGAPPPANAKKPGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDEQRQEGCR 200
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGV 173
+ G + VNKV GNFH APGKSF +HVHD+ + +D + +HKI++L FG V
Sbjct: 201 LEGSIRVNKVVGNFHIAPGKSFSNGNMHVHDLENYFKDEYAHTFTHKIHQLRFGPQLSDV 260
Query: 174 V---------------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT--------- 203
V NPLD + + + YFIKVV T Y
Sbjct: 261 VIQGIQDKHRGSGPGSWSNHHINPLDNTEQHTDEKAFNFMYFIKVVSTAYLPLGWEDAAP 320
Query: 204 ------DVSGHT--------IQSNQFSVTEHFRSSEQGRLQT------------LPGVFF 237
++ G T I+++Q+SVT H R+ + G + +PGVFF
Sbjct: 321 RLTKHDELLGSTIDATHKGSIETHQYSVTSHKRNLKGGNDEKDGHKERVHARGGIPGVFF 380
Query: 238 FYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV E +F FL +CA++GG TV+ +D +Y G IKK
Sbjct: 381 SYDISPMKVINREVREKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNRIKK 433
>gi|444314203|ref|XP_004177759.1| hypothetical protein TBLA_0A04460 [Tetrapisispora blattae CBS 6284]
gi|387510798|emb|CCH58240.1| hypothetical protein TBLA_0A04460 [Tetrapisispora blattae CBS 6284]
Length = 406
Score = 179 bits (455), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 111/322 (34%), Positives = 168/322 (52%), Gaps = 42/322 (13%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD +GE LD+ F K RLDS+GN + + + + P ++ YCG
Sbjct: 88 MDDAGEIQLDILSSGFTKTRLDSRGNELGTFDFDLSKDISEYP--------PDDDKYCGP 139
Query: 60 CYGA--ESSDED--------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
CYGA +S+++D CC C +VR+AY GWA + I+QC+REG++QRI +
Sbjct: 140 CYGALDQSNNKDDMPMDEKVCCQTCADVRQAYLNAGWAFFDGKDIEQCEREGYVQRINDH 199
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFGE 168
EGC I G +N++ GN HFAPG +F H HD + + + +H IN L+FG+
Sbjct: 200 LNEGCRIQGNARLNRIHGNVHFAPGLAFQNRRGHYHDTSLYDKKTELTFNHIINHLSFGK 259
Query: 169 HF-PGV--------VNPLDGVRWT-QETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSV 217
H PG+ V+PLDG + + P + + YF K+VPT Y + I++ QFS
Sbjct: 260 HVKPGIGSKFSAASVSPLDGHQMILNDDPHNVQFIYFAKIVPTRYEYLDKDVIETAQFST 319
Query: 218 TEHFR----------SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIV 266
T H + + + R PG++ Y++SP+KV E+HV +++ F+ N +
Sbjct: 320 TTHSKALNNLADDKTTPKPSRRSGTPGLYINYEMSPLKVINREQHVQTWVSFILNCLTSI 379
Query: 267 GGVFTVSGIIDAFIYHGQRAIK 288
GGV V +ID Y QR I+
Sbjct: 380 GGVLAVGTVIDKIFYRAQRTIQ 401
>gi|401839164|gb|EJT42494.1| ERV46-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 415
Score = 179 bits (455), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 117/337 (34%), Positives = 166/337 (49%), Gaps = 59/337 (17%)
Query: 1 MDISGEQHLDVKHDIFK-KRLDSQGNVIESRQ------DGIG-APKIDKPLQRHGGRLEH 52
MD SGE LD+ F RLD +G + DG G AP D P
Sbjct: 88 MDDSGEMQLDILDAGFTMTRLDKEGRPVGDAAELQVGGDGDGVAPVNDDP---------- 137
Query: 53 NETYCGSCYGAES---------SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFL 103
YCG CYGA +D+ CC +C+ VR AY GWA + I+QC+REG++
Sbjct: 138 --NYCGPCYGARDQTQNENLAQADKVCCQDCDAVRSAYLDAGWAFFDGKNIEQCEREGYV 195
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKIN 162
+I E EGC I G ++N++ GN HFAPG+ F + H HD+ +++ N +H IN
Sbjct: 196 SKINEHLHEGCRIEGSAQINRIQGNIHFAPGRPFQNANGHFHDVSLYEKTPDLNFNHMIN 255
Query: 163 KLAFGE--------------HFPGVV--NPLDGVRWTQE--TPSGMYQYFIKVVPTVYTD 204
L+FG+ H V+ +PLDG + E T S ++ YF K+VPT Y
Sbjct: 256 HLSFGKPIESRNKLLENDDRHGGAVIATSPLDGRKVFPERTTHSHLFSYFAKIVPTRYEY 315
Query: 205 VSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-V 253
+ I++ QFS T H R GR Q +PG+F F+++SP+KV E+H
Sbjct: 316 LDDVVIETAQFSATYHSRPLRGGRDQDHPNTFHARGGIPGLFVFFEMSPLKVINKEQHGQ 375
Query: 254 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
++ F+ N +GGV V ++D Y QR+I K
Sbjct: 376 TWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|291001965|ref|XP_002683549.1| predicted protein [Naegleria gruberi]
gi|284097178|gb|EFC50805.1| predicted protein [Naegleria gruberi]
Length = 391
Score = 179 bits (455), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 103/302 (34%), Positives = 154/302 (50%), Gaps = 34/302 (11%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK------IDKPLQRHGGRLE-HN 53
+D SG+ +DV H I K +DS G + + +PK + P ++ + H+
Sbjct: 91 VDASGDAAIDVAHHIHKVPVDSSGRITH-----LESPKHKTKLGTEMPQDKYDPTKDPHS 145
Query: 54 ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
YCG+CY E +CCN C++V E Y++ G + ++QC + + G
Sbjct: 146 IMYCGTCY-VEQRRGECCNTCQDVMEVYKRNGLPAPRVEDVEQCLFDA------SKNHPG 198
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGV----HVHDILAFQRDSFNISHKINKLAFGEH 169
CNIYG L+V KV GNFHF PG+SF Q H+H+ D +N +H I+ L+FG
Sbjct: 199 CNIYGTLDVQKVNGNFHFLPGRSFSQEYETRVHHIHEFNPILVDRYNSTHIIHSLSFGLR 258
Query: 170 FPGVVNPLDGV---------RWTQETPSGMYQYFIKVVPTVYTDVS--GHTIQSNQFSVT 218
P V PLD Q + +++YFIK VPT Y S TI + QFS T
Sbjct: 259 IPHVTYPLDETVGIIPKIEESDAQAPKTALFKYFIKAVPTTYIGSSYFSSTINTYQFSFT 318
Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
+H + ++ LPGVFF Y+ PI++T+ E + F HF+ ++ A+ G+F V IDA
Sbjct: 319 KHVMPFDSSKMMMLPGVFFVYNFEPIRITYEENGMPFTHFIVDLMAVCAGIFVVLNYIDA 378
Query: 279 FI 280
+
Sbjct: 379 LL 380
>gi|323454843|gb|EGB10712.1| hypothetical protein AURANDRAFT_2571, partial [Aureococcus
anophagefferens]
Length = 380
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 150/301 (49%), Gaps = 45/301 (14%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVI----------------ESRQDGIGAPKIDKPLQR 45
D SG+ V+ + K RLD+ G + + ++ + AP KP
Sbjct: 91 DESGQPLEGVQQHVIKTRLDTNGRRVLVNRKAANSVHKVGDTATSEEHLAAPDEAKP--- 147
Query: 46 HGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
E CG CYGA+ + CC C++VR AYRK+GW + + QC E
Sbjct: 148 --------EVACGDCYGAQDDERPCCATCDDVRSAYRKRGWTF-HEHTVAQCAGELAEAA 198
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV-HVHDILAFQRDSFNISHKINKL 164
+ + EGC+I G LE+ V+GNFH APG+ SG+ D++ D FN+SH + +L
Sbjct: 199 LDLDSDEGCSIKGTLELPAVSGNFHVAPGRHLQTSGLFKGMDLVQLTFDKFNVSHTVKQL 258
Query: 165 AFG---------EHFPGVVNP-------LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
FG VV P LDG T GM+QY++KVVPTVY ++ G
Sbjct: 259 RFGPDERSLEPARASRKVVGPDVDLSSQLDGESRTLGDGYGMHQYYLKVVPTVYKNLGGK 318
Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
T + Q+SVTEH R G + LPGVFFFY++SP+ F E +L LT + AIVGG
Sbjct: 319 TRELWQYSVTEHVRHVAPGSGKGLPGVFFFYEVSPLCAEFVERRNGWLALLTGLAAIVGG 378
Query: 269 V 269
V
Sbjct: 379 V 379
>gi|315044047|ref|XP_003171399.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma gypseum CBS 118893]
gi|311343742|gb|EFR02945.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma gypseum CBS 118893]
Length = 435
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 122/351 (34%), Positives = 163/351 (46%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGE DV H + K RL S G VI+ + K D P L+ N YC
Sbjct: 89 MDVSGELQTDVDHGVNKVRLSSAAEGGKVIDVTALALHK-KEDSP-----AHLDPN--YC 140
Query: 58 GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYG + + CCN CEEVR+AY +K WA + + QC EG+ QRI E+ EG
Sbjct: 141 GDCYGVPAPSNAKKPGCCNTCEEVRDAYAEKNWAFGRGENVAQCIDEGYSQRIDEQRHEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C I G L VNKVAGNFH APG+S H HD+ + +SH I+KL FG P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMSHTIHKLRFGPQLP 260
Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
+NPLD + + YF+KVV T Y +
Sbjct: 261 EELYSRWKWTHQDTINPLDKSDHKTDEARYNFMYFVKVVSTSYLPLGWDPTWSSEVHSQA 320
Query: 206 --------------SGHTIQSNQFSVTEHFRS------SEQGRLQT------LPGVFFFY 239
+ +I+++Q+SVT H RS S +G + +P V F Y
Sbjct: 321 HKDIPLGNHGVYFGTQGSIETHQYSVTSHQRSLDAEDASAEGHKERQHTRGGIPSVIFNY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
++SP+KV E S F T VCA++GG TV+ +D +Y G +KK
Sbjct: 381 EISPMKVINREARPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGGLRVKK 431
>gi|452001785|gb|EMD94244.1| hypothetical protein COCHEDRAFT_1202021 [Cochliobolus
heterostrophus C5]
Length = 437
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 118/354 (33%), Positives = 168/354 (47%), Gaps = 74/354 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
MD+SGE + V H I K RL + G+ I+ K L H H YCG
Sbjct: 89 MDVSGELQMGVTHGINKVRLGPEKE---------GSKTIEIKALDLHADEASHLAPDYCG 139
Query: 59 SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
C+GA + CCN C+EVR+AY W+ + ++QC+RE + + + E+ EGC
Sbjct: 140 ECFGAPPPANAKKPGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDEQRQEGC 199
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPG 172
+ G + VNKV GNFH APGKSF +HVHD+ + +D + +HKI++L FG
Sbjct: 200 RLEGSIRVNKVVGNFHIAPGKSFSNGNMHVHDLENYFKDEYAHTFTHKIHQLRFGPQLSD 259
Query: 173 VV---------------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT-------- 203
VV NPLD + + + YFIKVV T Y
Sbjct: 260 VVIQGIQDKHKGSGPGSWSNHHINPLDNTEQHTDEKAFNFMYFIKVVSTAYLPLGWEDAA 319
Query: 204 -------DVSGHT--------IQSNQFSVTEHFRSSEQGRLQT------------LPGVF 236
++ G T I+++Q+SVT H R+ + G + +PGVF
Sbjct: 320 PRLTKHDELLGSTIDASHKGSIETHQYSVTSHKRNLKGGNDEKDGHKERIHARGGIPGVF 379
Query: 237 FFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
F YD+SP+KV E +F FL +CA++GG TV+ +D +Y G IKK
Sbjct: 380 FSYDISPMKVINREVREKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNRIKK 433
>gi|326476034|gb|EGE00044.1| COPII-coated vesicle protein [Trichophyton tonsurans CBS 112818]
gi|326481270|gb|EGE05280.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Trichophyton equinum CBS 127.97]
Length = 435
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 122/351 (34%), Positives = 159/351 (45%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGE DV H + K RL S G VI+ + K D P L+ N YC
Sbjct: 89 MDVSGELQTDVDHGVNKVRLSSAAEGGKVIDVTALALHK-KEDSP-----AHLDPN--YC 140
Query: 58 GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYG + + CCN C+EVR+AY +K WA + + QC EG+ QRI E+ EG
Sbjct: 141 GDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFGRGENVAQCIDEGYSQRIDEQRHEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C I G L VNKVAGNFH APG+S H HD+ + +SH I+KL FG P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMSHIIHKLRFGPQLP 260
Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
+NPLD + YF+KVV T Y +
Sbjct: 261 EELYSRWKWTHQDTINPLDKSEHKTNEARYNFLYFVKVVSTSYLPLGWDPTLSSEAHSQA 320
Query: 206 --------------SGHTIQSNQFSVTEHFRS------------SEQGRLQTLPGVFFFY 239
S +I+++Q+SVT H RS Q +P V F Y
Sbjct: 321 HRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAEDASADGHKERQHARGGIPSVMFNY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
D+SP+KV E S F T VCA++GG TV+ +D +Y G +KK
Sbjct: 381 DISPMKVINRESRPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVKK 431
>gi|366996541|ref|XP_003678033.1| hypothetical protein NCAS_0I00190 [Naumovozyma castellii CBS 4309]
gi|342303904|emb|CCC71687.1| hypothetical protein NCAS_0I00190 [Naumovozyma castellii CBS 4309]
Length = 409
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 113/329 (34%), Positives = 170/329 (51%), Gaps = 58/329 (17%)
Query: 4 SGEQHLDVKHD--IFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
SGE LD+ + K R+DS GN ++S + + + P Q ++ YCGSCY
Sbjct: 93 SGELQLDLLQEGSFTKTRVDSNGNALDSMKFKLDDEVGEYPPQ--------DDNYCGSCY 144
Query: 62 GA-ESSDED--------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
GA + S+ D CC +CE+VR AY GWA + I+QC+REG++ RI E
Sbjct: 145 GALDQSNNDNLPKDEKVCCQDCEQVRNAYLTAGWAFFDGKKIEQCEREGYVARINSHLNE 204
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEHFP 171
GC + G + +N++ GN HFAPG++F + H HD +++ S N +H IN L+FG+
Sbjct: 205 GCRVKGDVLLNRIHGNIHFAPGRAFQNTKGHFHDTSLYEQTLSLNFNHIINHLSFGKSVE 264
Query: 172 GV---------VNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVT-- 218
+ +PLDG + + S +Y+ YF K+VPT Y + G ++ QFS T
Sbjct: 265 QLAEVRGASVSTSPLDGQQVSPSFDSHLYRYSYFTKIVPTRYEWLDGVVAETAQFSATFH 324
Query: 219 ------------EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS-----FLHFLTN 261
H R S G LPGVF ++++SP+KV E+H FLH +T+
Sbjct: 325 ESPVNGAMDPEHPHIRHSRTG----LPGVFIYFEMSPLKVINQEQHFKSWSGVFLHGITS 380
Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
+GG+ V ++D Y QR I+K+
Sbjct: 381 ----MGGILAVGTVLDKIFYRAQRTIQKR 405
>gi|119596606|gb|EAW76200.1| ERGIC and golgi 3, isoform CRA_b [Homo sapiens]
Length = 239
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 85/154 (55%), Positives = 112/154 (72%), Gaps = 3/154 (1%)
Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
+HD+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY
Sbjct: 85 IHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYM 144
Query: 204 DVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT
Sbjct: 145 KVDGEVLRTNQFSVTRHEKVAN-GLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTG 203
Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 204 VCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 237
>gi|224011116|ref|XP_002294515.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220970010|gb|EED88349.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 454
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 108/301 (35%), Positives = 157/301 (52%), Gaps = 24/301 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP-KIDKPLQRHGGRLEH-NETYCG 58
+D++G+ LD+ +FK RL+ G + + A K D+ ++ + YCG
Sbjct: 158 IDVAGDSQLDLSDTLFKHRLNLDGTLRSKAKIATEANIKADEDKKKQEALSKDIPADYCG 217
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSN-PDLIDQCKREGFLQR--IKEEEGEGCN 115
CYGA+ + DCCN C++V E Y+KK W + L +QC REG + + GEGCN
Sbjct: 218 PCYGADEKEGDCCNTCDDVMERYKKKRWNENAVQPLAEQCIREGKGKNEPKRMSNGEGCN 277
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH------ 169
+ G VN+VAGNFH A G+ + G H+H L R +FN SH +++L F +
Sbjct: 278 LSGHFTVNRVAGNFHIAMGEGVDRDGRHIHQFLPEDRMNFNASHVVHELIFMDEEYGDMV 337
Query: 170 ---FPG--VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
PG +N + V +G++QYFIKVVPT Y SG T+ EH +
Sbjct: 338 IAGVPGETSMNSVSKVVTEDTGTTGLFQYFIKVVPTKYKGKSGGTLHEK----VEHHDTQ 393
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
LPGVFF Y++ P V T+ V F+H L + A VGGVFT+ G ID+ +Y +
Sbjct: 394 N----AVLPGVFFVYEIYPFAVEVTKNKVPFMHLLIRIMATVGGVFTIMGWIDSALYSRE 449
Query: 285 R 285
+
Sbjct: 450 K 450
>gi|67479189|ref|XP_654976.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56472072|gb|EAL49587.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
Length = 361
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 100/296 (33%), Positives = 165/296 (55%), Gaps = 26/296 (8%)
Query: 4 SGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA 63
SGE + ++ ++ K R+ G+++ + K +Q + C SCYGA
Sbjct: 86 SGESMIGIEQNVTKIRIHHDGSLVTENEM--------KAIQSKLSIETPDPKECRSCYGA 137
Query: 64 ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVN 123
E+ ++ CC C++V+EAY+K+GW L + +++ QC+ +Q K + EGC + G +N
Sbjct: 138 ETPEKKCCFTCDDVKEAYKKRGWRL-DLNIVSQCQNHEKIQMAKLTKDEGCRLIGDFLLN 196
Query: 124 KVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWT 183
K+ GNFH APG S G H H++ + ++SHK N+L+FGE + ++T
Sbjct: 197 KIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQIDLSHKWNELSFGE---------NSKKFT 247
Query: 184 QETP----SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
E + M+QY++ ++P ++G T +S+ E+ RS E G Q PGVF +Y
Sbjct: 248 TEKKDTQMNSMFQYYLTIIPIKNNFING-TSTFYDYSIQENIRSGE-GEGQ--PGVFIYY 303
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
D+SP+ + TE + FLHFL +C+IVGG+FT + DA ++ +KKK+E+GK
Sbjct: 304 DVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHTLKKKVELGK 359
>gi|148674214|gb|EDL06161.1| ERGIC and golgi 3, isoform CRA_a [Mus musculus]
Length = 238
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 92/189 (48%), Positives = 123/189 (65%), Gaps = 14/189 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FKKRLD G + S + K++ + L+ N C SC
Sbjct: 61 MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTV-FDPNSLDPNR--CESC 117
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAES D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 118 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 177
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
EVNKV G G Q VHD+ +F D+ N++H I L+FGE +PG+VNPLD
Sbjct: 178 EVNKVPG------GSKARQL---VHDLQSFGLDNINMTHYIKHLSFGEDYPGIVNPLDHT 228
Query: 181 RWTQETPSG 189
T P G
Sbjct: 229 NVT--APQG 235
>gi|296811622|ref|XP_002846149.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma otae CBS 113480]
gi|238843537|gb|EEQ33199.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
[Arthroderma otae CBS 113480]
Length = 435
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 120/351 (34%), Positives = 161/351 (45%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGE DV H + K RL S G VI+ + K D P L+ N YC
Sbjct: 89 MDVSGELQTDVDHGVNKVRLSSAAEGGKVIDVTALDLHK-KDDSP-----AHLDPN--YC 140
Query: 58 GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G+CYG + + CCN C EVR+AY +K WA + + QC EG+ QRI E+ EG
Sbjct: 141 GNCYGVPAPSTAKKPGCCNTCAEVRDAYAEKNWAFGRGEGVTQCMDEGYSQRIDEQRHEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C I G L VNKVAGNFH APG+S H HD+ + ++H I+KL FG P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMTHIIHKLRFGPQLP 260
Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
+NPLD + + YF+KVV T Y +
Sbjct: 261 EELYSRWKWTHQDTINPLDKSEHRTDEVRYNFLYFVKVVSTSYLPLGWDATWSSEVHSQA 320
Query: 206 --------------SGHTIQSNQFSVTEHFRSSEQGRLQT------------LPGVFFFY 239
S +I+++Q+SVT H RS + G +P V F Y
Sbjct: 321 HKDIPLGNHGVYFGSQGSIETHQYSVTSHKRSLDGGDDSAEGHKERQYARGGIPSVMFNY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
++SP+KV E S F T VCA++GG TV+ +D +Y G +KK
Sbjct: 381 EISPMKVINRETRPKSLSTFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVKK 431
>gi|440301578|gb|ELP93964.1| endoplasmic reticulum-golgi intermediate compartment protein,
putative [Entamoeba invadens IP1]
Length = 363
Score = 176 bits (447), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 107/297 (36%), Positives = 156/297 (52%), Gaps = 26/297 (8%)
Query: 4 SGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN-----ETYCG 58
SGE +D++ +I K RL+ G P + L+ +L N + C
Sbjct: 86 SGESMIDIEKNITKTRLNKNG-----------VPLTESELKATQQKLNANIKTVDQKTCR 134
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCYGAE+ CC C++V EAY+++GW L N I QC L+ K EGC + G
Sbjct: 135 SCYGAETPSRKCCYTCDDVIEAYKERGWNL-NIRTIAQCDNSEKLEMAKLTLEEGCRVEG 193
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
L +NK+ GNFH APG S + H H+I R +++H N L+FGE
Sbjct: 194 NLLLNKIGGNFHIAPGTSDNTWTGHHHNIEWTGRTKIDLTHTWNDLSFGEGSKTYSGSKK 253
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
+ +GM+QYF+ ++P ++G + F + E RS G+ + PGVF +
Sbjct: 254 DAKM-----NGMFQYFLTLIPKKNNFINGTKFVYD-FVINEQTRS---GQGEGEPGVFVY 304
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
YD+SP+ + E + FLHFL VCAI+GGVFTV +IDAF++ ++KKIE+GK
Sbjct: 305 YDVSPMLLEVNEFNHGFLHFLIGVCAIIGGVFTVFQLIDAFVFDSIHTLQKKIELGK 361
>gi|354544621|emb|CCE41346.1| hypothetical protein CPAR2_303350 [Candida parapsilosis]
Length = 412
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 170/322 (52%), Gaps = 30/322 (9%)
Query: 2 DISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDK--PLQRHGGRLEHNETY-- 56
D SG+ LD+ + +K R+ QG+ + + P + + PL++ L +T
Sbjct: 91 DESGDLKLDIINSQLEKFRIIKQGHSSKPVEIKDEQPALQREVPLEQIAPGLPEGQTEGE 150
Query: 57 CGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGE 112
CGSCYGA D+ CCN C VR AY + W + + I QC++EG++QR+K+ E E
Sbjct: 151 CGSCYGAVPQDKKQYCCNTCAAVRRAYAEANWQFFDGENIAQCEQEGYVQRLKQRIGENE 210
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ--RDSFNISHKINKLAFGEHF 170
GC + G ++N+++G FAPG S + G HVHD+ +Q +D FN H IN L+FG +
Sbjct: 211 GCRVKGTAKINRISGTMDFAPGASMTKDGRHVHDLSLYQKYKDKFNFDHVINHLSFGNNP 270
Query: 171 P-------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG-HTIQSNQFSVTEHFR 222
P G + PLDG ++ Q YF+K+V T + + G H +NQFSV H R
Sbjct: 271 PASKLVDTGSITPLDGHKFLQHKKYHSINYFLKIVATRFESLDGKHKFDTNQFSVITHDR 330
Query: 223 SSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFT 271
G+ + +PGV F +D+SP+K+ EE+ F+ V + + GV
Sbjct: 331 PLAGGKDEDHQHTLHARGGVPGVAFNFDISPLKIINREEYAKTRSGFILGVVSSIAGVLM 390
Query: 272 VSGIIDAFIYHGQRAIKKKIEI 293
V ++D ++ Q+AIK K ++
Sbjct: 391 VGSLMDRSVFAAQQAIKGKKDL 412
>gi|344230638|gb|EGV62523.1| DUF1692-domain-containing protein [Candida tenuis ATCC 10573]
Length = 409
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 108/326 (33%), Positives = 172/326 (52%), Gaps = 37/326 (11%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNE---- 54
+D+SG LD+ + F+K R+ S G + + AP ID PL+ L+ E
Sbjct: 88 LDVSGNVELDILQNGFQKYRILSSGEEVLMKN----APLIDSTPLEVMAKGLDKPEDAEH 143
Query: 55 TYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--E 110
T CG CYG+ D CCNNCE +R AY K WA + + I C+ EG+++ I+ E
Sbjct: 144 TPCGDCYGSLPQDRKQYCCNNCETIRRAYAAKVWAFYDGENIKPCEDEGYVKAIQSEIFN 203
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFGE 168
EGC + G ++N+++GN HFAPG SF + HVHD+ + + D FN H IN L+FG+
Sbjct: 204 NEGCRVKGTTQINRISGNLHFAPGASFTEPSRHVHDLSLYNKFPDRFNFDHTINHLSFGK 263
Query: 169 HFPGVVN-------PLDGVRWTQETPSGMYQYFIKVVPTVYTDVS---GHTIQSNQFSVT 218
N PLDG + +Y YF+KVV T Y + +++NQFS
Sbjct: 264 DPETNANTDKKTLHPLDGETRNLKEKYHLYSYFLKVVSTRYEYLQEKLKAPLETNQFSAI 323
Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
H R + G+ + LPG++F++D+SP+K+ E++ ++ F+ V + +
Sbjct: 324 YHDRPIKGGKDEDHQHTLHARGGLPGLYFYFDISPLKIINKEQYSKTWSGFVLGVISSIA 383
Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
GV + ++D ++ ++AI+ K +I
Sbjct: 384 GVLMIGSLLDRSVWAAEKAIRAKKDI 409
>gi|302666755|ref|XP_003024974.1| hypothetical protein TRV_00895 [Trichophyton verrucosum HKI 0517]
gi|291189052|gb|EFE44363.1| hypothetical protein TRV_00895 [Trichophyton verrucosum HKI 0517]
Length = 435
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 120/351 (34%), Positives = 159/351 (45%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGE DV H + K RL S G VI+ + K D P L+ N YC
Sbjct: 89 MDVSGELQTDVDHGVNKVRLSSAAEGGRVIDVTALALHK-KEDSP-----AHLDPN--YC 140
Query: 58 GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYG + + CCN C+EVR+AY +K WA + + QC EG+ QRI E+ EG
Sbjct: 141 GDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFGRGENVAQCIDEGYSQRIDEQRHEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C I G L VNKVAGNFH APG+S H HD+ + ++H I+KL FG P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMTHIIHKLRFGPQLP 260
Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
+NPLD + YF+KVV T Y +
Sbjct: 261 EELYSRWKWTHQDTINPLDKSEHKTNEVRYNFLYFVKVVSTSYLPLGWDPTLSSEAHSQA 320
Query: 206 --------------SGHTIQSNQFSVTEHFRS------------SEQGRLQTLPGVFFFY 239
S +I+++Q+SVT H RS Q +P V F Y
Sbjct: 321 HRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAEDASADGHKERQHSRGGIPSVMFNY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
++SP+KV E S F T VCA++GG TV+ +D +Y G +KK
Sbjct: 381 EISPMKVINRETRPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVKK 431
>gi|344230637|gb|EGV62522.1| hypothetical protein CANTEDRAFT_131007 [Candida tenuis ATCC 10573]
Length = 410
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 108/326 (33%), Positives = 172/326 (52%), Gaps = 37/326 (11%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNE---- 54
+D+SG LD+ + F+K R+ S G + + AP ID PL+ L+ E
Sbjct: 89 LDVSGNVELDILQNGFQKYRILSSGEEVLMKN----APLIDSTPLEVMAKGLDKPEDAEH 144
Query: 55 TYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--E 110
T CG CYG+ D CCNNCE +R AY K WA + + I C+ EG+++ I+ E
Sbjct: 145 TPCGDCYGSLPQDRKQYCCNNCETIRRAYAAKVWAFYDGENIKPCEDEGYVKAIQSEIFN 204
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFGE 168
EGC + G ++N+++GN HFAPG SF + HVHD+ + + D FN H IN L+FG+
Sbjct: 205 NEGCRVKGTTQINRISGNLHFAPGASFTEPSRHVHDLSLYNKFPDRFNFDHTINHLSFGK 264
Query: 169 HFPGVVN-------PLDGVRWTQETPSGMYQYFIKVVPTVYTDVS---GHTIQSNQFSVT 218
N PLDG + +Y YF+KVV T Y + +++NQFS
Sbjct: 265 DPETNANTDKKTLHPLDGETRNLKEKYHLYSYFLKVVSTRYEYLQEKLKAPLETNQFSAI 324
Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
H R + G+ + LPG++F++D+SP+K+ E++ ++ F+ V + +
Sbjct: 325 YHDRPIKGGKDEDHQHTLHARGGLPGLYFYFDISPLKIINKEQYSKTWSGFVLGVISSIA 384
Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
GV + ++D ++ ++AI+ K +I
Sbjct: 385 GVLMIGSLLDRSVWAAEKAIRAKKDI 410
>gi|302511557|ref|XP_003017730.1| hypothetical protein ARB_04613 [Arthroderma benhamiae CBS 112371]
gi|291181301|gb|EFE37085.1| hypothetical protein ARB_04613 [Arthroderma benhamiae CBS 112371]
Length = 435
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 121/351 (34%), Positives = 161/351 (45%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGE DV H + K RL S G VI+ + K D P L+ N YC
Sbjct: 89 MDVSGELQTDVDHGVNKVRLSSAAEGGRVIDVTALALHK-KEDSP-----AHLDPN--YC 140
Query: 58 GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYG + + CCN C+EVR+AY +K WA + + QC EG+ QRI E+ EG
Sbjct: 141 GDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFGRGENVAQCIDEGYSQRIDEQRHEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C I G L VNKVAGNFH APG+S H HD+ + ++H I+KL FG P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMTHIIHKLRFGPQLP 260
Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
+NPLD + YF+KVV T Y +
Sbjct: 261 EELYSRWKWTHQDTINPLDKSEHKTNEVRYNFLYFVKVVSTSYLPLGWDPTLSSEAHSQA 320
Query: 206 --------------SGHTIQSNQFSVTEHFRS------SEQGRLQT------LPGVFFFY 239
S +I+++Q+SVT H RS S G + +P V F Y
Sbjct: 321 HRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAEDASADGHKERQHARGGIPSVMFNY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
++SP+KV E S F T VCA++GG TV+ +D +Y G +KK
Sbjct: 381 EISPMKVINRETRPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVKK 431
>gi|116181584|ref|XP_001220641.1| hypothetical protein CHGG_01420 [Chaetomium globosum CBS 148.51]
gi|88185717|gb|EAQ93185.1| hypothetical protein CHGG_01420 [Chaetomium globosum CBS 148.51]
Length = 438
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 120/353 (33%), Positives = 168/353 (47%), Gaps = 73/353 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MDISGEQ V+H + K RL Q G I+++ + A R + +YC
Sbjct: 89 MDISGEQQHGVQHGVTKTRLRPQSEGGGDIDTKAVALHA--------RDEVATHLDPSYC 140
Query: 58 GSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA+ + CCN CEEV++AY + WA + I+QC+RE + +++ E+ EG
Sbjct: 141 GPCYGAQPPPNAKKPGCCNTCEEVKDAYAQAAWAFGRGEGIEQCEREHYSEKLDEQRNEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEHFP 171
C I G L VNKV GNFH APG+SF +HVHD+ + SH+I+ L FG P
Sbjct: 201 CRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVHDLKNYWDTPTKHTFSHQIHHLRFGPQLP 260
Query: 172 G-----------------VVNPLDGV------------------------RWT-QETPSG 189
NPLD RW ++T +G
Sbjct: 261 DNLHKKLDARKNMRGRSTTFNPLDDTPPGDGTTSTTTTCTSSRSCPHRTCRWAGRKTWAG 320
Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS---------SEQGRLQT---LPGVFF 237
+ + + G +++++Q+SVT H RS Q RL +PGVFF
Sbjct: 321 FREEHHAELGSFGASADG-SVETHQYSVTSHKRSLAGGDDSAEGHQERLHARGGIPGVFF 379
Query: 238 FYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV EE SFL F+ +CAIVGG TV+ ID ++ G +KK
Sbjct: 380 SYDISPMKVINREEKAKSFLGFIAGLCAIVGGTLTVAAAIDRALFEGGVRLKK 432
>gi|255712984|ref|XP_002552774.1| KLTH0D01144p [Lachancea thermotolerans]
gi|238934154|emb|CAR22336.1| KLTH0D01144p [Lachancea thermotolerans CBS 6340]
Length = 402
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 110/320 (34%), Positives = 168/320 (52%), Gaps = 38/320 (11%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD +GE L+V + + K RLD G V++++Q G +D + +E YCG
Sbjct: 88 MDSAGEMQLEVLNKGWSKTRLDPSGQVLDTKQFKPGKDVVDYAPE--------DENYCGP 139
Query: 60 CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
CYGA ++ CC C++VREAY +K WA + I+QC+REG+++++ E
Sbjct: 140 CYGARDQSKNDEVNVDERVCCQTCDDVREAYAEKQWAFFDGKNIEQCEREGYVEQVNEHI 199
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEH 169
EGC I G ++N++ GN HFAPGK FH H HD +Q S N +H I+ L+FG+
Sbjct: 200 EEGCRIKGMAKLNRIGGNLHFAPGKGFHNIRGHFHDASLYQNSPSLNFNHIIHHLSFGKE 259
Query: 170 FPGVVN------PLDGVRWTQE--TPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 221
+ PLDG + E T + YF K+VPT Y +SG T+++ QF+ T H
Sbjct: 260 VEDITGQGASTAPLDGTNVSPEFDTHKHQFSYFAKIVPTRYEYLSGETVETTQFTTTYHS 319
Query: 222 RSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 270
R + GR P V+F++++SP+KV +++ S+ F N +GGV
Sbjct: 320 RPLKGGRDSDHPTTLHSQGGFPSVYFYFEMSPLKVINKQQYAQSWSGFWLNCITSIGGVL 379
Query: 271 TVSGIIDAFIYHGQRAIKKK 290
V ++D Y QR++ K
Sbjct: 380 AVGTVLDKITYKAQRSMWGK 399
>gi|169731514|gb|ACA64886.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
(predicted) [Callicebus moloch]
Length = 237
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 85/153 (55%), Positives = 111/153 (72%), Gaps = 3/153 (1%)
Query: 145 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 204
HD+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY
Sbjct: 84 HDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMK 143
Query: 205 VSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF HFLT V
Sbjct: 144 VDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGV 202
Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
CAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 203 CAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 235
>gi|365982867|ref|XP_003668267.1| hypothetical protein NDAI_0A08710 [Naumovozyma dairenensis CBS 421]
gi|343767033|emb|CCD23024.1| hypothetical protein NDAI_0A08710 [Naumovozyma dairenensis CBS 421]
Length = 410
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 168/326 (51%), Gaps = 51/326 (15%)
Query: 1 MDISGEQHLDVKHD--IFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
+D +G+ LD+ + K RLD GNVIE + KID + ++E YCG
Sbjct: 90 LDDAGDLQLDILNQGQFTKTRLDRMGNVIE-----VSKFKIDDDVAEFP---PNDENYCG 141
Query: 59 SCYGA----------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 108
CYG+ D+ CC CE+VREAY K GWA + I+QC+REG++ +I +
Sbjct: 142 PCYGSIDQSGNDKIESVKDKICCQTCEQVREAYLKAGWAFFDGKNIEQCEREGYVTKINK 201
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFG 167
EGC + G + +N++ GN HFAPGK+F H HD ++ N +H I+ L+FG
Sbjct: 202 HLNEGCRVKGNVLLNRIQGNIHFAPGKAFQNVKGHFHDSSLYETSPDLNFNHIIHHLSFG 261
Query: 168 EHFPGV---------VNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFS 216
+ + +PLDG + + S +Y+ YF+K+VPT Y + ++ QFS
Sbjct: 262 KTIEQLAQLRGATVATSPLDGQQISPSFDSHLYRYSYFVKIVPTRYEYLDKMISETAQFS 321
Query: 217 VTEHF------RSSEQGRLQT----LPGVFFFYDLSPIKVTFTEEHVS-----FLHFLTN 261
T H R E ++ LPG+F ++++SP+K+ TE+H FLH +T+
Sbjct: 322 ATFHQSLVTGERDPENPNIKYSRTGLPGLFIYFEMSPLKIINTEQHFKSWSGVFLHCITS 381
Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAI 287
+GG+ V I+D F Y QR +
Sbjct: 382 ----IGGILAVGTILDKFFYKAQRTV 403
>gi|406866287|gb|EKD19327.1| copii-coated vesicle membrane protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 453
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 124/368 (33%), Positives = 168/368 (45%), Gaps = 88/368 (23%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG-RLEH-NETYC 57
MD+SGEQ V H + K RL + G +I + L HG + H + YC
Sbjct: 89 MDVSGEQQTGVMHGVKKVRLGPEAE---------GGKEISIESLDLHGDDQATHLDPDYC 139
Query: 58 GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYGA + CCN CEEVREAY WA + ++QC+RE + +++ + EG
Sbjct: 140 GGCYGATAPPNAKKAGCCNTCEEVREAYASVSWAFGRGENVEQCEREHYGEKLDAQRKEG 199
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGEH 169
C I G + VNKV GNFH APG+SF +HVHD+ + +H I+ L FG
Sbjct: 200 CRIEGGIRVNKVVGNFHIAPGRSFSNGNMHVHDLNNYFDTPVPGGHVFTHHIHSLRFGPQ 259
Query: 170 FPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS------- 206
P V NPLD R + + YF+KVVPT Y +
Sbjct: 260 LPESVTKKLGNKALPWTNHHINPLDDTRQVAPETAYNFMYFVKVVPTSYLPLGWDNSVTS 319
Query: 207 ------------GH----TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
GH +++++QFSVT H RS G +L + +PGVFF
Sbjct: 320 EQRIDHVDIGSYGHLDDGSVETHQFSVTSHKRSLSGGDDGAEGHKEKLHSRGGIPGVFFS 379
Query: 239 Y----------------DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
Y D+SP+KV EE S FLT +CAI+GG TV+ +D +Y
Sbjct: 380 YVSSHFYPQKISTNKTQDISPMKVINREERAKSLAGFLTGLCAIIGGTLTVAAAVDRGVY 439
Query: 282 HGQRAIKK 289
G +KK
Sbjct: 440 EGTTRLKK 447
>gi|327296796|ref|XP_003233092.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
gi|326464398|gb|EGD89851.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
Length = 435
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 121/351 (34%), Positives = 161/351 (45%), Gaps = 70/351 (19%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD+SGE DV H + K RL S G VI+ + K D P L+ N YC
Sbjct: 89 MDVSGELQTDVDHGVNKVRLSSAAEGGRVIDVTALSLHK-KEDSP-----AHLDPN--YC 140
Query: 58 GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
G CYG + + CCN C+EVR+AY +K WA + + QC EG+ QRI E+ EG
Sbjct: 141 GDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFGRGENVAQCIDEGYSQRIDEQRHEG 200
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
C I G L VNKVAGNFH APG+S H HD+ + ++H I+KL FG P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMTHIIHKLRFGPQLP 260
Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
+NPLD + YF+KVV T Y +
Sbjct: 261 EELYSRWKWTHQDTINPLDKSEHKTNEVRYNFLYFVKVVSTSYLPLGWDPTLSSEAHSQA 320
Query: 206 --------------SGHTIQSNQFSVTEHFRS------SEQGRLQT------LPGVFFFY 239
S +I+++Q+SVT H RS S G + +P V F Y
Sbjct: 321 HRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAEDASADGHKERQHARGGIPSVMFNY 380
Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
++SP+KV E S F T VCA++GG TV+ +D +Y G +KK
Sbjct: 381 EISPMKVINRETRPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVKK 431
>gi|190347075|gb|EDK39286.2| hypothetical protein PGUG_03384 [Meyerozyma guilliermondii ATCC
6260]
Length = 404
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 169/324 (52%), Gaps = 44/324 (13%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN------ 53
+D+SG+ +D+ F+K RL G+ I + P+ G LE
Sbjct: 88 LDVSGDLQVDLLSSGFEKFRLLKDGSEIRD----------ESPVMSSAGELEERARGRAP 137
Query: 54 ETYCGSCYGAESSDED---CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
+ CGSCYGA DE+ CCN+CE VR AY +K W + + I+QC+REG++ R+ E+
Sbjct: 138 DGSCGSCYGALPQDENSDYCCNDCETVRLAYAQKAWGFFDGENIEQCEREGYVARLNEKI 197
Query: 111 G--EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAF 166
EGC I G ++N+++GN HFAPG SF G H HD+ F + D F H IN L+F
Sbjct: 198 NNFEGCRIKGTGKINRISGNLHFAPGASFTAPGSHFHDLSLFNKYDDKFTFDHVINHLSF 257
Query: 167 GEHFPGV-------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--IQSNQFSV 217
G + +PLD ++ +Y Y++KVV T + ++ +T +++NQFSV
Sbjct: 258 GSDPHNIQFFEKQSTHPLDKSSMILKSKDRLYSYYLKVVATRFEFLTPNTPALETNQFSV 317
Query: 218 TEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIV 266
H R G+ LPGVFF +++SP+K+ E++ ++ F+ V + +
Sbjct: 318 ISHHRPLAGGKDDDHQHTLHARGGLPGVFFHFEISPMKIINKEQYAKTWSGFVLGVISSI 377
Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
GV V ++D ++ +R I+ K
Sbjct: 378 AGVLMVGALLDRSVWAAERVIRAK 401
>gi|449705731|gb|EMD45722.1| endoplasmic reticulumgolgi intermediate compartment protein,
putative [Entamoeba histolytica KU27]
Length = 272
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 90/239 (37%), Positives = 142/239 (59%), Gaps = 10/239 (4%)
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE+ ++ CC C++V+EAY+K+GW L + +++ QC+ +Q K + EGC +
Sbjct: 42 CRSCYGAETPEKKCCFTCDDVKEAYKKRGWRL-DLNIVSQCQNHEKIQMAKLTKDEGCRL 100
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
G +NK+ GNFH APG S G H H++ + ++SHK N+L+FGE+
Sbjct: 101 IGDFLLNKIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQIDLSHKWNELSFGENSKKFTTE 160
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 236
+ + M+QY++ ++P ++G T +S+ E+ RS E G Q PGVF
Sbjct: 161 KKDTQM-----NSMFQYYLTIIPIKNNFING-TSTFYDYSIQENIRSGE-GEGQ--PGVF 211
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
+YD+SP+ + TE + FLHFL +C+IVGG+FT + DA ++ +KKK+E+GK
Sbjct: 212 IYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHTLKKKVELGK 270
>gi|443925078|gb|ELU44001.1| ER-derived vesicles protein ERV46 [Rhizoctonia solani AG-1 IA]
Length = 383
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 106/298 (35%), Positives = 155/298 (52%), Gaps = 43/298 (14%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
DISGE D+ H++ K RLDS G +I QDG ++D +++ + YCGSCY
Sbjct: 91 DISGEIQQDLTHNMVKTRLDSNGQII---QDGFHNNELDNDVEK--TMKARPQGYCGSCY 145
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
G E + CC CE VR+AY +GW+ +PD I+QC E + +I E+ EGC+I G +
Sbjct: 146 GGEPPEGGCCQTCESVRQAYMNRGWSFGDPDAIEQCVAEHWTAKIHEQNSEGCHISGRVR 205
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAF-GE-----HFPGV 173
VNKV GNFHF+PG+SF + H D++ + +D + H +++ F GE + G
Sbjct: 206 VNKVTGNFHFSPGRSFVLNRGHFQDLVPYLKDGNHHDFGHYVHEFRFEGESEAEDEWRGT 265
Query: 174 -------------VNPLDGVRW---TQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
NPLD V + M+QYF+KVV T + + G I+S+Q+SV
Sbjct: 266 DRGTRWRKKVGISANPLDQVSAHVVDDRASNYMFQYFMKVVSTEFKYLDGDIIRSHQYSV 325
Query: 218 TEHFRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
T + R G +Q LPG FF +++SP+ V E +F HF T+
Sbjct: 326 TSYERDLTHGDGAERDSHGTLTAHGVQGLPGAFFNFEISPMMVVHRETRQTFAHFATS 383
>gi|19113757|ref|NP_592845.1| COPII-coated vesicle component Erv46 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|1351651|sp|Q09895.1|YAI8_SCHPO RecName: Full=Uncharacterized protein C24B11.08c
gi|1061296|emb|CAA91773.1| COPII-coated vesicle component Erv46 (predicted)
[Schizosaccharomyces pombe]
Length = 390
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 113/312 (36%), Positives = 162/312 (51%), Gaps = 35/312 (11%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SGE D+ H + K RL G +I IG + + G CG C
Sbjct: 89 LDVSGEFQRDIHHTVSKTRLSPSGEIISVDDLDIGN---QQSISDDGA------AECGDC 139
Query: 61 YGAES-SDED---CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
YGA + ED CCN C+ VR+AY K W + + D QCK E F + + ++ EGCN+
Sbjct: 140 YGAADFAPEDTPGCCNTCDAVRDAYGKAHWRIGDVDAFKQCKDENFKELYEAQKVEGCNL 199
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEHFPGVV 174
G L VN++AGNFH APG+S HVHD + + D ++SH I+ L+FG V
Sbjct: 200 AGQLSVNRMAGNFHIAPGRSTQNGNQHVHDTRDYINELDLHDMSHSIHHLSFGPPLDASV 259
Query: 175 ---NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--IQSNQFSVTEHFRSSEQGRL 229
NPLDG T Y+YFIK V + +S T I +N+++VT+H RS GR
Sbjct: 260 HYSNPLDGTVKKVSTADYRYEYFIKCVSYQFMPLSKSTLPIDTNKYAVTQHERSIRGGRE 319
Query: 230 QT----------LPGVFFFYDLSPIKVTFTEEHV---SFLHFLTNVCAIVGGVFTVSGII 276
+ +PGV+F +D+SP++V E V +F FL+NV A++GG T++ +
Sbjct: 320 EKVPTHVNFHGGIPGVWFQFDISPMRV--IERQVRGNTFGGFLSNVLALLGGCVTLASFV 377
Query: 277 DAFIYHGQRAIK 288
D Y Q+ K
Sbjct: 378 DRGYYEVQKLKK 389
>gi|367007030|ref|XP_003688245.1| hypothetical protein TPHA_0N00300 [Tetrapisispora phaffii CBS 4417]
gi|357526553|emb|CCE65811.1| hypothetical protein TPHA_0N00300 [Tetrapisispora phaffii CBS 4417]
Length = 407
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/320 (34%), Positives = 162/320 (50%), Gaps = 43/320 (13%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRL--EHNETYC 57
MD +G LD+ FKK RLD G +E R+ L+ + R+ E YC
Sbjct: 90 MDDAGGLQLDILDSGFKKTRLDPNGKQLEFRE---------FDLKDNSKRIVSEKGPNYC 140
Query: 58 GSCYGA--------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
GSCYGA E + + CCN CE+VR AY WA + I+QC+ EG+++RI E
Sbjct: 141 GSCYGAIDQSHNDEEGAKKVCCNTCEDVRLAYVTANWAFFDGKNIEQCEDEGYVKRINEH 200
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGE 168
EGC + G ++N+V GN HFAPGK S H+HD +++ + N H I+ +FGE
Sbjct: 201 LNEGCRVTGKAKINRVKGNIHFAPGKPMQNSKGHLHDTSLYEKSPNMNFKHIIHHFSFGE 260
Query: 169 HFPG---------VVNPLD--GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
+ NPLD V+ +T + Y++KVVPT Y ++ +++ QFSV
Sbjct: 261 PIDRKAKSKGADVLTNPLDDYDVQPNIDTHYHQFSYYMKVVPTRYEYLNRMVVETAQFSV 320
Query: 218 TEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIV 266
T H R G+ + +PGVFFF+D+S IKV E+ ++ F+ N +
Sbjct: 321 TFHDRPLRGGKDEDHPNTIHARNGIPGVFFFFDISSIKVINNEQITQTWSGFILNCIITI 380
Query: 267 GGVFTVSGIIDAFIYHGQRA 286
GGV V ++D Y Q+
Sbjct: 381 GGVLAVGSMVDRLSYKAQKT 400
>gi|344301666|gb|EGW31971.1| hypothetical protein SPAPADRAFT_50577 [Spathaspora passalidarum
NRRL Y-27907]
Length = 410
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 103/320 (32%), Positives = 173/320 (54%), Gaps = 27/320 (8%)
Query: 1 MDISGEQHLDVKHDIFKKR--LDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH--NETY 56
+D +G+ L++ + F+K + +GN++ D A +D+PL L +
Sbjct: 91 LDETGDMQLNIINAGFQKLRLIKDKGNIVREISDDTPALNLDRPLSEVVKGLPEGGDPKT 150
Query: 57 CGSCYGA--ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGE 112
CGSCYGA + + CCN+C V+ AY ++ W+ + + I+QC++EG+++R+++ + E
Sbjct: 151 CGSCYGALPQEKHQYCCNDCYSVKRAYAERRWSFFDGENIEQCEKEGYVKRLRQRINDNE 210
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG--- 167
GC I G ++N+V+G FAPG SF G HVHD+ + + D FN H IN L+FG
Sbjct: 211 GCRIKGSAKINRVSGTMDFAPGASFTSDGRHVHDVSLYGKYQDKFNFDHIINHLSFGSND 270
Query: 168 --EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSS 224
E V+PLDG ++ + Y++KVV T + + + +NQFSV H R
Sbjct: 271 AREEILNSVHPLDGYQFMLHKKHHVASYYLKVVATRFESLDQSKRLDTNQFSVITHDRPL 330
Query: 225 EQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 273
G+ + +PGV F +D+SP+K+ E++ ++ F+ V + + GV V
Sbjct: 331 TGGKDEDHEHTLHARGGIPGVEFHFDISPLKIINKEQYAKTWSGFVLGVISSIAGVLMVG 390
Query: 274 GIIDAFIYHGQRAIKKKIEI 293
+ID +Y Q+AI+ K +I
Sbjct: 391 TLIDRSVYATQQAIRGKKDI 410
>gi|401626934|gb|EJS44847.1| erv46p [Saccharomyces arboricola H-6]
Length = 415
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 115/337 (34%), Positives = 164/337 (48%), Gaps = 59/337 (17%)
Query: 1 MDISGEQHLDVKHDIFK-KRLDSQGNVIESRQ------DGIGAPKIDKPLQRHGGRLEHN 53
MD SGE LD+ F R+D G+ + +G GA D P
Sbjct: 88 MDDSGELQLDILDAGFTMTRVDKDGHPVGDATELHVGGNGEGATPNDDP----------- 136
Query: 54 ETYCGSCYGAESS---------DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQ 104
YCG CYGA D+ CC NC+ VR AY KGWA + I+QC++EG++
Sbjct: 137 -NYCGQCYGARDQSNNENLAQEDKVCCQNCDSVRSAYLDKGWAFFDGKDIEQCEKEGYVN 195
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS-GVHVHDILAFQRD-SFNISHKIN 162
+I + EGC I G ++N++ GN HFAPGK F + G H HD + + N +H IN
Sbjct: 196 KINDHLHEGCRIEGSAQINRIQGNIHFAPGKPFQDTRGNHRHDTSLYDKTPDLNFNHIIN 255
Query: 163 KLAFGE--------------HFPGVV--NPLDGVRWTQETPSGMYQ--YFIKVVPTVYTD 204
+L+FG+ H VV +PLDG + + P+ +Q YF K+VPT Y
Sbjct: 256 RLSFGKPIQSHHKRLGNDKLHGGAVVSTSPLDGRQVFPDRPTHFHQFSYFAKIVPTRYEY 315
Query: 205 VSGHTIQSNQFSVTEHFRSSEQGRLQTLP----------GVFFFYDLSPIKVTFTEEH-V 253
+ I++ QFS T H R GR Q P G++ F+++SP+KV E+H
Sbjct: 316 LDSTVIETAQFSATYHSRPLGGGRDQDHPNTFHARGGISGLYVFFEMSPLKVINKEQHGQ 375
Query: 254 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
++ F+ N +GGV V ++D Y QR+I K
Sbjct: 376 TWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|148674216|gb|EDL06163.1| ERGIC and golgi 3, isoform CRA_c [Mus musculus]
Length = 261
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 85/160 (53%), Positives = 112/160 (70%), Gaps = 9/160 (5%)
Query: 144 VHDILAFQRDS------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKV 197
+HD+ +F D+ N++H I L+FGE +PG+VNPLD T S M+QYF+KV
Sbjct: 101 IHDLQSFGLDNPSDCLQINMTHYIKHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKV 160
Query: 198 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSF 255
VPTVY V G +++NQFSVT H + + G L Q LPGVF Y+LSP+ V TE+H SF
Sbjct: 161 VPTVYMKVDGEVLRTNQFSVTRHEKVAN-GLLGDQGLPGVFVLYELSPMMVKLTEKHRSF 219
Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
HFLT VCAI+GG+FTV+G+ID+ IYH RAI+KKI++GK
Sbjct: 220 THFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 259
>gi|340055752|emb|CCC50073.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 404
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 106/315 (33%), Positives = 158/315 (50%), Gaps = 28/315 (8%)
Query: 5 GEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAE 64
GE I K R+ +Q S P+ D+ + + + C SCYGAE
Sbjct: 94 GEYMTGAVRSITKVRVPTQDPAPVSE----ALPQSDRSVSTAALPVSNKMGGCVSCYGAE 149
Query: 65 SSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKE-EEGEGCNIYGFLEV 122
S DCCN+C++V A+R+ GW + D+ + QC EG L + EGCNI+ V
Sbjct: 150 ESPGDCCNSCDDVHAAFRRNGWEIDENDIKLSQCT-EGQLHNVGPVSPSEGCNIHSKFSV 208
Query: 123 NKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD---- 178
K+ GN HF PG+ + G ++ + N+SH + L FGE FPG VNPL+
Sbjct: 209 RKIKGNIHFVPGRRLNHRGQPMYVVRREAIKKMNLSHVFHSLEFGERFPGQVNPLNGIAN 268
Query: 179 --GVRWTQETPSGMYQYFIKVVPTVYTDV----SGHTIQSNQFSVTEHFRSSEQGRLQTL 232
GVR E SG + Y+++V+PT Y V S +++NQ+SV +HF S +
Sbjct: 269 ARGVRNASEVVSGRFSYYVQVLPTEYQFVPALGSRVRLETNQYSVKQHFTESWYTTDRRY 328
Query: 233 P---------GVFFFYDLSPIK--VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
P GVF YD+SP+K V T + S +H L +CA+ GG FTV+ +ID+ +
Sbjct: 329 PGWSDPTLVAGVFIVYDVSPVKTLVMRTSPYPSLIHLLLRMCAVGGGAFTVASMIDSLLL 388
Query: 282 HGQRAIKKKIEIGKF 296
+ ++K+ K+
Sbjct: 389 NILGHFRRKMRETKY 403
>gi|255732259|ref|XP_002551053.1| hypothetical protein CTRG_05351 [Candida tropicalis MYA-3404]
gi|240131339|gb|EER30899.1| hypothetical protein CTRG_05351 [Candida tropicalis MYA-3404]
Length = 414
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 172/326 (52%), Gaps = 33/326 (10%)
Query: 1 MDISGEQHLDVKHDIFKK-RL--DSQGNVIESR-QDGIGAPKIDKPLQRHGGRL---EHN 53
+D++G+Q LD+ KK RL + QG+VI + +D A D L+ L
Sbjct: 89 LDVTGDQQLDIIDSGLKKVRLLKNKQGDVIINEIEDDKPALNSDVSLKELAKGLPEGSDQ 148
Query: 54 ETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE-- 109
YCG CYGA D+ CCN+C VR AY +K W + + I+QC++EG+++R++E
Sbjct: 149 NAYCGPCYGALPQDKKQFCCNDCNTVRRAYAEKQWQFFDGENIEQCEKEGYVKRLRERIN 208
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG 167
EGC I G ++N+V+G FAPG SF+ G H HD+ +++ D FN H IN L+FG
Sbjct: 209 NNEGCRIKGSTKINRVSGTMDFAPGSSFNHDGRHFHDLSLYKKYNDKFNFDHVINHLSFG 268
Query: 168 --------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVT 218
E ++PLD ++ + YF+KVV T Y + + +NQFSV
Sbjct: 269 EVPTNNGAEEMFDSIHPLDDYQFMLHKKDHVVSYFLKVVATRYESLDYSKRVDTNQFSVI 328
Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
H R G+ + +PGV F +D+SP+K+ +++ ++ F+ V + +
Sbjct: 329 THDRPLIGGKDEDHQHTLHARGGIPGVNFNFDISPLKIINRQQYAKTWSGFILGVVSSIA 388
Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
GV V ++D ++ Q+AIK K +I
Sbjct: 389 GVLMVGTLLDRSVFAAQQAIKGKKDI 414
>gi|74267709|gb|AAI02327.1| ERGIC and golgi 3 [Bos taurus]
Length = 231
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 83/147 (56%), Positives = 103/147 (70%), Gaps = 11/147 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
MD++GEQ LDV+H++FKKRLD G + S + G K+ P R
Sbjct: 89 MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C SCYGAE D CCN+CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVH 143
YGFLEVNKVAGNFHFAPGKSF QS VH
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVH 228
>gi|323356370|gb|EGA88170.1| Erv46p [Saccharomyces cerevisiae VL3]
Length = 415
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 55/335 (16%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR------LEHNE 54
MD SGE LD+ LD+ SR + G P D GG + ++
Sbjct: 88 MDDSGEMQLDI--------LDA--GFTMSRLNSEGRPVGDATELHVGGNGDGTXPVNNDP 137
Query: 55 TYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
YCG CYGA+ ++ CC +C+ VR AY + GWA + I+QC+REG++ +
Sbjct: 138 NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAGWAFFDGKNIEQCEREGYVSK 197
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKL 164
I E EGC I G ++N++ GN HFAPGK + + H HD + + S N +H IN L
Sbjct: 198 INEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFHDTSLYDKTSNLNFNHIINHL 257
Query: 165 AFGE--------------HFPGVV--NPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVS 206
+FG+ H VV +PLDG V + T + YF K+VPT Y +
Sbjct: 258 SFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNTHFHQFSYFAKIVPTRYEYLD 317
Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSF 255
I++ QFS T H R GR + +PG+F F+++SP+KV E+H ++
Sbjct: 318 NVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGMFVFFEMSPLKVINKEQHGQTW 377
Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
F+ N +GGV V ++D Y QR+I K
Sbjct: 378 SGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|349576209|dbj|GAA21381.1| K7_Erv46p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 415
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 55/335 (16%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR------LEHNE 54
MD SGE LD+ LD+ SR + G P D GG + ++
Sbjct: 88 MDDSGEMQLDI--------LDA--GFTMSRLNSEGRPVGDATELHVGGNGDGTAPVNNDP 137
Query: 55 TYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
YCG CYGA+ ++ CC +C+ VR AY + GWA + I+QC+REG++ +
Sbjct: 138 NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAGWAFFDGKNIEQCEREGYVSK 197
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKL 164
I E EGC I G ++N++ GN HFAPGK + + H HD + + S N +H IN L
Sbjct: 198 INEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFHDTSLYDKTSNLNFNHIINHL 257
Query: 165 AFGE--------------HFPGVV--NPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVS 206
+FG+ H VV +PLDG V + T + YF K+VPT Y +
Sbjct: 258 SFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNTHFHQFSYFAKIVPTRYEYLD 317
Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSF 255
I++ QFS T H R GR + +PG+F F+++SP+KV E+H ++
Sbjct: 318 NVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGMFVFFEMSPLKVINKEQHGQTW 377
Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
F+ N +GGV V ++D Y QR+I K
Sbjct: 378 SGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|151941348|gb|EDN59719.1| ER vesicle protein [Saccharomyces cerevisiae YJM789]
gi|190406692|gb|EDV09959.1| ER-Golgi transport vesicle protein [Saccharomyces cerevisiae
RM11-1a]
gi|207348028|gb|EDZ74008.1| YAL042Wp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|256272276|gb|EEU07261.1| Erv46p [Saccharomyces cerevisiae JAY291]
gi|259144662|emb|CAY77603.1| Erv46p [Saccharomyces cerevisiae EC1118]
gi|323334778|gb|EGA76150.1| Erv46p [Saccharomyces cerevisiae AWRI796]
gi|323338873|gb|EGA80087.1| Erv46p [Saccharomyces cerevisiae Vin13]
gi|323349926|gb|EGA84136.1| Erv46p [Saccharomyces cerevisiae Lalvin QA23]
gi|365767200|gb|EHN08685.1| Erv46p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 415
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 55/335 (16%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR------LEHNE 54
MD SGE LD+ LD+ SR + G P D GG + ++
Sbjct: 88 MDDSGEMQLDI--------LDA--GFTMSRLNSEGRPVGDATELHVGGNGDGTAPVNNDP 137
Query: 55 TYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
YCG CYGA+ ++ CC +C+ VR AY + GWA + I+QC+REG++ +
Sbjct: 138 NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAGWAFFDGKNIEQCEREGYVSK 197
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKL 164
I E EGC I G ++N++ GN HFAPGK + + H HD + + S N +H IN L
Sbjct: 198 INEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFHDTSLYDKTSNLNFNHIINHL 257
Query: 165 AFGE--------------HFPGVV--NPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVS 206
+FG+ H VV +PLDG V + T + YF K+VPT Y +
Sbjct: 258 SFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNTHFHQFSYFAKIVPTRYEYLD 317
Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSF 255
I++ QFS T H R GR + +PG+F F+++SP+KV E+H ++
Sbjct: 318 NVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGMFVFFEMSPLKVINKEQHGQTW 377
Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
F+ N +GGV V ++D Y QR+I K
Sbjct: 378 SGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|6319274|ref|NP_009358.1| Erv46p [Saccharomyces cerevisiae S288c]
gi|1723191|sp|P39727.2|ERV46_YEAST RecName: Full=ER-derived vesicles protein ERV46
gi|1326054|gb|AAC04989.1| Yal042wp [Saccharomyces cerevisiae]
gi|285810158|tpg|DAA06944.1| TPA: Erv46p [Saccharomyces cerevisiae S288c]
gi|392301230|gb|EIW12318.1| Erv46p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 415
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 55/335 (16%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR------LEHNE 54
MD SGE LD+ LD+ SR + G P D GG + ++
Sbjct: 88 MDDSGEMQLDI--------LDA--GFTMSRLNSEGRPVGDATELHVGGNGDGTAPVNNDP 137
Query: 55 TYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
YCG CYGA+ ++ CC +C+ VR AY + GWA + I+QC+REG++ +
Sbjct: 138 NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAGWAFFDGKNIEQCEREGYVSK 197
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKL 164
I E EGC I G ++N++ GN HFAPGK + + H HD + + S N +H IN L
Sbjct: 198 INEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFHDTSLYDKTSNLNFNHIINHL 257
Query: 165 AFGE--------------HFPGVV--NPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVS 206
+FG+ H VV +PLDG V + T + YF K+VPT Y +
Sbjct: 258 SFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNTHFHQFSYFAKIVPTRYEYLD 317
Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSF 255
I++ QFS T H R GR + +PG+F F+++SP+KV E+H ++
Sbjct: 318 NVVIETAQFSATFHSRPLAGGRDKDHPNTLHVRGGIPGMFVFFEMSPLKVINKEQHGQTW 377
Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
F+ N +GGV V ++D Y QR+I K
Sbjct: 378 SGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412
>gi|366987855|ref|XP_003673694.1| hypothetical protein NCAS_0A07550 [Naumovozyma castellii CBS 4309]
gi|342299557|emb|CCC67313.1| hypothetical protein NCAS_0A07550 [Naumovozyma castellii CBS 4309]
Length = 425
Score = 170 bits (430), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 109/334 (32%), Positives = 166/334 (49%), Gaps = 48/334 (14%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD SGE LD+ F K R+D+ GN + S +G + +Q+ ++ YCGS
Sbjct: 93 MDDSGELQLDLLDSAFTKIRVDADGNELGSSTLEVGTDDLASEVQQRN----NDPDYCGS 148
Query: 60 CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
CYG++ DE+ CC C +VREAY GW + I+QC++EG++ +I E
Sbjct: 149 CYGSKVQDENDKLPRESRVCCQTCNDVREAYLNIGWGFFDGKGIEQCEKEGYVAKINEHL 208
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSF-----HQSGVHVHDILAFQRDS-FNISHKINKL 164
EGC + G ++++ GN HFAPGKS+ S H HD + + S N +HKIN L
Sbjct: 209 KEGCRVKGQTLLSRIQGNIHFAPGKSYTSYKRSTSASHYHDTSLYDKTSNLNFNHKINHL 268
Query: 165 AFGEHFPGV------------VNPLDG---VRWTQETPSGMYQYFIKVVPTVY--TDVSG 207
+FG+ + ++PLDG + +T +Y Y+ K+VPT Y +
Sbjct: 269 SFGKPIDKLDEKVQDHSTEFSISPLDGREVIPTDIDTHYHVYSYYAKIVPTRYEFLNKKE 328
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFL 256
+I++ QFS T H R GR +PG+F ++++S +KV E H S+
Sbjct: 329 KSIETAQFSTTFHSRPLRGGRDADHPTTMHSQGGIPGLFIYFEMSAVKVINKEHHFRSWS 388
Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
FL N VG V V + D Y Q++++ K
Sbjct: 389 SFLLNCITTVGSVLAVGTVSDKIFYRAQKSLQGK 422
>gi|12060847|gb|AAG48265.1|AF308298_1 serologically defined breast cancer antigen NY-BR-84, partial [Homo
sapiens]
Length = 239
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 79/143 (55%), Positives = 103/143 (72%), Gaps = 3/143 (2%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD++GEQ LDV+H++FK+RLD G + S + K++ + + C SC
Sbjct: 98 MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 154
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGAE+ D CCN CE+VREAYR++GWA NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 155 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 214
Query: 121 EVNKVAGNFHFAPGKSFHQSGVH 143
EVNKVAGNFHFAPGKSF QS VH
Sbjct: 215 EVNKVAGNFHFAPGKSFQQSHVH 237
>gi|449016424|dbj|BAM79826.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
10D]
Length = 499
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 100/275 (36%), Positives = 148/275 (53%), Gaps = 33/275 (12%)
Query: 55 TYCGSCYGAESSDED-----------CCNNCEEVREAYRKKGWALSNP-DLIDQCKREGF 102
YCGSCYGA + CCN C+E+R Y ++ WA +QC + +
Sbjct: 224 AYCGSCYGAVPQTDQVGEANQITSGVCCNTCDEIRVLYEERNWAFDQVLRTAEQCAEKRY 283
Query: 103 LQRIKEE---EGEGCNIYGFLEVNKVAGNFHFAPGKS-FHQSGVHVHDIL-AFQRDSFNI 157
L + E + GC + L++ +VAGNFHFAPGK H+ G HVH + ++N
Sbjct: 284 LTLLHEAGRVQSGGCRVSARLQLPRVAGNFHFAPGKGHTHRMGHHVHSVDDQLLHRTYNF 343
Query: 158 SHKINKLAFGEHFPGVVNPLDG-VRWTQETPSG-----MYQYFIKVVPTVYTD--VSGHT 209
SH+I L FG FP NPLDG +R ++ P G M Y+ K++PT Y G
Sbjct: 344 SHRIRHLRFGPLFPHQQNPLDGAMRILEQPPPGSPFGNMVLYYCKLIPTTYRRDRQRGDA 403
Query: 210 IQSNQFSVTEHFRSSEQGRLQ------TLPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNV 262
++S +++ + +SSEQ R+ LPG+FFFY+ P+++ + E + LHF+ +
Sbjct: 404 LRSMEYAAADLTQSSEQDRVGITHSTGALPGIFFFYEPQPLQIAYFEGRMYGLLHFIVQL 463
Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIK-KKIEIGKF 296
CAIVGGVFTVS +ID F++ I+ +K +GK
Sbjct: 464 CAIVGGVFTVSSMIDRFVFGAGTFIRAQKRRLGKL 498
>gi|226292523|gb|EEH47943.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides brasiliensis Pb18]
Length = 435
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 115/352 (32%), Positives = 164/352 (46%), Gaps = 72/352 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
MD+SGE + H I K RL + G+VI++ L +H + Y
Sbjct: 89 MDVSGEMQSGIIHGISKVRLAPESEGGHVIDTTA---------LVLHTQTDAAKHLDPDY 139
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA ++ +EVREAY + WA + ++QC+REG+ + + + E
Sbjct: 140 CGPCYGAPPPSHATKPGVALPAKEVREAYASQSWAFGRGENVEQCEREGYSKNLDAQRNE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHF 170
GC I G L VNKV GNFH APG+SF +H HD+ + ++SHKI++L FG
Sbjct: 200 GCRIEGVLRVNKVVGNFHIAPGRSFSSGNIHAHDLDTYYHTPVPHHMSHKIHQLRFGPQL 259
Query: 171 PGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV------------- 205
+ NPLD P + YF+KVV T Y +
Sbjct: 260 SDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNFMYFVKVVSTSYLPLGWSPEFSSSVHET 319
Query: 206 ---------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
S +I+++Q+SVT H RS + G RL + +PGVF
Sbjct: 320 TLGNTPLGKQGVHFGSSGSIETHQYSVTSHKRSIDGGDDAAEGHKERLHSHGGIPGVFVN 379
Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YD+SP+KV E +F FLT VCA++GG TV+ +D +Y G +KK
Sbjct: 380 YDISPMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAVDRALYEGVARVKK 431
>gi|254581328|ref|XP_002496649.1| ZYRO0D04972p [Zygosaccharomyces rouxii]
gi|238939541|emb|CAR27716.1| ZYRO0D04972p [Zygosaccharomyces rouxii]
Length = 404
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 103/326 (31%), Positives = 160/326 (49%), Gaps = 43/326 (13%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD +GE LD+ F K RLD G + S + D +E YCG+
Sbjct: 89 MDSAGEIQLDLLESGFTKTRLDQNGQSLGSSSLKVSDESYDP----------KDENYCGA 138
Query: 60 CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
CYGA+ + CC C +VR AY + WA + I+QC+REG++ R+ E+
Sbjct: 139 CYGAKDQSRNNEVPKEERVCCQTCNDVRRAYLEANWAFFDGKNIEQCEREGYVDRVNEQL 198
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEH 169
EGC + G +N++ G HFAPG +F H HD+ +++ + N +H IN L+FG+
Sbjct: 199 NEGCRVQGSALLNRIQGTLHFAPGVAFQNPKGHFHDLSLYEKTHNLNFNHIINHLSFGKP 258
Query: 170 FPG---------VVNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVT 218
PLDG + + + M+Q YF K+VPT Y + +++ QFS T
Sbjct: 259 VTSNARGRGASVATAPLDGRQAFPDRDTHMHQFSYFTKIVPTRYEYMDKMVVETAQFSAT 318
Query: 219 EHFRS----SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
H R ++Q TL PG+F ++++SP+KV E+H ++ F+ N +G
Sbjct: 319 LHDRPLHGGADQDHPTTLHTKGGFPGLFVYFEMSPLKVINREQHAQTWSGFILNCITSIG 378
Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
GV V ++D Y Q++I K +
Sbjct: 379 GVLAVGTVLDKITYKAQKSIWGKKSV 404
>gi|403215799|emb|CCK70297.1| hypothetical protein KNAG_0E00290 [Kazachstania naganishii CBS
8797]
Length = 408
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 108/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)
Query: 1 MDISGEQHLDVKHD---IFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET-Y 56
+D SG LDV + K R+D +G +++ A L +L + Y
Sbjct: 89 LDDSGVLLLDVDDENNHFTKTRIDQRGEPLDA------AAAASFKLDAEAAQLPPTDPDY 142
Query: 57 CGSCYGA---------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIK 107
CGSCYG+ + +++ CCN C VREAY GWA + I+QC+REG++ +I
Sbjct: 143 CGSCYGSRDQTRNDELDPANKVCCNTCSSVREAYLDAGWAFFDGKNIEQCEREGYVDKIS 202
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF-QRDSFNISHKINKLAF 166
+ EGC I G + +N+V GN HFAPG +F + H HD + Q S N H I+ L+F
Sbjct: 203 QRITEGCRIKGGVRLNRVQGNIHFAPGDAFRSARGHFHDTSMYDQTGSLNFDHIIHHLSF 262
Query: 167 GEHFPGV----------VNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQ 214
G + + PLDG + S YQ YF K+VPT + SG I++ Q
Sbjct: 263 GPSVDNMQSLEKASNVAIAPLDGKQVLPRYDSHAYQYTYFTKIVPTRFEYFSGSVIETTQ 322
Query: 215 FSVTEHFRSSEQGRLQT-------LPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIV 266
FS T R G +T PG++F ++SP+KV E++ +S+ FL N +
Sbjct: 323 FSSTFSARPIGGGTTETATYTSGGTPGLYFNIEMSPLKVIHKEQNKISWSGFLLNCITSI 382
Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
GGV V ++D +Y +R + K
Sbjct: 383 GGVLAVGTVVDKILYRAERTLLNK 406
>gi|169603005|ref|XP_001794924.1| hypothetical protein SNOG_04508 [Phaeosphaeria nodorum SN15]
gi|111067148|gb|EAT88268.1| hypothetical protein SNOG_04508 [Phaeosphaeria nodorum SN15]
Length = 351
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 102/297 (34%), Positives = 142/297 (47%), Gaps = 63/297 (21%)
Query: 56 YCGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
YCG CYGA S CCN C+EVR+AY W+ + ++QC+RE + + + ++
Sbjct: 51 YCGECYGAPSPTNAIKAGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDQQRQ 110
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN--ISHKINKLAFGEH 169
EGC + G + VNKV GNFH APGKSF +HVHD+ + +D ++ +HKI+ L FG
Sbjct: 111 EGCRLEGSIRVNKVVGNFHIAPGKSFSTGNMHVHDLENYFKDEYSHTFTHKIHHLRFGPQ 170
Query: 170 FPGVV---------------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT----- 203
V NPLD + + YF+KVV T Y
Sbjct: 171 LSNAVIADMQKKHQNTGPGGWTSHHINPLDNTEQQTSEKAYNFMYFVKVVSTAYLPLGWE 230
Query: 204 ----------DVSGHTIQSN--------QFSVTEHFRSSEQGRLQT------------LP 233
++ G TI+ N Q+SVT H RS G + +P
Sbjct: 231 KEAPRLTKHDELLGSTIEGNYKGSIETHQYSVTSHKRSLAGGNDEKEGHKERIHAKGGIP 290
Query: 234 GVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
GVFF YD+SP+KV E +F FL +CA++GG TV+ +D +Y G IKK
Sbjct: 291 GVFFSYDISPMKVINREVRDKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNKIKK 347
>gi|356547537|ref|XP_003542168.1| PREDICTED: probable endoplasmic reticulum-Golgi intermediate
compartment protein 3-like [Glycine max]
Length = 351
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 100/291 (34%), Positives = 157/291 (53%), Gaps = 35/291 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K RL+S G++I G I +++ EH++
Sbjct: 89 IDMSGKHEVDLDTNIWKLRLNSYGHII-------GTEYISDLVEKEHTNQEHDDNKDHDH 141
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE--EEGEGCNIYG 118
+ S + N +E E ++++KE + GEGC +YG
Sbjct: 142 HHEHSEQKIHLQNLDE---------------------STENIIKKVKEALKNGEGCRVYG 180
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
L+V +VAGNFH S H ++V ++ + N+SH I+ L+FG +PG+ NPLD
Sbjct: 181 VLDVQRVAGNFHI----SVHGLNIYVAQMIFDGAKNVNVSHFIHDLSFGPKYPGLHNPLD 236
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
SG ++Y+IKVVPT Y +S + +NQFSV+E++ Q +T P V+F
Sbjct: 237 DTTRILHDTSGTFKYYIKVVPTEYRYISKEVLPTNQFSVSEYYSPINQFD-RTWPAVYFL 295
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YDLSPI VT EE SFLHF+T +CA++GG F V+G++D ++Y A+ K
Sbjct: 296 YDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLDRWMYRLLEALTK 346
>gi|146416067|ref|XP_001484003.1| hypothetical protein PGUG_03384 [Meyerozyma guilliermondii ATCC
6260]
Length = 404
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 169/324 (52%), Gaps = 44/324 (13%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN------ 53
+D+SG+ +D+ F+K RL G +E R + P+ G LE
Sbjct: 88 LDVSGDLQVDLLLSGFEKFRLLKDG--LEIRDES--------PVMSSAGELEERARGRAP 137
Query: 54 ETYCGSCYGAESSDED---CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
+ CGSCYGA DE+ CCN+CE VR AY +K W + + I+QC+REG++ R+ E+
Sbjct: 138 DGLCGSCYGALPQDENLDYCCNDCETVRLAYAQKAWGFFDGENIEQCEREGYVARLNEKI 197
Query: 111 G--EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAF 166
EGC I G ++N+++GN HFAPG SF G H HD+ F + D F H IN L F
Sbjct: 198 NNFEGCRIKGTGKINRISGNLHFAPGASFTAPGSHFHDLSLFNKYDDKFTFDHVINHLLF 257
Query: 167 G------EHFPG-VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--IQSNQFSV 217
G + F + +PLD ++ +Y Y++KVV T + ++ +T +++NQF V
Sbjct: 258 GLDPHNIQFFEKQLTHPLDKSSMILKSKDRLYSYYLKVVATRFEFLTPNTPALETNQFLV 317
Query: 218 TEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIV 266
H R G+ LPGVFF +++ P+K+ E++ ++ F+ V + +
Sbjct: 318 ISHHRPLAGGKDDDHQHTLHARGGLPGVFFHFEILPMKIINKEQYAKTWSGFVLGVISSI 377
Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
GV V ++D ++ +R I+ K
Sbjct: 378 AGVLMVGALLDRSVWAAERVIRAK 401
>gi|356575088|ref|XP_003555674.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Glycine max]
Length = 347
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 98/289 (33%), Positives = 153/289 (52%), Gaps = 35/289 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K RL+S G++I + + ++K H N +
Sbjct: 89 IDMSGKHEVDLDTNIWKLRLNSYGHIIGTEY---ISDLVEKEHTHHKHDDNKNHEHSEQK 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
++ DE N ++V+EA + GEGC +YG L
Sbjct: 146 IHLQNLDESTENIIKKVKEALKN---------------------------GEGCRVYGVL 178
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V +VAGNFH S H ++V ++ + N+SH I+ L+FG +PG+ NPLD
Sbjct: 179 DVQRVAGNFHI----SVHGLNIYVAQMIFDGAKNVNVSHFIHDLSFGPKYPGLHNPLDDT 234
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
SG ++Y+IKVVPT Y +S + +NQFSV+E++ Q +T P V+F YD
Sbjct: 235 TRILHDTSGTFKYYIKVVPTEYRYISKEVLPTNQFSVSEYYSPINQFD-RTWPAVYFLYD 293
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
LSPI VT EE SFLHF+T +CA++GG F V+G++D ++Y + K
Sbjct: 294 LSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLDRWMYRLLETLTK 342
>gi|45188262|ref|NP_984485.1| ADR389Cp [Ashbya gossypii ATCC 10895]
gi|44983106|gb|AAS52309.1| ADR389Cp [Ashbya gossypii ATCC 10895]
Length = 392
Score = 167 bits (422), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 106/317 (33%), Positives = 155/317 (48%), Gaps = 42/317 (13%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA--PKIDKPLQRHGGRLEHNETYC 57
+D +GE L++ + F K RLD G + + +G P D ++ YC
Sbjct: 88 IDDTGEAQLNLLEEGFTKTRLDKHGRTLGKEEFRVGETLPSTD------------DQDYC 135
Query: 58 GSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 108
G CYGA D++ CC C EVR AY + WA + +QCKREG+ +R++E
Sbjct: 136 GPCYGARDQDQNENLPRSERVCCQTCGEVRAAYAEMNWATFDGKGFEQCKREGYTERLQE 195
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFG 167
+ EGC + G ++N+V GN HFAPG S H H HD ++ + +H I+ L+FG
Sbjct: 196 QINEGCRVAGTAQLNRVHGNIHFAPG-SAHVGKGHAHDDSFYKEHPHLSFNHVIHSLSFG 254
Query: 168 EHFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
G PL+G E P+G + YF KVVP Y ++G +S +FSVT H R
Sbjct: 255 PEIAGNPGPLNGR--AMEVPNGHSHFFSYFAKVVPIRYETLAGTITESAEFSVTAHDRPV 312
Query: 225 EQGRLQTLPGVFFF----------YDLSPIKVTFTEEHVS-FLHFLTNVCAIVGGVFTVS 273
GR P F +++SP+KV E++ S + F+ N +GGV V
Sbjct: 313 HGGRDADHPNTVHFRGGMAGMTINFEMSPLKVIQREQYASTWTAFVLNAITSIGGVLAVG 372
Query: 274 GIIDAFIYHGQRAIKKK 290
++D YH QR + K
Sbjct: 373 TVLDRVTYHTQRTLMGK 389
>gi|156838396|ref|XP_001642904.1| hypothetical protein Kpol_367p1 [Vanderwaltozyma polyspora DSM
70294]
gi|156113483|gb|EDO15046.1| hypothetical protein Kpol_367p1 [Vanderwaltozyma polyspora DSM
70294]
Length = 404
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 108/323 (33%), Positives = 164/323 (50%), Gaps = 48/323 (14%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD SG LD+ F K R+ S G +Q G K+ + L + + ++ YCGS
Sbjct: 88 MDDSGNVQLDITESGFTKTRIGSDG-----QQLGTTNFKVSEDLLEYSPK---DKNYCGS 139
Query: 60 CYGA---------ESSDED-CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
CYGA ES D+ CC CE+V+ AY GWA + I+QC+REG+++++ ++
Sbjct: 140 CYGARDQSKNDEAESVDKKVCCQTCEDVKNAYSDAGWAFFDGKNIEQCEREGYVEKMNDQ 199
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLAFG 167
EGC I G +N++ GN HFAPGK+F G H HD +F D + N H I L+FG
Sbjct: 200 LNEGCRISGEALLNRIHGNIHFAPGKAFQNRGGHFHDT-SFYNDHKNLNFKHMIEHLSFG 258
Query: 168 ---------EHFPGVVNPLDGVRWTQETPS-----GMYQYFIKVVPTVYTDVSGHTIQSN 213
+ + +PLDG QE PS + YF K+VPT + ++ +++
Sbjct: 259 RPVAQFKSNKDLVAMTSPLDG---HQELPSIDAHNHQFIYFAKIVPTRFEYLNKQAQETS 315
Query: 214 QFSVTEHFR--------SSEQGRLQTLPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCA 264
Q VT H + S+ Q +PG+F Y++SP+KV E+H ++ FL N
Sbjct: 316 QLVVTSHMKPIGDATDYSTTMNSRQGIPGLFIDYEISPLKVINREQHATTWSGFLLNCIT 375
Query: 265 IVGGVFTVSGIIDAFIYHGQRAI 287
+GG+ V + D ++ QR +
Sbjct: 376 SIGGILAVGTVADKIVHATQRVV 398
>gi|219110527|ref|XP_002177015.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411550|gb|EEC51478.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 500
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 164/343 (47%), Gaps = 72/343 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLD-----SQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN-- 53
MD++G+ L+++ + K+++D Q +++S Q + Q +L +
Sbjct: 161 MDVAGDSQLNIEDTLTKRKMDRTGRYGQAEILQSNQH--------EQEQSRKAKLRQDPL 212
Query: 54 -ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI----DQCKREGFLQRIKE 108
+TYCG CYGA+ + CCNNC+ + +AY+ KGW DL+ +QC REG Q+
Sbjct: 213 PDTYCGPCYGAQPDVDACCNNCDALLDAYKLKGW---RTDLVLYTAEQCIREGRDQKKLR 269
Query: 109 E--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
+GEGCN+ GF+ +N+VAGNFH A G+ + G H+H + +N SH I+ L+F
Sbjct: 270 PLIQGEGCNLSGFMSLNRVAGNFHIAMGEGLQRDGRHIHVFDPEDSEHYNASHVIHHLSF 329
Query: 167 GEHFPGVV-------NPLDGVRWTQETP----SGMYQYFIKVVPTVYTDVSGH-----TI 210
G G + L+GV TP +G++QYFIKVVPT Y G T
Sbjct: 330 GPEIQGKTKSGNLDSSSLNGVT-KMVTPEHGTTGLFQYFIKVVPTTYLGPGGRRDESGTF 388
Query: 211 QSNQFSVTEHFRS------SEQG------------------------RLQTLPGVFFFYD 240
++N++ TE FR E+ R LPGVFF Y+
Sbjct: 389 ETNRYFYTERFRPLMKEYLPEEAVAEDPKQAAVHAGGGHRTHDHHHVRNSVLPGVFFLYE 448
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
+ P V V H L + A +GGVFT+ +D + G
Sbjct: 449 IYPFAVEIHPVSVPLTHLLIRLMATIGGVFTIVRWVDTAVLEG 491
>gi|374107698|gb|AEY96606.1| FADR389Cp [Ashbya gossypii FDAG1]
Length = 392
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 105/317 (33%), Positives = 154/317 (48%), Gaps = 42/317 (13%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA--PKIDKPLQRHGGRLEHNETYC 57
+D +GE L++ + F K RLD G + + +G P D ++ YC
Sbjct: 88 IDDTGEAQLNLLEEGFTKTRLDKHGRTLGKEEFRVGETLPSTD------------DQDYC 135
Query: 58 GSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 108
G CYGA D++ CC C EVR AY + WA + +QCKREG+ +R++E
Sbjct: 136 GPCYGARDQDQNENLPRSERVCCQTCGEVRAAYAEMNWATFDGKGFEQCKREGYTERLQE 195
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFG 167
+ EGC + G ++N+V GN HFAPG S H H HD ++ + +H I+ L+FG
Sbjct: 196 QINEGCRVAGTAQLNRVHGNIHFAPG-SAHVGKGHAHDDSFYKEHPHLSFNHVIHSLSFG 254
Query: 168 EHFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
G PL+G E P+G + YF KVVP Y ++G +S +FS T H R
Sbjct: 255 PEIAGNPGPLNGR--AMEVPNGHSHFFSYFAKVVPIRYETLAGTITESAEFSATAHDRPV 312
Query: 225 EQGRLQTLPGVFFF----------YDLSPIKVTFTEEHVS-FLHFLTNVCAIVGGVFTVS 273
GR P F +++SP+KV E++ S + F+ N +GGV V
Sbjct: 313 HGGRDADHPNTVHFRGGMAGMTINFEMSPLKVIQREQYASTWTAFVLNAITSIGGVLAVG 372
Query: 274 GIIDAFIYHGQRAIKKK 290
++D YH QR + K
Sbjct: 373 TVLDRVTYHTQRTLMGK 389
>gi|444321132|ref|XP_004181222.1| hypothetical protein TBLA_0F01610 [Tetrapisispora blattae CBS 6284]
gi|387514266|emb|CCH61703.1| hypothetical protein TBLA_0F01610 [Tetrapisispora blattae CBS 6284]
Length = 414
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 110/331 (33%), Positives = 167/331 (50%), Gaps = 47/331 (14%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHG-GRLEHNET-YC 57
+D SGE LDV F K R+D+ GN ++ DG ++D R L+ ++ YC
Sbjct: 87 VDDSGETSLDVLESGFTKIRVDTNGNELD---DG---SQLDVGTDRESLSSLDMDKAKYC 140
Query: 58 GSCYGA----------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIK 107
G CYGA +S++ CC C +VR+AY GWA + I+QC+REG++ RI
Sbjct: 141 GPCYGALDQSGNDNIDVASEKVCCQTCYDVRKAYTDVGWAFFDGKDIEQCEREGYVDRIN 200
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-DSFNISHKINKLAF 166
+ EGC I G +N++ GN HFAPG +F + H HD + + + N +H IN L+F
Sbjct: 201 DHLHEGCRIVGSALLNRIQGNVHFAPGAAFETAKGHFHDTSLYDKTEQLNFNHIINHLSF 260
Query: 167 GEHFPGVVN-------------PLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTI 210
G+ ++ PLDG E+ + + YF K+VPT + +SG
Sbjct: 261 GKTGHELLTPKSSKSFSVSRRQPLDGRVMIPESRNTHFFQFSYFAKIVPTRFESLSGKVE 320
Query: 211 QSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFL 259
++ Q+SVT H R + GR + +PG+F ++ ++P+KV E H +F L
Sbjct: 321 EAAQYSVTFHSRPLQGGRDEDHPNTFHGRSGIPGLFIYFQMAPLKVIDIEAHSQTFSGLL 380
Query: 260 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
N +GGV V ++D Y QR+I K
Sbjct: 381 LNCITTIGGVLAVGTMMDKVFYKAQRSIWGK 411
>gi|149237735|ref|XP_001524744.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146451341|gb|EDK45597.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 411
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 172/322 (53%), Gaps = 31/322 (9%)
Query: 2 DISGEQHLDVKHDIFKK-RLDSQGN--VIESRQDGIGAPKIDKPLQRHGGRLEHNET-YC 57
D +G+ LDV + +K R+ +GN V+E D A + ++PL L NE C
Sbjct: 91 DETGDMKLDVINSGLEKYRIIKRGNNKVVEELDDQ-PALRREQPLHEICKGLGENEQGEC 149
Query: 58 GSCYGAESSD--EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGEG 113
GSCYGA D E CCN+C VR AY K W + + I+QC++EG++Q++K+ + EG
Sbjct: 150 GSCYGALPQDKKEYCCNSCAAVRRAYAHKKWQFFDGENIEQCEKEGYVQKLKDRINQNEG 209
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFGEHFP 171
C + G ++N+VAG FAPG S +G HVHD+ + + D FN H I+ L+FG+
Sbjct: 210 CRVKGSAKINRVAGTMDFAPGISTTSNGQHVHDLSLYTKYPDKFNFDHVIHHLSFGKIPT 269
Query: 172 GVVN--------PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG-HTIQSNQFSVTEHFR 222
+ N PLDG + Q M Y++K+V T + ++ G + +NQFSV H R
Sbjct: 270 AITNLQETDSLSPLDGHSFLQHKRYHMNNYYLKIVSTRFENLDGTKKVDTNQFSVITHDR 329
Query: 223 SSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFT 271
G+ + +P V F +D+SP+K+ E + ++ F+ V + V GV
Sbjct: 330 PLVGGKDEDHQHTLHARGGVPSVAFHFDISPLKIINRERYAKTWSGFVLGVVSSVAGVLM 389
Query: 272 VSGIIDAFIYHGQRAIKKKIEI 293
V ++D ++ Q+A+K K ++
Sbjct: 390 VGALLDRSVFAAQQAMKGKKDL 411
>gi|255637400|gb|ACU19028.1| unknown [Glycine max]
Length = 347
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 97/289 (33%), Positives = 152/289 (52%), Gaps = 35/289 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K RL+S G++I + + ++K H N +
Sbjct: 89 IDMSGKHEVDLDTNIWKLRLNSYGHIIGTEY---VSDLVEKEHTHHKHDDNKNHEHSEQK 145
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
++ DE N ++V+EA + GEGC +YG L
Sbjct: 146 IHLQNLDESTENIIKKVKEALKN---------------------------GEGCRVYGVL 178
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V +VAGNFH S H ++V ++ + N+SH I+ L+FG +PG+ NPLD
Sbjct: 179 DVQRVAGNFHI----SVHGLNIYVAQMIFDGAKNVNVSHFIHDLSFGPKYPGLHNPLDDT 234
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
SG ++Y+IKVVPT Y +S + +NQFSV+E++ Q +T P V+F YD
Sbjct: 235 TRILHDTSGTFKYYIKVVPTEYRYISKEVLPTNQFSVSEYYSPINQFD-RTWPAVYFLYD 293
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
LSPI VT EE SF HF+T +CA++GG F V+G++D ++Y + K
Sbjct: 294 LSPITVTIKEERRSFFHFITRLCAVLGGTFAVTGMLDRWMYRLLETLTK 342
>gi|50294900|ref|XP_449861.1| hypothetical protein [Candida glabrata CBS 138]
gi|49529175|emb|CAG62841.1| unnamed protein product [Candida glabrata]
Length = 415
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 113/332 (34%), Positives = 172/332 (51%), Gaps = 43/332 (12%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIG-APKIDKPLQRHGGRLEHNETYCG 58
MD SGE LD+ + F+K RL +G V+ + IG A K DK Q +L N YCG
Sbjct: 88 MDDSGEVQLDIMNAGFEKTRLSKEGKVLGTADMKIGEAAKKDKEAQL--AKLGAN--YCG 143
Query: 59 SCYGAESSDED----------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 108
+CYGA ++ CC C++VR+AY +K WA + I+QC+REG++Q+I +
Sbjct: 144 NCYGARDQGKNNDDTPRDQWVCCQTCDDVRQAYFEKNWAFFDGKDIEQCEREGYVQKIAD 203
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH-DILAFQRDSFNISHKINKLAFG 167
+ EGC + G ++N++ GN HFA G F H H D L Q + N +H IN L+FG
Sbjct: 204 QLQEGCRVSGSAQLNRIDGNLHFAAGPGFQNIRGHFHDDSLYIQHPNLNFNHIINHLSFG 263
Query: 168 EHFPG------------VVNPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQS 212
+ VNPLDG + ++ Y Y+ K+VPT Y ++ + +++
Sbjct: 264 KAVEPTKKGKVMGIEKVTVNPLDGHSMFPPRDAHFLQYSYYAKIVPTRYEGLNKKNMVET 323
Query: 213 NQFSVTEHFR----SSEQGRLQTL------PGVFFFYDLSPIKVTFTEEH-VSFLHFLTN 261
QFS T H R S+ T+ P ++ +++SP+KV EEH S+ F+ N
Sbjct: 324 AQFSSTFHIRPVGGGSDDDHPNTVHQRGGSPSMWINFEMSPLKVINREEHGQSWSGFVLN 383
Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
+GGV V ++D +Y QR I +K ++
Sbjct: 384 CITSIGGVLAVGTVLDKALYKAQRTIFQKKDV 415
>gi|30686584|ref|NP_188868.2| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
gi|13877821|gb|AAK43988.1|AF370173_1 unknown protein [Arabidopsis thaliana]
gi|51969000|dbj|BAD43192.1| unknown protein [Arabidopsis thaliana]
gi|51970108|dbj|BAD43746.1| unknown protein [Arabidopsis thaliana]
gi|51970556|dbj|BAD43970.1| unknown protein [Arabidopsis thaliana]
gi|51970734|dbj|BAD44059.1| unknown protein [Arabidopsis thaliana]
gi|62319967|dbj|BAD94071.1| hypothetical protein [Arabidopsis thaliana]
gi|332643097|gb|AEE76618.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
thaliana]
Length = 354
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 104/297 (35%), Positives = 161/297 (54%), Gaps = 43/297 (14%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGI--GAPKIDKPLQRHGGRLEH-NET 55
+D+SG+ +D+ +I+K RL+S G++I E D + G P +H G+ EH NET
Sbjct: 89 IDMSGKHEVDLDTNIWKLRLNSHGHIIGTEYISDLVEKGHEHGHSP-HKHDGKEEHKNET 147
Query: 56 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGEG 113
EA G+ DQ E ++++K+ +GEG
Sbjct: 148 ET---------------------EALNILGF--------DQAA-ETMIKKVKQALADGEG 177
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV 173
C +YG L+V +VAGNFH S H ++V ++ + N+SH I+ L+FG +PG+
Sbjct: 178 CRVYGVLDVQRVAGNFHI----SVHGLNIYVAQMIFGGSKNVNVSHMIHDLSFGPKYPGI 233
Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 233
NPLD SG ++Y+IK+VPT Y +S + +NQ+SVTE+F + +T P
Sbjct: 234 HNPLDDTNRILHDTSGTFKYYIKIVPTEYRYLSKDVLSTNQYSVTEYFTPMTEFD-RTWP 292
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
V+F YDLSPI VT EE SFLH +T +CA++GG F ++G++D +++ + KK
Sbjct: 293 AVYFLYDLSPITVTIKEERRSFLHLITRLCAVLGGTFALTGMLDRWMFRFIESFNKK 349
>gi|115455745|ref|NP_001051473.1| Os03g0784400 [Oryza sativa Japonica Group]
gi|14718311|gb|AAK72889.1|AC091123_8 unknown protein [Oryza sativa Japonica Group]
gi|108711422|gb|ABF99217.1| Serologically defined breast cancer antigen NY-BR-84, putative,
expressed [Oryza sativa Japonica Group]
gi|113549944|dbj|BAF13387.1| Os03g0784400 [Oryza sativa Japonica Group]
gi|215737170|dbj|BAG96099.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625918|gb|EEE60050.1| hypothetical protein OsJ_12848 [Oryza sativa Japonica Group]
Length = 350
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 148/289 (51%), Gaps = 35/289 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K RLD G++I G ++ +++ G HN +
Sbjct: 89 IDMSGKHEVDLHTNIWKLRLDKYGHII-------GTEYLNDLVEKEHG--THNHDHDHEH 139
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+ E N E+ + + A+ N GEGC +YG L
Sbjct: 140 EDEQKKQEHTFN--EDAEKMVKSVKQAMEN--------------------GEGCRVYGVL 177
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V +VAGNFH S H + V + + N+SH I+ L+FG +PG+ NPLD
Sbjct: 178 DVQRVAGNFHI----SVHGLNIFVAEKIFDGSSHVNVSHIIHDLSFGPKYPGIHNPLDET 233
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
SG ++Y+IK+VPT Y +S + +NQFSVTE+F P V+F YD
Sbjct: 234 TRILHDTSGTFKYYIKIVPTEYRYLSKQVLPTNQFSVTEYFVPKRATDRSAWPAVYFLYD 293
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
LSPI VT EE +FLHFLT +CA++GG F ++G++D ++Y ++ K
Sbjct: 294 LSPITVTIKEERRNFLHFLTRLCAVLGGTFAMTGMLDRWMYRLIESVTK 342
>gi|297830940|ref|XP_002883352.1| hypothetical protein ARALYDRAFT_479742 [Arabidopsis lyrata subsp.
lyrata]
gi|297329192|gb|EFH59611.1| hypothetical protein ARALYDRAFT_479742 [Arabidopsis lyrata subsp.
lyrata]
Length = 354
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 103/297 (34%), Positives = 161/297 (54%), Gaps = 43/297 (14%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGI--GAPKIDKPLQRHGGRLEH-NET 55
+D+SG+ +D+ +I+K RL+S G++I E D + G P +H G+ EH NET
Sbjct: 89 IDMSGKHEVDLDTNIWKLRLNSHGHIIGTEYISDLVEKGHEHGHSP-HKHDGKEEHKNET 147
Query: 56 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGEG 113
EA G+ DQ E ++++K+ +GEG
Sbjct: 148 ET---------------------EALNILGF--------DQAA-ETMIKKVKQALADGEG 177
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV 173
C +YG L+V +VAGNFH S H ++V ++ + N+SH I+ L+FG +PG+
Sbjct: 178 CRVYGVLDVQRVAGNFHI----SVHGLNIYVAQMIFGGSKNVNVSHMIHDLSFGPKYPGI 233
Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 233
NPLD SG ++Y+IK+VPT Y +S + +NQ+SVTE++ + +T P
Sbjct: 234 HNPLDDTNRILHDTSGTFKYYIKIVPTEYRYLSKDVLSTNQYSVTEYYTPMTEFD-RTWP 292
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
V+F YDLSPI VT EE SFLH +T +CA++GG F ++G++D +++ + KK
Sbjct: 293 AVYFLYDLSPITVTIKEERRSFLHLITRLCAVLGGTFALTGMLDRWMFRLIESFNKK 349
>gi|218193856|gb|EEC76283.1| hypothetical protein OsI_13786 [Oryza sativa Indica Group]
Length = 350
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 96/289 (33%), Positives = 148/289 (51%), Gaps = 35/289 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K RLD G++I G ++ +++ G HN +
Sbjct: 89 IDMSGKHEVDLHTNIWKLRLDKYGHII-------GTEYLNDLVEKEHG--THNHDHDHEH 139
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+ E N E+ + + A+ N GEGC +YG L
Sbjct: 140 EDEQKKQEHTFN--EDAEKMVKSVKQAMEN--------------------GEGCRVYGVL 177
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V +VAGNFH S H + V + + N+SH I+ L+FG +PG+ NPLD
Sbjct: 178 DVQRVAGNFHI----SVHGLNIFVAEKIFDGSSHVNVSHIIHDLSFGPKYPGIHNPLDET 233
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
SG ++Y+IK+VPT Y +S + +NQFSVTE+F P V+F YD
Sbjct: 234 TRILHDTSGTFKYYIKIVPTEYRYLSKQVLPTNQFSVTEYFVPKRATDRSAWPAVYFLYD 293
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
LSPI VT EE +FLHFLT +CA++GG F ++G++D ++Y ++ K
Sbjct: 294 LSPITVTIKEERRNFLHFLTRLCAVLGGTFAMTGMLDRWMYRLIESVTK 342
>gi|241955457|ref|XP_002420449.1| COPII-coated vesicle complex subunit, putative; ER-derived vesicle
protein, putative [Candida dubliniensis CD36]
gi|223643791|emb|CAX41527.1| COPII-coated vesicle complex subunit, putative [Candida
dubliniensis CD36]
Length = 414
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 172/326 (52%), Gaps = 33/326 (10%)
Query: 1 MDISGEQHLDVKHDIFKK-RL--DSQGNVIESR-QDGIGAPKIDKPLQRHGGRL---EHN 53
+D++G+ L++ KK RL + QG+VI + +D A D L L
Sbjct: 89 LDVTGDLSLNIIDSGLKKIRLLKNKQGDVIVNEIEDDEPAFNNDIELTDLAKGLPEGSDE 148
Query: 54 ETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE-- 109
YCGSCYGA D+ CCN+C VR AY +K W+ + + I+QC++EG++ R++E
Sbjct: 149 NAYCGSCYGALPQDKKQFCCNDCNTVRRAYAEKHWSFYDGENIEQCEKEGYVARLRERIN 208
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG 167
EGC I G ++N+V+G FAPG SF + G H HD+ + + D FN H IN L+FG
Sbjct: 209 NNEGCRIKGTTKINRVSGTMDFAPGASFTREGRHFHDLSLYTKYEDKFNFDHIINHLSFG 268
Query: 168 E--------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVT 218
E ++PLD ++ + + Y++KVV T + + + I +NQFSV
Sbjct: 269 EMPVDGQADQLFDSIHPLDDHQFMLHKKAHLVSYYLKVVATRFESLDYKNRIDTNQFSVI 328
Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
H R G+ + +PGV F +D+SP+K+ +++ ++ F+ V + +
Sbjct: 329 THDRPLRGGKDEDHQHTLHARGGIPGVNFNFDISPLKIINRQQYAKTWSGFVLGVISSIA 388
Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
GV V ++D ++ Q+AIK K +I
Sbjct: 389 GVLMVGTLLDRSVFAAQQAIKGKKDI 414
>gi|224137484|ref|XP_002322569.1| predicted protein [Populus trichocarpa]
gi|222867199|gb|EEF04330.1| predicted protein [Populus trichocarpa]
Length = 351
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 101/290 (34%), Positives = 155/290 (53%), Gaps = 36/290 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K RL+S G++ G + +++ E +
Sbjct: 89 IDMSGKHEVDLDTNIWKLRLNSHGHIT-------GTEYLSDLVEKEH---EAHNHDHDKD 138
Query: 61 YGAESSDEDCCNNCEEVREAYRKK-GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
+ +S +E + ++ E KK AL+N GEGC +YG
Sbjct: 139 HHKDSHEEQHTHGFDDAAETMIKKVKQALAN--------------------GEGCRVYGV 178
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
L+V +VAGNFH S H + V ++ N+SH I+ L+FG +PG+ NPLDG
Sbjct: 179 LDVQRVAGNFHI----SVHGLNIFVAQMIFDGAKHVNVSHIIHDLSFGPKYPGIHNPLDG 234
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
SG+++Y+IK+VPT Y +S + +NQFSVTE+F S +T P V+F Y
Sbjct: 235 TARILRETSGIFKYYIKIVPTEYRYISKDVLPTNQFSVTEYF-SPITDFDRTWPAVYFLY 293
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
DLSPI VT EE SFLHF+T +CAI+GG F ++G++D ++Y A+ K
Sbjct: 294 DLSPITVTIKEERRSFLHFITRLCAILGGTFALTGMLDRWMYRLLEALTK 343
>gi|365989554|ref|XP_003671607.1| hypothetical protein NDAI_0H01900 [Naumovozyma dairenensis CBS 421]
gi|343770380|emb|CCD26364.1| hypothetical protein NDAI_0H01900 [Naumovozyma dairenensis CBS 421]
Length = 438
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/346 (31%), Positives = 167/346 (48%), Gaps = 57/346 (16%)
Query: 1 MDISGEQHLDVKHDIF-KKRLDSQGNVIESRQDGIGAPKID-----KPLQRHGGR----- 49
MD SGE LD+ F K RLD QGN +++ + + D L ++G +
Sbjct: 91 MDESGELQLDLLDSTFIKTRLDPQGNPLDN-DNNVADTDADLVIGVDDLTKNGEKRLKEI 149
Query: 50 LEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKRE 100
L + YCGSCYG++ E+ CC C +VR++Y GWA + I+QC+ E
Sbjct: 150 LAKDPDYCGSCYGSQDQTENESKSKDQKICCQTCNDVRDSYLNAGWAFFDGAQIEQCENE 209
Query: 101 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH----QSGVHVHDILAFQR-DSF 155
G++ +I + EGC I G +N++ GN HFAPGKS+ + H HD + +
Sbjct: 210 GYVAKINKHLEEGCRIKGQALLNRIQGNIHFAPGKSYSNYKAKGSTHRHDTSLYDKVKKM 269
Query: 156 NISHKINKLAFGEHFPGV---------------VNPLDGVRWTQE--TPS-GMYQYFIKV 197
N +H I+ L+FG+ V +NPLD + + P+ + Y+ K+
Sbjct: 270 NFNHIIHHLSFGKSIDKVGKNDLKDYSDRKKFSINPLDDRKVIVKDFNPAFHQFSYYTKI 329
Query: 198 VPTVY--TDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIK 245
VPT Y D +I++ QFS T H R + G + +PG+FFF+++SPIK
Sbjct: 330 VPTRYEFLDEKISSIETAQFSATYHSRPIQGGTDEDHPTTFHSRGGIPGLFFFFEMSPIK 389
Query: 246 VTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
V E H ++ FL N +G V V + D Y Q+ +K K
Sbjct: 390 VINKEHHFRTWSSFLLNCITSIGSVLAVGTVFDKIFYRAQKTLKAK 435
>gi|403215743|emb|CCK70242.1| hypothetical protein KNAG_0D05030 [Kazachstania naganishii CBS
8797]
Length = 422
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 109/338 (32%), Positives = 167/338 (49%), Gaps = 50/338 (14%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGR-----LEHNE 54
MD SG LDV D F K R+D GN++ G A + KP G R L+ +
Sbjct: 90 MDESGNVQLDVLFDQFTKTRVDVNGNMV-----GGSASEPYKPNSLSGKRAGAKDLQMDA 144
Query: 55 TYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
YCGSCYG+++ + + CC C++V +AY + GWA + I+QC+ EG+++R
Sbjct: 145 DYCGSCYGSKNQENNAELPPEQRICCQTCDDVHDAYLEAGWAFFDGANIEQCESEGYVKR 204
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV--------HVHDILAFQRDS-FN 156
I+E+ EGCN+ G +N++ GN HFAPGK + Q H HD+ ++R+ N
Sbjct: 205 IQEQLHEGCNVKGTALLNRIQGNLHFAPGKPYQQLAAGMPGQGLGHYHDVSLYERNRHMN 264
Query: 157 ISHKINKLAFGEHFPGVV--------NPLDGVRWTQETPS-GMYQYFIKVVPTVYTDV-S 206
++H IN+ FGE + PL+ + E P ++ Y+ VVPT Y + +
Sbjct: 265 LNHVINEFRFGEDPQSEIVAQKIQRSAPLEDTVASLENPHYYIFNYYTNVVPTRYEFLGA 324
Query: 207 GHTIQSNQFSVTEHFRSSEQGR----LQTL------PGVFFFYDLSPIKVTFTEEHV-SF 255
+ + Q+S T H R GR TL PGV+F + SP+K+ E +
Sbjct: 325 SKPLDTAQYSATYHDRPIMGGRDADHPTTLHGRGGTPGVYFNLEFSPLKIINRERRPQQW 384
Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
L N +GG+ V + D +Y QR+I K ++
Sbjct: 385 STLLLNWITTIGGILAVGTVTDKVVYKAQRSIGAKKQL 422
>gi|150866674|ref|XP_001386342.2| hypothetical protein PICST_85013 [Scheffersomyces stipitis CBS
6054]
gi|149387930|gb|ABN68313.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 407
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 166/317 (52%), Gaps = 29/317 (9%)
Query: 1 MDISGEQHLDVKHDIFKK-RL--DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
MD +G+ LD+ F+K R+ DS+ +I+ I A + + + G E + C
Sbjct: 90 MDEAGDLQLDILKSGFEKFRIVKDSEEEIIDRESTPINADLSIEEMAK--GLKEGEDGEC 147
Query: 58 GSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG--EG 113
GSCYGA D+ CCN+CE V+ AY +K W + + I+QC+ EG++QR++ EG
Sbjct: 148 GSCYGALPQDKKQYCCNDCETVKLAYAEKLWGFYDGENIEQCENEGYVQRVQSRINGKEG 207
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFG----E 168
C I G +N+++G FAPG SF SG HVHD+ + + N H +NKL FG E
Sbjct: 208 CRIKGNARINRISGTMDFAPGASFTSSGHHVHDLSLYDKHPHLNFDHIVNKLTFGPIPDE 267
Query: 169 HFPGV--VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--IQSNQFSVTEHFRSS 224
P +PLD + ++ Y++KVV T + ++G + + +NQFSV H R
Sbjct: 268 SVPTAESTHPLDNYGVALNDKNHVFTYYLKVVATRFEFLNGASKALDANQFSVITHDRPI 327
Query: 225 EQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 273
G+ +PGV F +D+SP+K+ E++ S+ F+ V + V GV V
Sbjct: 328 SGGKDNDHQHTLHAKGGIPGVVFHFDISPLKIINREQYAKSWSGFVLGVVSSVAGVLIVG 387
Query: 274 GIIDAFIYHGQRAIKKK 290
++D +Y + AIK K
Sbjct: 388 SLLDRSVYAAESAIKGK 404
>gi|448086324|ref|XP_004196073.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
gi|359377495|emb|CCE85878.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
Length = 405
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 101/318 (31%), Positives = 167/318 (52%), Gaps = 27/318 (8%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQ---RHGGRLEHNETY 56
+D+SG+ DV F+K RL N E D + D L+ R+ +
Sbjct: 90 LDVSGDTQADVLKSGFEKYRLIPSSN--EEVLDNAPVLRNDLSLEDIARNPNKEGGGYCG 147
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE--EEGEGC 114
+ +E CCN+CE VR AY ++ WA + I+QC+ EG++ R+ + E+ EGC
Sbjct: 148 SCYGALPQGDNEFCCNDCETVRVAYAERMWAFYDGANIEQCENEGYVTRLNQRIEQKEGC 207
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG----E 168
I G ++N+V+GN HFAPG + G H+HD+ +++ D F+ H IN L+FG +
Sbjct: 208 RIKGTAQINRVSGNMHFAPGYAKTSPGRHIHDLSLYEKHFDKFSFDHVINHLSFGLDPAK 267
Query: 169 HFPG--VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
P +PLDG R S + Y++KVV T + ++G ++++NQFS H R
Sbjct: 268 EDPNHQSTHPLDGYRLILNDKSRVISYYLKVVATRFEFLNGSSMETNQFSAIPHHRPYRG 327
Query: 227 GRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGI 275
G+ + +PGVFF +D+SP+K+ E++ ++ F+ V + + GV TV +
Sbjct: 328 GKDEDHRHTMHAKGGIPGVFFHFDISPMKIINKEQYAKTWSGFVLGVISSIAGVLTVGAV 387
Query: 276 IDAFIYHGQRAIKKKIEI 293
+D ++ ++ IK K +I
Sbjct: 388 LDRSVWAAEKVIKSKKDI 405
>gi|353242343|emb|CCA73995.1| related to ERV46-component of copii vesicles [Piriformospora indica
DSM 11827]
Length = 420
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 102/322 (31%), Positives = 160/322 (49%), Gaps = 46/322 (14%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
D+SGE +V H+I K RLDS+G + QD I + + + G+ YCGSCY
Sbjct: 94 DVSGEHMREVSHNIVKVRLDSEGKPYPN-QDHISDLRNEISRVKDIGK----PGYCGSCY 148
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
G + CCN CE+VR++Y +GWA S P+ I+QC REG+ ++IK + +GC I G +
Sbjct: 149 GGLEPEGGCCNTCEDVRKSYLDRGWAFSAPEHIEQCVREGWTEKIKVQANDGCQISGRVR 208
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAF---GEHFPGVVN- 175
+ KVA + F+ G+SF + H +++ + +D + H I L F E+ P N
Sbjct: 209 IKKVASSLIFSFGRSFQANSFHAQELVPYLKDGLIHDFGHHIETLQFQSDDEYDPRRANE 268
Query: 176 -------------PLDGV---------RWTQETPSGMYQYFIKVVPTVYTDVSGHTI--- 210
PL+G R + + M+QYFIKVV + + +
Sbjct: 269 AARLKKHLGVPKDPLNGFNSHYAKYSGRRGPDITTYMFQYFIKVVSADFETLDHEHVSSH 328
Query: 211 ------QSNQFSVTEHFRSSE----QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
+ H +++E PG+F D+SP++V TE+ F HFLT
Sbjct: 329 LYSYSSHTRNVGEAYHLKNTEGIETTHGYDAAPGLFINIDVSPMQVIHTEKRKPFAHFLT 388
Query: 261 NVCAIVGGVFTVSGIIDAFIYH 282
CAI+GGV TV+ ++D+ +++
Sbjct: 389 TFCAIIGGVLTVASLVDSALFN 410
>gi|413949705|gb|AFW82354.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein,
partial [Zea mays]
Length = 202
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 69/95 (72%), Positives = 85/95 (89%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
DISGEQH D++HDI K+RL+S GNVIE+R++GIG K+++PLQ+HGGRL+ E YCG+CY
Sbjct: 91 DISGEQHHDIRHDIEKRRLNSHGNVIEARKEGIGGAKVERPLQKHGGRLDKGEQYCGTCY 150
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 96
GAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQ
Sbjct: 151 GAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQ 185
>gi|68483709|ref|XP_714213.1| hypothetical protein CaO19.586 [Candida albicans SC5314]
gi|68483794|ref|XP_714172.1| hypothetical protein CaO19.8218 [Candida albicans SC5314]
gi|46435713|gb|EAK95089.1| hypothetical protein CaO19.8218 [Candida albicans SC5314]
gi|46435761|gb|EAK95136.1| hypothetical protein CaO19.586 [Candida albicans SC5314]
gi|238882494|gb|EEQ46132.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 414
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 107/326 (32%), Positives = 172/326 (52%), Gaps = 33/326 (10%)
Query: 1 MDISGEQHLDVKHDIFKK-RL--DSQGNVIESR-QDGIGAPKIDKPLQRHGGRL---EHN 53
+D++G+ L++ KK RL + QG+VI + +D A D L L
Sbjct: 89 LDVTGDLSLNIIDSGLKKIRLLKNKQGDVIVNEIEDDEPAFNNDIELSDLAKGLPEGSDE 148
Query: 54 ETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE-- 109
YCGSCYGA D+ CCN+C VR AY +K W+ + + I+QC++EG++ R++E
Sbjct: 149 NAYCGSCYGALPQDKKQFCCNDCNTVRRAYAEKHWSFYDGENIEQCEKEGYVGRLRERIN 208
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG 167
EGC I G ++N+V+G FAPG SF + G H HD+ + + D FN H IN L+FG
Sbjct: 209 NNEGCRIKGTTKINRVSGTMDFAPGASFTREGRHFHDLSLYTKYPDKFNFDHIINHLSFG 268
Query: 168 E--------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVT 218
E ++PLD ++ + + Y++KVV T + + + I +NQFSV
Sbjct: 269 EMPVDGQADELFDSIHPLDDHQFMLHKKAHLVSYYLKVVATRFESLDYKNRIDTNQFSVI 328
Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
H R G+ + +PGV F +D+SP+K+ +++ ++ F+ V + +
Sbjct: 329 THDRPLVGGKDEDHQHTLHARGGIPGVNFNFDISPLKIINRQQYAKTWSGFVLGVISSIA 388
Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
GV V ++D ++ Q+AIK K +I
Sbjct: 389 GVLMVGTLLDRSVFAAQQAIKGKKDI 414
>gi|367017984|ref|XP_003683490.1| hypothetical protein TDEL_0H04200 [Torulaspora delbrueckii]
gi|359751154|emb|CCE94279.1| hypothetical protein TDEL_0H04200 [Torulaspora delbrueckii]
Length = 406
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 162/323 (50%), Gaps = 41/323 (12%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD +GE LD+ F K R+DS G I + + +E YCGS
Sbjct: 89 MDNAGELQLDIMEAGFTKTRIDSNGKEISTSSFDASD--------SSSDYVPDDENYCGS 140
Query: 60 CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
CYGA+ D++ CC C++VR+AY + WA + I+QC+REG+++RI ++
Sbjct: 141 CYGAKDQDKNDELPKEERVCCQTCDDVRKAYLEAEWAFYDGKNIEQCEREGYVERINQQL 200
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEH 169
EGC + G ++++ G HFAPG+ F + H HD+ + N +H I+ L+FG+
Sbjct: 201 NEGCRVQGNALLSRIQGTIHFAPGRGFQNNRGHFHDMSLYDNTPQLNFNHIIHHLSFGKP 260
Query: 170 F---------PGVVNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVT 218
+PLDG + + + ++Q YF K+VPT Y + +++ QFS T
Sbjct: 261 INSGAEDRGAATSTHPLDGRQVFPDRDTHLHQFSYFAKIVPTRYEYLDDVVVETAQFSTT 320
Query: 219 EHFRSSEQG----RLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
H R G TL PG+F ++++SP+KV E+H ++ FL N +G
Sbjct: 321 YHDRPLRGGVDDDHPNTLHSRGGSPGMFVYFEMSPLKVINKEQHAQTWSGFLLNCITSIG 380
Query: 268 GVFTVSGIIDAFIYHGQRAIKKK 290
GV V ++D +Y Q++I K
Sbjct: 381 GVLAVGTVLDKVLYKAQKSIWGK 403
>gi|388493200|gb|AFK34666.1| unknown [Medicago truncatula]
Length = 106
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 75/107 (70%), Positives = 92/107 (85%), Gaps = 2/107 (1%)
Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFT 249
M QYFIKVVPTVYTD+ G I SNQ+SVTEHF+SSE G +PGVFFFYD+SPIKV F
Sbjct: 1 MCQYFIKVVPTVYTDIRGRVIHSNQYSVTEHFKSSELG--AAVPGVFFFYDISPIKVNFK 58
Query: 250 EEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
EEH+ FLHFLTN+CAI+GG+FT++GI+D+ IY+GQ+ IKKK+EIGK+
Sbjct: 59 EEHIPFLHFLTNICAIIGGIFTIAGIVDSSIYYGQKTIKKKMEIGKY 105
>gi|449479952|ref|XP_004155757.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 266
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 81/192 (42%), Positives = 120/192 (62%), Gaps = 7/192 (3%)
Query: 100 EGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
E ++++K+ EE +GC +YG L+V +VAGNFH S H + V ++ N+
Sbjct: 74 ENLVKKVKQALEEAQGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFGGSKHVNV 129
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
SH I+ L+FG +PG+ NPLDG SG ++Y+IK+VPT Y +S + +NQFSV
Sbjct: 130 SHMIHDLSFGPKYPGIHNPLDGTVRILRDTSGTFKYYIKIVPTEYKYISKAVLPTNQFSV 189
Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
TE+F S ++ P V+F YDLSPI VT EE SFLHF+T +CA++GG F V+G++D
Sbjct: 190 TEYF-SPMTDSDRSWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLD 248
Query: 278 AFIYHGQRAIKK 289
+++ A+ K
Sbjct: 249 RWMFRFLEALTK 260
>gi|118482697|gb|ABK93267.1| unknown [Populus trichocarpa]
Length = 366
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 83/192 (43%), Positives = 120/192 (62%), Gaps = 7/192 (3%)
Query: 100 EGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
E ++++K+ GEGC +YG L+V +VAGNFH S H + V ++ N+
Sbjct: 172 ETMIKKVKQALANGEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAKHVNV 227
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
SH I+ L+FG +PG+ NPLDG SG+++Y+IK+VPT Y +S + +NQFSV
Sbjct: 228 SHIIHDLSFGPKYPGIHNPLDGTARILRETSGIFKYYIKIVPTEYRYISKDVLPTNQFSV 287
Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
TE+F S +T P V+F YDLSPI VT EE SFLHF+T +CAI+GG F ++G++D
Sbjct: 288 TEYF-SPITDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAILGGTFALTGMLD 346
Query: 278 AFIYHGQRAIKK 289
++Y A+ K
Sbjct: 347 RWMYRLLEALTK 358
>gi|449445069|ref|XP_004140296.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3-like [Cucumis sativus]
Length = 388
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 84/193 (43%), Positives = 125/193 (64%), Gaps = 9/193 (4%)
Query: 100 EGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
E ++++K+ EE +GC +YG L+V +VAGNFH S H + V ++ N+
Sbjct: 196 ENLVKKVKQALEEAQGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFGGSKHVNV 251
Query: 158 SHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 216
SH I+ L+FG +PG+ NPLDG VR ++T SG ++Y+IK+VPT Y +S + +NQFS
Sbjct: 252 SHMIHDLSFGPKYPGIHNPLDGTVRILRDT-SGTFKYYIKIVPTEYKYISKAVLPTNQFS 310
Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
VTE+F S ++ P V+F YDLSPI VT EE SFLHF+T +CA++GG F V+G++
Sbjct: 311 VTEYF-SPMTDSDRSWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGML 369
Query: 277 DAFIYHGQRAIKK 289
D +++ A+ K
Sbjct: 370 DRWMFRFLEALTK 382
>gi|302414546|ref|XP_003005105.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
gi|261356174|gb|EEY18602.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
Length = 349
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 108/303 (35%), Positives = 143/303 (47%), Gaps = 62/303 (20%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEHNET---Y 56
MD+SGEQ + I K RL SQ +DG G ID K L H Y
Sbjct: 89 MDVSGEQQHGIVSGISKVRLRSQ-------KDGGGV--IDTKALSLHAADEAATHLAPDY 139
Query: 57 CGSCYGAESS----DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA++ + CCN CEEVREAY + WA + ++QC RE + +R+ E+ E
Sbjct: 140 CGDCYGAKAPANAVKQGCCNTCEEVREAYAQASWAFGKGENVEQCTREHYAERLDEQRAE 199
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHF 170
GC I G L VNKV GNFH APG+SF +HVHD+ + + +H+I+ L F
Sbjct: 200 GCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDAEIIHDFTHQIHALRF---- 255
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
V +D + S H RL
Sbjct: 256 ------------------------------VLSDEPQAQLSGGDDSAEGHAE-----RLH 280
Query: 231 T---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
T +PGVFF YD+SP+KV EE SF FLT +CA++GG TV+ +D ++ G
Sbjct: 281 TRGGIPGVFFSYDISPMKVINREERSKSFTGFLTGLCAVIGGTLTVAAAVDRGMFEGSLR 340
Query: 287 IKK 289
+KK
Sbjct: 341 LKK 343
>gi|448531492|ref|XP_003870264.1| Erv46 protein [Candida orthopsilosis Co 90-125]
gi|380354618|emb|CCG24134.1| Erv46 protein [Candida orthopsilosis]
Length = 411
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 165/322 (51%), Gaps = 31/322 (9%)
Query: 2 DISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDK--PLQRHGGRLEHNETY-- 56
D SG+ LD+ + +K R+ G+ + + P + + PL++ L +T
Sbjct: 91 DESGDLKLDIINSQLEKFRIIKSGHSSKPTEIKDDQPPLQREMPLEQIAPGLPDGQTEGE 150
Query: 57 CGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGE 112
CGSCYGA D+ CCN+C VR AY + W + + I QC+ EG++QR+++ + E
Sbjct: 151 CGSCYGAVPQDKKQYCCNSCAAVRRAYAEANWQFYDGENIAQCEEEGYVQRLRQRINDNE 210
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ--RDSFNISHKINKLAFGEHF 170
GC + G ++N+VAG FAPG S + HVHD+ + +D FN H IN L+FG +
Sbjct: 211 GCRVKGTTKINRVAGTMDFAPGASMTKER-HVHDLSLYMKYKDKFNFDHVINHLSFGNNP 269
Query: 171 P-------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-TIQSNQFSVTEHFR 222
P G ++PLDG ++ Q YF+K+V T + + G +NQFS H R
Sbjct: 270 PDSQLVDTGSISPLDGHKFLQHKKLHSINYFLKIVATRFESLEGKDKFDTNQFSAITHDR 329
Query: 223 SSEQGR----------LQTLPGVFFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFT 271
G+ +PGV F +D+SP+K+ EE+ F+ V + + GV
Sbjct: 330 PLAGGKDDDHQHTLHARAGVPGVAFNFDISPLKIINREEYAKTRSGFILGVVSSIAGVLM 389
Query: 272 VSGIIDAFIYHGQRAIKKKIEI 293
V ++D ++ Q+AIK K ++
Sbjct: 390 VGSLMDRSVFAAQQAIKGKKDL 411
>gi|294657513|ref|XP_459821.2| DEHA2E11792p [Debaryomyces hansenii CBS767]
gi|199432751|emb|CAG88060.2| DEHA2E11792p [Debaryomyces hansenii CBS767]
Length = 402
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 102/317 (32%), Positives = 166/317 (52%), Gaps = 27/317 (8%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
+DISG+ LD+ F+K R+ + N D D L+ + N CG
Sbjct: 89 LDISGDLQLDILKSGFQKYRILKESN--HEILDEAPVLSNDLSLEEMAKGVGANGK-CGP 145
Query: 60 CYGA--ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGEGCN 115
CYGA + ++E CCN+CE V+ AY +K WA + I+QC+ EG++ R+ E EGC
Sbjct: 146 CYGALPQDNNEYCCNSCETVKLAYAEKMWAFYDGKDIEQCENEGYVSRLTERINNNEGCR 205
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFGE----- 168
+ G ++N+++GN HFAPG S G H+HD+ F++ D FN H IN +FG
Sbjct: 206 VKGTAQINRISGNLHFAPGSSSTAPGRHIHDLSLFEKYEDKFNFDHVINHFSFGSDPHDN 265
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQG 227
+ +PLD + + + Y++KVV T + + + + +NQFSV H R G
Sbjct: 266 NLQQSTHPLDNHQLVFDEKYHVASYYLKVVATRFEFIDTSLPLDTNQFSVISHHRPLRGG 325
Query: 228 RLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGII 276
+ + LPGVFF +++SP+K+ E++ ++ F+ V + V GV V ++
Sbjct: 326 KDEDHKHTLHARGGLPGVFFHFEISPMKIINKEQYAKTWSGFILGVISSVAGVLMVGTVL 385
Query: 277 DAFIYHGQRAIKKKIEI 293
D ++ ++AIK K ++
Sbjct: 386 DRSVWAAEKAIKGKKDM 402
>gi|255563175|ref|XP_002522591.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
gi|223538182|gb|EEF39792.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Ricinus communis]
Length = 191
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 84/191 (43%), Positives = 119/191 (62%), Gaps = 9/191 (4%)
Query: 102 FLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISH 159
++++K+ GEGC +YG L+V +VAGNFH S H + V ++ N+SH
Sbjct: 1 MIKKVKQALANGEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAIHVNVSH 56
Query: 160 KINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
I+ L+FG FPG+ NPLDG SG ++Y+IK+VPT Y +S + +NQFSVTE
Sbjct: 57 IIHDLSFGPKFPGLHNPLDGTARILHDASGTFKYYIKIVPTEYRYISKEVLPTNQFSVTE 116
Query: 220 HFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
+F SE R T P V+F YDLSPI VT EE SFLHF+T +CA++GG F ++G++D
Sbjct: 117 YFSPMSEYDR--TWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFALTGMLDR 174
Query: 279 FIYHGQRAIKK 289
++Y A+ K
Sbjct: 175 WMYRLLEAVTK 185
>gi|303290895|ref|XP_003064734.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226453760|gb|EEH51068.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 363
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/287 (38%), Positives = 157/287 (54%), Gaps = 32/287 (11%)
Query: 1 MDISGEQHLDVKHDIFKKR-LDSQGNVIES--RQDGIGAPK-IDKPLQRHGGRLEHNETY 56
MD +GE DV KKR LDS G +E + + A K I + ++ H L +E Y
Sbjct: 98 MDQAGEAFHDVHSGHLKKRRLDSDGKPLEGVFKHEKANAHKEIREDIESHALALSGDEEY 157
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN-PDLIDQCKREGFLQRIKEEEGEGCN 115
++S+ED ++G + N L+D+ G + K E EGC
Sbjct: 158 -------KTSEEDLM----------PEEGLTMFNLKQLLDKQFPGGIEKAFKNEAREGCE 200
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
+ G+LEVN+V G+F +PGKS HV L Q N+SH IN+ AFG+ FPG V+
Sbjct: 201 VIGYLEVNRVPGSFSVSPGKSIRLGMEHVQ--LNVQ-SRLNMSHTINRFAFGKSFPGFVS 257
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTL--- 232
PLDG P+ ++QYF+K+VPT +T + G +QSNQ+SVTE S+ L +
Sbjct: 258 PLDG-NARDLDPNYVHQYFLKIVPTSFTPLRGEYLQSNQYSVTE--ASAPAKALNVVGSK 314
Query: 233 -PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
GV+F YDLSP++V + E S F+T+VCAIVGGV ++SG++ A
Sbjct: 315 PSGVYFNYDLSPLRVDYVESRNSMTEFITSVCAIVGGVASMSGLVQA 361
>gi|357112836|ref|XP_003558212.1| PREDICTED: probable endoplasmic reticulum-Golgi intermediate
compartment protein 3-like [Brachypodium distachyon]
Length = 349
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 99/281 (35%), Positives = 146/281 (51%), Gaps = 36/281 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K RLD G +I + E+
Sbjct: 89 IDMSGKHEVDLHTNIWKLRLDKYGTIIGT---------------------EYLSDLVEKE 127
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+GA D ++ EE KK N D K R E GEGC +YG L
Sbjct: 128 HGAHHHDNGHEHHDEE------KKPEHTFNEDADKMVKS----VRQALENGEGCRVYGML 177
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V +VAGNFH S H ++V + + N+SH I++L+FG +PG+ NPLD
Sbjct: 178 DVQRVAGNFHI----SVHGLNIYVAEKIFEGSSHVNVSHVIHELSFGPKYPGIHNPLDDT 233
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
SG ++Y+IKVVPT Y +S + +NQFSVTE+F ++ P V+F YD
Sbjct: 234 TRILHDASGTFKYYIKVVPTEYRYLSKQVLPTNQFSVTEYFVPIRPAD-RSWPAVYFLYD 292
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
LSPI VT EE +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 293 LSPITVTIKEERRNFLHFITRLCAVLGGTFAMTGMLDRWMY 333
>gi|242059085|ref|XP_002458688.1| hypothetical protein SORBIDRAFT_03g038260 [Sorghum bicolor]
gi|241930663|gb|EES03808.1| hypothetical protein SORBIDRAFT_03g038260 [Sorghum bicolor]
Length = 350
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 147/284 (51%), Gaps = 41/284 (14%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K RLD G++I + K H EH++
Sbjct: 89 IDMSGKHEVDLHTNIWKLRLDKYGHIIGTEYLSDLVEKGHGAHHDHDHGQEHHD------ 142
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+ E N EE + + AL N GEGC +YG L
Sbjct: 143 --EQKKPEQTFN--EEAEKMIKSVKQALGN--------------------GEGCRVYGML 178
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V +VAGNFH S H + V + + N+SH I++L+FG +PG+ NPLD
Sbjct: 179 DVQRVAGNFHI----SVHGLNIFVAEKIFEGSSHVNVSHVIHELSFGPKYPGIHNPLDET 234
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQGRLQTLPGVFF 237
SG ++Y+IKVVPT Y +S + +NQFSVTE+F R S++ P V+F
Sbjct: 235 SRILHDTSGTFKYYIKVVPTEYKYLSKKVLPTNQFSVTEYFLPIRPSDRA----WPAVYF 290
Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
YDLSPI VT EE +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 291 LYDLSPITVTIKEERRNFLHFITRLCAVLGGTFAMTGMLDRWMY 334
>gi|11036454|dbj|BAB17274.1| unnamed protein product [Arabidopsis thaliana]
Length = 333
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 101/281 (35%), Positives = 152/281 (54%), Gaps = 43/281 (15%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGI--GAPKIDKPLQRHGGRLEH-NET 55
+D+SG+ +D+ +I+K RL+S G++I E D + G P +H G+ EH NET
Sbjct: 89 IDMSGKHEVDLDTNIWKLRLNSHGHIIGTEYISDLVEKGHEHGHSP-HKHDGKEEHKNET 147
Query: 56 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGEG 113
EA G+ DQ E ++++K+ +GEG
Sbjct: 148 ET---------------------EALNILGF--------DQAA-ETMIKKVKQALADGEG 177
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV 173
C +YG L+V +VAGNFH S H ++V ++ + N+SH I+ L+FG +PG+
Sbjct: 178 CRVYGVLDVQRVAGNFHI----SVHGLNIYVAQMIFGGSKNVNVSHMIHDLSFGPKYPGI 233
Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 233
NPLD SG ++Y+IK+VPT Y +S + +NQ+SVTE+F + +T P
Sbjct: 234 HNPLDDTNRILHDTSGTFKYYIKIVPTEYRYLSKDVLSTNQYSVTEYFTPMTEFD-RTWP 292
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
V+F YDLSPI VT EE SFLH +T +CA++GG F ++G
Sbjct: 293 AVYFLYDLSPITVTIKEERRSFLHLITRLCAVLGGTFALTG 333
>gi|224086657|ref|XP_002307923.1| predicted protein [Populus trichocarpa]
gi|222853899|gb|EEE91446.1| predicted protein [Populus trichocarpa]
Length = 351
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 81/192 (42%), Positives = 118/192 (61%), Gaps = 7/192 (3%)
Query: 100 EGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
E ++++K+ GEGC +YG L+V +VAGNFH S H + V ++ N+
Sbjct: 157 ETMVKKVKQALANGEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAKHVNV 212
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
SH I+ L+FG +PG+ NPLDG SG ++Y+IK+VPT Y +S + +NQFSV
Sbjct: 213 SHIIHDLSFGPKYPGIHNPLDGTTRILHETSGTFKYYIKIVPTEYRYISKEVLPTNQFSV 272
Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
TE+F S +T P V+F YDLSPI VT EE SFLHF+T +CA++GG F ++G++D
Sbjct: 273 TEYF-SPMTDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFALTGMLD 331
Query: 278 AFIYHGQRAIKK 289
++ A+ K
Sbjct: 332 RWMCRLLEALTK 343
>gi|50305633|ref|XP_452777.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49641910|emb|CAH01628.1| KLLA0C12947p [Kluyveromyces lactis]
Length = 405
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 102/323 (31%), Positives = 154/323 (47%), Gaps = 41/323 (12%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
+D SGE +++ F K R+ +G + + +G + G YCG
Sbjct: 88 LDDSGEFQINLLDSGFTKIRISPEGKELSKEKFQVGDKSSKQSFNEEG--------YCGP 139
Query: 60 CYGA-ESSDED--------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
CYGA + S D CC C++VR AY +KGWA + ++QC+REG+++ I
Sbjct: 140 CYGALDQSKNDELPQDQKVCCQTCDDVRAAYGQKGWAFKDGKGVEQCEREGYVESINARI 199
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-DSFNISHKINKLAFGEH 169
EGC + G ++N++ G HF PG S H HD + N +H IN L FGE
Sbjct: 200 HEGCRVQGRAQLNRIQGTIHFGPGSSMRNIRGHFHDTSLYDAYPHLNFNHIINTLTFGEK 259
Query: 170 F---------PGVVNPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
++PLD V ++T + YF K++PT + + G +++ QFS T
Sbjct: 260 PKDGDSELIGSASISPLDSRQVFPDRDTHFHEFSYFCKIIPTRFEFLDGKKVETTQFSAT 319
Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVG 267
H R GR + +PGVFF +++SP+KV E+H S+ FL N +G
Sbjct: 320 YHDRPLRGGRDEDHPNTVHSKGGVPGVFFNFEMSPLKVINKEQHATSWSGFLLNCITSIG 379
Query: 268 GVFTVSGIIDAFIYHGQRAIKKK 290
GV V +ID Y Q++I K
Sbjct: 380 GVLAVGTVIDKITYRAQKSIWGK 402
>gi|225446891|ref|XP_002284045.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 3 [Vitis vinifera]
gi|296086333|emb|CBI31774.3| unnamed protein product [Vitis vinifera]
Length = 351
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 97/294 (32%), Positives = 152/294 (51%), Gaps = 44/294 (14%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY---- 56
+D+SG+ +D+ +I+K RL+ G +I G + +++ +H+
Sbjct: 89 IDMSGKHEVDLDTNIWKLRLNRDGFII-------GTEYLSDLVEKEHADHKHDHNKDHHG 141
Query: 57 -CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
A S D+D N ++V++A L+N GEGC
Sbjct: 142 DSDQKLHAHSFDQDAENMVKKVKQA-------LAN--------------------GEGCR 174
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
+YG L+V +VAGNFH S H + V ++ N+SH I+ L+FG +PG+ N
Sbjct: 175 VYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAIHVNVSHIIHDLSFGPKYPGLHN 230
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
PLDG SG ++Y+IK+VPT Y +S + +NQFSV E+F + +T P V
Sbjct: 231 PLDGTVRILRGASGTFKYYIKIVPTEYRYISKEVLPTNQFSVMEYFSPMNEFD-RTWPAV 289
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+F YDLSP+ VT EE SFLHF+T +CA++GG F ++G++D ++Y + K
Sbjct: 290 YFLYDLSPVTVTIKEERRSFLHFITRLCAVLGGTFALTGMLDRWMYRFLEMLTK 343
>gi|212275606|ref|NP_001131002.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
gi|194690678|gb|ACF79423.1| unknown [Zea mays]
gi|413952089|gb|AFW84738.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 293
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/285 (34%), Positives = 152/285 (53%), Gaps = 43/285 (15%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQR-HGGRLEHNETYCGS 59
+D+SG+ +D+ +I+K RLD G++I G + +++ HG +H+ +
Sbjct: 32 IDMSGKHEVDLHTNIWKLRLDKYGHII-------GTEYLSDLVEKGHGAHHDHDHDHDHH 84
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
+ E N EE + + AL N GEGC +YG
Sbjct: 85 D--EQKKHEQTFN--EEAEKMIKSVKQALGN--------------------GEGCRVYGM 120
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
L+V +VAGNFH S H + V + + + N+SH I++L+FG +PG+ NPLD
Sbjct: 121 LDVQRVAGNFHI----SVHGLNIFVAEKIFEGSNHVNVSHVIHELSFGPKYPGIHNPLDE 176
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQGRLQTLPGVF 236
SG ++Y+IKVVPT Y +S + +NQFSVTE+F R +++ P V+
Sbjct: 177 TSRILHDTSGTFKYYIKVVPTEYKYLSKKVLPTNQFSVTEYFLPIRPTDRA----WPAVY 232
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
F YDLSPI VT EE +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 233 FLYDLSPITVTIKEERRNFLHFVTRLCAVLGGTFAMTGMLDRWMY 277
>gi|194708090|gb|ACF88129.1| unknown [Zea mays]
gi|195607866|gb|ACG25763.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|195619788|gb|ACG31724.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
gi|413952088|gb|AFW84737.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 350
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 98/285 (34%), Positives = 152/285 (53%), Gaps = 43/285 (15%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQR-HGGRLEHNETYCGS 59
+D+SG+ +D+ +I+K RLD G++I G + +++ HG +H+ +
Sbjct: 89 IDMSGKHEVDLHTNIWKLRLDKYGHII-------GTEYLSDLVEKGHGAHHDHDHDHDHH 141
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
+ E N EE + + AL N GEGC +YG
Sbjct: 142 D--EQKKHEQTFN--EEAEKMIKSVKQALGN--------------------GEGCRVYGM 177
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
L+V +VAGNFH S H + V + + + N+SH I++L+FG +PG+ NPLD
Sbjct: 178 LDVQRVAGNFHI----SVHGLNIFVAEKIFEGSNHVNVSHVIHELSFGPKYPGIHNPLDE 233
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQGRLQTLPGVF 236
SG ++Y+IKVVPT Y +S + +NQFSVTE+F R +++ P V+
Sbjct: 234 TSRILHDTSGTFKYYIKVVPTEYKYLSKKVLPTNQFSVTEYFLPIRPTDRA----WPAVY 289
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
F YDLSPI VT EE +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 290 FLYDLSPITVTIKEERRNFLHFVTRLCAVLGGTFAMTGMLDRWMY 334
>gi|367004394|ref|XP_003686930.1| hypothetical protein TPHA_0H02930 [Tetrapisispora phaffii CBS 4417]
gi|357525232|emb|CCE64496.1| hypothetical protein TPHA_0H02930 [Tetrapisispora phaffii CBS 4417]
Length = 439
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 102/343 (29%), Positives = 167/343 (48%), Gaps = 63/343 (18%)
Query: 4 SGEQHLDV-----KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
SG LD+ + K RL+++G VI + KI L + E E YCG
Sbjct: 95 SGNVQLDIDLEEASSNFVKTRLNNRGEVIGKAK----KFKITDDLGEYAP--EDKENYCG 148
Query: 59 SCYGAES----------SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 108
SCYG++ +D+ CCN+CE+VR+AY + GWA + I+QC+REG+++ I E
Sbjct: 149 SCYGSKDQTKNEDIEKITDKVCCNSCEDVRQAYSEAGWAFFDGKNIEQCEREGYVKTINE 208
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF-QRDSFNISHKINKLAFG 167
EGC + G +NK+ GN HFAPGK+F H HD F Q + N H IN L+FG
Sbjct: 209 RLSEGCRVKGEALLNKIHGNLHFAPGKAFQNRRGHFHDTSLFNQHKNLNFQHVINHLSFG 268
Query: 168 EHFPGVVN----------------PLDGVRWTQETPSG--------------MYQYFIKV 197
+ +V P+DG + + +G + Y+ ++
Sbjct: 269 KPIRQLVTSNFQDTMSDSLRAQTAPIDGHQAFIQDNTGDSDSASTTIAAHDYQFIYYAEI 328
Query: 198 VPTVYTDVSGHTIQSNQFSVTEHFRS----SEQGRLQTL------PGVFFFYDLSPIKVT 247
+ T + + G +++Q +VT H++ + Q +Q + PG++ +++SP+KV
Sbjct: 329 ISTRFEYLKGDLEETSQLTVTSHYKKIGYQNGQDYMQGMQSRSGIPGLYIDFEVSPLKVI 388
Query: 248 FTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
E++ S+ +L +GG+ V +ID +Y Q A+K+
Sbjct: 389 NKEQYSTSWSGYLLKTITSIGGILAVGTVIDKVVYATQTALKQ 431
>gi|260950825|ref|XP_002619709.1| hypothetical protein CLUG_00868 [Clavispora lusitaniae ATCC 42720]
gi|238847281|gb|EEQ36745.1| hypothetical protein CLUG_00868 [Clavispora lusitaniae ATCC 42720]
Length = 415
Score = 154 bits (388), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 99/331 (29%), Positives = 168/331 (50%), Gaps = 42/331 (12%)
Query: 1 MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY--- 56
+D++G+ HLD+ F+ R+ G E D + K + G L +E
Sbjct: 89 LDMTGDLHLDIVESGFEMFRVLPSG---EEISDDLPLLSGAKKFEDVCGPLTEDEISRGV 145
Query: 57 -CGSCYGA--ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRI--KEEEG 111
CG CYGA ++ ++ CCN CE VR AY + W + I+QC+REG+++++ +
Sbjct: 146 PCGPCYGAVDQTDNKRCCNTCEAVRMAYAVQEWGFFDGSNIEQCEREGYVEKMVSRINNN 205
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFGEH 169
EGC I G ++N+++GN HFAPG ++G H HD+ + + + F+I HKIN +FGE
Sbjct: 206 EGCRIKGSAKINRISGNLHFAPGVPLSRNGRHSHDLSLWTKYSNKFSIDHKINHFSFGED 265
Query: 170 FPGV--------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG--HTIQSN 213
P ++PLDG + + + + Y++ VV T + + G + +N
Sbjct: 266 -PSASRRLASTDDSQEPSIHPLDGFHFDLKKKNHVASYYLSVVSTRFEFLDGKKEAVDTN 324
Query: 214 QFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNV 262
QFSV H R GR +PG FF +D+SP+K+ EE+ ++ F+ V
Sbjct: 325 QFSVITHDRPIVGGRDDDHQNTMHAQGGVPGAFFHFDISPMKIISREEYAKTWSGFILGV 384
Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
+ + GV TV +D ++ ++ ++ K ++
Sbjct: 385 VSSIAGVLTVGAALDRSVWTAEQVLRGKKDM 415
>gi|410083920|ref|XP_003959537.1| hypothetical protein KAFR_0K00470 [Kazachstania africana CBS 2517]
gi|372466129|emb|CCF60402.1| hypothetical protein KAFR_0K00470 [Kazachstania africana CBS 2517]
Length = 417
Score = 150 bits (380), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 103/332 (31%), Positives = 160/332 (48%), Gaps = 53/332 (15%)
Query: 4 SGEQHLDV-KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYG 62
+G+ LD+ + + K R+DS G + + IG + K + + YCGSCYG
Sbjct: 91 AGDLQLDLLESGLTKTRVDSNGVSLTTESFNIGNEALIKR--------DFPQDYCGSCYG 142
Query: 63 A---------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
A ++++ CC CE+V +AY GWA + I+QC+ EG++ RI E EG
Sbjct: 143 ALDQGKNDELNANEKVCCQTCEDVHDAYLNIGWAFYDGKNIEQCETEGYVDRINEHLNEG 202
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQS------GVHVHDILAFQRD-SFNISHKINKLAF 166
C + G +N+V GN HFAPGKS+ H HD + + S + +H I+ +F
Sbjct: 203 CRVQGSARLNRVQGNIHFAPGKSYQDYSRRNSFATHFHDTSLYDKTHSLSFNHIIHHFSF 262
Query: 167 GE---------HFPGV----VNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHT-- 209
G+ H G+ NPLDG + + S Y YF ++VPT Y ++ +
Sbjct: 263 GKPIENSYVNNHNEGLSKISTNPLDGRKVFPDRDSHFIQYSYFAEIVPTRYEYLNNKSDP 322
Query: 210 IQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHF 258
+++ QFS T H R GR + +PG+F +++ SP+KV E++ ++ F
Sbjct: 323 VETTQFSATFHSRPLRGGRDEDHPTTLHQRGGIPGLFIYFETSPLKVINKEQYSQAWSTF 382
Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
L N +GG+ V D Y QR I K
Sbjct: 383 LLNCITTIGGILAVGTSFDKITYKAQRTIWGK 414
>gi|221114903|ref|XP_002155889.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Hydra magnipapillata]
Length = 399
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 78/173 (45%), Positives = 106/173 (61%), Gaps = 2/173 (1%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+E +GC IYG +EVNKVAGNFH GKS H H ++N SH+I+ L+FGE
Sbjct: 174 KEFDGCRIYGNIEVNKVAGNFHITAGKSIPHPRGHAHLSALVSELNYNFSHRIDMLSFGE 233
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQG 227
PG++NPLDG TP MYQY+I +VPT + +TI++NQ+SVT+ R +
Sbjct: 234 PHPGIINPLDGDLMITTTPYHMYQYYIAIVPTTIQTLK-NTIKTNQYSVTQRSRQLNLNS 292
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
Q +PG+FF YD + I V+ EE SF FL +C I+GGVF SG++ + I
Sbjct: 293 GSQGVPGIFFKYDFNAISVSVNEERRSFNEFLIRLCGIIGGVFATSGMLHSAI 345
>gi|397564627|gb|EJK44287.1| hypothetical protein THAOC_37187 [Thalassiosira oceanica]
Length = 506
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 108/356 (30%), Positives = 163/356 (45%), Gaps = 75/356 (21%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
+D++G+ L+V +FK+RLD G + A ++ +R YCG
Sbjct: 142 IDVAGDSQLEVSDKMFKQRLDLDGTPRPLAKISAEANAKALEDKKRREVVEKSVGPDYCG 201
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSN-PDLIDQCKREG---FLQRIKEEEGEGC 114
CYGA+ + +DCCN C++V E Y+KK W + L +QC REG + + GEGC
Sbjct: 202 PCYGAQENAQDCCNTCDDVIERYKKKRWNDNAVQPLAEQCIREGRAGVSEPKRMAGGEGC 261
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF-------- 166
N+ G VN+VAGNFH A G+ + G H+H L R +F +H I++L+F
Sbjct: 262 NLSGHFTVNRVAGNFHIAMGEGVERDGRHIHQFLPEDRVNFIANHVIHELSFLDDEYGDI 321
Query: 167 -GEHFPGVVNP--LDGVRWTQET---------PSGMYQYFIKVVPTVY------------ 202
GE F +++ ++G R + +G++QYFIKVVPT Y
Sbjct: 322 EGEGFLNLMSKAGVNGERSMNGSVKTVTEETGTTGLFQYFIKVVPTKYKGDIIDDMGVST 381
Query: 203 -TDVSGHTIQSNQFSVTEHFRS------------------------SEQGRLQ------- 230
+D +++N++ TE FR S+ G Q
Sbjct: 382 LSDGQEKQLETNRYFYTERFRPLIGDIDEEALLAGDVEKGTAGAHVSKAGGTQHQQAEHH 441
Query: 231 -----TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
LPGVFF Y++ P V + V F+H + A VGGVFT+ ID ++
Sbjct: 442 AATNAVLPGVFFVYEIYPFMVEVSRNRVPFMHLWIRIMATVGGVFTMMSWIDGALH 497
>gi|326490247|dbj|BAJ84787.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326493774|dbj|BAJ85349.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 348
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 93/281 (33%), Positives = 146/281 (51%), Gaps = 37/281 (13%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K RLD G + IG + +++ ++ G
Sbjct: 89 IDMSGKHEVDLHTNIWKLRLDKYGQI-------IGTEYLSDLVEK---EHGTHDHDHGHG 138
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+ + E N E+ + + A+ N GEGC +YG L
Sbjct: 139 HDVQKQPEHTFN--EDADKMVKSVKLAMEN--------------------GEGCRVYGAL 176
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
+V +VAGNFH S H + V + + N+SH I++L+FG +PG+ NPLD
Sbjct: 177 DVQRVAGNFHI----SVHGLNIFVANQIFDGSSHVNVSHVIHRLSFGPEYPGIHNPLDDT 232
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
SG ++Y+IKVVPT Y +S + +NQFSVTE+F ++ P V+F YD
Sbjct: 233 SRILHDTSGTFKYYIKVVPTEYRYLSKGVLPTNQFSVTEYFVPIRPTD-RSWPAVYFLYD 291
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
LSPI VT EE +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 292 LSPITVTIREERRNFLHFITRLCAVLGGTFAMTGMLDRWMY 332
>gi|67623967|ref|XP_668266.1| serologically defined breast cancer antigen 84 [Cryptosporidium
hominis TU502]
gi|54659454|gb|EAL38030.1| serologically defined breast cancer antigen 84 [Cryptosporidium
hominis]
Length = 397
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 97/303 (32%), Positives = 159/303 (52%), Gaps = 37/303 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-------PLQRHGGR---L 50
+D + LD+ DI RL + ++S D +G ++D P+ +G +
Sbjct: 87 VDNTINNKLDIMLDITFPRLRCEEISVDS-VDYVGENQVDSKEYMAKIPIDLNGQEVRNI 145
Query: 51 EHNE-----TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID---QCKREGF 102
++N+ C SCYGAE+++ CCN+C+ ++ AYR KGW S D++ QC
Sbjct: 146 KYNQQNDLKIECMSCYGAETNEFLCCNDCDSLKTAYRSKGW--SYLDIVSKAPQCI---- 199
Query: 103 LQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI-LAFQRDSFNISHKI 161
E GC I G ++VNKV+GN H A G + ++G HVH+ + FN SH I
Sbjct: 200 -------EKVGCRINGRMQVNKVSGNIHVALGTATIKNGKHVHEFNMNDVSRGFNTSHII 252
Query: 162 NKLAFG-EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT-IQSNQFSVTE 219
++L FG + P + +PL+ ++ + M+ Y++K++PT Y +G + NQ++ TE
Sbjct: 253 HELRFGSDRIPFLFSPLENIQKFVHKGTKMFHYYVKLIPTQYFSGNGEVNLYGNQYAFTE 312
Query: 220 HFRSS--EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
R + G L LPGVF YD P + + V H +T+ CAIVGG++++ ++D
Sbjct: 313 RERDVHVQNGELSGLPGVFIVYDFQPFLLQKIYKRVPISHLITSFCAIVGGIYSIMSLLD 372
Query: 278 AFI 280
F+
Sbjct: 373 TFV 375
>gi|123389547|ref|XP_001299739.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121880652|gb|EAX86809.1| hypothetical protein TVAG_100310 [Trichomonas vaginalis G3]
Length = 351
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 99/297 (33%), Positives = 144/297 (48%), Gaps = 34/297 (11%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
D+ G + + ++K R+D GN I Q CG CY
Sbjct: 86 DMMGSGNRPDQKTLYKVRVDQNGNPIPQTQIA---------------------EDCGPCY 124
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
GAESS CC CE+V AY++KGW + N QC+ EG + KE C YG L
Sbjct: 125 GAESSQRKCCQTCEDVVAAYQEKGWGIGNLSSWAQCRAEGVMFDGKER----CQAYGNLH 180
Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
VN + G FH APG + HVHD D+ N++H+I ++FG P +PLD R
Sbjct: 181 VNAIEGGFHLAPGINVFSRFGHVHDFSPLV-DTLNLTHEIEHISFGA--PIDKSPLDNTR 237
Query: 182 WTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSVT-EHFRSSEQGRLQTLPGVFFFY 239
Q+ P + Y+Y +K VPTV +V+G + +F+V + +GR PG+FF Y
Sbjct: 238 VVQKKPGQIHYRYNLKAVPTV-KEVNGKVHRFFRFTVNYAEIPVTARGRYG--PGIFFVY 294
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
+P+ +T T + + L + +I GG F ++ +ID+F Y I+ K I KF
Sbjct: 295 SFAPVAITSTYDRPNITVLLARLISIFGGSFMLARLIDSFTYR-LNTIEGKDRINKF 350
>gi|66363024|ref|XP_628478.1| ER vesicle protein; Erv41p, transmembrane region near C terminus
and possible N region transmembrane [Cryptosporidium
parvum Iowa II]
gi|46229502|gb|EAK90320.1| ER vesicle protein; Erv41p, transmembrane region near C terminus
and possible N region transmembrane [Cryptosporidium
parvum Iowa II]
Length = 397
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 93/288 (32%), Positives = 151/288 (52%), Gaps = 36/288 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D GE +D K + K +D G + + K Q++ ++E C SC
Sbjct: 116 VDYVGENQVDSKEYMVKIPIDLNGQEVRNI----------KYNQQNDLKIE-----CMSC 160
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID---QCKREGFLQRIKEEEGEGCNIY 117
YGAE+++ CCN+C+ ++ AYR KGW S D++ QC E GC I
Sbjct: 161 YGAETNEFLCCNDCDSLKTAYRSKGW--SYLDIVSKAPQCI-----------EKVGCRIN 207
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDI-LAFQRDSFNISHKINKLAFG-EHFPGVVN 175
G ++VNKV+GN H A G + ++G HVH+ + FN SH I++L FG + P + +
Sbjct: 208 GRMQVNKVSGNIHVALGTATIKNGKHVHEFNMNDVSRGFNTSHIIHELRFGSDKIPFLFS 267
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT-IQSNQFSVTEHFRSS--EQGRLQTL 232
PL+ ++ + M+ Y++K++PT Y +G + NQ++ TE R + G L L
Sbjct: 268 PLENIQKFVHKGTKMFHYYVKLIPTQYFSGNGEVNLYGNQYAFTERERDVHVQNGELSGL 327
Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
PG+F YD P + + V H +T+ CAIVGG++++ ++D F+
Sbjct: 328 PGIFIVYDFQPFLLQKIYKRVPISHLITSFCAIVGGIYSIMSLLDTFV 375
>gi|209876426|ref|XP_002139655.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555261|gb|EEA05306.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 395
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 98/304 (32%), Positives = 160/304 (52%), Gaps = 36/304 (11%)
Query: 1 MDISGEQHLDVKHDIFKKRL-------DSQGNVIESRQDGIGAPKIDKPLQRHGGRLE-- 51
+D + Q LD++ DI L D+ NV E++ + G + P+ HG ++
Sbjct: 87 VDDNMNQKLDIRLDISFPSLRCSEISVDTVDNVGENQVNAHGNL-LKIPIDIHGNEVQEE 145
Query: 52 ----HNETY---CGSCYGAESSDEDCCNNCEEVREAYRKKGWA-LSNPDLIDQCKREGFL 103
+NE+ C SC+GAES CCN CE ++ A+R KGW+ L QC
Sbjct: 146 IMAQYNESTSMKCLSCFGAESIHYKCCNTCESLKSAFRYKGWSYLDIASKAPQCINT--- 202
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI-LAFQRDSFNISHKIN 162
GC ++G L+VNKV+GN H A G++ + G HVH+ + FN SH I+
Sbjct: 203 --------VGCRLHGSLQVNKVSGNIHVALGQATVRDGKHVHEFNMNDISRGFNTSHTIH 254
Query: 163 KLAFG-EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--IQSNQFSVTE 219
+L FG ++ + +PL+ + T + M+ Y++K+VPT + SG++ + SNQ++ TE
Sbjct: 255 ELRFGKDNIEFIGSPLENTKKIVTTGTSMFHYYLKLVPTQFIK-SGYSKVLFSNQYTYTE 313
Query: 220 HFRSS--EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
+ + G L LPGVF YD P + + HFLT+ CAI+GG++++ ++D
Sbjct: 314 RQKDVLVKDGELSGLPGVFIVYDFQPFVIRKIHNSIPTTHFLTSFCAIIGGIYSLMSLVD 373
Query: 278 AFIY 281
+ ++
Sbjct: 374 SILF 377
>gi|428171090|gb|EKX40010.1| hypothetical protein GUITHDRAFT_154283 [Guillardia theta CCMP2712]
Length = 331
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 95/279 (34%), Positives = 137/279 (49%), Gaps = 65/279 (23%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGE LD+ HD++K+ ++S+ + +G P I
Sbjct: 97 MDVSGEHELDIVHDVYKR-------AMDSKGNALG-PVI--------------------- 127
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
E+V+ A ALS + +Q +R EGCNIYG L
Sbjct: 128 -------------SEKVKLARD----ALSISHIKEQLERH-----------EGCNIYGTL 159
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
KV+GNFH S H HV + R + N SH +N L+FG +PG+ NPLDG
Sbjct: 160 NAQKVSGNFHL----SLHAQDFHVLAQVFPDRATVNTSHIVNHLSFGRDYPGLKNPLDGE 215
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
+ SG ++Y+IK+VPT + + G I +NQ+SVT+HFR + G P V+F YD
Sbjct: 216 MKVLDQGSGTFEYYIKIVPTKFHHLDGTIIDTNQYSVTDHFRKLQDG----FPAVYFIYD 271
Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
+SPI V + SF H+ T +CAI GG++ V+G + A
Sbjct: 272 ISPIMVRVKQWKQSFSHYATQLCAITGGMYVVTGQLHAL 310
>gi|384253563|gb|EIE27037.1| DUF1692-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 327
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 79/181 (43%), Positives = 109/181 (60%), Gaps = 15/181 (8%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-----DSFNISHKINKLAF 166
EGCNI+G+L++ +VAGNF + VHV D A R N SH I++++F
Sbjct: 152 EGCNIFGWLDLQRVAGNFRVS---------VHVEDFFALTRLQADTTGINSSHIIHRVSF 202
Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
G FPG VNPLDG + SG ++YF+KVVPT Y +G +NQ+SVTE+ +
Sbjct: 203 GPTFPGQVNPLDGAERILDKESGTFKYFLKVVPTEYQWSAGTRTTTNQYSVTEYDTVVHK 262
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
G +Q +P V+F YD+SPI VT +E SF H L CA+VGGVF V+G+ D +++ A
Sbjct: 263 GEMQ-MPSVWFSYDISPISVTISEIRKSFAHLLVRFCAVVGGVFAVTGMFDRWVHRIVTA 321
Query: 287 I 287
I
Sbjct: 322 I 322
>gi|384501765|gb|EIE92256.1| hypothetical protein RO3G_17063 [Rhizopus delemar RA 99-880]
Length = 291
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 80/173 (46%), Positives = 107/173 (61%), Gaps = 11/173 (6%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD SGEQ D+ K RLD+ GN+IES K+ LE CGSC
Sbjct: 81 MDESGEQSSGYSQDVTKIRLDTLGNIIESGH----TVKLGDHTNDAKKALEE-APECGSC 135
Query: 61 YGAESSDED-CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
YGA+ ED CC++C++VREAY K+GW L N I+QC REG+L +++ + EGCN++G
Sbjct: 136 YGAKPLREDGCCHSCQDVREAYVKQGWGLVNTKEIEQCIREGWLAKLENQSNEGCNVHGH 195
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-----SFNISHKINKLAFG 167
L VNKV GNFHFAPG +F +HVHD+ + + SF++SH+I+KL FG
Sbjct: 196 LLVNKVRGNFHFAPGGAFQAGSMHVHDLQEYTQGAPNGHSFDMSHRIHKLKFG 248
>gi|410078101|ref|XP_003956632.1| hypothetical protein KAFR_0C05060 [Kazachstania africana CBS 2517]
gi|372463216|emb|CCF57497.1| hypothetical protein KAFR_0C05060 [Kazachstania africana CBS 2517]
Length = 414
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 88/269 (32%), Positives = 134/269 (49%), Gaps = 35/269 (13%)
Query: 53 NETYCGSCYGAESS-----------DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREG 101
+E YCG CYGA+ D CC C +V+ +Y GWA + I+QC+REG
Sbjct: 137 DENYCGPCYGAKDQSINDKEGIKKEDRVCCQTCSDVKNSYLDAGWAFFDGKNIEQCEREG 196
Query: 102 FLQRIKEEEGEGCNIYGF-LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ-RDSFNISH 159
++++I + EGC I G + +N+V GN HFAPG+++H H HD + + N +H
Sbjct: 197 YIEKINSQLNEGCQIKGSNVLINRVNGNLHFAPGEAYHNPNGHYHDTSFYDLKPQLNFNH 256
Query: 160 KINKLAFGE--------HFPGVVN-PLDGVRWTQETPSGMY--QYFIKVVPTVYTDVSGH 208
IN +FG H ++N PLDG + E S Y YF K+V T Y +
Sbjct: 257 IINHFSFGNGAVDRDATHDTTLMNSPLDGTQVLPEYDSHAYAFTYFNKIVSTRYEYLERD 316
Query: 209 TIQSNQFSVTEHFRSSEQGR----------LQTLPGVFFFYDLSPIKVTFTEEH-VSFLH 257
+++ QF+ H R G +PG+F ++D+SP+K+ E+H V++
Sbjct: 317 PLETVQFTSMFHDRQINGGNDIHDEKIKHARGGIPGLFIYFDISPMKIINKEQHTVNWST 376
Query: 258 FLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
F+ N +GG+ V +ID Y QR
Sbjct: 377 FVLNCITSIGGILAVGTVIDKIFYKTQRT 405
>gi|302823246|ref|XP_002993277.1| hypothetical protein SELMODRAFT_431378 [Selaginella moellendorffii]
gi|302825185|ref|XP_002994225.1| hypothetical protein SELMODRAFT_236933 [Selaginella moellendorffii]
gi|300137936|gb|EFJ04730.1| hypothetical protein SELMODRAFT_236933 [Selaginella moellendorffii]
gi|300138947|gb|EFJ05698.1| hypothetical protein SELMODRAFT_431378 [Selaginella moellendorffii]
Length = 333
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 95/291 (32%), Positives = 158/291 (54%), Gaps = 52/291 (17%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K RL G+++ G+ + +++ EH
Sbjct: 87 IDMSGKHEVDLDTNIWKLRLHKDGHIL-------GSEYLSDLVEK-----EH-------- 126
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
A + ++ EE+R A + ++++ + ++GEGC ++G L
Sbjct: 127 --AHDNLTGIFHSHEELRSAVK----------VVNEINK-------ALQDGEGCRVFGVL 167
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD- 178
+V +VAGNFH S H + + H + N+SH IN L+FG +PG+ NPLD
Sbjct: 168 DVERVAGNFHI----SMHGMSLQIFHSV-----KEVNVSHIINDLSFGPKYPGIHNPLDR 218
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
VR ++T +G ++YFIK+VPT Y ++G + +NQFSV E++ ++ + + P V+F
Sbjct: 219 TVRILRDT-AGTFKYFIKIVPTEYRYLNGGKLPTNQFSVGEYYLAARDDDI-SWPAVYFL 276
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
YDLSPI V EE SF H LT CAIVGG F+++G++D +IY +I +
Sbjct: 277 YDLSPITVLIKEERRSFGHLLTRFCAIVGGTFSLTGMLDRWIYRLVESITR 327
>gi|412992535|emb|CCO18515.1| predicted protein [Bathycoccus prasinos]
Length = 428
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 155/314 (49%), Gaps = 30/314 (9%)
Query: 1 MDISGEQHLDVKHD--IFKKRLDSQGNVIESR----QDGIGAPKIDKPLQRHGGRL---- 50
+D +GE H DV HD I K+RLD G I R +D + + +H +L
Sbjct: 121 LDAAGEVHHDV-HDGHITKRRLDRDGKPIPRRDSSAKDDVAVTREKPNKHKHIEKLVREK 179
Query: 51 -EHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWA------LSNPDLIDQCKREGFL 103
+ E + E +E R + A LI + G
Sbjct: 180 EKEEEGKKNEGEQEQEQQEQNHEQHDEKRRKLQNTALAGFGGGFFDINALIHEQFPNGLE 239
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+ K + EGC + G+LEVN+V G+F +PGKS H+ + N+SH IN+
Sbjct: 240 EAFKNKNKEGCEVMGYLEVNRVPGSFSISPGKSLQIGMSHIQLNVV---SHLNMSHTINR 296
Query: 164 LAFGEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
LAFGE FPG +N LD R+ P+ ++QYF+KVVPT + + T+ +NQ+SVTE
Sbjct: 297 LAFGEAFPGALNLLDKNTRYL--PPNAVHQYFLKVVPTSFARLKDTTLATNQYSVTESSS 354
Query: 223 SSEQ-----GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
S++Q G G++F Y+LSPI++ F E SF F+ +VC+I+GGV T SGI+
Sbjct: 355 SAKQSFFGMGSSGKPSGIYFHYELSPIRIDFKERRNSFGEFMLSVCSIIGGVATSSGILH 414
Query: 278 AFIYHGQ-RAIKKK 290
I Q RA KK
Sbjct: 415 KLIVFIQTRARSKK 428
>gi|145351005|ref|XP_001419879.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580112|gb|ABO98172.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 373
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 99/281 (35%), Positives = 150/281 (53%), Gaps = 27/281 (9%)
Query: 2 DISGEQHLDVKHD--IFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
D +GE+H DV HD I K+R+D G VI++ + K +K + + NET S
Sbjct: 95 DKAGEEHYDV-HDGHIEKRRIDKHGKVIDA---AFTSEKPNKHKEIEQALQKMNET--DS 148
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
+ A+S E + G L+ + EG + E EGC + G+
Sbjct: 149 AHAADS----------HAMEHVQPFGGMFGLQSLLQEVFPEGVEHAFRNENQEGCEVKGY 198
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
LEVN+V G F +PG+S V L Q + N++H I++L+FGE FPG+V+PLDG
Sbjct: 199 LEVNRVPGRFSISPGRSLMMGMQMVK--LNVQ-TALNLTHTIHRLSFGESFPGLVSPLDG 255
Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTL----PG 234
+ P+ + QYF+ VV T + + I ++Q+SVTE F SS++ + T PG
Sbjct: 256 THRSLP-PNAVQQYFLNVVSTTFEPLGENKIISTHQYSVTETFTSSQRSIMGTSNGRDPG 314
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
V F Y++SPI+V F E SF F+ +C+++GGV T++GI
Sbjct: 315 VIFTYEISPIRVDFKETRTSFGAFVLGICSVIGGVVTMAGI 355
>gi|156402826|ref|XP_001639791.1| predicted protein [Nematostella vectensis]
gi|156226921|gb|EDO47728.1| predicted protein [Nematostella vectensis]
Length = 413
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 105/184 (57%), Gaps = 1/184 (0%)
Query: 98 KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
KRE +E + C +YG +VNKVAGNFH GKS H H H +S N
Sbjct: 156 KREESKDAANTKEHDACRVYGSFKVNKVAGNFHITSGKSIHHPRGHAHLSSMVPVESLNF 215
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
SH+I+ L+FG+ PG+V+PLDG E MYQY+I+VVPT ++ I++NQ+S+
Sbjct: 216 SHRIDMLSFGKRVPGIVHPLDGEMQITEKRRMMYQYYIQVVPTSIKSLNSEEIKTNQYSM 275
Query: 218 TEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
T+ R S + G+FF YD+S I V +H S + FL +C IVGG+F SG++
Sbjct: 276 TQRIREISHDSGSHGIAGLFFKYDMSSIMVRVKHQHHSMVGFLVRLCGIVGGIFATSGML 335
Query: 277 DAFI 280
FI
Sbjct: 336 HDFI 339
>gi|168004249|ref|XP_001754824.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693928|gb|EDQ80278.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 347
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 149/282 (52%), Gaps = 43/282 (15%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D+SG+ +D+ +I+K R+ G V+ S E
Sbjct: 91 IDMSGKHEVDLDTNIWKLRIHRDGYVLGS------------------------EFVNDLV 126
Query: 61 YGAESSDEDCCNNCEEVREA-YRKKGWALSNPD-LIDQCKREGFLQRIKEEEGEGCNIYG 118
G +E + +E ++ +RKK +P +I++ K+ ++GEGC I+G
Sbjct: 127 EGEHRKEEPKADKKDEHKDGDHRKK-----DPQKVINEVKK-------AIDDGEGCQIFG 174
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
L+V +VAGNFH S H ++V + N+SH I+ L+FG +PG NPLD
Sbjct: 175 VLDVERVAGNFHI----SMHGLSLYVASKIFEAGYEVNVSHVIHDLSFGPTYPGHHNPLD 230
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
G SG ++YF+K+VPT Y + G + +NQFSVTE+++ ++ ++ P V+F
Sbjct: 231 GSERILHDTSGTFKYFLKIVPTEYHYLHGEVMPTNQFSVTEYYQRTKPSD-RSYPAVYFV 289
Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
YDLSPI VT E +F HF+T +CA++GG F V+G++D ++
Sbjct: 290 YDLSPIVVTIREHRRNFGHFITRLCAVLGGTFAVTGMLDRWM 331
>gi|145476255|ref|XP_001424150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391213|emb|CAK56752.1| unnamed protein product [Paramecium tetraurelia]
Length = 339
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 116/205 (56%), Gaps = 27/205 (13%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISHKINKLA 165
+E EGC I G++ VNKV GNFH S H G +H + FQR + ++SH IN ++
Sbjct: 146 KEKEGCQIAGYIIVNKVPGNFHV----SAHAFGGILHQV--FQRSQIQTLDLSHTINHIS 199
Query: 166 FGEH----------FPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQS 212
FGE GV+NPLD + + G M+QY+I VVPT Y DVSG
Sbjct: 200 FGEEDDLMKIKKQFQKGVLNPLDNTKKVAQPQGGTGMMFQYYISVVPTTYVDVSG----- 254
Query: 213 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
N++ V + +S + LP +F YDLSP+ V F + SFLHFL +CAI+GGVFT+
Sbjct: 255 NEYYVHQFTANSNEVLTDHLPAAYFRYDLSPVTVKFLQYRESFLHFLVQICAILGGVFTI 314
Query: 273 SGIIDAFIYHGQRAIKKKIEIGKFS 297
+ I+D I+ A+ KK E+GK S
Sbjct: 315 ASIVDGMIHKSVVALLKKYEMGKLS 339
>gi|145511431|ref|XP_001441642.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408894|emb|CAK74245.1| unnamed protein product [Paramecium tetraurelia]
Length = 329
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 116/205 (56%), Gaps = 27/205 (13%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISHKINKLA 165
+E EGC I G++ VNKV GNFH S H G +H + FQR + ++SH IN ++
Sbjct: 136 KEKEGCQIAGYIIVNKVPGNFHV----SAHAFGGILHQV--FQRSQIQTLDLSHTINHIS 189
Query: 166 FGEH----------FPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQS 212
FGE GV+NPLD + + G M+QY+I VVPT Y DVSG+
Sbjct: 190 FGEEDDLMKIKKQFQKGVLNPLDNTKKVAQPQGGTGMMFQYYISVVPTTYVDVSGNEYYV 249
Query: 213 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
+QF+ +S + LP +F YDLSP+ V F + SFLHFL +CAI+GGVFT+
Sbjct: 250 HQFTA-----NSNEVLTDHLPAAYFRYDLSPVTVKFLQYRESFLHFLVQICAILGGVFTI 304
Query: 273 SGIIDAFIYHGQRAIKKKIEIGKFS 297
+ I+D I+ A+ KK E+GK S
Sbjct: 305 ASIVDGMIHKSVVALLKKYEMGKLS 329
>gi|195162746|ref|XP_002022215.1| GL25735 [Drosophila persimilis]
gi|194104176|gb|EDW26219.1| GL25735 [Drosophila persimilis]
Length = 313
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 87/201 (43%), Positives = 115/201 (57%), Gaps = 21/201 (10%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
MD SG+ HL V HDIFK RLD +G P + P++ N+ CGS
Sbjct: 89 MDSSGDTHLRVDHDIFKHRLDLKGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
CYGAE + CCN CE+V +AYR W + D I+QCK G +R E+ EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRLHKWNVQ-VDKIEQCK--GKYKRTDEDAFKEGCRIQG 194
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
LEVN++AG+FHFAPGKSF H+HD FQ + +SH IN L+FGE +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251
Query: 178 DGVRW-TQETPSGMYQYFIKV 197
DG+R ET + M+ +++K+
Sbjct: 252 DGLRVDVAETKTEMFNHYLKI 272
>gi|156030895|ref|XP_001584773.1| hypothetical protein SS1G_14228 [Sclerotinia sclerotiorum 1980]
gi|154700619|gb|EDO00358.1| hypothetical protein SS1G_14228 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 381
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 144/317 (45%), Gaps = 79/317 (24%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETY 56
MD+SGEQ + V H + K RL +Q G VI++ D A + L + Y
Sbjct: 67 MDVSGEQQVGVMHGVKKVRLSAQEEGGKVIDTTALDLHNADEAATHL---------DPNY 117
Query: 57 CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
CG CYGA + + CCN C+EVREAY WA + ++QC+RE + +R+ + E
Sbjct: 118 CGPCYGATPPPNAKKQGCCNTCDEVREAYASVSWAFGRGENVEQCEREHYGERLDSQRKE 177
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGE 168
GC I G L VNKV GNFH APG+SF +HVHD+ + SH I+ L FG
Sbjct: 178 GCRIEGGLRVNKVIGNFHIAPGRSFTNGNMHVHDLNNYFDTPVPGGHVFSHHIHSLRFGP 237
Query: 169 HFPGVV-----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS----- 206
P V NPLD + + YF+KVV T Y +
Sbjct: 238 ELPEEVTKKLGSDSIIPWTNHHLNPLDNTEQITHEAAYNFMYFVKVVSTSYLPLGWETTY 297
Query: 207 --------------GH----TIQSNQFSVTEHFRS------SEQGRLQTL------PGVF 236
GH +I+++Q+SVT H RS S +G + L PGVF
Sbjct: 298 NSPPHDASVDIGTYGHSEDGSIETHQYSVTSHRRSLNGGDDSAEGHKEKLHARGGIPGVF 357
Query: 237 FFYDLSPIKVTFTEEHV 253
F Y V+F E H+
Sbjct: 358 FSY------VSFLEIHM 368
>gi|195997845|ref|XP_002108791.1| hypothetical protein TRIADDRAFT_49706 [Trichoplax adhaerens]
gi|190589567|gb|EDV29589.1| hypothetical protein TRIADDRAFT_49706 [Trichoplax adhaerens]
Length = 324
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/170 (45%), Positives = 99/170 (58%), Gaps = 4/170 (2%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G + +NKVAGNFH G S + H H R+S N SH+I+ LAFG P
Sbjct: 137 DACRIHGNIPLNKVAGNFHVTAGMSINHPMGHAHVSDLVPRESVNFSHRIDLLAFGVAAP 196
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE--QGRL 229
V+NPLDGV + + MYQYFIK+VPT S I + Q+SVTEHF + G+
Sbjct: 197 NVINPLDGVEFITKITDKMYQYFIKIVPTKVKTFSV-AIDTYQYSVTEHFSKVDHMNGK- 254
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
+ G+FF YDLSPI V TE V F L +C IVGG+F SG+I F
Sbjct: 255 HGVSGLFFKYDLSPISVQVTEARVPFGQLLIRLCGIVGGIFATSGMIHIF 304
>gi|229594330|ref|XP_001024169.3| hypothetical protein TTHERM_00455630 [Tetrahymena thermophila]
gi|225566928|gb|EAS03924.3| hypothetical protein TTHERM_00455630 [Tetrahymena thermophila
SB210]
Length = 348
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 87/212 (41%), Positives = 121/212 (57%), Gaps = 26/212 (12%)
Query: 103 LQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-DSFNISH 159
L+R+K+ + EGC I GF+ VNKV GNFH S H G ++ I R ++ ++SH
Sbjct: 146 LERVKKAFNDREGCKISGFMLVNKVPGNFHI----SSHAYGNYLQRIFQDARINTLDLSH 201
Query: 160 KINKLAFGEHF----------PGVVNPLDGVRWTQ----ETPSGMYQYFIKVVPTVYTDV 205
IN L+FGE G++ PLD + + T +QY+I VVPT Y D+
Sbjct: 202 VINHLSFGEENDLNRIKKTFQQGILQPLDHTKKIKPENLRTVGVTHQYYINVVPTTYKDL 261
Query: 206 SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
S + ++ V + +S + Q LP VFF YDLSP+ V F++ SFLHFL VCAI
Sbjct: 262 S-----NRKYHVYQFVANSNEMTTQHLPAVFFRYDLSPVTVQFSQTRESFLHFLVQVCAI 316
Query: 266 VGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
+GGVFTV+GIID+ ++ I KK E+GK S
Sbjct: 317 IGGVFTVAGIIDSIVHRSVVHILKKAEMGKLS 348
>gi|123451578|ref|XP_001313964.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121895945|gb|EAY01112.1| hypothetical protein TVAG_442240 [Trichomonas vaginalis G3]
Length = 375
Score = 140 bits (354), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 97/293 (33%), Positives = 148/293 (50%), Gaps = 19/293 (6%)
Query: 7 QHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESS 66
Q DV +DI ++R+D G I D + ++ + + + E + YCG CYGA
Sbjct: 96 QSTDV-NDIKQQRIDENGFAI----DSVNWIRLKRAAKSKKQKKEQPQQYCGKCYGALPQ 150
Query: 67 DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVA 126
+ CCN+CE+V A++ KGW + D QC EG+ KE CN+YG + V ++
Sbjct: 151 GK-CCNSCEDVINAFKAKGWGIDGIDRWQQCIDEGYADLGKES----CNVYGDINVAHIS 205
Query: 127 GNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG---EHFPGVVNPLDGVRWT 183
G +FA + + H DI +N++H IN L FG H PG PLDG+
Sbjct: 206 GFLYFAL-EDYKVGDKHPKDISRLSH-KYNLTHTINYLEFGPRVSHEPG---PLDGLTVL 260
Query: 184 QETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLS 242
QE P M Y Y ++VVPT + G + + +F ++ + + +PG+F Y+L+
Sbjct: 261 QEEPGLMQYNYDLEVVPTKWFSSRGFPVSTYKFHPMITQKNFTEKVNRGVPGIFLNYNLA 320
Query: 243 PIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
PI + E S +T+VCAIVGG FT + D + +I+ K +IGK
Sbjct: 321 PISLVQYEVISSPWKLITSVCAIVGGCFTCVSLADQIFFRTLSSIEGKRQIGK 373
>gi|405968654|gb|EKC33703.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Crassostrea gigas]
Length = 345
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 98/295 (33%), Positives = 144/295 (48%), Gaps = 18/295 (6%)
Query: 14 DIFKKRLDSQGNVIESRQDGIGAPKIDKPLQ-RHG-GRLEHNETYCGSCYGAESSDEDCC 71
D F K LD S IGA +D Q HG G L++ ET+ + +
Sbjct: 20 DAFPKVLDDCQEKTASGGGTIGADVLDVTGQDTHGFGELKYEETH----FELSPNQRHYH 75
Query: 72 NNCEEVREAYRKKGWALSNPDLIDQ----CKREGFLQRIKEEEGE--GCNIYGFLEVNKV 125
+E+ E R + AL + + + + G +R EGE C +YG LEVNKV
Sbjct: 76 ETVQEISEFLRSEYHALQDVMWMSRGLIATYKTGMPKREIPAEGEPDACRVYGSLEVNKV 135
Query: 126 AGNFHFAPGKS---FHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRW 182
AGNFH GKS F + H H + +N SH+I+ +FGE G++NPLDG
Sbjct: 136 AGNFHITAGKSVPVFPRG--HAHISMMVHEKEYNFSHRIDHFSFGESVKGIINPLDGEEQ 193
Query: 183 TQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDL 241
++ YFIK+VPT + I + QFSVT+ R+ + +PG+F YDL
Sbjct: 194 VSSDNFHVFNYFIKIVPTEVRTYAAGNIDTYQFSVTQRNRTINHSKGSHGVPGIFVKYDL 253
Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
+ +K+ E+H F FL +C IVGG+F VSG++ + + K ++GK+
Sbjct: 254 NALKIRVVEKHRPFSQFLIRLCGIVGGIFAVSGMLHNWTEFFMEVVCCKFKLGKY 308
>gi|148224086|ref|NP_001087666.1| ERGIC and golgi 2 [Xenopus laevis]
gi|51950053|gb|AAH82468.1| MGC81917 protein [Xenopus laevis]
Length = 377
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/175 (41%), Positives = 98/175 (56%), Gaps = 14/175 (8%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E+ C I+G L++NKVAGNFH GK+ H H DS+N SH+I+ +FGE
Sbjct: 165 EQPNACRIHGHLDINKVAGNFHITVGKAIPHPRGHAHLAALVSHDSYNFSHRIDHFSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT------VYTDVSGHTIQSNQFSVTEHFR 222
P ++NPLDG E + MYQYFI +VPT VY D ++QFSVTE R
Sbjct: 225 PLPAIINPLDGTEKIAEDSNQMYQYFITIVPTKLNTNKVYCD-------THQFSVTERER 277
Query: 223 SSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YD+S + VT TE+H+ FL +C I+GG+FT +G+I
Sbjct: 278 VINHATGSHGVSGIFMKYDISSLMVTVTEDHMPLWKFLVRLCGIIGGIFTTTGMI 332
>gi|345567560|gb|EGX50490.1| hypothetical protein AOL_s00075g219 [Arthrobotrys oligospora ATCC
24927]
Length = 354
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 71/175 (40%), Positives = 104/175 (59%), Gaps = 11/175 (6%)
Query: 103 LQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKI 161
L K +G+ C I+G ++VN+V G+FH A G + G HV D+FN SH +
Sbjct: 153 LNLPKRPKGKSCRIWGSMDVNRVMGDFHITAKGHGYWDPGQHV------DHDTFNFSHVV 206
Query: 162 NKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 221
N+L+FGE +P +VNPLDGV E YQYF+ VVPT Y G T+Q+NQ+SVTE
Sbjct: 207 NELSFGEFYPKLVNPLDGVASVTEDKFYRYQYFMSVVPTTYK-AHGRTLQTNQYSVTEQG 265
Query: 222 RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
RS Q++PG+FF +D+ PI +T T+ H +++ + + ++GGV G +
Sbjct: 266 RSMNP---QSVPGIFFKFDIEPIMLTITDTHTPWIYLIVRLANVIGGVMVAGGWL 317
>gi|89272944|emb|CAJ82943.1| ptx1 [Xenopus (Silurana) tropicalis]
Length = 377
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 73/170 (42%), Positives = 98/170 (57%), Gaps = 4/170 (2%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E C I+G LE+NKVAGNFH GK+ H H DS+N SH+I+ +FGE
Sbjct: 165 EPPNACRIHGHLEINKVAGNFHITVGKAIPHPRGHAHLAALVSHDSYNFSHRIDHFSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQG 227
PG+VNPLDG E + MYQYFI +VPT ++T+ ++QFSVTE R
Sbjct: 225 PLPGIVNPLDGTEKIAEDSNQMYQYFITIVPTKLHTNKVD--CDTHQFSVTERERVINHA 282
Query: 228 R-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YD+S + V TE+H+ FL +C IVGG+FT +G+I
Sbjct: 283 SGSHGVSGIFMKYDISSLMVMVTEDHMPLWKFLVRLCGIVGGIFTTTGMI 332
>gi|126339088|ref|XP_001363644.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Monodelphis domestica]
Length = 378
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 71/166 (42%), Positives = 99/166 (59%), Gaps = 2/166 (1%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQ 230
G++NPLDG + M+QYFI VVPT + + ++QFSVTE R+ +
Sbjct: 228 GIINPLDGTEKIANDHNQMFQYFITVVPT-KLNTYKISADTHQFSVTERERAINHAAGSH 286
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F FL +C I+GG+F+ +G++
Sbjct: 287 GVSGIFMKYDLSSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGML 332
>gi|313661438|ref|NP_001186332.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Gallus gallus]
Length = 377
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 103/174 (59%), Gaps = 6/174 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 165 ESPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SE 225
PG++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 225 LIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET---HQFSVTERERVINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
+ G+F YD+S + VT TEEH+ F FL +C I+GG+F+ +GI+ F
Sbjct: 282 AAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGILHGF 335
>gi|326911226|ref|XP_003201962.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Meleagris gallopavo]
Length = 377
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 75/174 (43%), Positives = 103/174 (59%), Gaps = 6/174 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 165 ESPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SE 225
PG++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 225 LIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET---HQFSVTERERVINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
+ G+F YD+S + VT TEEH+ F FL +C I+GG+F+ +GI+ F
Sbjct: 282 AAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGILHGF 335
>gi|327273481|ref|XP_003221509.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Anolis carolinensis]
Length = 377
Score = 136 bits (343), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 72/168 (42%), Positives = 99/168 (58%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGELIP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI--QSNQFSVTEHFRSSEQGR- 228
G++NPLDG + M+QYFI VVP T + H I +++QFSVTE R
Sbjct: 228 GIINPLDGTEKVASDHNQMFQYFITVVP---TKLHTHKISAETHQFSVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YD+S + VT TEEH+ F FL +C I+GG+F+ +GI+
Sbjct: 285 SHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGIL 332
>gi|387015774|gb|AFJ50006.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2-like
[Crotalus adamanteus]
Length = 377
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 71/171 (41%), Positives = 100/171 (58%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 165 QSADACRIHGHLYVNKVAGNFHVTVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI--QSNQFSVTEHFRSSEQ 226
PG++NPLDG + M+QYF+ VVP T + H I +++QF+VTE R
Sbjct: 225 LIPGIINPLDGTEKIASDHNQMFQYFVTVVP---TKLQTHKISAETHQFAVTERERIINH 281
Query: 227 GR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YD+S + VT TEEH+ F FL +C IVGG+F+ +GI+
Sbjct: 282 AAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIVGGIFSTTGIL 332
>gi|395537817|ref|XP_003770886.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Sarcophilus harrisii]
Length = 378
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 71/166 (42%), Positives = 99/166 (59%), Gaps = 2/166 (1%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQ 230
G++NPLDG + M+QYFI VVPT + + ++QFSVTE R+ +
Sbjct: 228 GIINPLDGTEKIAIDHNQMFQYFITVVPT-KLNTYKISADTHQFSVTERERAINHAAGSH 286
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F FL +C I+GG+F+ +G++
Sbjct: 287 GVSGIFMKYDLSSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGML 332
>gi|224093106|ref|XP_002193654.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Taeniopygia guttata]
Length = 377
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/174 (42%), Positives = 103/174 (59%), Gaps = 6/174 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SE 225
PG++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 225 LIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET---HQFSVTERERVINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
+ G+F YD+S + VT TEEH+ F FL +C I+GG+F+ +GI+ F
Sbjct: 282 AAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGILHGF 335
>gi|449278843|gb|EMC86582.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Columba livia]
Length = 377
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/174 (42%), Positives = 103/174 (59%), Gaps = 6/174 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SE 225
PG++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 225 LIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET---HQFSVTERERVINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
+ G+F YD+S + VT TEEH+ F FL +C I+GG+F+ +GI+ F
Sbjct: 282 AAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGILHGF 335
>gi|57208596|emb|CAI42845.1| ERGIC and golgi 3 [Homo sapiens]
Length = 129
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/128 (54%), Positives = 88/128 (68%), Gaps = 9/128 (7%)
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG------HTIQSNQFSVTEHFRSSEQGRL 229
PLD T S M+QYF+KVVPTVY V G +++NQFSVT H + + G L
Sbjct: 1 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEAPLPPQVLRTNQFSVTRHEKVAN-GLL 59
Query: 230 --QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
Q LPGVF Y+LSP+ V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH RAI
Sbjct: 60 GDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAI 119
Query: 288 KKKIEIGK 295
+KKI++GK
Sbjct: 120 QKKIDLGK 127
>gi|431908425|gb|ELK12022.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Pteropus alecto]
Length = 377
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/168 (44%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
G++NPLDG E + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAEDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|355686514|gb|AER98081.1| ERGIC and golgi 2 [Mustela putorius furo]
Length = 365
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 100/171 (58%), Gaps = 6/171 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++ F
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGMLHGF 335
>gi|417399911|gb|JAA46936.1| Putative endoplasmic reticulum-golgi intermediate compartment
protein 2 isoform 1 [Desmodus rotundus]
Length = 376
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 75/168 (44%), Positives = 99/168 (58%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 167 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 226
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRSSEQGR- 228
G+VNPLDG + M+QYFI VVPT ++T +S T +QFSVTE R
Sbjct: 227 GIVNPLDGTEKIAVDHNRMFQYFITVVPTKLHTYKISADT---HQFSVTERERVVNHAAG 283
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 284 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 331
>gi|348562091|ref|XP_003466844.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Cavia porcellus]
Length = 377
Score = 133 bits (335), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 74/171 (43%), Positives = 101/171 (59%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
PG++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 225 LVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|345441780|ref|NP_001230861.1| ERGIC and golgi 2 [Sus scrofa]
Length = 377
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 74/168 (44%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|82074366|sp|Q5EHU7.1|ERGI2_GECJA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
Length = 377
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 74/168 (44%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|301783747|ref|XP_002927289.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Ailuropoda melanoleuca]
Length = 377
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 74/168 (44%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|410964074|ref|XP_003988581.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Felis catus]
Length = 377
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 74/168 (44%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|417399168|gb|JAA46612.1| Putative endoplasmic reticulum-golgi intermediate compartment
protein 2 isoform 1 [Desmodus rotundus]
Length = 337
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 77/173 (44%), Positives = 102/173 (58%), Gaps = 7/173 (4%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 167 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 226
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRSSEQGR- 228
G+VNPLDG + M+QYFI VVPT ++T +S T +QFSVTE R
Sbjct: 227 GIVNPLDGTEKIAVDHNRMFQYFITVVPTKLHTYKISADT---HQFSVTERERVVNHAAG 283
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G D+F++
Sbjct: 284 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTG-KDSFLF 335
>gi|123438593|ref|XP_001310077.1| MGC83277 protein [Trichomonas vaginalis G3]
gi|121891831|gb|EAX97147.1| MGC83277 protein, putative [Trichomonas vaginalis G3]
Length = 355
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 88/277 (31%), Positives = 140/277 (50%), Gaps = 15/277 (5%)
Query: 8 HLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSD 67
H+DV +I + +G+V R D G P + K ++ + + YCG+CYG +S
Sbjct: 79 HVDVIDNIKESDESYEGHVRMERFDEKGNPILKKSYPKNSS-VTKDPGYCGNCYGQKSG- 136
Query: 68 EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAG 127
CCN C+EVR+A++ I QC EG+ + + +GE C ++G L V++ G
Sbjct: 137 --CCNTCKEVRKAFKANNRPPPPIIHIQQCVDEGYKEELIAMKGEACRVHGTLTVHRAPG 194
Query: 128 NFHFAPGKSFHQSGVHVH--DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE 185
FH APG+S++ +G H H + L D N SH IN + G PLDG Q+
Sbjct: 195 TFHVAPGESYNINGEHDHYYEDLGINIDEMNFSHTINHFSIGMPTANSYYPLDGHTEIQQ 254
Query: 186 TPSGMYQ-YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPI 244
M YF++ VP ++ G S S +++R S + PGVFF YD+S I
Sbjct: 255 KTGRMKMIYFLRAVP---INLDGRVF-SFGASSYQNYRGSNSTK---YPGVFFSYDVSLI 307
Query: 245 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
+ + ++ S + +T + +I+GGVF ++ +D Y
Sbjct: 308 GIV-SSQNSSLMDLVTELMSILGGVFAIATFLDMLSY 343
>gi|432943284|ref|XP_004083140.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oryzias latipes]
Length = 372
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 68/173 (39%), Positives = 98/173 (56%), Gaps = 2/173 (1%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G + VNKVAGN H GK H H H +S+N SH+I++L FGE
Sbjct: 156 QSPDACRIHGDIYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHESYNFSHRIDRLCFGE 215
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQG 227
PG++NPLDG + MYQYFI VVPT T ++QFSVTE R +
Sbjct: 216 EIPGIINPLDGTEKITYDNNQMYQYFITVVPTKLKTYKI-TADTHQFSVTERERVINHTA 274
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G+FF YD S + VT +E+H+ FL +C I+GG+++ +G++ + I
Sbjct: 275 GSHGVSGIFFKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIYSTTGMLHSLI 327
>gi|21312962|ref|NP_080444.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
isoform 1 [Mus musculus]
gi|81903633|sp|Q9CR89.1|ERGI2_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|12835992|dbj|BAB23451.1| unnamed protein product [Mus musculus]
gi|12843481|dbj|BAB25998.1| unnamed protein product [Mus musculus]
gi|12844310|dbj|BAB26318.1| unnamed protein product [Mus musculus]
gi|13905198|gb|AAH06895.1| ERGIC and golgi 2 [Mus musculus]
gi|17390417|gb|AAH18188.1| ERGIC and golgi 2 [Mus musculus]
gi|20072972|gb|AAH26558.1| ERGIC and golgi 2 [Mus musculus]
gi|26326029|dbj|BAC26758.1| unnamed protein product [Mus musculus]
gi|40353061|gb|AAH64749.1| ERGIC and golgi 2 [Mus musculus]
gi|74191314|dbj|BAE39481.1| unnamed protein product [Mus musculus]
gi|148678796|gb|EDL10743.1| ERGIC and golgi 2, isoform CRA_c [Mus musculus]
Length = 377
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C I+GG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 332
>gi|12846043|dbj|BAB27008.1| unnamed protein product [Mus musculus]
Length = 377
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C I+GG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 332
>gi|12841082|dbj|BAB25070.1| unnamed protein product [Mus musculus]
Length = 377
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C I+GG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 332
>gi|47214843|emb|CAF95749.1| unnamed protein product [Tetraodon nigroviridis]
Length = 299
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 67/149 (44%), Positives = 93/149 (62%), Gaps = 19/149 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD-----GIGAPKIDKPLQRHGGRLEHNET 55
MD++GEQ LDV+H++FK+RLD + + + G ++ P L+ N
Sbjct: 89 MDVAGEQQLDVEHNLFKQRLDKNLKPVSTEAEKHELGGAEDVEVFDP-----STLDPNR- 142
Query: 56 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
C SCYGAE+ D CCN+C++VREAYR++GWA N D I+QCKREGF Q+++E++ EGC
Sbjct: 143 -CESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADTIEQCKREGFTQKMQEQKNEGCQ 201
Query: 116 IYGFLEVNKVA-------GNFHFAPGKSF 137
+YG LEVNKV+ G F GK F
Sbjct: 202 VYGVLEVNKVSLIAQEGGGKFSLCSGKKF 230
>gi|291392459|ref|XP_002712727.1| PREDICTED: PTX1 protein [Oryctolagus cuniculus]
gi|291416214|ref|XP_002724342.1| PREDICTED: PTX1 protein-like [Oryctolagus cuniculus]
Length = 377
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/171 (42%), Positives = 101/171 (59%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
PG++NPLDG + M+QYFI +VPT ++T +S T +QFSVTE R +
Sbjct: 225 LVPGIINPLDGTEKIAIDHNQMFQYFITIVPTKLHTYKISADT---HQFSVTERERIINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|149048933|gb|EDM01387.1| rCG29652, isoform CRA_c [Rattus norvegicus]
Length = 377
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C I+GG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 332
>gi|410918691|ref|XP_003972818.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Takifugu rubripes]
Length = 378
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 72/175 (41%), Positives = 96/175 (54%), Gaps = 6/175 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E C IYG + VNKVAGN H GK H H H +++N SH+I+ L+FGE
Sbjct: 164 EPHNACRIYGHIYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHETYNFSHRIDHLSFGE 223
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS-SE 225
G++NPLDG + MYQYFI VVPT V VS T +QFSVTE R +
Sbjct: 224 EITGIINPLDGTEKITSKHTQMYQYFITVVPTRLVTHKVSADT---HQFSVTERERVINH 280
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G+F YD S + VT TE+H+ FL +C IVGG+F+ +G++ +
Sbjct: 281 AAGSHGVSGIFVKYDTSSLTVTVTEQHMPLWQFLVRLCGIVGGIFSTTGMLHGLV 335
>gi|74189495|dbj|BAE22750.1| unnamed protein product [Mus musculus]
Length = 303
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 94 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 153
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 154 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 210
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C I+GG+F+ +G++
Sbjct: 211 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 258
>gi|344267803|ref|XP_003405755.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Loxodonta africana]
Length = 377
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/168 (44%), Positives = 99/168 (58%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|320170541|gb|EFW47440.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
Length = 408
Score = 132 bits (333), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 76/205 (37%), Positives = 107/205 (52%), Gaps = 8/205 (3%)
Query: 76 EVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHFAP 133
E R+ ++ +LS + + + + +EG + C ++G + +K+AGNFH
Sbjct: 177 ENRKPLTREHLSLSGTTRKAKKNFQAMPRELSSQEGTPDACRLHGSVSADKIAGNFHIIA 236
Query: 134 GKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQY 193
G + G H H + + N +H+IN L+FGE PG+ PLDG W + + YQY
Sbjct: 237 GAAVEVPGGHAHMGQMIPQHALNFTHRINHLSFGEEMPGMEFPLDGDEWITTSHTMAYQY 296
Query: 194 FIKVVPTVYTDVSG--HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEE 251
FI+VVPTVYT + ++S QFSVT H E LPG+FF YD PI VT
Sbjct: 297 FIQVVPTVYTRHANDPEQLRSGQFSVTRH----ESPNSNRLPGLFFKYDTFPILVTVQYS 352
Query: 252 HVSFLHFLTNVCAIVGGVFTVSGII 276
SF H L + I+GGVF SG I
Sbjct: 353 PYSFWHLLIRLSGIIGGVFATSGFI 377
>gi|430811512|emb|CCJ31046.1| unnamed protein product [Pneumocystis jirovecii]
Length = 264
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 80/185 (43%), Positives = 99/185 (53%), Gaps = 19/185 (10%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD+SGE DV H++ K RLD G I S I +P++ YCGSC
Sbjct: 89 MDVSGELQTDVSHNVVKNRLDKNGIFINST--SINTLNFQQPIKVLPS------DYCGSC 140
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
YGA+ E CCN CE+V AY W + N +QCK + + EGCN G +
Sbjct: 141 YGAK---EGCCNTCEDVINAYIANNWPIPNKRTFEQCKDSNNM----DGPDEGCNFVGRI 193
Query: 121 EVNKVAGNFHFAPGKSFHQ-SGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGVV-NP 176
EVNKV GNFHFAPG S +G HVHDI + DS + SH INKL+FG G + NP
Sbjct: 194 EVNKVIGNFHFAPGHSSQTITGGHVHDIYDYLTDSLPHDFSHMINKLSFGPEIEGSLQNP 253
Query: 177 LDGVR 181
LD V+
Sbjct: 254 LDNVK 258
>gi|432862155|ref|XP_004069750.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oryzias latipes]
Length = 373
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/178 (39%), Positives = 100/178 (56%), Gaps = 2/178 (1%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
QR C I+G L VNKVAGNFH GKS H H DS+N SH+I+
Sbjct: 157 QRDSSSPPNACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVSHDSYNFSHRIDH 216
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
L+FGE PG+++PLDG + M+QYFI +VPT + + +++Q+SVTE R
Sbjct: 217 LSFGEAIPGLISPLDGTEKIAADYNHMFQYFITIVPT-KLNTYKVSAETHQYSVTERERV 275
Query: 224 -SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ + G+F YD+S + V TE+H+ F FL +C IVGG+F+ +G+I +
Sbjct: 276 INHAAGSHGVSGIFMKYDISSLMVKVTEQHMPFWKFLVRLCGIVGGIFSTTGMIHGLV 333
>gi|149713890|ref|XP_001502984.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Equus caballus]
Length = 377
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/168 (44%), Positives = 99/168 (58%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|340502903|gb|EGR29544.1| hypothetical protein IMG5_153610 [Ichthyophthirius multifiliis]
Length = 342
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 83/218 (38%), Positives = 117/218 (53%), Gaps = 31/218 (14%)
Query: 100 EGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAP---GKSFHQSGVHVHDILAFQRDS 154
E L R+K + EGC I G + VNK GNFH + + HQ HV+ +
Sbjct: 136 EARLNRLKSAFLDQEGCKIQGHIFVNKAPGNFHVSAHSFDRILHQIASHVN------IST 189
Query: 155 FNISHKINKLAFGEHFP-----------GVVNPLDGVRWT----QETPSGMYQYFIKVVP 199
++SH IN ++FG+ G+++PLD R Q+ S YQY+I VV
Sbjct: 190 IDVSHIINHISFGDETDIIRIKRQFKSQGILDPLDRTRKIKTEDQKNISISYQYYINVVH 249
Query: 200 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
T Y + IQ ++SV + ++ + LP FF YDLSP+ V F++ +SFLHF+
Sbjct: 250 TTYVN-----IQKKEYSVYQFTANNNELLSDRLPACFFRYDLSPVIVRFSQSRMSFLHFI 304
Query: 260 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
VCAI+GGVFTV+GIID+ I+ I KK E+GK S
Sbjct: 305 VQVCAIIGGVFTVAGIIDSIIHKSVVHILKKAEMGKLS 342
>gi|57106442|ref|XP_534852.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 isoform 1 [Canis lupus familiaris]
Length = 377
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/168 (44%), Positives = 99/168 (58%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGEVVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|432879813|ref|XP_004073560.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Oryzias latipes]
Length = 271
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 82/201 (40%), Positives = 110/201 (54%), Gaps = 22/201 (10%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I +GEGC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 86 MKIPINQGEGCRFEGKFTINKVPGNFH-----------VSTHSATA-QPQNPDMTHSIHK 133
Query: 164 LAFGE-----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
LAFG+ + G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 134 LAFGDTLQVHNVKGAFNALGGADKLSSNPLASHDYILKIVPTVYEDLSGRQRFSYQYTVA 193
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F F+T +CAIVGG FTV+GII
Sbjct: 194 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGII 251
Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
D+ I+ A KKI+IGK S
Sbjct: 252 DSCIFTASEA-WKKIQIGKMS 271
>gi|395839293|ref|XP_003792530.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Otolemur garnettii]
Length = 377
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/171 (43%), Positives = 100/171 (58%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE
Sbjct: 165 QSPDACRISGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
PG++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 225 LVPGIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|328875761|gb|EGG24125.1| DUF1692 family protein [Dictyostelium fasciculatum]
Length = 1172
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 95/319 (29%), Positives = 148/319 (46%), Gaps = 34/319 (10%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKP-------LQRHGGRLEHN 53
+D+S +++ D+ L ++ES G P D L R G LE
Sbjct: 864 VDVSRGNRMNINFDVHFPSLICSDIIVESVDGVDGKPIKDAAHQIVKERLNRRGSPLERL 923
Query: 54 ETYCG--SCYGAESSDE-------DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQ 104
G SC E + CCN+CE++R YR D QC +
Sbjct: 924 HARAGLFSCTKCELPPKYQLLEKRKCCNSCEDLRTFYRTNKVPQHLADESPQCTIGKPVT 983
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS----GVHVHDI---LAFQRDSFNI 157
E EGC ++G L V K+ G+ H G+ +S HVH + +A + FNI
Sbjct: 984 -----EDEGCRVFGILSVQKMKGDIHIIAGRPHEESHDGHSHHVHKLTPEIAQRIHKFNI 1038
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
SH I+K +FG+ G++NPL+G G+ Y+++VVPT+Y + + +++NQ+S
Sbjct: 1039 SHHIHKFSFGQDVEGLINPLEGFGIVVPMGLGLQTYYLQVVPTIYKQ-NNYILETNQYSY 1097
Query: 218 TEHFRSSEQGRLQTL-PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
T ++S L L PG++F YDLSP+ + + F +T++CAI GG++ G+
Sbjct: 1098 TREYKSINYNNLGYLFPGIYFKYDLSPLMIEVDQSSKPFSELITSICAIGGGMYVAFGL- 1156
Query: 277 DAFIYHGQRAIKKKIEIGK 295
YH I KI+ K
Sbjct: 1157 ---FYHVTARIVGKIKKQK 1172
>gi|326672443|ref|XP_003199668.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Danio rerio]
Length = 365
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 73/174 (41%), Positives = 101/174 (58%), Gaps = 4/174 (2%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E C I+G + VNKVAGNFH GK H H + + +N SH+I+ L+FG
Sbjct: 166 ESQNACRIHGKIYVNKVAGNFHITLGKPIETHKGHAHYASFIKDEVYNFSHRIDHLSFGN 225
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSEQ 226
PG +NPLDG+ T + ++QYFI VVPT S ++ +QFSVTE R S+E+
Sbjct: 226 DVPGHINPLDGMEKTTLEQNTLFQYFITVVPT-KLHTSNVSVDMHQFSVTERERVVSNEK 284
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
G Q + G+FF Y LSP+ V +EEH+ FL +C IVGG+F+ S ++ I
Sbjct: 285 GN-QGVSGIFFKYKLSPLMVRVSEEHMPLAAFLVRLCGIVGGIFSTSDLLHRLI 337
>gi|426225295|ref|XP_004006802.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Ovis aries]
Length = 377
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 73/168 (43%), Positives = 99/168 (58%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
G++NPLDG + M+QYFI VVPT ++T +S T +QF+VTE R +
Sbjct: 228 GIINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADT---HQFAVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|443716796|gb|ELU08142.1| hypothetical protein CAPTEDRAFT_19918 [Capitella teleta]
Length = 403
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 71/170 (41%), Positives = 96/170 (56%), Gaps = 11/170 (6%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQS-GVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
GC YG L+VNKVAGNFH GKS + G H H + + +N +H+I +FG+
Sbjct: 169 GCRFYGTLDVNKVAGNFHITAGKSVPLNIGGHAHMAMMVKESDYNFTHRIEHFSFGDKVS 228
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVP----TVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
G +NPLDG MYQYFI+VVP T++TD I + QFSVTE R+ G
Sbjct: 229 GRINPLDGEEKNTNDNYHMYQYFIQVVPTHVKTLFTD-----INTYQFSVTEQNRTISHG 283
Query: 228 R-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ +PG+F YDL+P+ V E H F L +C I+GG+F SG++
Sbjct: 284 KGSHGIPGIFVKYDLAPMMVKVIESHKPFSQLLIRLCGIIGGLFATSGML 333
>gi|229366152|gb|ACQ58056.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Anoplopoma fimbria]
Length = 290
Score = 131 bits (329), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 82/201 (40%), Positives = 108/201 (53%), Gaps = 22/201 (10%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I +G+GC G +NKV GNFH V H A Q S +++H I+K
Sbjct: 105 MKIPLNQGDGCRFEGEFTINKVPGNFH-----------VSTHSATA-QPQSPDMTHNIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
LAFGE G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 153 LAFGEKIQVQRVQGAFNALGGADRLSSNPLASHDYILKIVPTVYEDLSGKQRFSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAIVGG FTV+GII
Sbjct: 213 NKEYVAYSHAGRI--IPAIWFRYDLSPITVKYTERRQPVYRFITTICAIVGGTFTVAGII 270
Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
D+ I+ A KKI+IGK S
Sbjct: 271 DSCIFTASEA-WKKIQIGKMS 290
>gi|388501278|gb|AFK38705.1| unknown [Medicago truncatula]
Length = 148
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 62/134 (46%), Positives = 86/134 (64%)
Query: 156 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
N+SH I+ L+FG +PG+ NPLD SG ++Y+IK+VPT Y +S + +NQF
Sbjct: 10 NVSHVIHDLSFGPKYPGIHNPLDETSRILHDASGTFKYYIKIVPTEYRYISKEVLPTNQF 69
Query: 216 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
SVTE+F +T P V+F YDLSPI VT EE SFLHF+T +CA++GG F V+G+
Sbjct: 70 SVTEYFSPITSQFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGM 129
Query: 276 IDAFIYHGQRAIKK 289
+D ++Y A K
Sbjct: 130 LDRWMYRLVEAATK 143
>gi|403269250|ref|XP_003926667.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Saimiri boliviensis boliviensis]
Length = 377
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 73/171 (42%), Positives = 99/171 (57%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRSSEQ 226
P ++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R
Sbjct: 225 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281
Query: 227 GRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 282 AAGSYGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|302853436|ref|XP_002958233.1| hypothetical protein VOLCADRAFT_69193 [Volvox carteri f.
nagariensis]
gi|300256421|gb|EFJ40687.1| hypothetical protein VOLCADRAFT_69193 [Volvox carteri f.
nagariensis]
Length = 337
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 74/187 (39%), Positives = 107/187 (57%), Gaps = 11/187 (5%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV--HVHDILAFQR--DSFNISHKIN 162
+ E EGC++YG ++V +VAG HF S HQ+ V + +L R NISH I
Sbjct: 156 EAEHHEGCHVYGTMDVKRVAGRLHF----SVHQNMVFQMLPQLLGAHRIPKVANISHTIK 211
Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
L FG H+PG +NPLDG + P ++YF+KVVPT Y + G +++Q+SVTE+ +
Sbjct: 212 HLGFGPHYPGQLNPLDGYVRMVKGPPQSFKYFLKVVPTEYYNRLGRVTETHQYSVTEYTQ 271
Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
E G + TL YDLSPI +T E S LHF+ +CA+VGG F ++ + D ++
Sbjct: 272 PLEPGYVPTLD---VHYDLSPIVMTINERPPSLLHFVVRLCAVVGGAFAITRMTDRWVDW 328
Query: 283 GQRAIKK 289
R + K
Sbjct: 329 FVRLVTK 335
>gi|115497448|ref|NP_001069031.1| endoplasmic reticulum-Golgi intermediate compartment protein 2 [Bos
taurus]
gi|113912114|gb|AAI22616.1| ERGIC and golgi 2 [Bos taurus]
gi|296487341|tpg|DAA29454.1| TPA: PTX1 protein [Bos taurus]
Length = 377
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 70/168 (41%), Positives = 97/168 (57%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN--QFSVTEHFRS-SEQGR 228
G++NPLDG + M+QYFI +VP T + + I ++ QF+VTE R +
Sbjct: 228 GIINPLDGTEKIALDHNQMFQYFITIVP---TKLQTYKISADTHQFAVTERERVINHAAG 284
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|443700340|gb|ELT99344.1| hypothetical protein CAPTEDRAFT_162161 [Capitella teleta]
Length = 110
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 62/108 (57%), Positives = 83/108 (76%), Gaps = 2/108 (1%)
Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVT 247
M+ Y++KVVPT Y +G + SNQ+SVT+H + G L Q LPGVF Y+LSP+ V
Sbjct: 1 MFSYYVKVVPTSYLRANGEFVSSNQYSVTKHHKKVGGGILGEQGLPGVFVTYELSPMMVK 60
Query: 248 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
+TE++ SF+HFLT VCAI+GGVFTV+G++DAFIYH RAI+KKI++GK
Sbjct: 61 YTEKNRSFMHFLTGVCAIIGGVFTVAGLVDAFIYHSARAIQKKIDLGK 108
>gi|348516790|ref|XP_003445920.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Oreochromis niloticus]
Length = 290
Score = 130 bits (327), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 108/201 (53%), Gaps = 22/201 (10%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I +G+GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNQGDGCRFEGEFTINKVPGNFH-----------VSTHSATA-QPQNPDMTHTIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
LAFGE G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 153 LAFGEKLQVQKVQGAFNALGGADKMSSNPLASHDYILKIVPTVYEDLSGRQRFSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GII
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGAFTVAGII 270
Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
D+ I+ A KKI+IGK S
Sbjct: 271 DSCIFTASEA-WKKIQIGKMS 290
>gi|291232448|ref|XP_002736170.1| PREDICTED: MGC81917 protein-like [Saccoglossus kowalevskii]
Length = 395
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/235 (34%), Positives = 121/235 (51%), Gaps = 21/235 (8%)
Query: 82 RKKGW---------ALSNP----DLIDQCKREGFLQRIKEEEGE------GCNIYGFLEV 122
R+K W AL+N DL+ + +G + E E + C I+G + +
Sbjct: 120 RQKQWQKKLQAVRSALANEHAIQDLLFKVGFDGSPTSMPEREDKPAGAPNSCRIHGSMSL 179
Query: 123 NKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRW 182
NKVAGNFH GKS H H + +N SH+I+ +FG PG+VNPLDG +
Sbjct: 180 NKVAGNFHITLGKSIPHPRGHAHLAAFISQSQYNFSHRIDHFSFGVPTPGIVNPLDGDQR 239
Query: 183 TQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDL 241
+ + MYQYFI++VPT + + ++Q++VTE R S + G+FF YDL
Sbjct: 240 VTQENARMYQYFIQIVPT-RVNTRRASADTHQYAVTERDRVISHSSGSHGVAGIFFKYDL 298
Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
S + V TEE+ + FL +C I+GGVF SG++ + I I K + GK+
Sbjct: 299 SSVSVKVTEEYQPYWQFLVRLCGIIGGVFATSGMLHSLIGCLYDLICCKYQFGKY 353
>gi|15010925|gb|AAK77355.1|AF302767_1 PTX1 protein [Homo sapiens]
Length = 377
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/167 (43%), Positives = 99/167 (59%), Gaps = 6/167 (3%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE P
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVPA 228
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGRL 229
++NPLDG + M+QYFI VVPT ++T +S +T +QFSVTE R +
Sbjct: 229 IINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISAYT---HQFSVTERERIINHAAGS 285
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 286 HGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|323306137|gb|EGA59869.1| Erv46p [Saccharomyces cerevisiae FostersB]
Length = 349
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 89/254 (35%), Positives = 124/254 (48%), Gaps = 48/254 (18%)
Query: 1 MDISGEQHLDVKHDIFK-KRLDSQG----NVIESRQDGIG---APKIDKPLQRHGGRLEH 52
MD SGE LD+ F RL+S+G + E G G AP + P
Sbjct: 88 MDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGGNGDGTAPVNNDP---------- 137
Query: 53 NETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFL 103
YCG CYGA+ ++ CC +C+ VR AY + GWA + I+QC+REG++
Sbjct: 138 --NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAGWAFFDGKNIEQCEREGYV 195
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKIN 162
+I E EGC I G ++N++ GN HFAPGK + + H HD + + S N +H IN
Sbjct: 196 SKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFHDTSLYDKTSNLNFNHIIN 255
Query: 163 KLAFGE--------------HFPGVV--NPLDG--VRWTQETPSGMYQYFIKVVPTVYTD 204
L+FG+ H VV +PLDG V + T + YF K+VPT Y
Sbjct: 256 HLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNTHFHQFSYFAKIVPTRYEY 315
Query: 205 VSGHTIQSNQFSVT 218
+ I++ QFS T
Sbjct: 316 LDNVVIETAQFSAT 329
>gi|75075986|sp|Q4R5C3.1|ERGI2_MACFA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|67970720|dbj|BAE01702.1| unnamed protein product [Macaca fascicularis]
Length = 377
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
P ++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 225 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|348505737|ref|XP_003440417.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oreochromis niloticus]
Length = 374
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 97/169 (57%), Gaps = 2/169 (1%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G L VNKVAGNFH GKS H H DS+N SH+I+ L+FGE PG
Sbjct: 168 ACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVAHDSYNFSHRIDHLSFGEPLPG 227
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT 231
+++PLDG + M+QYFI +VPT + + +++Q+SVTE R +
Sbjct: 228 IISPLDGTEKIATDSNHMFQYFITIVPT-KLNTYKVSAETHQYSVTERERVINHAAGSHG 286
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G+F YD+S + V TE+H+ FL +C I+GG+F+ +G+I +
Sbjct: 287 VSGIFMKYDISSLMVKVTEQHMPLWQFLVRLCGIIGGIFSTTGMIHGLV 335
>gi|380787459|gb|AFE65605.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Macaca mulatta]
gi|383418929|gb|AFH32678.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Macaca mulatta]
gi|384941148|gb|AFI34179.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Macaca mulatta]
Length = 377
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
P ++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 225 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|225717192|gb|ACO14442.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Esox lucius]
Length = 379
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 67/169 (39%), Positives = 95/169 (56%), Gaps = 2/169 (1%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G + VNKVAGNFH GK H H H D++N SH+I+ +FGE PG
Sbjct: 168 ACRIHGHVYVNKVAGNFHITVGKPIHHPRGHAHIAAFVSHDTYNFSHRIDHFSFGEEIPG 227
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT 231
++NPLDG + M+ YFI VVPT S + ++QFSVTE R +
Sbjct: 228 IINPLDGTEKVTTNNNHMFLYFITVVPT-KLHTSKVSADTHQFSVTERERVINHAAGSHG 286
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G+F YD S + VT +E+H+ FL +C I+GG+F+ +G+I F+
Sbjct: 287 VSGIFMKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIFSTTGMIHGFV 335
>gi|330803630|ref|XP_003289807.1| hypothetical protein DICPUDRAFT_80570 [Dictyostelium purpureum]
gi|325080118|gb|EGC33688.1| hypothetical protein DICPUDRAFT_80570 [Dictyostelium purpureum]
Length = 388
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 88/287 (30%), Positives = 146/287 (50%), Gaps = 37/287 (12%)
Query: 3 ISGEQHLDVKHDIFKKRLDSQG-----NVIESRQDGIGAPKIDK---PLQRHGGRLEHNE 54
I G+ D + I K+RLDS+G V + + GI + + + P Q+ G + +
Sbjct: 122 IDGKPIKDAAYQIVKERLDSKGVPFAKGVALAGKKGIFSSRCTECEFPKQKKGSSVFFRQ 181
Query: 55 TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
CCN+C+++RE YR + D QC E +Q + EGC
Sbjct: 182 K--------------CCNSCDDLREYYRLNRIPQNFADDAPQCLIERPIQ-----DDEGC 222
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQS----GVHVHDILAF---QRDSFNISHKINKLAFG 167
IYG L+V K+ G+FH G S +S HVH I + FNI+H I+K +FG
Sbjct: 223 RIYGSLQVQKMKGDFHILAGLSADESHDGHAHHVHRITKENIGRVTQFNITHHIHKFSFG 282
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
+ G++NPL+G ++ + Y+I+VVP +Y + + +++NQ+S T +R+
Sbjct: 283 DDIDGLINPLEGFGIVAQS-LAVQNYYIQVVPAIYKK-NDYVLETNQYSYTYDYRNVNVF 340
Query: 228 RL-QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
L + PG++F YD+SP+ + + + +T++CAI GG+F +S
Sbjct: 341 NLGRIFPGIYFKYDMSPLMIEVDQTSKPIVELITSICAIGGGIFYIS 387
>gi|397517363|ref|XP_003828883.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Pan paniscus]
gi|410259224|gb|JAA17578.1| ERGIC and golgi 2 [Pan troglodytes]
gi|410298004|gb|JAA27602.1| ERGIC and golgi 2 [Pan troglodytes]
gi|410334949|gb|JAA36421.1| ERGIC and golgi 2 [Pan troglodytes]
gi|410334951|gb|JAA36422.1| ERGIC and golgi 2 [Pan troglodytes]
Length = 377
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
P ++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 225 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|332233018|ref|XP_003265701.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2 isoform 1 [Nomascus leucogenys]
Length = 377
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
P ++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 225 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|402885549|ref|XP_003906216.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Papio anubis]
Length = 364
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 152 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 211
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
P ++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 212 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 268
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 269 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 319
>gi|213512030|ref|NP_001133523.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
gi|209154344|gb|ACI33404.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
Length = 381
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 66/169 (39%), Positives = 95/169 (56%), Gaps = 2/169 (1%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G L VNKVAGNFH GK+ H H D++N SH+I+ L+FGE PG
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDTYNFSHRIDHLSFGEEIPG 228
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQT 231
++NPLDG + M+QYFI +VPT + + +NQ+SVTE R
Sbjct: 229 IINPLDGTEKVCTDHNQMFQYFITIVPT-KLNTYQISADTNQYSVTERERVINHAVGSHG 287
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G+F YD+S + V TE+H+ FL +C I+GG+F+ +G+I +
Sbjct: 288 VSGIFMKYDISSLMVKVTEQHMPLWRFLVRLCGIIGGIFSTTGMIHGMV 336
>gi|297262047|ref|XP_001105686.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 2 [Macaca mulatta]
Length = 374
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE
Sbjct: 162 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 221
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
P ++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 222 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 278
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 279 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 329
>gi|410907774|ref|XP_003967366.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Takifugu rubripes]
Length = 388
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/169 (37%), Positives = 98/169 (57%), Gaps = 2/169 (1%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G L VNKVAGNFH GKS H H DS+N SH+I+ L+FGE PG
Sbjct: 167 ACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVSHDSYNFSHRIDHLSFGEDLPG 226
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT 231
+++PLDG + ++QYFI +VPT + + +++Q+SVTE R+ +
Sbjct: 227 IISPLDGTEKVSADSNHIFQYFITIVPT-KLNTYRVSAETHQYSVTEQDRAINHAAGSHG 285
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G+F YD++ + V TE+H+ FL +C I+GG+F+ +G+I +
Sbjct: 286 VSGIFMKYDINSLMVKVTEQHMPLWQFLVRLCGIIGGIFSTTGMIHGIV 334
>gi|322791472|gb|EFZ15869.1| hypothetical protein SINV_02690 [Solenopsis invicta]
Length = 403
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 104/171 (60%), Gaps = 6/171 (3%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEHFP 171
C ++G L VNKVAGNFH GKS H+H I AF D +N +H+IN+ +FG P
Sbjct: 183 ACRVHGSLNVNKVAGNFHITAGKSLSVPHGHIH-ISAFMTDRDYNFTHRINRFSFGGPSP 241
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR-L 229
G+V+PL+G + +YQYF++VVPT + T +S T ++ Q+SV +H R + +
Sbjct: 242 GIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRTLLS--TSKTYQYSVKDHQRPIDHHKGS 299
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+PG+FF YD+S +K+ T+E + FL +CA VGG+F SG+I +
Sbjct: 300 HGIPGIFFKYDMSALKIKVTQERDTIFQFLVKLCATVGGIFVTSGLIKNIV 350
>gi|50959176|ref|NP_057654.2| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Homo sapiens]
gi|108935982|sp|Q96RQ1.2|ERGI2_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|22760017|dbj|BAC11037.1| unnamed protein product [Homo sapiens]
gi|38173702|gb|AAH00887.2| ERGIC and golgi 2 [Homo sapiens]
gi|78070782|gb|AAI07795.1| ERGIC and golgi 2 [Homo sapiens]
gi|119616998|gb|EAW96592.1| ERGIC and golgi 2, isoform CRA_a [Homo sapiens]
gi|119617000|gb|EAW96594.1| ERGIC and golgi 2, isoform CRA_a [Homo sapiens]
gi|167773797|gb|ABZ92333.1| ERGIC and golgi 2 [synthetic construct]
Length = 377
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/167 (43%), Positives = 98/167 (58%), Gaps = 6/167 (3%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE P
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVPA 228
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGRL 229
++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 229 IINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAGS 285
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 286 HGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|62897157|dbj|BAD96519.1| CDA14 variant [Homo sapiens]
Length = 377
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/167 (43%), Positives = 98/167 (58%), Gaps = 6/167 (3%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE P
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVPA 228
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGRL 229
++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 229 IINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAGS 285
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 286 HGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332
>gi|357627966|gb|EHJ77470.1| putative PTX1 protein isoform 1 [Danaus plexippus]
Length = 353
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 105/185 (56%), Gaps = 2/185 (1%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C ++G L +NKVAGNFH GKS H H+H + F N SH+IN+L+FG
Sbjct: 142 DACRLHGVLTLNKVAGNFHITAGKSLHLPRGHIHLNMLFDDTPQNFSHRINRLSFGSPAN 201
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQ 230
G++ PL+G S +YQYF++VVPT D + +I++ Q+SV E R +
Sbjct: 202 GIIYPLEGDEKITSDESMLYQYFLEVVPT-DVDTTFESIKTFQYSVKELARPISHSKGSH 260
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
+PGVFF YD++ +KV +E + L F+ + +I+GG++ + I+ + + + KK
Sbjct: 261 GVPGVFFKYDMAALKVQVYQERENLLQFMLRLFSIIGGIYVIISFINTIVLTAKTLLVKK 320
Query: 291 IEIGK 295
E+ K
Sbjct: 321 PEVKK 325
>gi|41055383|ref|NP_956701.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Danio rerio]
gi|82188148|sp|Q7T2D4.1|ERGI2_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 2
gi|32451749|gb|AAH54593.1| ERGIC and golgi 2 [Danio rerio]
gi|182890474|gb|AAI64472.1| Ergic2 protein [Danio rerio]
Length = 376
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 98/175 (56%), Gaps = 14/175 (8%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G L VNKVAGNFH GK+ H H +++N SH+I+ L+FGE PG
Sbjct: 168 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHETYNFSHRIDHLSFGEEIPG 227
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT------VYTDVSGHTIQSNQFSVTEHFRS-SE 225
++NPLDG + M+QYFI +VPT VY D ++Q+SVTE R +
Sbjct: 228 ILNPLDGTEKVSADHNQMFQYFITIVPTKLQTYKVYAD-------THQYSVTERERVINH 280
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G+F YD+S + V TE+H+ F FL +C I+GG+F+ +G++ +
Sbjct: 281 AAGSHGVSGIFMKYDISSLMVKVTEQHMPFWQFLVRLCGIIGGIFSTTGMLHNLV 335
>gi|12857352|dbj|BAB30984.1| unnamed protein product [Mus musculus]
Length = 377
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 72/170 (42%), Positives = 99/170 (58%), Gaps = 10/170 (5%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ +FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHCSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRSS---EQ 226
G++NPLDG + M+QYFI V+PT ++T +S T +QFSVTE R S
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVMPTKLHTYKISADT---HQFSVTE--RESIINHA 282
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C I+GG+F+ +G++
Sbjct: 283 AGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 332
>gi|395744111|ref|XP_003780425.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Pongo abelii]
Length = 387
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 72/169 (42%), Positives = 100/169 (59%), Gaps = 7/169 (4%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE P
Sbjct: 177 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVP 236
Query: 172 GVVNPLDGV-RWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQG 227
++NPLDG + + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 237 AIINPLDGTEKIAIDRKHQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAA 293
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 294 GSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 342
>gi|348529156|ref|XP_003452080.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Oreochromis niloticus]
Length = 379
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 65/173 (37%), Positives = 96/173 (55%), Gaps = 2/173 (1%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E C I+G + VNKVAGN H GK H H H +++N SH+I+ L+FGE
Sbjct: 164 EPLNACRIHGHVYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHETYNFSHRIDHLSFGE 223
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQG 227
PG++NPLDG + M+QYFI VVPT + + ++QFSVTE R +
Sbjct: 224 ELPGIINPLDGTEKITYNNNQMFQYFITVVPT-KLNTYKISADTHQFSVTERERVINHAA 282
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G+F YD S + VT +E+H+ FL +C I+GG+F+ +G++ +
Sbjct: 283 GSHGVSGIFVKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIFSTTGMLHGLV 335
>gi|410914052|ref|XP_003970502.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Takifugu rubripes]
Length = 290
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 108/201 (53%), Gaps = 22/201 (10%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I +G GC G +NKV GNFH S H + Q + +++H I+K
Sbjct: 105 MKIPLNQGAGCRFEGEFIINKVPGNFHI----STHSASA--------QPQNPDMTHFIHK 152
Query: 164 LAFGEHF-----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
LAFG+ G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 153 LAFGDKLQMHQEKGAFNALGGADRLASNPLASHDYILKIVPTVYEDLSGKQKFSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F F+T +CAIVGG FTV+GII
Sbjct: 213 NKEYVAYSHTGRI--VPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGII 270
Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
D+ I+ A KKI+IGK S
Sbjct: 271 DSCIFTASEA-WKKIQIGKMS 290
>gi|307188057|gb|EFN72889.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Camponotus floridanus]
Length = 386
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 69/171 (40%), Positives = 104/171 (60%), Gaps = 6/171 (3%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEHFP 171
C I+G L VNKVAGNFH GKS H+H I A+ D +N +H+IN+ +FG P
Sbjct: 169 ACRIHGSLVVNKVAGNFHITAGKSLSLPRGHIH-ISAYMTDQDYNFTHRINRFSFGGPSP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR-L 229
G+V+PL+G + +YQYF++VVPT + T +S T ++ Q+SV +H R + +
Sbjct: 228 GIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRTLLS--TSKTYQYSVKDHQRPIDHHKGS 285
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+PG+FF YD+S +K+ T+E + FL +CA VGG+F SG++ +
Sbjct: 286 HGIPGIFFKYDMSALKIKVTQERDTIFQFLVKLCATVGGIFVTSGLVKNIV 336
>gi|57208595|emb|CAI42844.1| ERGIC and golgi 3 [Homo sapiens]
Length = 156
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 71/157 (45%), Positives = 89/157 (56%), Gaps = 35/157 (22%)
Query: 159 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH---------- 208
H I L+FGE +PG+VNPLD T S M+QYF+KVVPTVY V G
Sbjct: 1 HYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEAQQERGRSRG 60
Query: 209 ----------------------TIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPI 244
+++NQFSVT H + + G L Q LPGVF Y+LSP+
Sbjct: 61 GADGGWSQVLALALAQAPLPPQVLRTNQFSVTRHEKVAN-GLLGDQGLPGVFVLYELSPM 119
Query: 245 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
V TE+H SF HFLT VCAI+GG+FTV+G+ID+ IY
Sbjct: 120 MVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIY 156
>gi|301093181|ref|XP_002997439.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
gi|262110695|gb|EEY68747.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
Length = 278
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 80/198 (40%), Positives = 112/198 (56%), Gaps = 15/198 (7%)
Query: 99 REGFLQRIKEEE--GE-GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 155
+E LQ+ +EE GE GC +YG ++V KVAG+ FA H+ + V F +F
Sbjct: 92 KEIMLQKDIQEEPYGENGCRLYGTVQVQKVAGDLSFA-----HEGSLTVFSFFDFL--NF 144
Query: 156 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
N SH +N L FG P + PL V Y+YF+ VVP+ Y ++G ++ + Q+
Sbjct: 145 NSSHVVNHLRFGPQIPDMETPLIDVSKILTKNLATYKYFVSVVPSRYVYLNGRSVTTFQY 204
Query: 216 SVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
SVTEH SS Q + PGV F Y+ SPI V + E +S LHFLT+ AIVGGVF V+
Sbjct: 205 SVTEHETSSRGPNGQVSFPGVIFSYEFSPIAVEYIESKLSVLHFLTSTSAIVGGVFAVAR 264
Query: 275 IIDAFIYHGQRAIKKKIE 292
+ID IY ++ KK++
Sbjct: 265 MIDGAIY----SVSKKVD 278
>gi|47222972|emb|CAF99128.1| unnamed protein product [Tetraodon nigroviridis]
Length = 288
Score = 127 bits (318), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 108/201 (53%), Gaps = 22/201 (10%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I +G GC G +NKV GNFH S H + Q + +++H I+K
Sbjct: 103 MKIPLNQGGGCRFEGEFNINKVPGNFHI----STHSASA--------QPQNPDMTHFIHK 150
Query: 164 LAFGEHF-----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
LAFG+ G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 151 LAFGDKLQMHQVKGAFNALGGADRLASNPLASHDYILKIVPTVYEDLSGKQKFSYQYTVA 210
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F F+T +CAIVGG FTV+GII
Sbjct: 211 NKEYVAYSHTGRI--VPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGII 268
Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
D+ I+ A KKI+IGK S
Sbjct: 269 DSCIFTASEA-WKKIQIGKMS 288
>gi|71480113|ref|NP_001025133.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Danio rerio]
gi|78099248|sp|Q4V8Y6.1|ERGI1_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|66911928|gb|AAH97146.1| Zgc:114085 [Danio rerio]
Length = 290
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 79/201 (39%), Positives = 107/201 (53%), Gaps = 22/201 (10%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
++ G GC G +NKV GNFH V H A Q S +++H I+K
Sbjct: 105 MKVPLNNGHGCRFEGEFSINKVPGNFH-----------VSTHSATA-QPQSPDMTHIIHK 152
Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
LAFG +H G N L G Q + Y +K+VPTVY ++ G S Q++V
Sbjct: 153 LAFGAKLQVQHVQGAFNALGGADRLQSNALASHDYILKIVPTVYEELGGKQRFSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F F+T +CAI+GG FTV+GII
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRRPFYRFITTICAIIGGTFTVAGII 270
Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
D+ I+ A KKI+IGK S
Sbjct: 271 DSCIFTASEA-WKKIQIGKMS 290
>gi|332020071|gb|EGI60517.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Acromyrmex echinatior]
Length = 390
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 70/194 (36%), Positives = 111/194 (57%), Gaps = 9/194 (4%)
Query: 86 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 145
W + L + + + + + C ++G L +NKVAGNFH GKS H+H
Sbjct: 145 WKSNQVTLYSEMPKRSY---VPDYAPNACRVHGSLNINKVAGNFHITAGKSLSVPHGHIH 201
Query: 146 DILAFQRD-SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT 203
I AF D +N +H+INK +FG PG+V+PL+G + +YQYF++VVPT + T
Sbjct: 202 -ISAFMTDRDYNFTHRINKFSFGGPSPGIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRT 260
Query: 204 DVSGHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
++ T ++ Q+SV +H R + + +PG+FF YD+S +K+ T+E + FL +
Sbjct: 261 LLT--TSKTYQYSVKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQFLVKL 318
Query: 263 CAIVGGVFTVSGII 276
CA VGG+F SG++
Sbjct: 319 CATVGGIFVTSGLV 332
>gi|405119686|gb|AFR94458.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Cryptococcus neoformans var. grubii H99]
Length = 431
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 98/188 (52%), Gaps = 14/188 (7%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINK 163
K E+G C IYG +EV KV N H H ++FQ N+SH +++
Sbjct: 202 KVEDGPACRIYGSVEVKKVTANLHIT---------TLGHGYMSFQHTDHHLMNLSHVVHE 252
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
+FG FP + PLD E P ++QYF++VVPT Y D S + ++Q++VT++ RS
Sbjct: 253 FSFGPFFPAIAQPLDQSYEITEQPFTIFQYFLRVVPTTYIDASRRKLITSQYAVTDYSRS 312
Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
E G+ +PG+FF YDL P+ V E S FL + +VGGV+TV+
Sbjct: 313 FEHGK--GVPGLFFKYDLEPMSVVIRERTTSLYQFLIRLAGVVGGVWTVAAFALRVFNRA 370
Query: 284 QRAIKKKI 291
QR + K +
Sbjct: 371 QREVSKAV 378
>gi|308808274|ref|XP_003081447.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
gi|116059910|emb|CAL55969.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
Length = 406
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/295 (32%), Positives = 153/295 (51%), Gaps = 39/295 (13%)
Query: 2 DISGEQHLDVKHD--IFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
D +GEQH DV HD I K+R+D G I++ P K + + ++ ++ G+
Sbjct: 124 DKAGEQHYDV-HDGHIEKRRVDKDGKPIDATFTS-EKPNKHKEMVQALEKMNQTDSVVGN 181
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
+ D A+R G ++ + EG + E EGC + G+
Sbjct: 182 ETALQKQDR-----------AHRFAG-VFGFESMLKEAFPEGIENAFRNEAREGCEVKGY 229
Query: 120 LEVNKVAGNFHFAPGK----SFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
LEVN+V G +PG+ Q ++VH L N++H I++L+FGE FPG+V+
Sbjct: 230 LEVNRVPGRISISPGRVVMMGMQQFKLNVHTDL-------NLTHTIHRLSFGERFPGLVS 282
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT-IQSNQFSVTEHFRSSEQ-------G 227
PLDG + P+ + QYF+ VV T + + G I ++Q+SVTE F +S++ G
Sbjct: 283 PLDGTHRSLP-PNAVQQYFLNVVATTFQPLRGDARISTHQYSVTETFTTSQRSLGGSSNG 341
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
R PGVFF Y++ PI+V F E +F F+ +C+I+GGV T++G++ + + H
Sbjct: 342 RD---PGVFFTYEIEPIRVDFKETRTTFGAFIIGICSIIGGVVTMAGVVQSAVEH 393
>gi|340709072|ref|XP_003393139.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Bombus terrestris]
Length = 392
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 104/171 (60%), Gaps = 6/171 (3%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFGEHFP 171
C I+G L VNKVAGNFH GKS H+H IL F D +N +H+INK +FG P
Sbjct: 169 SCRIHGSLNVNKVAGNFHITAGKSLSFPMGHIH-ILTFMTDKDYNFTHRINKFSFGGPSP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSE-QGRL 229
G+++PL+G + +YQYF++VVPT + T +S T ++ Q+SV +H R + Q
Sbjct: 228 GIIHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLS--TSKTYQYSVKDHQRPIDHQKGS 285
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
PG+FF YD+S +K+ T++ + FL +CA VGG+F SG++ + +
Sbjct: 286 HGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGMVKSIV 336
>gi|350419069|ref|XP_003492060.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Bombus impatiens]
Length = 392
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 71/171 (41%), Positives = 103/171 (60%), Gaps = 6/171 (3%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFGEHFP 171
C I+G L VNKVAGNFH GKS H+H IL F D +N +H+INK +FG P
Sbjct: 169 SCRIHGSLNVNKVAGNFHITAGKSLSFPMGHIH-ILTFMTDKDYNFTHRINKFSFGGPSP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSE-QGRL 229
G+++PL+G + +YQYF++VVPT + T +S T ++ Q+SV +H R + Q
Sbjct: 228 GIIHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLS--TSKTYQYSVKDHQRPIDHQKGS 285
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
PG+FF YD+S +K+ T++ + FL +CA VGG+F SG+I +
Sbjct: 286 HGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGMIKNIV 336
>gi|224067439|ref|XP_002195791.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Taeniopygia guttata]
Length = 290
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 108/200 (54%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G+GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGDGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L+G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 153 LSFGDKLQVHNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T++CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|326928384|ref|XP_003210360.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Meleagris gallopavo]
Length = 321
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 108/200 (54%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G+GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 136 MKIPLNNGDGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 183
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L+G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 184 LSFGDKLQVQNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVA 243
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T++CAI+GG FTV+GI+
Sbjct: 244 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGIL 301
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 302 DSCIFTASEA-WKKIQLGKM 320
>gi|322792513|gb|EFZ16471.1| hypothetical protein SINV_10123 [Solenopsis invicta]
Length = 141
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 52/109 (47%), Positives = 75/109 (68%)
Query: 70 CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 129
CCN CE+V EAYR+K WA +P + QC+ + ++++K +GC IYG++EVN+V G+F
Sbjct: 12 CCNTCEDVWEAYRRKKWAPPDPADVKQCQNDKSMEKLKHAFTQGCQIYGYMEVNRVGGSF 71
Query: 130 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
H APG SF + VHVHD+ + FN++HKI L+FG + PG NP+D
Sbjct: 72 HIAPGVSFSVNHVHVHDVQPYTSSHFNMTHKIRHLSFGLNIPGKTNPMD 120
>gi|145524934|ref|XP_001448289.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124415833|emb|CAK80892.1| unnamed protein product [Paramecium tetraurelia]
Length = 324
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 80/201 (39%), Positives = 111/201 (55%), Gaps = 28/201 (13%)
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISH----------K 160
I G++ VNKV GNFH S H G +H + FQR + ++SH K
Sbjct: 135 VKIAGYIIVNKVPGNFHV----SAHAFGGILHQV--FQRSQISTLDLSHTYQSYSHLVKK 188
Query: 161 INKLAFGEHF-PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFS 216
+ + + F GV+NPLD + + G M+QY+I VVPT Y DVSG N++
Sbjct: 189 DDLVKIKKQFQKGVLNPLDNTKKIAQPQGGTGMMFQYYISVVPTTYIDVSG-----NEYY 243
Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
V + +S + + LP V+F YDLSP+ V F + SFLHFL +CAI+GGVFT++ II
Sbjct: 244 VHQFTANSNEVQTDHLPAVYFRYDLSPVTVKFLQYRESFLHFLVQICAILGGVFTIASII 303
Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
D I+ A+ KK E+GK S
Sbjct: 304 DGMIHKSVVALLKKYEMGKLS 324
>gi|194689880|gb|ACF79024.1| unknown [Zea mays]
gi|413949702|gb|AFW82351.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
[Zea mays]
Length = 176
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 54/84 (64%), Positives = 70/84 (83%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
DISGEQH D++HDI K+RL+S GNVIE+R++GIG K+++PLQ+HGGRL+ E YCG+CY
Sbjct: 91 DISGEQHHDIRHDIEKRRLNSHGNVIEARKEGIGGAKVERPLQKHGGRLDKGEQYCGTCY 150
Query: 62 GAESSDEDCCNNCEEVREAYRKKG 85
GAE SDE CCN+CEE + R+KG
Sbjct: 151 GAEESDEQCCNSCEESGKHIRRKG 174
>gi|145551751|ref|XP_001461552.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124429387|emb|CAK94179.1| unnamed protein product [Paramecium tetraurelia]
Length = 317
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/193 (35%), Positives = 114/193 (59%), Gaps = 24/193 (12%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF-QRDSFNISHKINKLAFG 167
++ EGC + G++ +++V GNFH S H G V+ +L F + + ++SH I L+FG
Sbjct: 128 DQKEGCEMTGYIIISRVPGNFHI----SAHSYGGQVNIVLPFVEMSTIDLSHTIKHLSFG 183
Query: 168 ---------EHFP-GVVNPLDGVRW--TQETPSG--MYQYFIKVVPTVYTDVSGHTIQSN 213
E F G++NPLDG+ TQE + +QY+I +VPT+Y D+ N
Sbjct: 184 NQNDIQKIREKFQQGLLNPLDGISRIKTQELKNVGVTHQYYISIVPTIYVDIDNREYFVN 243
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
QF+ ++ + + ++P ++F YD+SP+ V FT+ + +F HF+ +CAI+GGVFT++
Sbjct: 244 QFTA-----NTNEAQTNSMPAIYFRYDISPVTVQFTKYYETFNHFIVQLCAILGGVFTIA 298
Query: 274 GIIDAFIYHGQRA 286
GIID+ Y Q+
Sbjct: 299 GIIDSVFYALQKT 311
>gi|145546125|ref|XP_001458746.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124426567|emb|CAK91349.1| unnamed protein product [Paramecium tetraurelia]
Length = 325
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/267 (33%), Positives = 135/267 (50%), Gaps = 57/267 (21%)
Query: 30 RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 89
+QD IG + Q G L + T G S D N E ++AY++K
Sbjct: 82 QQDVIGTHQ-----QNVEGELYKSRTLNGKVIDKYLSTNDSLN-LERAQQAYQQK----- 130
Query: 90 NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 149
EGC++ G++ +++V GNFH S H G V+ +L
Sbjct: 131 ----------------------EGCDLAGYIIISRVPGNFHI----SAHPYGGQVNMVLP 164
Query: 150 FQRDS-FNISHKINKLAFG---------EHFP-GVVNPLDGVRW--TQE-TPSGM-YQYF 194
F S ++SH I L+FG E F G++NPLDG+R TQE T G+ +QY+
Sbjct: 165 FVGLSVIDLSHSIKHLSFGKQNDIQKIREKFKQGLLNPLDGIRRIKTQELTNVGVTHQYY 224
Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 254
I +VPT+Y D+ NQF+ ++ + + +P V+F YD+SP+ V FT+ + S
Sbjct: 225 ISIVPTLYVDIDNKEYFVNQFAA-----NTNEAQTTQMPAVYFRYDISPVTVQFTKYYES 279
Query: 255 FLHFLTNVCAIVGGVFTVSGIIDAFIY 281
F HF+ +CAI+GGVFT++GIID+ Y
Sbjct: 280 FNHFIVQLCAILGGVFTIAGIIDSIFY 306
>gi|66500700|ref|XP_395190.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 1 [Apis mellifera]
Length = 389
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 67/170 (39%), Positives = 99/170 (58%), Gaps = 4/170 (2%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G L VNKVAGNFH GKS H+H +N +H+INK +FG PG
Sbjct: 169 ACRIHGSLNVNKVAGNFHITAGKSLSIPKGHIHISAFMTEKDYNFTHRINKFSFGGPSPG 228
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQ 230
+V+PL+G + +YQYF++VVPT + T +S T ++ Q+SV +H R + Q
Sbjct: 229 IVHPLEGDEKIADNNMLLYQYFVEVVPTDIQTLLS--TSKTYQYSVKDHQRPINHQKGSH 286
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
PG+FF YD+S +K+ T++ + FL +CA VGG+F SG++ +
Sbjct: 287 GSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGLVKNIV 336
>gi|380016475|ref|XP_003692209.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Apis florea]
Length = 392
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 67/170 (39%), Positives = 99/170 (58%), Gaps = 4/170 (2%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G L VNKVAGNFH GKS H+H +N +H+INK +FG PG
Sbjct: 169 ACRIHGSLNVNKVAGNFHITAGKSLSIPKGHIHISAFMTEKDYNFTHRINKFSFGGPSPG 228
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQ 230
+V+PL+G + +YQYF++VVPT + T +S T ++ Q+SV +H R + Q
Sbjct: 229 IVHPLEGDEKIADNNMLLYQYFVEVVPTDIQTLLS--TSKTYQYSVKDHQRPINHQKGSH 286
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
PG+FF YD+S +K+ T++ + FL +CA VGG+F SG++ +
Sbjct: 287 GSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGLVKNIV 336
>gi|340505495|gb|EGR31815.1| hypothetical protein IMG5_101180 [Ichthyophthirius multifiliis]
Length = 327
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 76/204 (37%), Positives = 115/204 (56%), Gaps = 25/204 (12%)
Query: 103 LQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ-RDSFNISH 159
LQR + + EGCNI G + VNKV GNFH S H G + +L+ +++ ++SH
Sbjct: 126 LQRATQAYMDKEGCNISGTMLVNKVPGNFHI----SSHAYGHVLGQVLSNAGKNTIDLSH 181
Query: 160 KINKLAFGEHFP----------GVVNPLDGVRW--TQETPSGM-YQYFIKVVPTVYTDVS 206
K+ L+FG+ F G+++P+D + Q +G+ YQY+I +VPT Y D
Sbjct: 182 KVKHLSFGDEFDLKNIKRQFSQGLLHPMDNKQKDKPQNILNGITYQYYINIVPTTYVDTG 241
Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
QF+ + S+EQ LP V++ YDLSP+ V F+ + SFLHFL +CAI+
Sbjct: 242 NKNYHVYQFT----YNSNEQIN-NHLPTVYYRYDLSPVTVKFSMQKESFLHFLVQICAII 296
Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
GG+FTV+ I+D+ +Y I K+
Sbjct: 297 GGIFTVASIVDSIVYRAVLNILKR 320
>gi|334311203|ref|XP_001380577.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Monodelphis domestica]
Length = 321
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 105/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I GEGC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 136 MKIPLNNGEGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 183
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 184 LSFGDTLQVQNIHGAFNALGGADKLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 243
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 244 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 301
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 302 DSCIFTASEAW-KKIQLGKM 320
>gi|66813156|ref|XP_640757.1| DUF1692 family protein [Dictyostelium discoideum AX4]
gi|60468793|gb|EAL66793.1| DUF1692 family protein [Dictyostelium discoideum AX4]
Length = 421
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 87/286 (30%), Positives = 135/286 (47%), Gaps = 29/286 (10%)
Query: 3 ISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYG 62
I G D + I K+RLDS G E G+ L G + T C
Sbjct: 140 IDGNPIKDAAYQIVKQRLDSYG---EPFAQGVA-------LAGKKGIFSRSCTECEFPKS 189
Query: 63 AESSD----EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
S + CCN+CE++R+ YR + D QC E +Q + EGC IYG
Sbjct: 190 KRVSSVFYKQKCCNSCEDLRQYYRLNRIPQNLADDSPQCLIERPVQ-----DDEGCRIYG 244
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILA-FQRDS------FNISHKINKLAFGEHFP 171
L V K+ G+FH G QS R++ FNI+H I+K +FGE
Sbjct: 245 SLSVQKMKGDFHILAGTGIDQSHDGHVHHAHHIPRENIGRIKHFNITHHIHKFSFGEDIE 304
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-Q 230
G++NPL+ ++ + Y+++VVP +Y + +++NQ+S T +R L Q
Sbjct: 305 GLINPLEDFGIVAQS-LAVQTYYLQVVPAIYKK-NDFVLETNQYSYTYDYRIVNMFNLGQ 362
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
PG++F YDLSP+ + + + +T++CAI GG++ V G++
Sbjct: 363 LFPGIYFKYDLSPLMIEVDQTSKPLVELITSICAIGGGMYVVLGLV 408
>gi|395505103|ref|XP_003756885.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Sarcophilus harrisii]
Length = 290
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 106/200 (53%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I +GEGC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNDGEGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVQNIHGAFNALGGADKLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|387015778|gb|AFJ50008.1| ER Golgi intermediate [Crotalus adamanteus]
Length = 290
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 106/200 (53%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G+GC G +NKV GNFH + H A Q + +++H I+K
Sbjct: 105 MKIPLNNGDGCRFEGHFSINKVPGNFH-----------ISTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 153 LSFGDKLQVPNIHGAFNALGGTDRLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|9963759|gb|AAG09679.1|AF183410_1 cd002 protein [Homo sapiens]
Length = 387
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 72/168 (42%), Positives = 98/168 (58%), Gaps = 7/168 (4%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH-DILAFQRDSFNISHKINKLAFGEHFP 171
C I+G L VNKVAGNFH GK+ H H +S+N SH+I+ L+FGE P
Sbjct: 178 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLGSTCSTMESYNFSHRIDHLSFGELVP 237
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 238 AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 294
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 295 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 342
>gi|312376736|gb|EFR23738.1| hypothetical protein AND_12338 [Anopheles darlingi]
Length = 265
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 76/182 (41%), Positives = 102/182 (56%), Gaps = 22/182 (12%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
D +GEQHL ++H I+K+RLD +GN IE PK + +Q R+ ET S
Sbjct: 37 DSTGEQHLHIEHSIYKRRLDLEGNQIEE-------PKKED-IQVSTKRVSSTETPVTS-- 86
Query: 62 GAESSDEDCCNNCEEVREAYRKKGWALSNPDLID--QCKREGFLQRIKEEEGEGCNIYGF 119
S+ + C N V +AYR++ W NP++ D QCK + EGC+IYG
Sbjct: 87 ---STIKPACGN---VIDAYRERKW---NPNVEDFEQCKNSNHGAIEGKAFNEGCHIYGT 137
Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPLD 178
+EVN+V G FH APGKSF +HVHD+ + FN SH+IN L+FGE F G PLD
Sbjct: 138 MEVNRVEGRFHIAPGKSFSIQNIHVHDVQPYSSSRFNTSHRINTLSFGEQFDFGTTQPLD 197
Query: 179 GV 180
G+
Sbjct: 198 GL 199
>gi|345320110|ref|XP_001521132.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like, partial [Ornithorhynchus anatinus]
Length = 283
Score = 123 bits (309), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G+GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 98 MKIPLNNGDGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 145
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P Y Y +K+VPTVY D +G S Q++V
Sbjct: 146 LSFGDKLQVQNIHGAFNALGGADKRSSNPLASYDYILKIVPTVYEDKNGKQRYSYQYTVA 205
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 206 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 263
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 264 DSCIFTASEA-WKKIQLGKM 282
>gi|109079798|ref|XP_001099287.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Macaca mulatta]
Length = 379
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 194 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 241
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 242 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 301
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 302 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 359
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 360 DSCIFTASEAW-KKIQLGKM 378
>gi|449272958|gb|EMC82607.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Columba livia]
Length = 297
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 107/200 (53%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G+GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 112 MKIPLNNGDGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 159
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L+G P + Y +K+VPTVY D+ G S Q++V
Sbjct: 160 LSFGDKLQVHNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMGGKQRYSYQYTVA 219
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T++CAI+GG FTV+GI+
Sbjct: 220 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGIL 277
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 278 DSCIFTASEA-WKKIQLGKM 296
>gi|73953406|ref|XP_852891.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 isoform 1 [Canis lupus familiaris]
Length = 290
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/200 (38%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
RI G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MRIPVNNGAGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|392351111|ref|XP_001066818.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Rattus norvegicus]
Length = 497
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 313 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 360
Query: 165 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 218
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 361 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 420
Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+D
Sbjct: 421 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 478
Query: 278 AFIYHGQRAIKKKIEIGKF 296
+ I+ A KKI++GK
Sbjct: 479 SCIFTASEAW-KKIQLGKI 496
>gi|6330243|dbj|BAA86495.1| KIAA1181 protein [Homo sapiens]
Length = 336
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 151 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 198
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 199 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 258
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 259 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 316
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 317 DSCIFTASEAW-KKIQLGKM 335
>gi|58261152|ref|XP_567986.1| ER to Golgi transport-related protein [Cryptococcus neoformans var.
neoformans JEC21]
gi|134115843|ref|XP_773404.1| hypothetical protein CNBI2490 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50256029|gb|EAL18757.1| hypothetical protein CNBI2490 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57230068|gb|AAW46469.1| ER to Golgi transport-related protein, putative [Cryptococcus
neoformans var. neoformans JEC21]
Length = 431
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 98/188 (52%), Gaps = 14/188 (7%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINK 163
K ++G C IYG +EV KV N H H ++FQ N+SH +++
Sbjct: 202 KVQDGPACRIYGSVEVKKVTANLHIT---------TLGHGYMSFQHTDHHLMNLSHVVHE 252
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
+FG FP + PLD E P ++QYF++VVPT Y D S + ++Q++VT++ RS
Sbjct: 253 FSFGPFFPAIAQPLDQSYEITEQPFTIFQYFLRVVPTTYIDASRRKLITSQYAVTDYSRS 312
Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
E G+ +PG+FF YDL P+ V E S FL + +VGGV+TV+
Sbjct: 313 FEHGK--GVPGLFFKYDLEPMSVIIRERTTSLYQFLIRLAGVVGGVWTVAAFALRVFNRA 370
Query: 284 QRAIKKKI 291
Q+ + K +
Sbjct: 371 QKHVSKAV 378
>gi|326427137|gb|EGD72707.1| hypothetical protein PTSG_04435 [Salpingoeca sp. ATCC 50818]
Length = 357
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 99/174 (56%), Gaps = 3/174 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ E + C ++G L V KVA NFH GKS H S H H D+ N SH+I++ +F
Sbjct: 165 DSEPDACRLHGVLPVAKVAANFHITAGKSVHHSRGHSHVNSMVPPDAVNFSHRIDRFSFS 224
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQ 226
E G + LDG T + P ++QYF++VVP+ + +SNQ+SVTE R ++
Sbjct: 225 EEPRGAMA-LDGDLRTTDQPRQVFQYFLEVVPSTTQRLGQRQPFRSNQYSVTEQHRVLKE 283
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
G + +PG++F +D+ I V+ +EEH L +C IVGG+ SG++ +FI
Sbjct: 284 G-ARGIPGIYFKFDIESIGVSVSEEHPPLSRLLIRLCGIVGGIVAASGMLHSFI 336
>gi|194382656|dbj|BAG64498.1| unnamed protein product [Homo sapiens]
Length = 235
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 50 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 97
Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 98 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 157
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 158 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 215
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 216 DSCIFTASEAW-KKIQLGKM 234
>gi|395817675|ref|XP_003782285.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Otolemur garnettii]
Length = 356
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 171 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 218
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 219 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVA 278
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 279 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 336
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 337 DSCIFTASEAW-KKIQLGKM 355
>gi|383865060|ref|XP_003707993.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Megachile rotundata]
Length = 392
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 104/171 (60%), Gaps = 6/171 (3%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEHFP 171
C I+G L VNKV+GNFH GKS H+H I AF D +N +H+INK +FG P
Sbjct: 169 ACRIHGSLNVNKVSGNFHITAGKSLSIPRGHIH-ISAFMIDRDYNFTHRINKFSFGGPSP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSE-QGRL 229
GVV+PL+G + +YQYF++VVPT + T +S T ++ Q+SV ++ R + Q
Sbjct: 228 GVVHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLS--TSKTYQYSVKDYQRPIDHQKGS 285
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+PG+FF YD+S +K+ T++ + FL +CA VGG+F SG++ +
Sbjct: 286 HGVPGIFFKYDMSALKIKVTQQRDTVSQFLVKLCATVGGIFVTSGLVKNIV 336
>gi|417409674|gb|JAA51332.1| Putative endoplasmic reticulum-golgi intermediate compartment
protein, partial [Desmodus rotundus]
Length = 318
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 133 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 180
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 181 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 240
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 241 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 298
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 299 DSCIFTASEA-WKKIQLGKM 317
>gi|114603487|ref|XP_001145588.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Pan troglodytes]
Length = 424
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 239 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 286
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 287 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 346
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 347 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 404
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 405 DSCIFTASEA-WKKIQLGKM 423
>gi|410949214|ref|XP_003981318.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Felis catus]
Length = 398
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 213 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 260
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 261 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 320
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 321 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 378
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 379 DSCIFTASEA-WKKIQLGKM 397
>gi|281351238|gb|EFB26822.1| hypothetical protein PANDA_005115 [Ailuropoda melanoleuca]
Length = 238
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 53 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 100
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 101 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 160
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 161 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 218
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 219 DSCIFTASEAW-KKIQLGKM 237
>gi|115452719|ref|NP_001049960.1| Os03g0321400 [Oryza sativa Japonica Group]
gi|113548431|dbj|BAF11874.1| Os03g0321400, partial [Oryza sativa Japonica Group]
Length = 83
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 59/83 (71%), Positives = 70/83 (84%), Gaps = 1/83 (1%)
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
QFSVTEHFR + G + PGV+FFY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+
Sbjct: 1 QFSVTEHFREA-IGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVA 59
Query: 274 GIIDAFIYHGQRAIKKKIEIGKF 296
GIID+F+YHG RAIKKK+EIGK
Sbjct: 60 GIIDSFVYHGHRAIKKKMEIGKL 82
>gi|50510831|dbj|BAD32401.1| mKIAA1181 protein [Mus musculus]
Length = 320
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 135 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHTIHK 182
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 183 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 242
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 243 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 300
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 301 DSCIFTASEAW-KKIQLGKI 319
>gi|123449396|ref|XP_001313417.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121895300|gb|EAY00488.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 361
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 43/281 (15%)
Query: 19 RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVR 78
RLDSQG IE+ + ++ +Q CGSCY A+ CC +C+EV
Sbjct: 117 RLDSQGKPIEALD---LSTLVNTTVQEK----------CGSCYNAKDPKRICCRSCQEVF 163
Query: 79 EAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH 138
+AYR + I+QCK +++ + EGEGC + + +VA H APG S++
Sbjct: 164 DAYRDAAFKPPVLTEIEQCKPVA--EKVAKMEGEGCKVDASFKALRVASEMHIAPGYSWN 221
Query: 139 QSGVHVHDILAFQRD--SFNISHKINKLAFGEH---FPGVVNPLDGVRWTQETPSGMYQY 193
G HVHD+ F ++ S N++H I+ L+F E +P +N L+ V +T +G +
Sbjct: 222 SEGWHVHDLSLFTKEFASLNLTHTIHYLSFSEKEGDYP--LNNLNNV----QTENGAW-- 273
Query: 194 FIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIK-VTFTEEH 252
+VV T ++ Q + F S G+FF YD+SPI VT+T+
Sbjct: 274 --RVVYTADILEGNYSASKYQMYNPKSFAS----------GLFFKYDVSPISAVTYTDSE 321
Query: 253 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
F H LT + ++GGV + +IDA +H +R +K+ EI
Sbjct: 322 PVF-HLLTRILTVLGGVLGLCRLIDAITFHTRR-MKRTEEI 360
>gi|407927953|gb|EKG20833.1| protein of unknown function DUF1692 [Macrophomina phaseolina MS6]
Length = 366
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/167 (40%), Positives = 97/167 (58%), Gaps = 14/167 (8%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG L+ N+V G+FH A G + + G H+ FN SH+IN+L+FG ++
Sbjct: 171 DSCRIYGSLDANRVQGDFHITARGHGYMEFGEHL------DHSQFNFSHQINELSFGPYY 224
Query: 171 PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
P + NPLD R TP +QY++ VVPTVYTD S HTI +NQ++VTE S +
Sbjct: 225 PSLTNPLDYTRAVTPTPDDHFYKFQYYLSVVPTVYTDNS-HTIVTNQYAVTEQSHSVPE- 282
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
++PGVF +D+ PIK+T +E + FL L + +V GV G
Sbjct: 283 --MSVPGVFVKFDIEPIKLTISEYNGGFLALLIRLVNVVSGVMVAGG 327
>gi|355686511|gb|AER98080.1| endoplasmic reticulum-golgi intermediate compartment 1 [Mustela
putorius furo]
Length = 312
Score = 122 bits (305), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 128 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 175
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 176 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 235
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 236 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 293
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 294 DSCIFTASEA-WKKIQLGKM 312
>gi|390459630|ref|XP_002744599.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Callithrix jacchus]
Length = 342
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 157 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 204
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 205 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVA 264
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 265 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 322
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 323 DSCIFTASEA-WKKIQLGKM 341
>gi|351705474|gb|EHB08393.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Heterocephalus glaber]
Length = 305
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 120 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 167
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 168 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQWYSYQYTVA 227
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 228 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 285
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 286 DSCIFTASEA-WKKIQLGKM 304
>gi|301763094|ref|XP_002916978.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Ailuropoda melanoleuca]
Length = 306
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 121 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 168
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 169 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 228
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 229 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 286
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 287 DSCIFTASEA-WKKIQLGKM 305
>gi|354477345|ref|XP_003500881.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Cricetulus griseus]
Length = 333
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 148 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 195
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 196 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 255
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 256 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 313
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 314 DSCIFTASEA-WKKIQLGKI 332
>gi|340507573|gb|EGR33515.1| hypothetical protein IMG5_050820 [Ichthyophthirius multifiliis]
Length = 290
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 74/191 (38%), Positives = 109/191 (57%), Gaps = 24/191 (12%)
Query: 103 LQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 160
LQRI++ + EGC + GF+ VN+V GNFH + +F Q +V I ++ ++SHK
Sbjct: 88 LQRIQQAIQNKEGCKLSGFMYVNRVPGNFHIS-CHAFGQILGYVFRITGI--NTIDLSHK 144
Query: 161 INKLAFGEH----------FPGVVNPLDGVRWTQ----ETPSGMYQYFIKVVPTVYTDVS 206
IN L+FG+ GV+NP+D + T+ E Y Y++ VVPT Y D
Sbjct: 145 INHLSFGDEDEIKIVKKQFTLGVLNPMDKLVKTKQKHFENYGISYNYYLNVVPTTYIDEW 204
Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
G+T NQF TE+ Q + +P ++F YDLSP+ V F ++ + FLHFL V AIV
Sbjct: 205 GYTYYVNQFVFTEN-----QIQTDYIPAIYFRYDLSPVTVMFKKDRMPFLHFLVQVSAIV 259
Query: 267 GGVFTVSGIID 277
GG+FT++ +D
Sbjct: 260 GGIFTIAAFMD 270
>gi|13385678|ref|NP_080446.1| endoplasmic reticulum-Golgi intermediate compartment protein 1 [Mus
musculus]
gi|52000733|sp|Q9DC16.1|ERGI1_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|12835932|dbj|BAB23423.1| unnamed protein product [Mus musculus]
gi|13529617|gb|AAH05516.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Mus
musculus]
gi|26351067|dbj|BAC39170.1| unnamed protein product [Mus musculus]
gi|26353098|dbj|BAC40179.1| unnamed protein product [Mus musculus]
gi|53236959|gb|AAH83144.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Mus
musculus]
gi|71059789|emb|CAJ18438.1| 1200007D18Rik [Mus musculus]
gi|74185526|dbj|BAE30231.1| unnamed protein product [Mus musculus]
gi|148690563|gb|EDL22510.1| RIKEN cDNA 1200007D18 [Mus musculus]
gi|158148953|dbj|BAF82010.1| MAA-136 protein [Mus musculus]
Length = 290
Score = 121 bits (304), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHTIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKI 289
>gi|410349413|gb|JAA41310.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
gi|410349417|gb|JAA41312.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
Length = 290
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|71409973|ref|XP_807304.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70871276|gb|EAN85453.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 393
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 98/299 (32%), Positives = 146/299 (48%), Gaps = 40/299 (13%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
+D++G +L+V +IFK +D+QGN I +RQ G+G + ++ +CG
Sbjct: 109 LDVTGTVNLNVTRNIFKTPVDAQGNFAFIGTRQ-GVGE---YGSFREQSKDDPNSPQFCG 164
Query: 59 SCYGAE---SSDED---CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
C+ +E S E+ CCN C +V AY ++G + ++QC + L RI
Sbjct: 165 RCFISEHQLSMSENKNRCCNTCNDVLNAYDQQGLPRPQKNEVEQCIYD--LSRINP---- 218
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-- 170
GCN G L V K G FAP + G + D++ F DS SH INKL+ G+
Sbjct: 219 GCNYKGTLIVKKFGGRLVFAPKRV--PGGFLIRDVMQF--DS---SHIINKLSIGDERVT 271
Query: 171 ----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
GV +PL+G + + +YF+KVVPT+Y +SG S F+ T +
Sbjct: 272 RFSRRGVQHPLNGHEFDTQRRFTEIRYFLKVVPTMY--LSGK--NSASFNATYEYSVQWS 327
Query: 227 GRLQTL-----PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
RL + P V +D P++V SF HFL +C IVGG+F V G+ID +
Sbjct: 328 HRLTPIGFGHFPSVSLGFDFHPMQVNNYFRRSSFPHFLVQLCGIVGGLFVVLGLIDGLV 386
>gi|313247758|emb|CBY15879.1| unnamed protein product [Oikopleura dioica]
Length = 285
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 77/191 (40%), Positives = 102/191 (53%), Gaps = 22/191 (11%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
K ++ GC +G VNKV GNFH + S Q H HD FN HKINKL F
Sbjct: 108 KNQQKSGCRFHGEFYVNKVPGNFHVSTHASKKQP--HKHD--------FN--HKINKLFF 155
Query: 167 GE-----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 221
GE PG L G T E PS Y Y +K+VPTV+ D T Q++VT
Sbjct: 156 GEDLSALELPGNQTSLAGQATTNE-PSLSYDYTLKIVPTVHNDNKRRTTFGYQYTVTSKT 214
Query: 222 RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
+ +G P ++F Y+++PI V +T + F H LT +CAIVGG FTV+G+ID+ I+
Sbjct: 215 FKNTRGT----PAIWFRYEIAPITVKYTHKKKPFYHLLTTICAIVGGTFTVAGMIDSMIF 270
Query: 282 HGQRAIKKKIE 292
+A+KK E
Sbjct: 271 SAHQAVKKASE 281
>gi|403290258|ref|XP_003936243.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Saimiri boliviensis boliviensis]
Length = 415
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 230 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 277
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 278 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGRQQYSYQYTVA 337
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 338 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 395
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 396 DSCIFTASEAW-KKIQLGKM 414
>gi|338713524|ref|XP_001499596.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Equus caballus]
Length = 356
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
++ G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 171 MKVPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 218
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 219 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 278
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 279 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 336
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 337 DSCIFTASEAW-KKIQLGKM 355
>gi|397485838|ref|XP_003814045.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Pan paniscus]
Length = 290
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDMLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|402873423|ref|XP_003900575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Papio anubis]
gi|380784387|gb|AFE64069.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Macaca mulatta]
gi|383408185|gb|AFH27306.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Macaca mulatta]
gi|384941372|gb|AFI34291.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Macaca mulatta]
Length = 290
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|348575225|ref|XP_003473390.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Cavia porcellus]
Length = 345
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 160 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 207
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 208 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 267
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 268 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 325
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 326 DSCIFTASEAW-KKIQLGKM 344
>gi|432100023|gb|ELK28916.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Myotis davidii]
Length = 298
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 113 MKIPLNSGAGCRFEGQFSINKVPGNFH-----------VSTHSASA-QPQNPDMTHVIHK 160
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 161 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 220
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 221 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 278
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 279 DSCIFTASEA-WKKIQLGKM 297
>gi|350594414|ref|XP_003134100.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Sus scrofa]
Length = 313
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 106/200 (53%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I +G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 128 MKIPLNDGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPPNPDMTHVIHK 175
Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 176 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 235
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 236 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 293
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 294 DSCIFTASEA-WKKIQLGKM 312
>gi|72534712|ref|NP_001026881.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Homo sapiens]
gi|332248275|ref|XP_003273290.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Nomascus leucogenys]
gi|426351000|ref|XP_004043047.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Gorilla gorilla gorilla]
gi|51701446|sp|Q969X5.1|ERGI1_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|15215343|gb|AAH12766.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
[Homo sapiens]
gi|15680269|gb|AAH14490.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
[Homo sapiens]
gi|119581826|gb|EAW61422.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1,
isoform CRA_a [Homo sapiens]
gi|208966210|dbj|BAG73119.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
[synthetic construct]
gi|410301142|gb|JAA29171.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
gi|410349415|gb|JAA41311.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
troglodytes]
Length = 290
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|145349688|ref|XP_001419260.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579491|gb|ABO97553.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 310
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 99/184 (53%), Gaps = 15/184 (8%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
R + + EGC ++G LE +VAG + G ++ ++D + ++ H +
Sbjct: 134 REAKADVEGCRLHGELEARRVAGTLRASTGPESYEFLKEIYD----EPWEIDMRHAVKTF 189
Query: 165 AFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--------TIQSNQFS 216
FG FPG VNP++GVR ET SG+Y+YF+KVVPT Y+ ++NQ+S
Sbjct: 190 TFGAEFPGAVNPMNGVR-RMETKSGIYKYFMKVVPTTYSSTRALFGFIPWTVRTRTNQYS 248
Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
VTEHF E LP +FF YDLS I V T S ++FLT A +GG+F ++ +
Sbjct: 249 VTEHF--IETPHWGALPQLFFIYDLSAIAVNITVTSKSIVYFLTKTLATMGGIFALTRTV 306
Query: 277 DAFI 280
D +I
Sbjct: 307 DRYI 310
>gi|355691849|gb|EHH27034.1| hypothetical protein EGK_17136, partial [Macaca mulatta]
gi|355750428|gb|EHH54766.1| hypothetical protein EGM_15664, partial [Macaca fascicularis]
Length = 290
Score = 121 bits (303), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|395736490|ref|XP_002816264.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Pongo abelii]
Length = 290
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG ++ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|407424942|gb|EKF39210.1| hypothetical protein MOQ_000571 [Trypanosoma cruzi marinkellei]
Length = 393
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 93/299 (31%), Positives = 144/299 (48%), Gaps = 40/299 (13%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
+D++G +L+V +IFK +D+QGN I +RQ G+G + ++ +CG
Sbjct: 109 LDVTGTVNLNVTRNIFKTPVDAQGNFAFIGTRQ-GVGE---YGSFREQSKDDPNSPQFCG 164
Query: 59 SCY------GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
C+ + + CCN C++V AY ++G ++QC + L RI
Sbjct: 165 RCFINEHQVSVKENKNRCCNTCDDVLNAYDQQGLPRPRKSEVEQCIYD--LSRINP---- 218
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-- 170
GCN G L V K G FAP + G + D++ F DS SH INKL+ G+
Sbjct: 219 GCNYKGTLIVKKFGGRLVFAPKRV--SGGFLIKDVMQF--DS---SHVINKLSIGDERVT 271
Query: 171 ----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
GV +PL+G ++ + +YF+K+VPT+Y +SG S F+ T +
Sbjct: 272 RFSRRGVQHPLNGHKFDTQRRITEIRYFLKIVPTMY--LSGK--NSAPFNATYEYSVQWS 327
Query: 227 GRLQTL-----PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
RL + P V +D P++V SF HF+ +C IVGG+F V G+ID +
Sbjct: 328 QRLTPIGFGHFPSVSLGFDFHPMQVNNYFRRSSFPHFIVQLCGIVGGLFVVLGLIDGLV 386
>gi|322792517|gb|EFZ16475.1| hypothetical protein SINV_13267 [Solenopsis invicta]
Length = 110
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 58/110 (52%), Positives = 80/110 (72%), Gaps = 7/110 (6%)
Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----LPGVFFFYDLSPIK 245
M+ ++IK+VPT Y G T+ +NQFSVT H ++Q L T +PG+FF Y+LSP+
Sbjct: 4 MFYHYIKIVPTTYVRADGSTLLTNQFSVTRH---AKQVSLLTGESGMPGIFFSYELSPLM 60
Query: 246 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
V +TE+ SF HF TN CAI+GGVFTV+G+ID+ +YH RAI++KIE+GK
Sbjct: 61 VKYTEKAKSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQRKIELGK 110
>gi|440902711|gb|ELR53466.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1,
partial [Bos grunniens mutus]
Length = 290
Score = 120 bits (301), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|390337315|ref|XP_792272.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 2 [Strongylocentrotus purpuratus]
gi|390337317|ref|XP_003724529.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 1 [Strongylocentrotus purpuratus]
Length = 388
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 66/171 (38%), Positives = 98/171 (57%), Gaps = 4/171 (2%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C ++G L NKVAGNFH GKS H H L +++N SH+I+ ++G P
Sbjct: 169 DACRLHGSLTTNKVAGNFHVTIGKSIPHPRGHAHLALMIDPNNYNFSHRIDHFSYGTPVP 228
Query: 172 GVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-L 229
G+VNPLDG ++ T E+ +YQYFI++VPT ++Q++VTE R G
Sbjct: 229 GIVNPLDGDLKVTNESLQ-IYQYFIQIVPT-KVKTRAAKAHTHQYAVTERERVINHGAGS 286
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G+FF Y+LS + ++ E + F L +C IVGGVF SGII++ +
Sbjct: 287 HGVTGIFFKYELSSLVISVEEVYDPFWKLLVRLCGIVGGVFATSGIINSLM 337
>gi|426246271|ref|XP_004016918.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1 [Ovis aries]
Length = 290
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|158292441|ref|XP_001688474.1| AGAP005044-PB [Anopheles gambiae str. PEST]
gi|157016994|gb|EDO64057.1| AGAP005044-PB [Anopheles gambiae str. PEST]
Length = 287
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 102/172 (59%), Gaps = 2/172 (1%)
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
I E+ + C I+G L +NKVAGNFH GK+ H S H+H F N SH+IN+ +
Sbjct: 79 IPEKPHDACRIHGVLTLNKVAGNFHITVGKTIHFSRGHIHLNSIFANTQTNFSHRINRFS 138
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
FG+H G+++PL+G + M QYFI+VVPT H+ ++ Q++V E+ + +
Sbjct: 139 FGDHTAGIIHPLEGDEKLFDNGQVMMQYFIEVVPTDVQKFYSHS-KTYQYTVRENLQLID 197
Query: 226 QGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ +Q + G++F YD+S ++V ++ S HF+ + +I+ G+ +SG++
Sbjct: 198 IDKGMQGVAGIYFKYDMSALRVLVRQDRDSIAHFIVRLSSIIAGIVVISGML 249
>gi|307206941|gb|EFN84785.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Harpegnathos saltator]
Length = 396
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/171 (39%), Positives = 101/171 (59%), Gaps = 6/171 (3%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEHFP 171
C I+G L VNKVAGNFH GKS H+H I AF D +N +H+IN+ +FG P
Sbjct: 169 ACRIHGSLNVNKVAGNFHITTGKSLSVPRGHIH-ISAFMTDRDYNFTHRINRFSFGGPSP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR-L 229
G+V+PL+G + +YQYF++VVPT + T +S T ++ Q+SV ++ R
Sbjct: 228 GIVHPLEGDEKIADYNMMLYQYFVEVVPTDIRTLLS--TSKTYQYSVKDYQRPINHNEGS 285
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+PG+F Y++S +K+ T++ + FL +CA VGG+F SG+I +
Sbjct: 286 HGVPGIFIKYNMSALKIKVTQQRDTIFQFLVKLCATVGGIFVTSGLIKNIV 336
>gi|149052230|gb|EDM04047.1| rCG34297 [Rattus norvegicus]
Length = 283
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 98 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 145
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 146 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 205
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 206 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 263
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 264 DSCIFTASEA-WKKIQLGKI 282
>gi|392331685|ref|XP_003752358.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Rattus norvegicus]
Length = 290
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKI 289
>gi|344265732|ref|XP_003404936.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Loxodonta africana]
Length = 338
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 153 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 200
Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG ++ G N L G P + Y +K+VPTVY D +G S Q++V
Sbjct: 201 LSFGDTLQVQNVQGAFNALGGADRLHSNPLASHDYILKIVPTVYEDKNGKQRYSYQYTVA 260
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 261 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 318
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 319 DSCIFTASEAW-KKIQLGKM 337
>gi|158292439|ref|XP_313915.3| AGAP005044-PA [Anopheles gambiae str. PEST]
gi|157016993|gb|EAA09437.3| AGAP005044-PA [Anopheles gambiae str. PEST]
Length = 371
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 102/172 (59%), Gaps = 2/172 (1%)
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
I E+ + C I+G L +NKVAGNFH GK+ H S H+H F N SH+IN+ +
Sbjct: 163 IPEKPHDACRIHGVLTLNKVAGNFHITVGKTIHFSRGHIHLNSIFANTQTNFSHRINRFS 222
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
FG+H G+++PL+G + M QYFI+VVPT H+ ++ Q++V E+ + +
Sbjct: 223 FGDHTAGIIHPLEGDEKLFDNGQVMMQYFIEVVPTDVQKFYSHS-KTYQYTVRENLQLID 281
Query: 226 QGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ +Q + G++F YD+S ++V ++ S HF+ + +I+ G+ +SG++
Sbjct: 282 IDKGMQGVAGIYFKYDMSALRVLVRQDRDSIAHFIVRLSSIIAGIVVISGML 333
>gi|307105802|gb|EFN54050.1| hypothetical protein CHLNCDRAFT_136126 [Chlorella variabilis]
Length = 319
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 66/180 (36%), Positives = 93/180 (51%), Gaps = 17/180 (9%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGEH 169
EGCNI+G+LEV +VAGN HFA ++ I+ D+ NISH
Sbjct: 152 EGCNIHGWLEVQRVAGNVHFAVRPEALFLSMNAEAIMQLHPDASKLNISHA--------- 202
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
NPL+GV T +G+ +YF+KVVPT + + G + Q+SVTE++ G
Sbjct: 203 -----NPLEGVAQIDRTATGIDKYFVKVVPTDFYTLWGRKTHTYQYSVTEYYHQFRGGEE 257
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
Q P V+ YD SPI V E L L VCA+VGG F ++G+ D ++ A+K+
Sbjct: 258 QP-PAVYLLYDASPIMVDIREMRPGLLRLLVRVCAVVGGAFALTGLFDKMVHRAVVAVKR 316
>gi|7341109|gb|AAF61208.1|AF216751_1 CDA14 [Homo sapiens]
Length = 378
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/168 (42%), Positives = 97/168 (57%), Gaps = 7/168 (4%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI-SHKINKLAFGEHFP 171
C I+G L VNKVAGNFH GK+ H H Q + I SH+I+ L+FGE P
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLGSTCQPWNLTIFSHRIDHLSFGELVP 228
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R +
Sbjct: 229 AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 285
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 286 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 333
>gi|115497382|ref|NP_001069885.1| endoplasmic reticulum-Golgi intermediate compartment protein 1 [Bos
taurus]
gi|111308658|gb|AAI20358.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Bos
taurus]
Length = 290
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQFSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|308806572|ref|XP_003080597.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
gi|116059058|emb|CAL54765.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
Length = 327
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 102/191 (53%), Gaps = 29/191 (15%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPG-KSFHQSGVHVHDILAFQRDSFN------I 157
R + + EGC ++G +E +VAG+ + G +SF F R+ FN
Sbjct: 141 RKAKADMEGCRLHGRVEARRVAGSLRISTGPESFE-----------FLREMFNEPWEIDA 189
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG--------HT 209
H I AFG FPG VNPL+GV+ +E SG+Y+YF+KVVPT Y +
Sbjct: 190 RHAIKTFAFGPEFPGSVNPLNGVK-RKEKKSGIYKYFMKVVPTTYANSRNLFGMIPWTMR 248
Query: 210 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
+++NQ+SVTEHF +E LP + F YD+S I V + S ++FLT A VGGV
Sbjct: 249 VRTNQYSVTEHF--TESAHWGMLPQILFSYDISAISVNVESQSKSGVYFLTKTIATVGGV 306
Query: 270 FTVSGIIDAFI 280
F ++ ID ++
Sbjct: 307 FALTRTIDRYV 317
>gi|392577310|gb|EIW70439.1| hypothetical protein TREMEDRAFT_43159 [Tremella mesenterica DSM
1558]
Length = 435
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 63/168 (37%), Positives = 91/168 (54%), Gaps = 9/168 (5%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
K + G C IYG +EV KV N H + S H L N+SH +++ +F
Sbjct: 196 KADNGPACRIYGSVEVKKVTANLHITTLGHGYMSFEHTDHAL------MNLSHVVHEFSF 249
Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
G FP + PLD + P QYF++VVPT Y D +G + ++Q++VT++ RS +
Sbjct: 250 GPFFPAIAQPLDMTMQVSDNPFTAIQYFLRVVPTTYIDANGRKLVTSQYAVTDYLRSFQH 309
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC-AIVGGVFTVS 273
G Q +PG+FF YDL + VT E S HF+ + IVGGV+TV+
Sbjct: 310 G--QGVPGIFFKYDLEAMAVTVRERTTSLYHFVIRLIGVIVGGVWTVA 355
>gi|296475934|tpg|DAA18049.1| TPA: endoplasmic reticulum-golgi intermediate compartment 32 kDa
protein [Bos taurus]
Length = 290
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 152
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 153 LSFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQFSYQYTVA 212
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289
>gi|242006215|ref|XP_002423949.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
gi|212507219|gb|EEB11211.1| Endoplasmic reticulum-golgi intermediate compartment protein,
putative [Pediculus humanus corporis]
Length = 349
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 75/208 (36%), Positives = 112/208 (53%), Gaps = 11/208 (5%)
Query: 86 WALSNPDLID-QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 144
W ++P I+ R+ R + C IYG L +NKVAGNFH + GKS H+
Sbjct: 149 WKSASPSFINVYVPRKNLPNR----PYDACRIYGELVLNKVAGNFHISAGKSLQLPRGHI 204
Query: 145 HDILAFQRDS-FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VY 202
H I F D FN SH++N +FG++ PG+V+PL+G YQYFI+VVPT V
Sbjct: 205 H-IATFMSDKEFNFSHRLNYFSFGDYSPGIVHPLEGDEKIATDAMMSYQYFIEVVPTEVK 263
Query: 203 TDVSGHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
T ++ + Q+SV ++ R +PG+FF YD+S +KV +E S ++F
Sbjct: 264 TFLTNQL--TYQYSVKDYQRPINHNTGSHGIPGIFFKYDMSALKVIVMQERDSPINFAVK 321
Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+CA +GG+ SG+++ I + KK
Sbjct: 322 LCASIGGIHITSGLVNNIILYLINFYKK 349
>gi|270003406|gb|EEZ99853.1| hypothetical protein TcasGA2_TC002635 [Tribolium castaneum]
Length = 380
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/173 (38%), Positives = 107/173 (61%), Gaps = 8/173 (4%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEH 169
+ C I+G L +NKV+GNFH GKS + H+H I AF +RD +N SH+I+ +FG+
Sbjct: 175 DACRIHGSLILNKVSGNFHITAGKSLNLPRGHIH-ISAFMSERD-YNFSHRIDTFSFGDS 232
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
PG+++PL+G ++ YFI+VVPT V T ++ + + Q+SV E R + +
Sbjct: 233 SPGIIHPLEGDELITHNGMTLFNYFIEVVPTNVKTFLAN--VNTYQYSVKELNRPIDHDK 290
Query: 229 -LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+PG+FF YD+S +KVT ++E FL +C+I+GG+F SG +++F+
Sbjct: 291 GSHGMPGIFFKYDMSALKVTVSQERDHLGMFLARLCSIIGGIFVCSGFVNSFV 343
>gi|189235693|ref|XP_966630.2| PREDICTED: similar to AGAP005044-PA [Tribolium castaneum]
Length = 373
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/173 (38%), Positives = 107/173 (61%), Gaps = 8/173 (4%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEH 169
+ C I+G L +NKV+GNFH GKS + H+H I AF +RD +N SH+I+ +FG+
Sbjct: 168 DACRIHGSLILNKVSGNFHITAGKSLNLPRGHIH-ISAFMSERD-YNFSHRIDTFSFGDS 225
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
PG+++PL+G ++ YFI+VVPT V T ++ + + Q+SV E R + +
Sbjct: 226 SPGIIHPLEGDELITHNGMTLFNYFIEVVPTNVKTFLAN--VNTYQYSVKELNRPIDHDK 283
Query: 229 -LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+PG+FF YD+S +KVT ++E FL +C+I+GG+F SG +++F+
Sbjct: 284 GSHGMPGIFFKYDMSALKVTVSQERDHLGMFLARLCSIIGGIFVCSGFVNSFV 336
>gi|301626814|ref|XP_002942582.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like, partial [Xenopus (Silurana) tropicalis]
Length = 298
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I GC GF +NKV GNFH V H +A Q + ++ H I+K
Sbjct: 113 MKIPINNAHGCRFEGFFSINKVPGNFH-----------VSTHSAMA-QPANPDMRHIIHK 160
Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG E+ G N L G + Y +K+VPTVY D++G S Q++V
Sbjct: 161 LSFGNTLQVENIHGAFNALGGADKLASQALESHDYVLKIVPTVYEDMNGEQQFSYQYTVA 220
Query: 219 E--HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ S GR+ +P ++F YDLSPI V +TE F+T VCAI+GG FTV+GI+
Sbjct: 221 NKAYVAYSHTGRV--VPAIWFRYDLSPITVKYTERRQPIYRFITTVCAIIGGTFTVAGIL 278
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+FI+ A KKI++GK
Sbjct: 279 DSFIFTASEA-WKKIQLGKM 297
>gi|328862174|gb|EGG11276.1| hypothetical protein MELLADRAFT_33547 [Melampsora larici-populina
98AG31]
Length = 361
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/226 (33%), Positives = 112/226 (49%), Gaps = 24/226 (10%)
Query: 51 EHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE-- 108
E E G E++++ + + VR+A + GW R F ++ K
Sbjct: 111 EGTEFSIGQAARLETNNDAGISASKMVRDA--QGGWT-----------RPTF-KKTKPLI 156
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
EG C I+G V KV GN H + S H L N++H I++ +FGE
Sbjct: 157 PEGPACRIFGSTHVKKVTGNLHITTLGHGYLSWEHTDHQL------MNLTHVISEFSFGE 210
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
FP +V PLD + P ++QYFI VVPT Y + G + +NQ+SVT+ RS+E GR
Sbjct: 211 FFPNMVQPLDNSVEITDKPFHIFQYFISVVPTTYINSGGRQVFTNQYSVTDMSRSTEHGR 270
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+PG+FF YD+ P+ +T E + + FL + IVGG+ +G
Sbjct: 271 --GVPGIFFKYDIEPMYLTIRERTTTLVQFLVRLAGIVGGIVVCTG 314
>gi|407859749|gb|EKG07137.1| hypothetical protein TCSYLVIO_001725 [Trypanosoma cruzi]
Length = 393
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 95/299 (31%), Positives = 142/299 (47%), Gaps = 40/299 (13%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
+D++G +L+V +IFK +D+QGN I +RQ G+G + ++ +CG
Sbjct: 109 LDVTGTVNLNVTRNIFKTPVDAQGNFAFIGTRQ-GVGE---YGSFREQSKDDPNSPQFCG 164
Query: 59 SCYGAE------SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
C+ +E + CCN C +V AY ++G + ++QC E L I
Sbjct: 165 RCFISEHQLSMMDNKNRCCNTCNDVLNAYDQQGLPRPQKNEVEQCIYE--LSLINP---- 218
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP- 171
GCN G L V K G FAP + G + D++ F DS SH INKL+ G+
Sbjct: 219 GCNYKGTLIVKKFGGRLVFAPKRV--PGGFLIKDVMQF--DS---SHIINKLSIGDERVT 271
Query: 172 -----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
GV +PL+G + + +YF+KVVPT+Y SG S F+ T +
Sbjct: 272 RFSRRGVQHPLNGHEFVAQRRFTEIRYFLKVVPTMY--FSGK--NSASFNATYEYSVQWS 327
Query: 227 GRLQTL-----PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
RL + P V +D P++V SF HF+ +C IVGG+F V G+ID +
Sbjct: 328 HRLTPIGFGHFPSVSLGFDFHPMQVNNYFRRSSFPHFIVQLCGIVGGLFVVLGLIDGLV 386
>gi|225712696|gb|ACO12194.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Lepeophtheirus salmonis]
Length = 372
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 99/178 (55%), Gaps = 6/178 (3%)
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
I +E + C I+G L +NKVAGNFH +PGK+ HVH + +N +H+I++ +
Sbjct: 166 IPDEPHDACRIHGSLTLNKVAGNFHISPGKTLPLFRAHVHFATFGGDEVYNFTHRIDRFS 225
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT---IQSNQFSVTEHFR 222
FG G+V PL+G S YQY I+VVP TD+ G+T + Q+SV EH R
Sbjct: 226 FGTPHGGIVQPLEGEEKIAMQDSMHYQYLIQVVP---TDIQGYTDLIWSTYQYSVKEHKR 282
Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
++++ PG++F YD+S +KV +++ FL + A VGG S I+ FI
Sbjct: 283 ATKERGSGDTPGIYFKYDMSALKVLASQDREPIFKFLVRLLAAVGGRIATSQIVCVFI 340
>gi|148678794|gb|EDL10741.1| ERGIC and golgi 2, isoform CRA_a [Mus musculus]
Length = 375
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 64/168 (38%), Positives = 91/168 (54%), Gaps = 16/168 (9%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 176 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 235
Query: 172 GVVNPLDGVR--WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGR 228
G++NPLDG P+ ++ Y I + ++QFSVTE R +
Sbjct: 236 GIINPLDGTEKIAVDLVPTKLHTYKI-------------SADTHQFSVTERERIINHAAG 282
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ G+F YDLS + VT TEEH+ F F +C I+GG+F+ +G++
Sbjct: 283 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 330
>gi|154418008|ref|XP_001582023.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121916255|gb|EAY21037.1| hypothetical protein TVAG_172950 [Trichomonas vaginalis G3]
Length = 371
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 78/245 (31%), Positives = 116/245 (47%), Gaps = 14/245 (5%)
Query: 56 YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
YCG+CY S+D+ CCN C EV + ++ KG +QC REG L + E C
Sbjct: 134 YCGNCY--LSTDKKCCNTCREVMDVFKAKGLTYYASFRWEQCIREGVL----DFGNETCR 187
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQS-GVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
I G L+V K +GNFH A G + + + H HD+ + S ++H I+ L FGE
Sbjct: 188 IKGKLKVKKQSGNFHIALGANTNDNYKGHSHDLSSVDA-SHKLNHVIHSLTFGEPVDYYK 246
Query: 175 NPLDGVRWTQETPSG----MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
L V +G M Y++ P + + I S ++S R +
Sbjct: 247 PQLTDVEMQLPELNGSNYWMVTYYLHAAPERIS--TTDKIDSYRYSAFPSRRKVTNKTKK 304
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
PG+ F+YD +P+ V + H S + ++C IVGG F+ + IIDA + I+ K
Sbjct: 305 GFPGIVFYYDFAPMIVVYQPTHGSIRSIIVDICGIVGGAFSFAAIIDALAFGALSGIRGK 364
Query: 291 IEIGK 295
IGK
Sbjct: 365 TMIGK 369
>gi|321258600|ref|XP_003194021.1| ER to Golgi transport-related protein [Cryptococcus gattii WM276]
gi|317460491|gb|ADV22234.1| ER to Golgi transport-related protein, putative [Cryptococcus
gattii WM276]
Length = 444
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 92/171 (53%), Gaps = 14/171 (8%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINK 163
K ++G C IYG ++V KV N H H ++FQ N+SH +++
Sbjct: 204 KVQDGPACRIYGSVQVKKVTANLHITTLG---------HGYMSFQHTDHHLMNLSHVVHE 254
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
+FG FP + PLD P ++QYF++VVPT Y D S + ++Q++VT++ RS
Sbjct: 255 FSFGPFFPAIAQPLDQSYEITLQPFTIFQYFLRVVPTTYIDASRRKLITSQYAVTDYSRS 314
Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
E G+ +PG+FF YDL P+ V E S FL + +VGGV+TV+
Sbjct: 315 FEHGK--GVPGLFFKYDLEPMSVVIRERTTSLFQFLIRLAGVVGGVWTVAA 363
>gi|412991249|emb|CCO16094.1| predicted protein [Bathycoccus prasinos]
Length = 409
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 109/235 (46%), Gaps = 69/235 (29%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E+ EGC +YG + V +V GNFH + +++ H + + NISH I L+FG
Sbjct: 173 EKKEGCRLYGRMHVQRVGGNFHISAHAEEYETLQHAFGAV----NKINISHTITHLSFGA 228
Query: 169 HFPGVVNPLDGV------------------------------------------------ 180
+PG+VNPLDGV
Sbjct: 229 GYPGLVNPLDGVARSGSDDEFHYDESSKDSRSSDRKNIEKEKEEEEKRKKKEQVRRSRLM 288
Query: 181 --RWTQETPSGMYQYFIKVVPTVYTDVSG---------HTIQSNQFSVTEHFRSSEQGRL 229
W E SG+Y+YF+K+VPT Y ++ +NQ+SVTE+FR ++
Sbjct: 289 DLTW-DENGSGVYKYFLKLVPTFYRTHRSVFLGLFSWTKSVSTNQYSVTEYFRKTDAWS- 346
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT----VSGIIDAFI 280
+LP V+F YD SPI VT + F++FLT +CA+ GGVF +S ++DA +
Sbjct: 347 GSLPAVYFLYDFSPIAVTIDTKRPHFVYFLTRLCAVCGGVFAFAHMISNLVDALL 401
>gi|358333955|dbj|GAA52416.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Clonorchis sinensis]
Length = 306
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 65/168 (38%), Positives = 91/168 (54%), Gaps = 6/168 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFH-QSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ CNI G V KVAGN H PG+ F G HVH + FN SH+IN L+FG
Sbjct: 86 DACNIVGTFHVQKVAGNMHVLPGRPFDGPGGSHVHIAPFVRLADFNFSHRINHLSFGAQV 145
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
VNPLD V P ++Y+I +VPT VY + ++ + Q+++T R++E +
Sbjct: 146 ANRVNPLDAVEEISYNPMETFRYYISIVPTRVVY---AFSSLDTYQYAITVKNRTAEGNK 202
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
++PG+FF YD P+ V TE F FL + A+VGG+F G I
Sbjct: 203 SDSIPGIFFSYDTFPLLVQVTESRELFGTFLARLAALVGGLFATVGFI 250
>gi|331239265|ref|XP_003332286.1| hypothetical protein PGTG_14582 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|309311276|gb|EFP87867.1| hypothetical protein PGTG_14582 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 366
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 92/175 (52%), Gaps = 12/175 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+G C IYG +V KV GN H + S H L N+SH I + +FG+
Sbjct: 157 DGPACRIYGNTQVKKVTGNLHITTLGHGYLSWEHTDHKL------MNLSHVITEFSFGQF 210
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
FP +V PLD + P ++QYFI VVPT Y D G + +NQ+SVT+ R E G
Sbjct: 211 FPKIVQPLDNSVELTDKPFHIFQYFISVVPTTYIDRLGRQLHTNQYSVTDMSRPVEHG-- 268
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG----IIDAFI 280
Q +PG+FF YD+ P+ + E S + FL + ++GG+ +G ++D F+
Sbjct: 269 QGIPGLFFKYDMEPMSLILHERTTSLIQFLVRLAGMIGGIVVCTGWTFRLVDRFV 323
>gi|340372649|ref|XP_003384856.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Amphimedon queenslandica]
Length = 347
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 92/169 (54%), Gaps = 1/169 (0%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C ++G ++VNKV+GNFH G++ H H + N SH+I+ FG PG
Sbjct: 164 SCRVHGHIQVNKVSGNFHITAGQAVPHPQGHAHLSAFVPTNMINFSHRIDSFGFGVSTPG 223
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT 231
+V+PL+G + ++QY+I++VPT G + +NQ+SVTE R+ S +
Sbjct: 224 MVDPLEGTYVIARESNRLFQYYIQIVPTTLQMRGGSDLHTNQYSVTERNRAISHKAGSHG 283
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
LPG+FF Y++ + V E FL +CAIVGGVF G+I F+
Sbjct: 284 LPGLFFKYEIYSLMVLMKEVDRPLSLFLVRLCAIVGGVFATLGMISQFL 332
>gi|198422133|ref|XP_002131157.1| PREDICTED: similar to ptx1 [Ciona intestinalis]
Length = 391
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 63/172 (36%), Positives = 94/172 (54%), Gaps = 6/172 (3%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C YG L +NKVAGNFH GK G H H + F +N SH+I+ +FG
Sbjct: 173 DACRFYGNLPLNKVAGNFHIVAGKPIQMFGGHAHLSMMFSPIPYNFSHRIDHFSFGNMKT 232
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN--QFSVTEHFRSSEQGR- 228
G +N LDG + S ++QY++ VV T ++ I ++ QFSV+E R+ +
Sbjct: 233 GFINALDGDERVTSSESYIFQYYLDVVS---TKINSRRITTDTFQFSVSEQSRALDHASG 289
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
PGVFF Y+ SP+ V TE+ + F L +C+IVGG+F S +++A +
Sbjct: 290 SHGQPGVFFKYNFSPLSVMITEQKMPFYRLLVRLCSIVGGIFATSHVLNALL 341
>gi|156553212|ref|XP_001600226.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Nasonia vitripennis]
Length = 391
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 98/176 (55%), Gaps = 2/176 (1%)
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
I C IYG L+VNKVAGNFH GKS H H ++N +H+IN+ +
Sbjct: 161 IPSYPSNACRIYGSLDVNKVAGNFHVTSGKSVILPRGHFHFTSFHSSTAYNFTHRINRFS 220
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
FG+ PG+++PL+G ++QYFI+VV T ++ H ++ Q+SV +H R
Sbjct: 221 FGKPSPGIIHPLEGDEKITTDNMMLFQYFIEVVSTD-INMLMHKSKTYQYSVKDHQRPIN 279
Query: 226 QGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ +PG+FF YD S +K+ ++E S FL +CA VG +F +GI+++ +
Sbjct: 280 HAKGSHGIPGIFFKYDTSALKIKVSQERDSIGQFLVKLCATVGCIFVTNGILNSIV 335
>gi|385302035|gb|EIF46185.1| erv46p [Dekkera bruxellensis AWRI1499]
Length = 266
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 64/180 (35%), Positives = 97/180 (53%), Gaps = 17/180 (9%)
Query: 1 MDISGEQHLDV-KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
MD++G+ D+ + + + RLD G I + + K++K + YCGS
Sbjct: 89 MDLTGDVQADILEGNFLRTRLDRDGKEIATDE----PFKVNKEDXVKSELSTEDSQYCGS 144
Query: 60 CYGA--ESSDED--------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
CYGA +S +E CCN+CE V+ AY K W + + I+QC++EG++ RI +
Sbjct: 145 CYGAIDQSGNEKESDPTKWVCCNSCEAVKLAYSKAAWKFYDGEGIEQCEKEGYVDRINKR 204
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG 167
EGC + G ++N++ GN HFAPG S + HVHD+ F + D FN H IN +FG
Sbjct: 205 LDEGCRVKGTAQLNRIGGNLHFAPGSSITMNDRHVHDLSLFDKHQDKFNFDHVINHFSFG 264
>gi|321465392|gb|EFX76393.1| hypothetical protein DAPPUDRAFT_306117 [Daphnia pulex]
Length = 289
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 111/207 (53%), Gaps = 25/207 (12%)
Query: 101 GFLQ---RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
GF++ + +G+GC +N+V GNFH + H D Q DS ++
Sbjct: 98 GFIENTLKTPWNKGKGCIFESRFHINRVPGNFHVS---------THSADK---QPDSADM 145
Query: 158 SHKINKLAFGE-----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 212
+H I L FGE + PG NPL +Q P+ + Y +K+VPT+Y D +G T+ S
Sbjct: 146 AHYITSLTFGEMLDNKNLPGNFNPLARRDRSQADPAESHDYTMKIVPTIYEDSAGTTLVS 205
Query: 213 NQFSV--TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
Q++ + + S GR + ++F YDL+PI V + E FLT+VCAI+GG F
Sbjct: 206 YQYTYAYSNYVSFSLGGR--SPAAIWFRYDLNPITVKYHERRQPIYAFLTSVCAIIGGTF 263
Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGKFS 297
TV+GIID+F++ I KK E+GK S
Sbjct: 264 TVAGIIDSFVFTASE-IFKKFELGKLS 289
>gi|159464951|ref|XP_001690702.1| hypothetical protein CHLREDRAFT_180779 [Chlamydomonas reinhardtii]
gi|158270379|gb|EDO96229.1| predicted protein [Chlamydomonas reinhardtii]
Length = 656
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/190 (37%), Positives = 100/190 (52%), Gaps = 27/190 (14%)
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGV-----------HVHDILAFQRDSFNISHKINKLA 165
Y +V +VAG H S HQ+ V H+ IL N+SH I L
Sbjct: 84 YHTPQVKRVAGRLHL----SVHQNMVFQMLPQLLGTHHIPKIL-------NMSHVIKHLG 132
Query: 166 FGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
FG H+PG +NPLDG VR P Y+YF+KVVPT Y + G +++Q+SVTE+ +
Sbjct: 133 FGPHYPGQLNPLDGYVRMVGREPFS-YKYFLKVVPTEYYNRLGRATETHQYSVTEYAQPL 191
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
++G P V YDLSPI +T E S LHF+ +CA+VGGVF ++ + D ++
Sbjct: 192 QRG---YAPAVDVHYDLSPIVMTINERPPSLLHFVVRLCAVVGGVFAITRLTDRWVDWLV 248
Query: 285 RAIKKKIEIG 294
R + K G
Sbjct: 249 RLVNKAAARG 258
>gi|324516732|gb|ADY46617.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Ascaris suum]
Length = 286
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 109/206 (52%), Gaps = 24/206 (11%)
Query: 101 GFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 158
GF+ + + E GC E+NKV GNFH S H + A Q +S+++
Sbjct: 96 GFITDVTKVPTEENGCRFEANFEINKVPGNFHL----STHSA--------ASQPESYDMR 143
Query: 159 HKINKLAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
H +N + FG+ G NPL Q P ++Y +KVVP+VY D++G T S
Sbjct: 144 HIVNSVKFGDDLQEKAQIGSFNPLQDRTALQGDPLNTHEYILKVVPSVYEDIAGRTKYSY 203
Query: 214 QFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
Q++ E+ GR+ +P V+F Y+L PI V +TE F+T+VCA+VGG FT
Sbjct: 204 QYTYAHKEYIAYHHSGRI--IPAVWFKYELQPITVKYTERRQPLYAFITSVCAVVGGTFT 261
Query: 272 VSGIIDAFIYHGQRAIKKKIEIGKFS 297
V+GIID+ ++ + KK ++GK S
Sbjct: 262 VAGIIDSSLF-SLSELYKKHQLGKLS 286
>gi|340058906|emb|CCC53277.1| conserved hypothetical protein [Trypanosoma vivax Y486]
Length = 394
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 81/301 (26%), Positives = 141/301 (46%), Gaps = 32/301 (10%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLE-HNETYCGSC 60
D +G +V ++ K LD+ G + + D + ++ + + + +CG C
Sbjct: 111 DATGSTRFNVTMNVHKTPLDASGKSVFVGERHF---HTDYTVPQYNAKFDPTSPKFCGKC 167
Query: 61 YGA------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
+ + + C N CE+V E + ++ A + ++QC E EE GC
Sbjct: 168 FVGRKYSYLQQPETPCRNTCEQVMEEFERRKLAKPSKSTVEQCIGE------LSEENPGC 221
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF---- 170
N G L++ K +G FAP ++ ++D++ FN SH INKL+ G+
Sbjct: 222 NYRGSLKLKKASGTLIFAP--KMFENVFRINDLM-----QFNASHVINKLSIGDDLVRRF 274
Query: 171 --PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY-TDVSGHTIQSN-QFSVTEHFRSSEQ 226
GV PL+ R+ +YF+K+VPT Y +D + + + S ++SV R
Sbjct: 275 SKRGVYFPLNNQRFVTTKQFAQVRYFMKIVPTTYISDNTANPVASTYEYSVQWDHRQVPL 334
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
G + +P V F +D S ++V + SF HF+ ++C IVGG+F V G++D + R
Sbjct: 335 GSGE-IPSVVFSFDFSSMQVNNYFQRPSFCHFIVSLCGIVGGLFVVLGMVDGLVARVLRL 393
Query: 287 I 287
+
Sbjct: 394 L 394
>gi|296415728|ref|XP_002837538.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633410|emb|CAZ81729.1| unnamed protein product [Tuber melanosporum]
Length = 341
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 94/165 (56%), Gaps = 11/165 (6%)
Query: 114 CNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C IYG + VN++ G+FH A G + + G H+ SFN SH I +L+FG+++P
Sbjct: 155 CRIYGSMGVNRILGDFHITAKGHGYWEDGAHI------DHRSFNFSHVITELSFGDYYPK 208
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVY-TDVSGHTIQSNQFSVTEHFRSSEQGRLQT 231
+VNPLDGV + +QYF+ +VPT Y + SG ++ +NQ++VTE R +
Sbjct: 209 LVNPLDGVVSKTDENFHKFQYFLSIVPTTYESQTSGKSLLTNQYAVTEQSRKISS---HS 265
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+PG++F YD+ PI + ++ + L F+ + IV G+ G +
Sbjct: 266 VPGIYFKYDIEPISLKISDRRTALLAFVVRLVNIVSGILVGGGWV 310
>gi|440801547|gb|ELR22565.1| serologically defined breast cancer antigen 84 isoform 1, putative
[Acanthamoeba castellanii str. Neff]
Length = 355
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/173 (39%), Positives = 93/173 (53%), Gaps = 8/173 (4%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD----ILAFQRDSFNISHKINKLA 165
+G GC ++G EV KV GN H A G + QS I Q SFN+SH I L+
Sbjct: 148 KGSGCRVFGKAEVQKVKGNLHIAAGSNAPQSHDGHQHHVHHITPEQVASFNVSHFIPHLS 207
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
FG FP +PL R + P+ M + I++VPT+Y D G+ I+ Q+S +++
Sbjct: 208 FGPAFPRRTDPLSWTRVIE--PNAMQVNHMIQLVPTIYEDWGGNVIEGYQYSAQTNYKHI 265
Query: 225 EQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
G LPGVF +D+SP + + E SF HFLT +CAI GG F V G+I
Sbjct: 266 VPGASSFPLPGVFIKWDMSPFVIQYRETGRSFAHFLTRLCAITGGTFVVLGLI 318
>gi|115623567|ref|XP_794044.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Strongylocentrotus purpuratus]
Length = 289
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 73/201 (36%), Positives = 110/201 (54%), Gaps = 21/201 (10%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
++I G+GC Y +NKV GNFH V H + Q S + +H I++
Sbjct: 103 KKIPLNNGQGCLFYSAFTINKVPGNFH-----------VSTHAVGMNQPQSTDFAHIIHE 151
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSV 217
++FG+ NPL+G R +++ S + + Y++K+VPTVY D+ G S Q++
Sbjct: 152 VSFGDDIQNKTLGASFNPLEG-RDKRDSKSDLSHDYYMKIVPTVYEDLWGTKNVSYQYTY 210
Query: 218 T-EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ + S GR + LP ++F YD+SPI V + E+ F F+T VCAIVGG FTV+GI
Sbjct: 211 AYKDYGSQGHGR-RVLPAIWFRYDISPITVKYHEKRAPFYTFITTVCAIVGGTFTVAGIF 269
Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
D+ I+ KK E+GK S
Sbjct: 270 DSIIFTAAEVFKKA-ELGKLS 289
>gi|148223633|ref|NP_001084786.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Xenopus laevis]
gi|78099249|sp|Q6NS19.1|ERGI1_XENLA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
protein 1; AltName: Full=ER-Golgi intermediate
compartment 32 kDa protein; Short=ERGIC-32
gi|47125098|gb|AAH70532.1| MGC78834 protein [Xenopus laevis]
Length = 290
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 73/200 (36%), Positives = 104/200 (52%), Gaps = 22/200 (11%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I GC G +NKV GNFH V H +A Q + ++ H I+K
Sbjct: 105 MKIPINNAYGCRFEGLFSINKVPGNFH-----------VSTHSAIA-QPANPDMRHIIHK 152
Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG ++ G N L G + Y +K+VPTVY D++G S Q++V
Sbjct: 153 LSFGNTLQVDNIHGAFNALGGADKLASKALESHDYVLKIVPTVYEDLNGKQQFSYQYTVA 212
Query: 219 E--HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ S GR+ +P ++F YDLSPI V +TE F+T VCAI+GG FTV+GI+
Sbjct: 213 NKAYVAYSHTGRV--VPAIWFRYDLSPITVKYTERRQPMYRFITTVCAIIGGTFTVAGIL 270
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+FI+ A KKI++GK
Sbjct: 271 DSFIFTASEA-WKKIQLGKM 289
>gi|325187435|emb|CCA21973.1| endoplasmic reticulumGolgi intermediate compartment protein
putative [Albugo laibachii Nc14]
Length = 283
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 94/181 (51%), Gaps = 8/181 (4%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
++ EGC G L + K+ G+ F G S + + +++ R FN SH I KL F
Sbjct: 110 EDPHNEGCRYKGTLTIQKLQGDIFFCHGGS-----LSIFNLMEMFR--FNSSHVITKLNF 162
Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
G P + PL V T Y+YF KVVP+ Y + G + + Q+SVTEH +
Sbjct: 163 GLSIPKMQTPLTDVHKTVLAQVATYKYFAKVVPSRYVYLDGKSTMTYQYSVTEHLLKMD- 221
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
G + +PGV YD SPI V + E + HF+TN CAI+GGV V+ I DA +Y +
Sbjct: 222 GFVTNIPGVIISYDFSPIAVDYIETKPNIFHFITNTCAILGGVIAVARIFDAALYSMSKK 281
Query: 287 I 287
+
Sbjct: 282 L 282
>gi|261334705|emb|CBH17699.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
DAL972]
Length = 391
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 82/298 (27%), Positives = 141/298 (47%), Gaps = 29/298 (9%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
D+SG ++V ++ K +D GN+ + +R+ P+ +R+ ++ +CG
Sbjct: 111 DVSGTFSINVTENLLKTPVDVGGNLAYLGTRR-FFTDPRSPLYTRRND---PNSPDFCGR 166
Query: 60 CYG---AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C+ A + ++CCN CEEV + +KG N ++++QC E L E GCN
Sbjct: 167 CFTGNKAIAGGKNCCNTCEEVMAEHDRKGLPRPNKNVVEQCIGELSL------ENPGCNY 220
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF------ 170
G L V KV+G F P ++ + + D+L F+ SH INK + G+
Sbjct: 221 RGALNVRKVSGVIFFTP--KVIKNTIKMEDLL-----KFDASHVINKFSIGDESVRRHSR 273
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG-RL 229
GV+NPL+ R+ +Y++ +VPT Y + + + + ++ S E
Sbjct: 274 RGVLNPLEKQRFNGSGRFMKVRYYLNIVPTTYGSGASSGLHPPTYEYSANWNSREVAIGY 333
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
P V F +D P++V + HFL +C IVGG+F V G++D+ + R +
Sbjct: 334 GGFPSVEFSFDFFPMQVNNNFKREPIYHFLVQLCGIVGGLFVVLGLVDSVVARLTRLV 391
>gi|363738942|ref|XP_414530.3| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 1 [Gallus gallus]
Length = 291
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 72/200 (36%), Positives = 106/200 (53%), Gaps = 21/200 (10%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G+GC G +NKV+ H V H A Q + +++H I+K
Sbjct: 105 MKIPLNNGDGCRFEGHFSINKVSP-------WXLH---VSTHSATA-QPQNPDMTHIIHK 153
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L+G P + Y +K+VPTVY D+SG S Q++V
Sbjct: 154 LSFGDKLQVQNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVA 213
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ S GR+ +P ++F YDLSPI V +TE F+T++CAI+GG FTV+GI+
Sbjct: 214 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGIL 271
Query: 277 DAFIYHGQRAIKKKIEIGKF 296
D+ I+ A KKI++GK
Sbjct: 272 DSCIFTASEA-WKKIQLGKM 290
>gi|388858415|emb|CCF48009.1| uncharacterized protein [Ustilago hordei]
Length = 415
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 91/165 (55%), Gaps = 8/165 (4%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+G C IYG +EV +V GN H + S H L N+SH I++ +FG +
Sbjct: 171 DGPACRIYGSMEVKRVTGNLHITTLGHGYLSLEHTDHKL------MNLSHVIHEFSFGPY 224
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
FP + PLD T + ++QYFI VPT++ D G + ++Q+SVT++ R E G+
Sbjct: 225 FPEISQPLDSSVETTDKHFTVFQYFISAVPTLFVDARGRKLHTHQYSVTDYTRQIEHGK- 283
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+PG+F YD+ PI++T E +F+ FL + ++GGV+ G
Sbjct: 284 -GVPGIFIKYDIEPIQMTIRERSSTFVQFLVRLAGVLGGVWVCVG 327
>gi|71755761|ref|XP_828795.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
GUTat10.1]
gi|70834181|gb|EAN79683.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
927/4 GUTat10.1]
Length = 391
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 81/298 (27%), Positives = 141/298 (47%), Gaps = 29/298 (9%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
D+SG ++V ++ K +D GN+ + +R+ P+ +R+ ++ +CG
Sbjct: 111 DVSGTFSINVTENLLKTPVDVGGNLAYLGTRR-FFTDPRSPLYTRRND---PNSPDFCGR 166
Query: 60 CYG---AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C+ A + ++CCN CEEV + +KG N ++++QC E L E GCN
Sbjct: 167 CFTGNKAIAGGKNCCNTCEEVMAEHDRKGLPRPNKNVVEQCIGELSL------ENPGCNY 220
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF------ 170
G L V KV+G F P ++ + + D+L F+ SH INK + G+
Sbjct: 221 RGALNVRKVSGVIFFTP--KVIKNTIKMEDLL-----KFDASHVINKFSIGDESVRRHSR 273
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG-RL 229
GV+NPL+ R+ +Y++ +VPT Y + + + + ++ S E
Sbjct: 274 RGVLNPLEKQRFNGSGRFMKVRYYLNIVPTTYGSGASSGLHPPTYEYSANWNSREVAIGY 333
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
P V F +D P++V + HFL +C I+GG+F V G++D+ + R +
Sbjct: 334 GGFPSVEFSFDFFPMQVNNNFKREPIYHFLVQLCGIIGGLFVVLGLVDSVVARLTRLV 391
>gi|190346055|gb|EDK38054.2| hypothetical protein PGUG_02151 [Meyerozyma guilliermondii ATCC
6260]
Length = 407
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 101/181 (55%), Gaps = 13/181 (7%)
Query: 99 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 158
REG+ + E C+I+G + VN+V+G+FH ++ HV + N S
Sbjct: 204 REGYHE---AESAPACHIFGSIPVNQVSGDFHITAKGMGYRDRAHV------DPQALNFS 254
Query: 159 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
H I + +FGE +P + NPLD T + Y+Y+ KVVPT+Y + G + +NQ+S+T
Sbjct: 255 HIIAEFSFGEFYPLIKNPLDFTGKTTDDHFQAYKYYAKVVPTLYERM-GLQVDTNQYSIT 313
Query: 219 EHFRSSE---QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
E R E GR+Q +PG+FF Y+ IK+ +++ + F F+ + I+GGVF V+G
Sbjct: 314 ESHRKYELNTNGRIQGVPGIFFKYEFEAIKLIVSDKRIPFTSFVARLATIIGGVFIVAGY 373
Query: 276 I 276
+
Sbjct: 374 L 374
>gi|327265232|ref|XP_003217412.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Anolis carolinensis]
Length = 291
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 69/199 (34%), Positives = 102/199 (51%), Gaps = 22/199 (11%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
+I G+GC +NK+ GNFH V H A Q + +++H I+KL
Sbjct: 107 KIPLNNGDGCRFESHFSINKIPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 154
Query: 165 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 218
+FG+ G N L+G P + Y +K+VPTVY D+SG Q++V
Sbjct: 155 SFGDQLQAQKIRGSFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQQYPFQYTVAN 214
Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
E+ S GR+ P ++F YDL+PI + + E F+T +CAI+GG FTV+GI D
Sbjct: 215 KEYVVYSHTGRIT--PAIWFRYDLTPITLKYIERRQPLYRFITTICAIIGGTFTVAGIFD 272
Query: 278 AFIYHGQRAIKKKIEIGKF 296
+ I+ A KKI++GK
Sbjct: 273 SCIFTASEA-WKKIQLGKM 290
>gi|343427702|emb|CBQ71229.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
Length = 412
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 90/165 (54%), Gaps = 8/165 (4%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+G C IYG +EV +V GN H + S H L N+SH I++ +FG +
Sbjct: 171 DGPACRIYGSMEVKRVTGNLHITTLGHGYLSMEHTDHKL------MNLSHVIHEFSFGPY 224
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
FP + PLD T + ++QYF+ VPT++ D G + ++Q+SVT++ R E G+
Sbjct: 225 FPEISQPLDSSVETTDKHFTVFQYFVSAVPTLFVDARGRKLHTHQYSVTDYTRQIEHGK- 283
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+PG+F YD+ P+++T E + L FL + ++GGV+ G
Sbjct: 284 -GVPGIFIKYDIEPLQMTIRERSTTLLQFLVRLAGVLGGVWVCVG 327
>gi|361126303|gb|EHK98312.1| putative ER-derived vesicles protein 41 [Glarea lozoyensis 74030]
Length = 343
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 71/176 (40%), Positives = 99/176 (56%), Gaps = 18/176 (10%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
R++ G+ C IYG LEVNKV G+FH A G + + G D AF N SH +N+
Sbjct: 142 RLRGNVGDSCRIYGNLEVNKVQGDFHLTARGHGYQEWGAGHLDHTAF-----NFSHIVNE 196
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYT-DVSGH----TIQSNQFS 216
L+FG +P ++NPLD R TP+ +QYF+ VVPT YT D S TI +NQ++
Sbjct: 197 LSFGAFYPSLLNPLD--RTVSTTPNHFHKFQYFLSVVPTAYTVDSSSRSARDTIFTNQYA 254
Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
VTE S + +++PG+FF YD+ P+ +T E SFL F+ V + GV
Sbjct: 255 VTEQ---SHEVNERSVPGIFFKYDIEPMLLTVEESRDSFLRFVVKVVNVFSGVLVA 307
>gi|260800124|ref|XP_002594986.1| hypothetical protein BRAFLDRAFT_128968 [Branchiostoma floridae]
gi|229280225|gb|EEN50997.1| hypothetical protein BRAFLDRAFT_128968 [Branchiostoma floridae]
Length = 292
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/220 (34%), Positives = 114/220 (51%), Gaps = 28/220 (12%)
Query: 92 DLIDQCKRE--GFLQ---RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 146
D+ D+ R GF++ ++ G GC G +NKV GNFH S H + V
Sbjct: 87 DIQDEMGRHEVGFVEDTEKVPVNNGLGCRFEGRFWINKVPGNFHM----STHSAHV---- 138
Query: 147 ILAFQRDSFNISHKINKLAFGE--------HFPGVVNPLDGVRWTQETPSGMYQYFIKVV 198
Q S +++H ++ L FGE H G NPLD V + YF+K+V
Sbjct: 139 ----QPASPDMTHVVHDLRFGEDLAAFLPDHIKGSFNPLDEVERLHANALSSHDYFLKIV 194
Query: 199 PTVYTDVSGHTIQSNQFSVT-EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 257
PT++ + S + Q++ + + S G + +P ++F YDLSPI V +T++ F H
Sbjct: 195 PTIFENRSDKKSFAFQYTYAYKDYISFGHGN-RVMPAIWFRYDLSPITVKYTDKRKPFYH 253
Query: 258 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
F+T +CA+VGG FTV+GIID+ I+ KK E+GK S
Sbjct: 254 FITTICAVVGGTFTVAGIIDSVIFTAAEVFKKA-ELGKLS 292
>gi|291244956|ref|XP_002742359.1| PREDICTED: endoplasmic reticulum-golgi intermediate compartment
(ERGIC) 1-like [Saccoglossus kowalevskii]
Length = 318
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/199 (34%), Positives = 97/199 (48%), Gaps = 17/199 (8%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I GC + ++NKV GNFH S H +G + Q + H I++
Sbjct: 132 NKIPLNNNAGCRFEAYFKINKVPGNFHV----STHAAG-------SRQPQKADFVHTIHE 180
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
+ G+ NPL G + + Y++KVVPTVY DV G S Q++
Sbjct: 181 IIIGDDIQNKSINAAFNPLAGYDRSDAAAESSHDYYMKVVPTVYEDVWGRVNLSYQYTYA 240
Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
S + +P ++F YD+SPI V + E+ F F+T +CAIVGG FTV+GIID+
Sbjct: 241 YKDYVSYGHGHRVMPAIWFRYDISPITVKYHEKRAPFYTFITTICAIVGGTFTVAGIIDS 300
Query: 279 FIYHGQRAIKKKIEIGKFS 297
IY KK EIGK S
Sbjct: 301 MIYSASEVFKKA-EIGKLS 318
>gi|348667045|gb|EGZ06871.1| hypothetical protein PHYSODRAFT_319561 [Phytophthora sojae]
Length = 469
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 110/228 (48%), Gaps = 44/228 (19%)
Query: 92 DLIDQCKREGFLQRIK--EEEG----------EGCNIYGFLEVNKVAGNFHFAPGKSFHQ 139
D ++ K+E F Q K E+G EGC +YG L V +V GNFH
Sbjct: 257 DAVEARKKELFEQDKKNAREQGKAIARSAVGPEGCRLYGHLYVKRVPGNFH--------- 307
Query: 140 SGVHVHDILAFQRDS--FNISHKINKLAFGEHFPG--------------VVNPLDGVRWT 183
VH+ + A+ DS N SH +N+L FGEH + LD +T
Sbjct: 308 --VHLANP-AYSMDSSLVNASHTVNELWFGEHLTSGEMSMLPRDAQMQLYTHRLDNQDYT 364
Query: 184 QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSP 243
+ Y ++IKVV Y I N + T H S+E LP + F YDLSP
Sbjct: 365 SFYKNHTYVHYIKVVTNSYVQSDAADI--NVYKYTAH--SNEYLETDDLPSIMFRYDLSP 420
Query: 244 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
+ V +E+ V F HFLT+ CAI+GGVFTV GI+D I+ RA+ KK+
Sbjct: 421 MSVRISEDSVPFYHFLTSACAIIGGVFTVIGILDQIIHQTARALNKKV 468
>gi|146421059|ref|XP_001486481.1| hypothetical protein PGUG_02151 [Meyerozyma guilliermondii ATCC
6260]
Length = 407
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 101/181 (55%), Gaps = 13/181 (7%)
Query: 99 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 158
REG+ + E C+I+G + VN+V+G+FH ++ HV + N S
Sbjct: 204 REGYHE---AESAPACHIFGSIPVNQVSGDFHITAKGMGYRDRAHV------DPQALNFS 254
Query: 159 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
H I + +FGE +P + NPLD T + Y+Y+ KVVPT+Y + G + +NQ+S+T
Sbjct: 255 HIIAEFSFGEFYPLIKNPLDFTGKTTDDHFQAYKYYAKVVPTLYERM-GLQVDTNQYSIT 313
Query: 219 EHFRSSE---QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
E R E GR+Q +PG+FF Y+ IK+ +++ + F F+ + I+GGVF V+G
Sbjct: 314 ELHRKYELNTNGRIQGVPGIFFKYEFEAIKLIVSDKRIPFTLFVARLATIIGGVFIVAGY 373
Query: 276 I 276
+
Sbjct: 374 L 374
>gi|156406959|ref|XP_001641312.1| predicted protein [Nematostella vectensis]
gi|156228450|gb|EDO49249.1| predicted protein [Nematostella vectensis]
Length = 287
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 78/217 (35%), Positives = 110/217 (50%), Gaps = 26/217 (11%)
Query: 92 DLIDQCKRE--GFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 146
D+ D+ R GF + ++ E GEGC I +NKV GNFH S H +G
Sbjct: 86 DIQDEMGRHEVGFKENVERREINNGEGCFISTRFTINKVPGNFHV----STHGAGK---- 137
Query: 147 ILAFQRDSFNISHKINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
Q DS +++H IN + FG + PG L + + Y +K+VPT+Y
Sbjct: 138 ----QPDSPDMNHIINAVNFGSRIMDKLPGAFTALKDRKRHDTNGLASHDYILKIVPTIY 193
Query: 203 TDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
+ G T S Q++ E+ S G Q LP ++F YDLSPI V + E HF+T
Sbjct: 194 QKLDGTTTFSYQYTWAYKEYVSYSHGG--QMLPAIWFRYDLSPITVKYIERRQPLYHFIT 251
Query: 261 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
VCAIVGG FTV+GIID+ ++ +K ++GK S
Sbjct: 252 TVCAIVGGTFTVAGIIDSAVFTASEMWRKH-QLGKLS 287
>gi|388583623|gb|EIM23924.1| DUF1692-domain-containing protein [Wallemia sebi CBS 633.66]
Length = 396
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 60/166 (36%), Positives = 91/166 (54%), Gaps = 7/166 (4%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
++G C IYG +E KV GN H + S H L N+SH I++ +FG+
Sbjct: 158 KDGPACRIYGSVETKKVNGNMHITTLGHGYSSLEHTDHKL------MNLSHTIDEFSFGQ 211
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
HFP + PLD + +YQYF+ VVPT Y D SGH++ +NQ+S E + +
Sbjct: 212 HFPYISQPLDKSVEITDNHFPVYQYFMHVVPTTYVDASGHSLSTNQYSAREDIKFIHNHQ 271
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+ +PG+FF Y+L PI ++ + +SF L + A++GGV+ SG
Sbjct: 272 -RGIPGLFFRYELEPIHLSLSATTMSFTKLLIRLTALIGGVWCCSG 316
>gi|301100294|ref|XP_002899237.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
gi|262104154|gb|EEY62206.1| endoplasmic reticulum-Golgi intermediate compartment protein,
putative [Phytophthora infestans T30-4]
Length = 469
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 116/232 (50%), Gaps = 52/232 (22%)
Query: 92 DLIDQCKREGFLQRIKE--EEG----------EGCNIYGFLEVNKVAGNFHFAPGKSFHQ 139
D+++ K+E F Q K+ E+G EGC ++G L V +V GNFH
Sbjct: 257 DVVEARKKELFEQDKKDAREQGRAIARSAVGPEGCRLFGHLYVKRVPGNFH--------- 307
Query: 140 SGVHVHDILAFQRDS--FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQY---- 193
VH+ + A+ DS N SH +N+L FGEH + P D R +E + +Y +
Sbjct: 308 --VHLANP-AYSMDSSLVNASHTVNELWFGEH----LAPGDMSRLPREAQTQLYTHRLEN 360
Query: 194 --------------FIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
+IKVV Y V G + N + T H S+E LP V F Y
Sbjct: 361 QDFTSLYKNHTYVHYIKVVTNSY--VQGDGSEINVYKYTAH--SNEYLETDDLPSVMFRY 416
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
DLSP+ V +E+ V F HF+T+ CAI+GGVFTV GI+D I+ RA+ KK+
Sbjct: 417 DLSPMSVRISEDTVPFYHFVTSACAIIGGVFTVIGIVDQIIHQTARALNKKV 468
>gi|310800159|gb|EFQ35052.1| hypothetical protein GLRG_10196 [Glomerella graminicola M1.001]
Length = 377
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/168 (38%), Positives = 93/168 (55%), Gaps = 14/168 (8%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
EG+ C IYG L+VN+V G+FH A G + + G H+ +FN SH I++L+FG
Sbjct: 183 EGDSCRIYGNLDVNRVQGDFHITARGHGYMEFGAHL------DHAAFNFSHIISELSFGP 236
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGHTIQSNQFSVTEHFRSS 224
+P +VNPLD +QY++ VVPTVYT S +TI +NQ++VTE + +
Sbjct: 237 FYPSLVNPLDRTVNLARINFHKFQYYLSVVPTVYTVGKSASSSNTIFTNQYAVTEQSKET 296
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
+ +PG+FF YD+ PI ++ E FL L + IV GV
Sbjct: 297 DD---HNIPGIFFKYDIEPILLSVEESRDGFLQLLMKIVNIVSGVLVA 341
>gi|71013590|ref|XP_758634.1| hypothetical protein UM02487.1 [Ustilago maydis 521]
gi|46098292|gb|EAK83525.1| hypothetical protein UM02487.1 [Ustilago maydis 521]
Length = 415
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 89/165 (53%), Gaps = 8/165 (4%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+G C IYG +EV +V GN H + S H L N+SH I++ +FG +
Sbjct: 171 DGPACRIYGSMEVKRVTGNLHITTLGHGYLSVEHTDHKL------MNLSHVIHEFSFGPY 224
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
FP + PLD T E ++QYF+ VPT++ D G + ++Q+SVT++ R E G+
Sbjct: 225 FPEISQPLDSSVETTEKHFTVFQYFVSAVPTLFIDARGRKLHTHQYSVTDYTRQIEHGK- 283
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+PG+F YD+ P+++T + S FL + ++GGV+ G
Sbjct: 284 -GVPGIFIKYDIEPLQMTIRQRSTSLFQFLVRLAGVLGGVWVCVG 327
>gi|313220803|emb|CBY31643.1| unnamed protein product [Oikopleura dioica]
Length = 289
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 79/219 (36%), Positives = 121/219 (55%), Gaps = 28/219 (12%)
Query: 92 DLIDQCKRE--GFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 146
D+ D+ R G+L+ +++ G+GC G VNKV GNFH S H S V
Sbjct: 86 DIQDEHGRHEVGYLENTRKDPINGGKGCIFGGTFHVNKVPGNFHV----STHSSQV---- 137
Query: 147 ILAFQRDSFNISHKINKLAFGEHFPGVVN-------PLDGVRWTQETPSGMYQYFIKVVP 199
Q + +++H+I++L+FGE G+ + PL+G + E + + Y +KVVP
Sbjct: 138 ----QPQNPDMNHEIHELSFGESMKGINSNLPANFIPLNGKKTGAEKMAS-HDYTLKVVP 192
Query: 200 TVYTDVSGHTIQSNQFS-VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
TVY D+ T QF+ V + F + G + +P ++F Y++SPI V +TE+ HF
Sbjct: 193 TVYQDIKKRTKFGYQFTAVYKDFVAFGHGH-RVMPAIWFRYEVSPITVKYTEKSKPLYHF 251
Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LT CAI+GG FTV+G+ID+ I+ + +KK E GK S
Sbjct: 252 LTTFCAIIGGTFTVAGMIDSMIFSAHQMVKKAGE-GKLS 289
>gi|313230728|emb|CBY08126.1| unnamed protein product [Oikopleura dioica]
Length = 289
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 79/219 (36%), Positives = 121/219 (55%), Gaps = 28/219 (12%)
Query: 92 DLIDQCKRE--GFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 146
D+ D+ R G+L+ +++ G+GC G VNKV GNFH S H S V
Sbjct: 86 DIQDEHGRHEVGYLENTRKDPINGGKGCIFGGTFHVNKVPGNFHV----STHSSQV---- 137
Query: 147 ILAFQRDSFNISHKINKLAFGEHFPGVVN-------PLDGVRWTQETPSGMYQYFIKVVP 199
Q + +++H+I++L+FGE G+ + PL+G + E + + Y +KVVP
Sbjct: 138 ----QPQNPDMNHEIHELSFGESMKGINSNLPANFIPLNGKKTGAEKMAS-HDYTLKVVP 192
Query: 200 TVYTDVSGHTIQSNQFS-VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
TVY D+ T QF+ V + F + G + +P ++F Y++SPI V +TE+ HF
Sbjct: 193 TVYQDIKKRTKFGYQFTAVYKDFVAFGHGH-RVMPAIWFRYEVSPITVKYTEKSKPLYHF 251
Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
LT CAI+GG FTV+G+ID+ I+ + +KK E GK S
Sbjct: 252 LTTFCAIIGGTFTVAGMIDSMIFSAHQMVKKAGE-GKLS 289
>gi|336269097|ref|XP_003349310.1| hypothetical protein SMAC_05593 [Sordaria macrospora k-hell]
gi|380089883|emb|CCC12416.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 379
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 93/169 (55%), Gaps = 10/169 (5%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
R+ + C ++G LE+NKV G+FH A G + + G H+ +FN SH I++
Sbjct: 181 RLWGATPDSCRVFGSLELNKVQGDFHITAKGHGYMEFGQHL------DHSAFNFSHIISE 234
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
L++G P +VNPLD + +QYFI VVPTVY+ G +I +NQ++VTE
Sbjct: 235 LSYGPFLPSLVNPLDQTVNLATSNFHKFQYFISVVPTVYSVSGGRSIVTNQYAVTEQ--- 291
Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
S++ + +PG+F YD+ PI + EE SFL FL V ++ G
Sbjct: 292 SQEVTERIIPGIFVKYDIEPILLNIVEERDSFLLFLIKVVNVISGALVA 340
>gi|348690307|gb|EGZ30121.1| COPII vesicle trafficking protein [Phytophthora sojae]
Length = 306
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 82/227 (36%), Positives = 115/227 (50%), Gaps = 44/227 (19%)
Query: 99 REGFLQRIKEEE--GE-GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 155
+E L++ +EE GE GC ++G ++V KVAG+ FA H+ + V F +F
Sbjct: 91 KEILLKKDIQEEPFGENGCRLFGTVQVQKVAGDLSFA-----HEGSLTVFSFFDFL--NF 143
Query: 156 NISHKINKLAFGEHFPGVVNPLDGV------RWTQET----------------------- 186
N SH +N L FG P + PL V TQE+
Sbjct: 144 NSSHVVNHLRFGPQIPDMETPLIDVSKILERNCTQESCWLARSWDSVAALLTSFIALLLF 203
Query: 187 PSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIK 245
Y+YF+ VVP+ Y ++G ++ + Q+SVTEH SS Q + PGV F Y+ SPI
Sbjct: 204 TVATYKYFVNVVPSRYVYLNGRSVTTFQYSVTEHETSSRGPNGQVSFPGVIFSYEFSPIA 263
Query: 246 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
V + E S LHFLT+ AIVGGVF V+ +ID IY ++ KKI+
Sbjct: 264 VEYIESKPSVLHFLTSTSAIVGGVFAVARMIDGAIY----SVSKKID 306
>gi|67623433|ref|XP_667999.1| serologically defined breast cancer antigen 84 like (42.9 kD)
(XQ234) [Cryptosporidium hominis TU502]
gi|54659178|gb|EAL37768.1| serologically defined breast cancer antigen 84 like (42.9 kD)
(XQ234) [Cryptosporidium hominis]
Length = 388
Score = 111 bits (277), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 78/265 (29%), Positives = 119/265 (44%), Gaps = 38/265 (14%)
Query: 57 CGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--- 109
CG CY A +++ +CCN C++V Y KKG L + QC + +RI
Sbjct: 117 CGPCYDASINNDLGVVNCCNTCKDVFNEYDKKGIKLPHVISFKQCDYDK-SKRISNALSS 175
Query: 110 --EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV---HVHDILAFQRDSFNISHKINKL 164
EGC I + KV G + H+ V + D+ + FN S+K+N L
Sbjct: 176 NLNSEGCKIKVNGYIPKVKGKIEIS-----HKRWVKYKEMTDLEIAESHLFNFSYKMNYL 230
Query: 165 AFGEHFPGVVNPLDGVRWTQET-------------PSGMYQYFIKVVPTVYTDVSGHTIQ 211
FGE PG+ N + Q + + + +PT Y ++ +I
Sbjct: 231 DFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFDDAYIDFDMHCIPTQYNTINNKSIN 290
Query: 212 SNQFSVTEHFR----SSEQGRL---QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
S+QFSV ++ S G+ ++PG+ YD +P V TE SFL F+T CA
Sbjct: 291 SHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPFLVKMTESRRSFLSFITECCA 350
Query: 265 IVGGVFTVSGIIDAFIYHGQRAIKK 289
I+GG+F SG+ID F + ++ K
Sbjct: 351 IIGGIFAFSGMIDIFFFKFLSSVNK 375
>gi|336472105|gb|EGO60265.1| hypothetical protein NEUTE1DRAFT_56465 [Neurospora tetrasperma FGSC
2508]
gi|350294686|gb|EGZ75771.1| DUF1692-domain-containing protein [Neurospora tetrasperma FGSC
2509]
Length = 379
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 99/185 (53%), Gaps = 14/185 (7%)
Query: 92 DLIDQCKREGFLQRIKEEEG---EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDI 147
D++ +R+ R G + C ++G LE+NKV G+FH A G + + G H+
Sbjct: 166 DIVSLGRRKAKWARTPRLWGATPDSCRVFGSLELNKVQGDFHITAKGHGYMEFGQHL--- 222
Query: 148 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
+FN SH I++L+FG P +VNPLD +QYFI VVPTVY+ SG
Sbjct: 223 ---DHSAFNFSHIISELSFGPFLPSLVNPLDQTVNIASANFHKFQYFISVVPTVYSS-SG 278
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
+I +NQ++VTE S++ + +PG+F YD+ PI + EE SFL F+ V ++
Sbjct: 279 KSIVTNQYAVTEQ---SQEVTERIIPGIFVKYDIEPILLNIEEERDSFLVFIIKVVNVIS 335
Query: 268 GVFTV 272
G
Sbjct: 336 GALVA 340
>gi|443683891|gb|ELT87978.1| hypothetical protein CAPTEDRAFT_224400 [Capitella teleta]
Length = 292
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 106/202 (52%), Gaps = 23/202 (11%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
++ EGC ++NKV GNFH + S Q N+ H +++L
Sbjct: 105 KVPINNNEGCRFKSSFKINKVPGNFHISTHASKEQP------------PQPNMKHIVHEL 152
Query: 165 AFGE------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
FG+ H PG NPL ++ + Y++K+VP V+ D SG T+ + + T
Sbjct: 153 IFGDRVPQTIHIPGSFNPLLEKDKSESNALSSHDYYLKIVPAVFNDYSGKTLM-HPYQYT 211
Query: 219 EHFRSS--EQGRLQTLPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGI 275
+R S ++G +P ++F Y L+P+ V ++E+ + F HFLT VCAIVGG FTV+GI
Sbjct: 212 FAYRHSIRQRGGQVVIPAIWFKYKLNPMCVKYSEQRPIPFYHFLTAVCAIVGGTFTVAGI 271
Query: 276 IDAFIYHGQRAIKKKIEIGKFS 297
D+F++ I KK E+GK S
Sbjct: 272 FDSFLFTAAE-IFKKAELGKLS 292
>gi|325184531|emb|CCA19024.1| endoplasmic reticulumGolgi intermediate compartment protein
putative [Albugo laibachii Nc14]
Length = 466
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 72/192 (37%), Positives = 96/192 (50%), Gaps = 26/192 (13%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
EGC +YG L V +V GNFH H S H + N SH +N+L FGE
Sbjct: 288 EGCQLYGHLIVKRVPGNFHI------HLS----HPFYSMNSSLVNASHTVNELWFGEVLS 337
Query: 172 GVV-------NPLDGVRWT-QETPSGM----YQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
LD R QE + M Y ++IKVV Y +G I + +++
Sbjct: 338 ASALAKLPPNTRLDSHRLARQEFTAYMQNYTYVHYIKVVTNTYVQRNGEVISAYRYTA-- 395
Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
S+E + LP V F YDLSP+ V TE + F HF+T+ CAI+GGVFTV GIID
Sbjct: 396 --HSNEYLETEDLPSVMFRYDLSPMSVRITERSMPFYHFVTSACAIIGGVFTVIGIIDQL 453
Query: 280 IYHGQRAIKKKI 291
++ RA+ KK+
Sbjct: 454 VHQTVRAMNKKV 465
>gi|62319241|dbj|BAD94459.1| hypothetical protein [Arabidopsis thaliana]
Length = 56
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 53/56 (94%), Positives = 56/56 (100%)
Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
SPIKVTFTEEH+SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 1 SPIKVTFTEEHISFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQKAIKKKMEIGKFS 56
>gi|402085784|gb|EJT80682.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Gaeumannomyces graminis var. tritici R3-111a-1]
Length = 379
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 71/206 (34%), Positives = 106/206 (51%), Gaps = 22/206 (10%)
Query: 99 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 157
R G R+ + C ++G L++NKV G+FH A G + + G H+ D+FN
Sbjct: 174 RWGKTPRLWGSTADSCRLFGSLDLNKVQGDFHITARGHGYMEFGEHL------DHDAFNF 227
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-----GHTIQS 212
+H IN+ +FGE +P +VNPLD T +QYF+ VVPTVY+ S G TI +
Sbjct: 228 THIINEFSFGEFYPSLVNPLDRTINGANTHFHKFQYFLSVVPTVYSVKSSAGGFGSTIFT 287
Query: 213 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV--- 269
NQ++VTE + + +PG+FF YD+ P+ + E +FL FL V I+ G
Sbjct: 288 NQYAVTEQNAEISE---RAIPGIFFKYDIEPVLLNIEESRDTFLLFLVKVVNILSGAMVA 344
Query: 270 ----FTVSGIIDAFIYHGQRAIKKKI 291
FT++ I + +RA I
Sbjct: 345 GHWGFTMTEWIKEIMGKRRRATSGMI 370
>gi|403337257|gb|EJY67839.1| hypothetical protein OXYTRI_11647 [Oxytricha trifallax]
Length = 279
Score = 110 bits (275), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 116/218 (53%), Gaps = 28/218 (12%)
Query: 99 REGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 156
R+ L+RIK+E + +GC + GF +N+V GNFH + S Q + V+ L Q +F+
Sbjct: 71 RQDLLKRIKDEMDQKQGCQLKGFFNINRVPGNFHIS---SHSQKDLIVN--LEMQGYTFD 125
Query: 157 ISHKINKLAFG--EHFP---------GVVNPLDGVRWTQE-----TPSGM-YQYFIKVVP 199
+HKIN ++FG E F GV+NPLDG+ ++ P + +F+ V
Sbjct: 126 FTHKINHVSFGRQEDFKVIQKNFKQQGVLNPLDGLEFSANQDNKGKPQALATNFFMVAVS 185
Query: 200 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
+ Y D + +T Q + T +S+ L F Y+LSPIKV F +E + + F+
Sbjct: 186 SYYMDTNRNTYNMYQLTSTHKSQSNANVNENML---VFSYELSPIKVLFNQEKENIVDFM 242
Query: 260 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
+CAI+GGVFT+S ++D I H ++ K IGK S
Sbjct: 243 IQLCAIIGGVFTISSVVDTII-HRSVSLLFKQRIGKLS 279
>gi|289741661|gb|ADD19578.1| cOPII vesicle protein [Glossina morsitans morsitans]
Length = 418
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 62/205 (30%), Positives = 112/205 (54%), Gaps = 3/205 (1%)
Query: 86 WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 145
+ + +P+ +D+ E + + EE+ + C ++G L +NKVAG H G + H
Sbjct: 153 YIIQSPE-VDETATEEDEKPLSEEQYDACRLHGTLGINKVAGVLHLVGGTQPVVDLLGEH 211
Query: 146 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV 205
++ F+ + N +H+IN+L+FG++ +V PL+G + QYF+ +VPT
Sbjct: 212 LMIGFRHIAANFTHRINRLSFGQYARRIVQPLEGDETFVSEEGTIVQYFLNIVPT-EIHK 270
Query: 206 SGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
+ TI + Q+SVTE+ R + R PG++F YD S +K+ + + L F+ +C+
Sbjct: 271 TFTTISTYQYSVTENVRVLDSDRNSYGSPGIYFKYDWSALKIIVRTDRDNMLQFIIRLCS 330
Query: 265 IVGGVFTVSGIIDAFIYHGQRAIKK 289
I+ G+ +SGI++ F+ +R I K
Sbjct: 331 IISGIVVLSGILNVFLLTLRRNIIK 355
>gi|429862433|gb|ELA37083.1| copii-coated vesicle protein [Colletotrichum gloeosporioides Nara
gc5]
Length = 375
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 62/168 (36%), Positives = 93/168 (55%), Gaps = 14/168 (8%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
EG+ C IYG L+VN+V G+FH A G + + G H+ +FN SH I++++FG
Sbjct: 183 EGDSCRIYGNLDVNRVQGDFHITARGHGYMEFGEHL------DHAAFNFSHIISEMSFGP 236
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGHTIQSNQFSVTEHFRSS 224
+P +VNPLD +QY++ VVPTVYT + +TI +NQ++VTE +
Sbjct: 237 FYPSLVNPLDRTVNAARINFHKFQYYLSVVPTVYTVGKSASTSNTIFTNQYAVTEQSKEV 296
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
+ +PG+FF YD+ PI ++ E FL FL + +V GV
Sbjct: 297 DD---HNVPGIFFKYDIEPILLSVEESRDGFLQFLMKIVNVVSGVLVA 341
>gi|198421328|ref|XP_002120997.1| PREDICTED: similar to Endoplasmic reticulum-Golgi intermediate
compartment protein 1 (ER-Golgi intermediate compartment
32 kDa protein) (ERGIC-32) [Ciona intestinalis]
Length = 289
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 67/201 (33%), Positives = 110/201 (54%), Gaps = 21/201 (10%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+++ +G GC ++NKV GNFH V H + Q D+ +++H+I +
Sbjct: 103 EKVPTHDGNGCLFTSRFQINKVPGNFH-----------VSTHSARS-QPDNPDMTHEIKE 150
Query: 164 LAFGEHF--PGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS- 216
L G++ PGV N L+G + P + Y +K+VPTVY + G+ Q++
Sbjct: 151 LRIGDNMVIPGVKSQSFNALEGKTTFDKHPLSSHDYIMKIVPTVYESIDGNLRYLYQYTN 210
Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ + + G+ + +P ++F Y+++PI V +TE F HF+T VCAI+GG FTV+GII
Sbjct: 211 AYKDYIAYGHGQ-RVMPAIWFRYEMTPITVKYTERRKPFYHFITMVCAIIGGTFTVAGII 269
Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
D+ I+ + KK+ IGK S
Sbjct: 270 DSMIFSATE-MYKKLTIGKLS 289
>gi|332373256|gb|AEE61769.1| unknown [Dendroctonus ponderosae]
Length = 382
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 95/170 (55%), Gaps = 2/170 (1%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C IYG L +NKVAGNF + GK + + +N +H+IN+ +FG P
Sbjct: 175 DACRIYGTLGLNKVAGNFLISGGKRYMFGLGYQQFRTLISEGEYNFTHRINRFSFGHSSP 234
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQ 230
G+V+PL+G P + YFI++VPT + +TI + Q+SV E R + +
Sbjct: 235 GIVHPLEGDELILPDPMTVVNYFIEIVPTT-VNTFMYTISTYQYSVKELTRPIDHNKGSH 293
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
P ++F YD+S ++VT ++E FL +C+IVGGV+ SGI+++ +
Sbjct: 294 GTPAIYFKYDMSALRVTVSQERDHLGMFLARLCSIVGGVYVCSGILNSIV 343
>gi|380492334|emb|CCF34678.1| hypothetical protein CH063_01185 [Colletotrichum higginsianum]
Length = 377
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 60/168 (35%), Positives = 94/168 (55%), Gaps = 14/168 (8%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+G+ C +YG L+VN+V G+FH A G + + G H+ +FN SH +++L+FG
Sbjct: 183 DGDSCRVYGNLDVNRVQGDFHITARGHGYMEFGEHL------DHAAFNFSHIVSELSFGP 236
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGHTIQSNQFSVTEHFRSS 224
+P +VNPLD +QY++ +VPTVYT S +TI +NQ++VTE + +
Sbjct: 237 FYPSLVNPLDRTVNLARINFHKFQYYLSIVPTVYTVGKSASSSNTIFTNQYAVTEQSKET 296
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
+ +PG+FF YD+ PI ++ E FL FL + +V GV
Sbjct: 297 DD---HNIPGIFFKYDIEPILLSVEESRDGFLQFLMKIVNVVSGVLVA 341
>gi|85101064|ref|XP_961083.1| hypothetical protein NCU04293 [Neurospora crassa OR74A]
gi|11611445|emb|CAC18610.1| conserved hypothetical protein [Neurospora crassa]
gi|28922621|gb|EAA31847.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 379
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 99/185 (53%), Gaps = 14/185 (7%)
Query: 92 DLIDQCKREGFLQRIKEEEG---EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDI 147
D++ +R+ R G + C ++G LE+NKV G+FH A G + + G H+
Sbjct: 166 DIVSLGRRKAKWARTPRLWGATPDSCRVFGSLELNKVQGDFHITAKGHGYMEFGQHL--- 222
Query: 148 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
+FN SH I++L+FG P +VNPLD +QYFI VVPTVY+ SG
Sbjct: 223 ---DHSAFNFSHIISELSFGPFLPSLVNPLDQTVNIASANFHKFQYFISVVPTVYSS-SG 278
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
+I +NQ++VTE S++ + +PG+F YD+ PI + EE SFL F+ V ++
Sbjct: 279 KSIVTNQYAVTEQ---SQEVTERIIPGIFVKYDIEPILLHIDEERDSFLVFIIKVVNVIS 335
Query: 268 GVFTV 272
G
Sbjct: 336 GALVA 340
>gi|342878666|gb|EGU79974.1| hypothetical protein FOXB_09504 [Fusarium oxysporum Fo5176]
Length = 376
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 102/183 (55%), Gaps = 18/183 (9%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG L++NKV G+FH A G + +G H+ FN SH I++L++G +
Sbjct: 189 DSCRIYGSLDLNKVQGDFHITARGHGYRGNGEHL------DHSKFNFSHIISELSYGPFY 242
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
P +VNPLDG T +QY++ VVPTVY+ V+ +I +NQ++VTE ++ ++ +
Sbjct: 243 PSLVNPLDGTVNTAPDNFHKFQYYLSVVPTVYS-VNSKSILTNQYAVTEQSKAVDE---R 298
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------FTVSGIIDAFIYHG 283
+PG+FF YD+ PI +T E + L V I+ GV FT+S I I
Sbjct: 299 YIPGIFFKYDIEPILLTVHESRDGIISLLVKVINIMSGVLVAGHWGFTISDWIHDVIGRR 358
Query: 284 QRA 286
+R+
Sbjct: 359 RRS 361
>gi|66360024|ref|XP_627190.1| ERV41 like membrane associated protein involved in vesicular
transport with a transmembrane region near the
C-terminus [Cryptosporidium parvum Iowa II]
gi|46228832|gb|EAK89702.1| ERV41 like membrane associated protein involved in vesicular
transport with a transmembrane region near the
C-terminus [Cryptosporidium parvum Iowa II]
Length = 403
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/265 (29%), Positives = 118/265 (44%), Gaps = 38/265 (14%)
Query: 57 CGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--- 109
CG CY A ++ +CCN C+++ Y KKG L + QC + +RI
Sbjct: 132 CGPCYDASIINDLGAVNCCNTCKDIFNEYDKKGIKLPHVISFKQCDYDK-SKRISNALSS 190
Query: 110 --EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV---HVHDILAFQRDSFNISHKINKL 164
EGC I + KV G + H+ V + D+ + FN S+K+N L
Sbjct: 191 NLNSEGCKIKVNGYIPKVKGKIEIS-----HKRWVKYKEMTDLEIAESHLFNFSYKMNYL 245
Query: 165 AFGEHFPGVVNPLDGVRWTQET-------------PSGMYQYFIKVVPTVYTDVSGHTIQ 211
FGE PG+ N + Q + + + +PT Y ++ +I
Sbjct: 246 DFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFEDAYIDFDMHCIPTQYNTINNKSIN 305
Query: 212 SNQFSVTEHFR----SSEQGRL---QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
S+QFSV ++ S G+ ++PG+ YD +P V TE SFL F+T CA
Sbjct: 306 SHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPFLVKITESRRSFLSFITECCA 365
Query: 265 IVGGVFTVSGIIDAFIYHGQRAIKK 289
I+GG+F SG+ID F + ++ K
Sbjct: 366 IIGGIFAFSGMIDIFFFKFLSSVNK 390
>gi|323509323|dbj|BAJ77554.1| cgd8_2900 [Cryptosporidium parvum]
gi|323510503|dbj|BAJ78145.1| cgd8_2900 [Cryptosporidium parvum]
Length = 388
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 77/265 (29%), Positives = 118/265 (44%), Gaps = 38/265 (14%)
Query: 57 CGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--- 109
CG CY A ++ +CCN C+++ Y KKG L + QC + +RI
Sbjct: 117 CGPCYDASIINDLGAVNCCNTCKDIFNEYDKKGIKLPHVISFKQCDYDK-SKRISNALSS 175
Query: 110 --EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV---HVHDILAFQRDSFNISHKINKL 164
EGC I + KV G + H+ V + D+ + FN S+K+N L
Sbjct: 176 NLNSEGCKIKVNGYIPKVKGKIEIS-----HKRWVKYKEMTDLEIAESHLFNFSYKMNYL 230
Query: 165 AFGEHFPGVVNPLDGVRWTQET-------------PSGMYQYFIKVVPTVYTDVSGHTIQ 211
FGE PG+ N + Q + + + +PT Y ++ +I
Sbjct: 231 DFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFEDAYIDFDMHCIPTQYNTINNKSIN 290
Query: 212 SNQFSVTEHFR----SSEQGRL---QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
S+QFSV ++ S G+ ++PG+ YD +P V TE SFL F+T CA
Sbjct: 291 SHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPFLVKITESRRSFLSFITECCA 350
Query: 265 IVGGVFTVSGIIDAFIYHGQRAIKK 289
I+GG+F SG+ID F + ++ K
Sbjct: 351 IIGGIFAFSGMIDIFFFKFLSSVNK 375
>gi|328771759|gb|EGF81798.1| hypothetical protein BATDEDRAFT_86854 [Batrachochytrium
dendrobatidis JAM81]
Length = 333
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 100/212 (47%), Gaps = 21/212 (9%)
Query: 84 KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGV 142
KG S+ DL D G + C G + NKV G HF A G + GV
Sbjct: 138 KGLRDSSRDLEDHASESG--------TPDACRFRGSFQANKVEGMLHFTALGHGYF--GV 187
Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
H D+ N +H+I++L+FG +P + NPLD T + YF+ VVPT+Y
Sbjct: 188 HT------PHDAINFTHRIDELSFGARYPDLHNPLDHTLEIGTTNFDSFMYFLGVVPTIY 241
Query: 203 TDVS----GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
D + G T+ +NQ++VTE + + LPG+F Y + PI V TE + + F
Sbjct: 242 VDKARSLFGATLLTNQYAVTEFSHAVDPQNPDALPGIFIKYHIEPISVRITESRLGLVQF 301
Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
T +C I+GG F G I F + + + K
Sbjct: 302 TTRMCGIIGGAFVTIGAILGFFRNVRTMLSAK 333
>gi|367025937|ref|XP_003662253.1| hypothetical protein MYCTH_2302675 [Myceliophthora thermophila ATCC
42464]
gi|347009521|gb|AEO57008.1| hypothetical protein MYCTH_2302675 [Myceliophthora thermophila ATCC
42464]
Length = 380
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 95/175 (54%), Gaps = 19/175 (10%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
R+ E + C IYG LE+NKV G+FH A G + + G H+ ++FN SH I++
Sbjct: 181 RLWGAEADSCRIYGSLELNKVQGDFHITARGHGYMEFGEHL------DHNAFNFSHIISE 234
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMY--QYFIKVVPTVYT-----DVSGHTIQSNQFS 216
L+FG P +VNPLD R P+ Y QYF+ VVPT Y+ + ++ +NQ++
Sbjct: 235 LSFGPFLPSLVNPLD--RTVNTAPAHFYKFQYFLSVVPTTYSVGHPEERGSRSVLTNQYA 292
Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
VTE ++ + T+PG+F YD+ PI + E SF FL V +V GV
Sbjct: 293 VTEQSKAVPE---NTVPGIFVKYDIEPILLNIVETRDSFFVFLIKVINVVSGVLV 344
>gi|449303002|gb|EMC99010.1| hypothetical protein BAUCODRAFT_120300 [Baudoinia compniacensis
UAMH 10762]
Length = 387
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/176 (35%), Positives = 92/176 (52%), Gaps = 17/176 (9%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
K +E + C IYG + NKV G+FH A G + + G H+ + SFN SH IN+L+
Sbjct: 184 KSKEADSCRIYGSMHGNKVQGDFHITARGHGYMEFGQHL------EHSSFNFSHHINELS 237
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-------DVSGHTIQSNQFSVT 218
FG +P + NPLD E +QY++ VVPT+YT ++ T+ +NQ++VT
Sbjct: 238 FGPFYPSLTNPLDNTLAATEFNFFKFQYYLSVVPTIYTTNAKALRKITKSTVFTNQYAVT 297
Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
E R + + +PGVF YD+ PI + EE SF + ++ GV G
Sbjct: 298 EQSRPVPENQ---VPGVFVKYDIEPILLMIAEERNSFPALFIRLVNVISGVLVAGG 350
>gi|238880883|gb|EEQ44521.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 345
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 74/220 (33%), Positives = 113/220 (51%), Gaps = 26/220 (11%)
Query: 90 NPDLIDQCKREGFLQRIKEE-----EGE-GCNIYGFLEVNKVAGNFHFAPGKSF-HQSGV 142
PDL D+ +E + E EG C+I+G + VN+V G+F GK F ++
Sbjct: 126 TPDL-DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRIT-GKGFGYRDRS 183
Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
HV +S N SH I + +FGE +P + NPLD E Y Y+ KVVPT+Y
Sbjct: 184 HV------PFESLNFSHVIQEFSFGEFYPYLNNPLDATGKVTEERLQTYMYYAKVVPTLY 237
Query: 203 TDVSGHTIQSNQFSVTE--HFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
+ G I +NQ+S+TE H +Q R +PG++F YD PIK+ E+ + F F
Sbjct: 238 EQL-GLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQF 296
Query: 259 LTNVCAIVGGVFTVSGII------DAFIYHGQRAIKKKIE 292
+ + I GG+ +G + FI++GQ+A+++ E
Sbjct: 297 IAKLATIGGGLLIAAGYLFRLYEKLLFIFYGQKAVQQNRE 336
>gi|167383125|ref|XP_001736415.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165901233|gb|EDR27345.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 116
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 54/115 (46%), Positives = 75/115 (65%), Gaps = 9/115 (7%)
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQG 227
+VNP+DG+ T + MYQYF++VVP YT + I +N +SVTEH+R S EQG
Sbjct: 1 MVNPMDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNRIINTNGYSVTEHYRPGNLKSPEQG 60
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
+PGVF YD+S I+V + EE SF H LT++C I+GGVF + ++D FI+H
Sbjct: 61 ----IPGVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFSLLDYFIFH 111
>gi|68465583|ref|XP_723153.1| likely COPII secretory vesicle component [Candida albicans SC5314]
gi|68465876|ref|XP_723006.1| likely COPII secretory vesicle component [Candida albicans SC5314]
gi|46445018|gb|EAL04289.1| likely COPII secretory vesicle component [Candida albicans SC5314]
gi|46445174|gb|EAL04444.1| likely COPII secretory vesicle component [Candida albicans SC5314]
Length = 345
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 74/220 (33%), Positives = 113/220 (51%), Gaps = 26/220 (11%)
Query: 90 NPDLIDQCKREGFLQRIKEE-----EGE-GCNIYGFLEVNKVAGNFHFAPGKSF-HQSGV 142
PDL D+ +E + E EG C+I+G + VN+V G+F GK F ++
Sbjct: 126 TPDL-DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRIT-GKGFGYRDRS 183
Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
HV +S N SH I + +FGE +P + NPLD E Y Y+ KVVPT+Y
Sbjct: 184 HV------PFESLNFSHVIQEFSFGEFYPYLNNPLDATGKVTEERLQTYMYYAKVVPTLY 237
Query: 203 TDVSGHTIQSNQFSVTE--HFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
+ G I +NQ+S+TE H +Q R +PG++F YD PIK+ E+ + F F
Sbjct: 238 EQL-GLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQF 296
Query: 259 LTNVCAIVGGVFTVSGII------DAFIYHGQRAIKKKIE 292
+ + I GG+ +G + FI++GQ+A+++ E
Sbjct: 297 IAKLATIGGGLLIAAGYLFRLYEKLLFIFYGQKAVQQNRE 336
>gi|241560364|ref|XP_002401002.1| COPII vesicle protein, putative [Ixodes scapularis]
gi|215501827|gb|EEC11321.1| COPII vesicle protein, putative [Ixodes scapularis]
gi|442749161|gb|JAA66740.1| Putative copii vesicle protein [Ixodes ricinus]
Length = 285
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 73/204 (35%), Positives = 106/204 (51%), Gaps = 22/204 (10%)
Query: 101 GFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISH 159
GF++ K G GC G ++KV GNFH S H + A Q D +++H
Sbjct: 97 GFVENTEKTPVGAGCRFEGKFYIHKVPGNFHM----STHAA--------AKQPDKIDMTH 144
Query: 160 KINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
I+ L FG E G N LD + ++ + Y +K+VPTV+ I+S Q+
Sbjct: 145 IIHDLTFGNKMVEGVRGSFNSLDEMDKSEANGLESHDYVMKIVPTVFEKSPSERIESYQY 204
Query: 216 SVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
+ + S GR+ +P ++F YDL+PI V +T V FLT+VCAIVGG FTV+
Sbjct: 205 TYAYKSYVSISHSGRI--MPAIWFRYDLTPITVKYTRRSVPLYSFLTSVCAIVGGTFTVA 262
Query: 274 GIIDAFIYHGQRAIKKKIEIGKFS 297
GI+D+ ++ I KK E+GK S
Sbjct: 263 GIVDSLVFTASE-IFKKYEMGKLS 285
>gi|302882273|ref|XP_003040047.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256720914|gb|EEU34334.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 376
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/205 (32%), Positives = 108/205 (52%), Gaps = 20/205 (9%)
Query: 92 DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
D++ +++ + + +G + C +YG L +NKV G+FH A G + +G H
Sbjct: 167 DIVALGRKKAKWAKTPKVKGRADSCRVYGSLHLNKVQGDFHITARGHGYMGNGEH----- 221
Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
+FN SH I++L++G +P +VNPLDG +QY++ +VPTVY+ V
Sbjct: 222 -LDHKNFNFSHIISELSYGPFYPSLVNPLDGTVNAASDNFHKFQYYLSIVPTVYS-VGSR 279
Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
+I +NQ++VTE +S + +PG+FF YD+ PI +T E L FL + IV G
Sbjct: 280 SILTNQYAVTEQSKSVNE---HYIPGIFFKYDIEPILLTVHESRDGILTFLVKIINIVSG 336
Query: 269 V-------FTVSGIIDAFIYHGQRA 286
V FT+S + I +R+
Sbjct: 337 VLVAGHWGFTISDWVKDVIGRRRRS 361
>gi|401427507|ref|XP_003878237.1| hypothetical protein, unknown function [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322494484|emb|CBZ29786.1| hypothetical protein, unknown function [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 309
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/191 (35%), Positives = 102/191 (53%), Gaps = 19/191 (9%)
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE-- 168
EGC + G+++V KV GNFH + H H + N+ H I+ L+FG
Sbjct: 130 AEGCRLEGYIKVGKVPGNFHISSHGRQHLLAQHF-------PNGINVEHSIHHLSFGTTD 182
Query: 169 ----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
++PLDG E P +YQYF+ +VPT+Y + S T+ + QF+ T SS
Sbjct: 183 VKKLAKKAALHPLDGKEHRSEVPM-VYQYFLDIVPTIY-ESSFSTVHTYQFTGTS---SS 237
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
+ + V F Y LSPI V ++ VS HFLT VCAI+GGV+TV+G++ F++
Sbjct: 238 TPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSA 297
Query: 285 RAIKKKIEIGK 295
++++ +GK
Sbjct: 298 AQFQRRV-LGK 307
>gi|119497911|ref|XP_001265713.1| COPII-coated vesicle protein (Erv41), putative [Neosartorya
fischeri NRRL 181]
gi|119413877|gb|EAW23816.1| COPII-coated vesicle protein (Erv41), putative [Neosartorya
fischeri NRRL 181]
Length = 397
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/203 (33%), Positives = 100/203 (49%), Gaps = 27/203 (13%)
Query: 94 IDQCKREGFLQRIKEEEGEG---CNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILA 149
+ + R+ F + K G+ C IYG LE NKV G+FH A G +H S H+
Sbjct: 170 VRRNPRKKFAKGPKLRRGDAVDSCRIYGSLEGNKVQGDFHITARGHGYHNSAPHL----- 224
Query: 150 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD----- 204
+ +FN SH I +L+FG H+P ++NPLD T E YQYF+ +VPT+Y+
Sbjct: 225 -EHKTFNFSHMITELSFGPHYPTLLNPLDKTIATTEDHYYKYQYFLSIVPTIYSKGNLAL 283
Query: 205 -----------VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 253
S + I +NQ++ T S+ +PG+FF Y++ PI + +EE
Sbjct: 284 DTYANAPPTSRYSKNLIFTNQYAATSQ-SSAIPENPYFIPGIFFKYNIEPILLMISEERT 342
Query: 254 SFLHFLTNVCAIVGGVFTVSGII 276
SFL L + + GV G +
Sbjct: 343 SFLSLLVRLVNTISGVMVTGGWL 365
>gi|398021306|ref|XP_003863816.1| hypothetical protein, unknown function [Leishmania donovani]
gi|322502049|emb|CBZ37133.1| hypothetical protein, unknown function [Leishmania donovani]
Length = 309
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/191 (35%), Positives = 102/191 (53%), Gaps = 19/191 (9%)
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE-- 168
EGC + G+++V KV GNFH + H H + N+ H I+ L+FG
Sbjct: 130 AEGCRLEGYIKVAKVPGNFHISSHGRQHLLAQHF-------PNGINVEHSIHHLSFGTID 182
Query: 169 ----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
++PLDG E P +YQYF+ +VPT+Y + S T+ + QF+ T SS
Sbjct: 183 VKKLAKKAALHPLDGKEHRSEVPM-VYQYFLDIVPTIY-ESSFSTVHTYQFTGTS---SS 237
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
+ + V F Y LSPI V ++ VS HFLT VCAI+GGV+TV+G++ F++
Sbjct: 238 TPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSA 297
Query: 285 RAIKKKIEIGK 295
++++ +GK
Sbjct: 298 AQFQRRV-LGK 307
>gi|146097219|ref|XP_001468078.1| hypothetical protein, unknown function [Leishmania infantum JPCM5]
gi|134072444|emb|CAM71154.1| hypothetical protein, unknown function [Leishmania infantum JPCM5]
Length = 309
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/191 (35%), Positives = 102/191 (53%), Gaps = 19/191 (9%)
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE-- 168
EGC + G+++V KV GNFH + H H + N+ H I+ L+FG
Sbjct: 130 AEGCRLEGYIKVAKVPGNFHISSHGRQHLLAQHF-------PNGINVEHSIHHLSFGTID 182
Query: 169 ----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
++PLDG E P +YQYF+ +VPT+Y + S T+ + QF+ T SS
Sbjct: 183 VKKLAKKAALHPLDGKEHRSEVPM-VYQYFLDIVPTIY-ESSFSTVHTYQFTGTS---SS 237
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
+ + V F Y LSPI V ++ VS HFLT VCAI+GGV+TV+G++ F++
Sbjct: 238 TPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSA 297
Query: 285 RAIKKKIEIGK 295
++++ +GK
Sbjct: 298 AQFQRRV-LGK 307
>gi|241953329|ref|XP_002419386.1| COPii-coated vesicle-associated protein, putative [Candida
dubliniensis CD36]
gi|223642726|emb|CAX42980.1| COPii-coated vesicle-associated protein, putative [Candida
dubliniensis CD36]
Length = 345
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 74/220 (33%), Positives = 113/220 (51%), Gaps = 26/220 (11%)
Query: 90 NPDLIDQCKREGFLQRIKEE-----EGE-GCNIYGFLEVNKVAGNFHFAPGKSF-HQSGV 142
PDL D+ +E + E EG C+I+G + VN+V G+F GK F ++
Sbjct: 126 TPDL-DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRIT-GKGFGYRDRS 183
Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
HV +S N SH I + +FGE +P + NPLD E Y Y+ KVVPT+Y
Sbjct: 184 HV------PFESLNFSHVIQEFSFGEFYPYLNNPLDATGKITEERLQTYMYYAKVVPTLY 237
Query: 203 TDVSGHTIQSNQFSVTE--HFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
+ G I +NQ+S+TE H +Q R +PG++F YD PIK+ E+ + F F
Sbjct: 238 EQL-GLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQF 296
Query: 259 LTNVCAIVGGVFTVSGII------DAFIYHGQRAIKKKIE 292
+ + I GG+ +G + FI++GQ+A+++ E
Sbjct: 297 IAKLATIGGGLLIAAGYLFRLYEKLLFIFYGQKAVQQNRE 336
>gi|400594740|gb|EJP62573.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Beauveria bassiana ARSEF 2860]
Length = 374
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 63/162 (38%), Positives = 91/162 (56%), Gaps = 10/162 (6%)
Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+ C IYG L++NKV G+FH A G + + G H+ D FN SH I++L++G
Sbjct: 185 ADSCRIYGSLDLNKVQGDFHITARGHGYMEFGQHL------DHDKFNFSHVISELSYGAF 238
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
+P +VNPLD +QY++ VVPTVY+ V TIQ+NQ++VTE +S E
Sbjct: 239 YPSLVNPLDRTVNVAAAHFHKFQYYLSVVPTVYS-VGRSTIQTNQYAVTE--QSKEIDEH 295
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
+PG+F YD+ PI + E SF+ FL + +V GV
Sbjct: 296 SAVPGIFVKYDIEPILLAVHESRDSFIVFLLKLINVVSGVLV 337
>gi|308494873|ref|XP_003109625.1| hypothetical protein CRE_07522 [Caenorhabditis remanei]
gi|308245815|gb|EFO89767.1| hypothetical protein CRE_07522 [Caenorhabditis remanei]
Length = 286
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 103/190 (54%), Gaps = 18/190 (9%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 168
GC E+NKV GNFH + + A Q D++++ H I+ + FG+
Sbjct: 110 GCRFESRFEINKVPGNFHLSTHSA------------ATQPDNYDMRHTIHSIKFGDDVSH 157
Query: 169 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
+ G +PL +QE ++Y +K+VP+V+ D SG+ + S Q++ +
Sbjct: 158 KNLKGSFDPLANRDTSQENGLNTHEYILKIVPSVHEDYSGNILNSYQYTFGHKSYITYHH 217
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
+ +P V+F Y+L PI + TE+ SF FLT++CA+VGG FTV+GIID+ + +
Sbjct: 218 SGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTISELV 277
Query: 288 KKKIEIGKFS 297
KK+ ++GK +
Sbjct: 278 KKQ-QMGKLT 286
>gi|358058634|dbj|GAA95597.1| hypothetical protein E5Q_02253 [Mixia osmundae IAM 14324]
Length = 682
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 85/166 (51%), Gaps = 8/166 (4%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E G C IYG + V KV GN H + S H L N+SH I++ +FG
Sbjct: 170 ENGPACRIYGTMAVKKVTGNLHITTLGHGYLSWEHTDHKL------MNLSHVIHEFSFGP 223
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
FPG+ PLD E+ ++QYF+ +V T Y D + +++ Q+SVT+ R++ GR
Sbjct: 224 LFPGISQPLDNTLEVTESSFHIFQYFMSIVSTTYVDHHRNVLETAQYSVTDMSRATVHGR 283
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+PG+F YD P+ +T E + FL + IVGGV SG
Sbjct: 284 --GVPGIFLKYDPEPMMLTLRERTTTLGQFLIRLAGIVGGVIVCSG 327
>gi|312081872|ref|XP_003143209.1| HT034 [Loa loa]
gi|307761627|gb|EFO20861.1| hypothetical protein LOAG_07628 [Loa loa]
Length = 292
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 108/204 (52%), Gaps = 20/204 (9%)
Query: 101 GFLQRIKEEE--GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 158
GF+Q ++ GC G E++KV GNFH + H D Q +++++
Sbjct: 102 GFVQNTEKIPIGTSGCRFEGKFEISKVPGNFHLS---------THAADT---QPETYDMR 149
Query: 159 HKINKLAFGEHF-----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
H I+ + FG++ G NPL Q S + Y +K+VP+VY D++G+T S
Sbjct: 150 HTIHSVVFGDNIITSQNLGSFNPLKNREALQTDGSFTHDYVLKIVPSVYEDINGNTKYSY 209
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
Q++ + + +P ++F Y+L PI + +TE F F+T++CA+VGG FTV+
Sbjct: 210 QYTYAHKEYVTYHYSGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVA 269
Query: 274 GIIDAFIYHGQRAIKKKIEIGKFS 297
GIIDA ++ + +K +IGK S
Sbjct: 270 GIIDASLF-SLTELYRKHQIGKLS 292
>gi|212527292|ref|XP_002143803.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
marneffei ATCC 18224]
gi|210073201|gb|EEA27288.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
marneffei ATCC 18224]
Length = 402
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 64/183 (34%), Positives = 94/183 (51%), Gaps = 25/183 (13%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG LE NKV G+FH A G +++ G H+ +FN +H + +L+FG H+
Sbjct: 191 DSCRIYGSLESNKVHGDFHITARGHGYNEVGQHL------DHSNFNFTHMVTELSFGPHY 244
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-----------------DVSGHTIQSN 213
P ++NPLD + ET +QYFI VVPT+Y + S +TI +N
Sbjct: 245 PSLLNPLDKTVASTETHYYKFQYFINVVPTIYAKGNNAVEKYTANPAKAFEKSRNTIFTN 304
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
Q+S T + T PG+FF Y++ PI + +EE SFL L + +V GV
Sbjct: 305 QYSATSQSHPLPESPFNT-PGIFFKYNIEPILLFVSEERGSFLALLVRLVNVVSGVIVTG 363
Query: 274 GII 276
G +
Sbjct: 364 GWL 366
>gi|427788003|gb|JAA59453.1| Putative copii vesicle protein [Rhipicephalus pulchellus]
Length = 285
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 105/204 (51%), Gaps = 22/204 (10%)
Query: 101 GFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISH 159
GF++ K G GC G ++KV GNFH S H + A Q D +++H
Sbjct: 97 GFVENTEKTPVGSGCRFEGKFFIHKVPGNFHV----STHAA--------AKQPDKIDMTH 144
Query: 160 KINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
I+ L FG + G N LD + + + Y +K+VPTVY G I+S Q+
Sbjct: 145 IIHDLTFGVKMTDEVRGSFNSLDEMDKSGANGIESHDYVMKIVPTVYEKSKGERIESYQY 204
Query: 216 SVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
+ + S GR+ +P ++F YDL+PI V +T + FLT+VCAIVGG FTV+
Sbjct: 205 TYAYKSYVSISHSGRI--MPAIWFRYDLTPITVKYTRRGIPLYSFLTSVCAIVGGTFTVA 262
Query: 274 GIIDAFIYHGQRAIKKKIEIGKFS 297
GI+D+ ++ +K E+GK S
Sbjct: 263 GIVDSLVFTASEVF-RKFEMGKLS 285
>gi|390594538|gb|EIN03948.1| DUF1692-domain-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 551
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 82/167 (49%), Gaps = 8/167 (4%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ +G C IYG L+V KV N H + S HV D N+SH I + +FG
Sbjct: 173 QPDGGACRIYGTLQVKKVTANLHITTAGHGYASVQHV------PHDQMNLSHVITEFSFG 226
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
+FP + PLD P YQYF+ VVPT Y +++ Q+SVT + R E G
Sbjct: 227 PYFPDITQPLDDSFEITTDPFIAYQYFLHVVPTTYVAPRSSPLKTAQYSVTHYTRVLEHG 286
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
R PG+FF ++L P+ +T + + V +VGG+F +G
Sbjct: 287 R--GTPGIFFKFELDPLSITVNQRTTTLAQLFIRVIGVVGGIFVCAG 331
>gi|195439332|ref|XP_002067585.1| GK16119 [Drosophila willistoni]
gi|194163670|gb|EDW78571.1| GK16119 [Drosophila willistoni]
Length = 443
Score = 107 bits (266), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 111/202 (54%), Gaps = 4/202 (1%)
Query: 89 SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 148
S P ++D ++ LQ+ E + + C ++G L +NKVAG H G H ++
Sbjct: 179 SLPAVLD-LHQDTHLQQ-PEAKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFQDHWMI 236
Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
F+R N +H+IN+L+FG++ +V PL+G + + QYF+K+VPT + +
Sbjct: 237 EFRRMPANFTHRINRLSFGQYSRRIVQPLEGDETIIQEEATTVQYFLKIVPT-EIEQTFS 295
Query: 209 TIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
TI + Q+SVTE+ R + R PG++F YD S +K+ + + L F+ +C+I+
Sbjct: 296 TINTFQYSVTENVRKLDSERNSYGSPGIYFKYDWSALKIVVSNDRDHILTFVIRLCSIIS 355
Query: 268 GVFTVSGIIDAFIYHGQRAIKK 289
G+ +SG I++ + QR + +
Sbjct: 356 GIIVLSGAINSLLLGMQRRLLR 377
>gi|398412138|ref|XP_003857398.1| hypothetical protein MYCGRDRAFT_66006 [Zymoseptoria tritici IPO323]
gi|339477283|gb|EGP92374.1| hypothetical protein MYCGRDRAFT_66006 [Zymoseptoria tritici IPO323]
Length = 407
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 98/207 (47%), Gaps = 39/207 (18%)
Query: 98 KREGFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 155
KR + R + EE + C IYG + NKV G+FH A G + H+ +F
Sbjct: 172 KRYQYTPRTPRNEEADSCRIYGSMHSNKVQGDFHITARGHGYMAYSQHL------DHSAF 225
Query: 156 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT------------ 203
N SH IN+L+FG ++P +VNPLD E +QY++ VVPT+YT
Sbjct: 226 NFSHHINELSFGPYYPKLVNPLDSTYARTEAHFHKFQYYLSVVPTIYTVDVNALKRMDSK 285
Query: 204 ----------------DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 247
V+ H++ +NQ++VTE S + +PG+FF YD+ P+++T
Sbjct: 286 YETPSSGDDGLNQHPRRVTQHSVFTNQYAVTEQSHSVPENH---VPGIFFKYDIEPLQLT 342
Query: 248 FTEEHVSFLHFLTNVCAIVGGVFTVSG 274
EE S L + +V G+ G
Sbjct: 343 IAEEWTSVPALLLRIVNVVSGLLVAGG 369
>gi|157874469|ref|XP_001685717.1| hypothetical protein LMJF_32_3910 [Leishmania major strain
Friedlin]
gi|68128789|emb|CAJ08922.1| hypothetical protein LMJF_32_3910 [Leishmania major strain
Friedlin]
Length = 309
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/191 (35%), Positives = 101/191 (52%), Gaps = 19/191 (9%)
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE-- 168
EGC + G+++V KV GNFH + H H + N+ H I+ L+FG
Sbjct: 130 AEGCRLEGYIKVAKVPGNFHISSHGRQHLLAQHF-------PNGINVEHSIHHLSFGTID 182
Query: 169 ----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
++PLDG E P +YQYF+ +VPT+Y + S T+ + QF+ T SS
Sbjct: 183 VKKLAKKAALHPLDGKEHRSEMPM-VYQYFLDIVPTIY-ESSFSTVYTYQFTGTS---SS 237
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
+ + V F Y LSPI V ++ VS HFLT VCAI+GGV+TV+G++ F++
Sbjct: 238 TPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSA 297
Query: 285 RAIKKKIEIGK 295
++ + +GK
Sbjct: 298 AQFQRHV-LGK 307
>gi|19112857|ref|NP_596065.1| COPII-coated vesicle component Erv41 (predicted)
[Schizosaccharomyces pombe 972h-]
gi|74582843|sp|O94283.1|ERV41_SCHPO RecName: Full=ER-derived vesicles protein 41
gi|3850069|emb|CAA21880.1| COPII-coated vesicle component Erv41 (predicted)
[Schizosaccharomyces pombe]
Length = 333
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 97/184 (52%), Gaps = 12/184 (6%)
Query: 97 CKREGFLQRIKEEEGEG--CNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRD 153
+ E F ++ E G G C IYG L VN+V G H APG + +S + H
Sbjct: 133 ARTEKFRKKNNAEPGSGTACRIYGQLVVNRVNGQLHITAPGWGYGRSNIPFH-------- 184
Query: 154 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
S N +H I +L+FGE++P +VN LDG +QY++ V+PT Y S + ++N
Sbjct: 185 SLNFTHYIEELSFGEYYPALVNALDGHYGHANDHPFAFQYYLSVLPTSYKS-SFRSFETN 243
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
Q+S+TE+ + G PG+F YDL P+ V ++H + L + AI GG+ TV+
Sbjct: 244 QYSLTENSVVRQLGFGSLPPGIFIDYDLEPLAVRVVDKHPNVASTLLRILAISGGLITVA 303
Query: 274 GIID 277
I+
Sbjct: 304 SWIE 307
>gi|448105220|ref|XP_004200441.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
gi|448108351|ref|XP_004201072.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
gi|359381863|emb|CCE80700.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
gi|359382628|emb|CCE79935.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
Length = 344
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 100/194 (51%), Gaps = 17/194 (8%)
Query: 89 SNPDLIDQCKREGFLQRIK------EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 142
+ PDL D+ RE E+ C+IYG + VNKVAG+FH GK F +
Sbjct: 125 NTPDL-DEVMRETVRAEFNVAGTRMNEDASACHIYGSIPVNKVAGDFHIT-GKGFGYADR 182
Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
H + F++ N SH I + +FGE +P + NPLD Y+YF+ VPT+Y
Sbjct: 183 HR---VPFEK--LNFSHVIMEFSFGEFYPMIKNPLDFTGKIASQKLQSYKYFMTAVPTLY 237
Query: 203 TDVSGHTIQSNQFSVTEHFR---SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
+ G + + Q+S+TE R + E G +PG++F YD IK+ E+ + FL F+
Sbjct: 238 EKL-GIEVDTYQYSLTEQHRAITTDETGLPSDIPGLYFKYDFDTIKLLIAEKRIPFLQFV 296
Query: 260 TNVCAIVGGVFTVS 273
+ IV G+F V+
Sbjct: 297 ARLATIVSGLFIVA 310
>gi|336370998|gb|EGN99338.1| hypothetical protein SERLA73DRAFT_108802 [Serpula lacrymans var.
lacrymans S7.3]
gi|336383753|gb|EGO24902.1| hypothetical protein SERLADRAFT_449635 [Serpula lacrymans var.
lacrymans S7.9]
Length = 503
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 87/181 (48%), Gaps = 8/181 (4%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ +G C IYG L+V KV N H + S VHV N+SH I + +FG
Sbjct: 166 QADGSACRIYGTLQVKKVTANLHITTLGHGYTSNVHV------DHTKMNLSHVITEFSFG 219
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
+FP + PLD + P YQYF+ VVPT + + +NQ+SVT H+ +G
Sbjct: 220 PYFPDITQPLDYSFEVAKDPFVAYQYFLHVVPTTFIAPRSEPLHTNQYSVT-HYTRVLKG 278
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
T PG+FF +DL P+ +T + SFL ++GGVFT + F A+
Sbjct: 279 HHGT-PGIFFKFDLDPMVITIHQRTTSFLQLFIRCVGVIGGVFTCTSYFLRFTTRAVDAV 337
Query: 288 K 288
Sbjct: 338 S 338
>gi|119191516|ref|XP_001246364.1| hypothetical protein CIMG_00135 [Coccidioides immitis RS]
gi|392864406|gb|EAS34753.2| COPII-coated vesicle protein [Coccidioides immitis RS]
Length = 399
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 73/242 (30%), Positives = 114/242 (47%), Gaps = 52/242 (21%)
Query: 64 ESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+ D+ + EVR +++++ G L D++D C+ IYG L
Sbjct: 157 QEEDQHVGHVLGEVRRSWKRQFPPGPKLKRKDVVDSCR-----------------IYGSL 199
Query: 121 EVNKVAGNFHF-APGKSFHQSG--VHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
E NKV GNFH A G ++ V+V+D+ N +H I +L+FG H+P ++NPL
Sbjct: 200 EGNKVQGNFHITAKGLGYYDPTGMVNVNDM--------NFTHLITELSFGPHYPTLLNPL 251
Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSG--------------------HTIQSNQFSV 217
D + YQY++ VVPT+YT +TI +NQ++V
Sbjct: 252 DKTVAATKDKFYKYQYYLSVVPTIYTRAGTVDPYSQRLPDPSTITVSQRKNTIFTNQYAV 311
Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
T R+ QG ++PG+FF +D+ PI + +EE S L L + +V GV G +
Sbjct: 312 TSQSRTISQGPY-SVPGIFFKFDIEPILLVVSEERGSLLALLVRLVNVVSGVLVAGGWVF 370
Query: 278 AF 279
F
Sbjct: 371 NF 372
>gi|346469653|gb|AEO34671.1| hypothetical protein [Amblyomma maculatum]
Length = 285
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 105/204 (51%), Gaps = 22/204 (10%)
Query: 101 GFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISH 159
GF++ K G GC G ++KV GNFH S H + A Q + +++H
Sbjct: 97 GFVENTEKTPVGSGCRFEGKFFIHKVPGNFHV----STHAA--------AKQPEKIDMTH 144
Query: 160 KINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
I+ L FG + G N LD + + + Y +K+VPTVY G I+S Q+
Sbjct: 145 IIHDLTFGVKMTDEVKGSFNSLDEMDKSGGNGIESHDYVMKIVPTVYEKSRGERIESYQY 204
Query: 216 SVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
+ + S GR+ +P ++F YDL+PI V +T V FLT+VCAIVGG FTV+
Sbjct: 205 TYAYKSYVSISHTGRI--MPAIWFRYDLTPITVKYTRRGVPLYSFLTSVCAIVGGTFTVA 262
Query: 274 GIIDAFIYHGQRAIKKKIEIGKFS 297
GI+D+ I+ +K E+GK S
Sbjct: 263 GIVDSLIFTASEVF-RKFEMGKLS 285
>gi|303313533|ref|XP_003066778.1| hypothetical protein CPC735_060030 [Coccidioides posadasii C735
delta SOWgp]
gi|240106440|gb|EER24633.1| hypothetical protein CPC735_060030 [Coccidioides posadasii C735
delta SOWgp]
gi|320036232|gb|EFW18171.1| COPII-coated vesicle protein [Coccidioides posadasii str. Silveira]
Length = 399
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 73/242 (30%), Positives = 114/242 (47%), Gaps = 52/242 (21%)
Query: 64 ESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+ D+ + EVR +++++ G L D++D C+ IYG L
Sbjct: 157 QEEDQHVGHVLGEVRRSWKRQFPPGPKLKRKDVVDSCR-----------------IYGSL 199
Query: 121 EVNKVAGNFHF-APGKSFHQSG--VHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
E NKV GNFH A G ++ V+V+D+ N +H I +L+FG H+P ++NPL
Sbjct: 200 EGNKVQGNFHITAKGLGYYDPTGMVNVNDM--------NFTHLITELSFGPHYPTLLNPL 251
Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSG--------------------HTIQSNQFSV 217
D + YQY++ VVPT+YT +TI +NQ++V
Sbjct: 252 DKTVAATKDKFYKYQYYLSVVPTIYTRAGTVDPYSQRLPDPSTITPSQRKNTIFTNQYAV 311
Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
T R+ QG ++PG+FF +D+ PI + +EE S L L + +V GV G +
Sbjct: 312 TSQSRTISQGPY-SVPGIFFKFDIEPILLVVSEERGSLLALLVRLVNVVSGVLVAGGWVF 370
Query: 278 AF 279
F
Sbjct: 371 NF 372
>gi|402591333|gb|EJW85263.1| hypothetical protein WUBG_03826, partial [Wuchereria bancrofti]
Length = 244
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 101/190 (53%), Gaps = 18/190 (9%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP- 171
GC + G E++KV GNFH + H D Q +++++ H I+ + FG+
Sbjct: 68 GCRLEGKFEISKVPGNFHIS---------THAADT---QPETYDMRHTIHSVVFGDDIST 115
Query: 172 ----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
G NPL + S + Y +K+VP+VY D++G+ S Q++ +
Sbjct: 116 SQNLGSFNPLKNREALESDGSFTHDYVLKIVPSVYEDITGNKKYSYQYTYAHKEYVTYHY 175
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
+ +P ++F Y+L PI + +TE F F+T++CA+VGG FTV+GIIDA ++ +
Sbjct: 176 SGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLF-SLTEL 234
Query: 288 KKKIEIGKFS 297
+K ++GK S
Sbjct: 235 YRKHQMGKLS 244
>gi|17570549|ref|NP_508375.1| Protein Y102A11A.6 [Caenorhabditis elegans]
gi|351063407|emb|CCD71590.1| Protein Y102A11A.6 [Caenorhabditis elegans]
Length = 286
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 101/190 (53%), Gaps = 18/190 (9%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 168
GC E+NKV GNFH + + A Q +S+++ H I+ + FG+
Sbjct: 110 GCRFESRFEINKVPGNFHLSTHSA------------ATQPESYDMRHLIHSIKFGDDVSH 157
Query: 169 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
+ G +PL +QE ++Y +K+VP+V+ D SG + S Q++ +
Sbjct: 158 KNLKGSFDPLAKRNTSQENGLNTHEYILKIVPSVHEDYSGTILNSYQYTFGHKSYITYHH 217
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
+ +P V+F Y+L PI + TE+ SF FLT++CA+VGG FTV+GIID+ + +
Sbjct: 218 SGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTISELV 277
Query: 288 KKKIEIGKFS 297
KK+ +GK +
Sbjct: 278 KKQ-RLGKLT 286
>gi|70988875|ref|XP_749289.1| COPII-coated vesicle protein (Erv41) [Aspergillus fumigatus Af293]
gi|66846920|gb|EAL87251.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
fumigatus Af293]
gi|159128703|gb|EDP53817.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
fumigatus A1163]
Length = 379
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/252 (30%), Positives = 110/252 (43%), Gaps = 53/252 (21%)
Query: 44 QRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKRE 100
Q H RL E +D + EVR RKK G L D +D C+
Sbjct: 130 QEHADRLSEQE-----------ADAHVHHVLGEVRRNPRKKFAKGPKLRRGDAVDSCR-- 176
Query: 101 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISH 159
IYG LE NKV G+FH A G +H + H+ + +FN SH
Sbjct: 177 ---------------IYGSLEGNKVQGDFHITARGHGYHNNAPHL------EHKTFNFSH 215
Query: 160 KINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT------DVSGHTIQSN 213
I +L+FG H+P ++NPLD T E YQYF+ +VPT+Y+ D + SN
Sbjct: 216 MITELSFGPHYPTLLNPLDKTIATTEDHYYKYQYFLSIVPTIYSKGNLALDTYANAPPSN 275
Query: 214 Q----FSVTEHFRSSEQGRLQT-----LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
+ T + + Q + +PG+FF Y++ PI + +EE SFL L +
Sbjct: 276 RRGKNLVFTNQYAVTSQSSVIPESPYFIPGLFFKYNIEPILLLISEERTSFLSLLVRLVN 335
Query: 265 IVGGVFTVSGII 276
V GV G +
Sbjct: 336 TVSGVMVTGGWL 347
>gi|346970151|gb|EGY13603.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium dahliae VdLs.17]
Length = 373
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 100/190 (52%), Gaps = 20/190 (10%)
Query: 92 DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
D++ Q K+ R G + C I+G L++NKV G+FH A G + +G H+
Sbjct: 160 DIVAQSKKRQKWARTPRLRGPPDSCRIFGSLDLNKVQGDFHITARGHGYQGAGQHL---- 215
Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYT--- 203
SFN SH +N+L+FG +P + NPLD R P+ +QY++ +VPTVYT
Sbjct: 216 --DHTSFNFSHIVNELSFGAFYPNLENPLD--RTVNLAPANFHKFQYYLSIVPTVYTVGR 271
Query: 204 -DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
+T+ +NQF+VTE +S E G ++PGVF YD+ PI + E F+ F V
Sbjct: 272 SASKANTVYTNQFAVTE--QSKEVGD-HSVPGVFVKYDIEPILLLVEETRPGFVQFWLKV 328
Query: 263 CAIVGGVFTV 272
++ GV
Sbjct: 329 INVLSGVLVA 338
>gi|320591987|gb|EFX04426.1| copii-coated vesicle protein [Grosmannia clavigera kw1407]
Length = 385
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/182 (33%), Positives = 96/182 (52%), Gaps = 19/182 (10%)
Query: 99 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 157
R G R++ + C I+G L++N+V G++H A G + + G H+ SFN
Sbjct: 179 RWGKTPRLRGAAPDSCRIFGSLDLNRVQGDYHITARGHGYMEMGDHL------DHTSFNF 232
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMY--QYFIKVVPTVYT-----DVSGHTI 210
SH +N+L+FG +P +VNPLD E + Y QYF+ +VPTVY+ S +I
Sbjct: 233 SHVVNELSFGPFYPSLVNPLDQT--VNEATANFYRFQYFMSIVPTVYSVGHAGSRSARSI 290
Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
+NQ++VTE +Q + +PG+FF YD+ PI + E FL F+ + ++ G
Sbjct: 291 VTNQYAVTEQSAEIDQ---RAIPGIFFKYDIEPILLYIEESRDGFLVFVLKIVNVLSGAL 347
Query: 271 TV 272
Sbjct: 348 VA 349
>gi|294655234|ref|XP_457337.2| DEHA2B08778p [Debaryomyces hansenii CBS767]
gi|199429792|emb|CAG85341.2| DEHA2B08778p [Debaryomyces hansenii CBS767]
Length = 354
Score = 105 bits (261), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 56/172 (32%), Positives = 101/172 (58%), Gaps = 11/172 (6%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSF-HQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E C+I+G + VN+V G+FH GK F + G ++ F+ + N +H I++ ++G
Sbjct: 150 EGAPACHIFGSIPVNQVKGDFHIT-GKGFGYNDG---RSVVPFE--ALNFTHVISEFSYG 203
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH---FRSS 224
+ +P + NPLD E Y+Y+ KVVPT+Y + G I +NQ+S+TE ++ +
Sbjct: 204 DFYPFINNPLDFTGKVTEQKLQAYKYYSKVVPTIYEKL-GMIIDTNQYSLTEQHNVYKVN 262
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
++ +PG+FF Y+ PIK+ +E+ + F+ F++ + I+GG+ V+G +
Sbjct: 263 RFNNVEGIPGIFFKYEFEPIKLIISEKRIPFIQFVSRLATIIGGLLIVAGYL 314
>gi|164661257|ref|XP_001731751.1| hypothetical protein MGL_1019 [Malassezia globosa CBS 7966]
gi|159105652|gb|EDP44537.1| hypothetical protein MGL_1019 [Malassezia globosa CBS 7966]
Length = 454
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 98/173 (56%), Gaps = 20/173 (11%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFH---FAP---GKSFHQSGVHVHDILAFQRDSFNISHKIN 162
EE C +YG + V KV GN H F P + H++G+ + ++SH I+
Sbjct: 217 EEARACRVYGSILVKKVTGNLHISTFVPTFMAVNAHENGMGI-----------DMSHIIH 265
Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
+ +FG++FP + PLD + P+ +QYF+ VVPT + I++NQ+SV + ++
Sbjct: 266 EFSFGDYFPNIAEPLDASLELTDDPAAAFQYFLSVVPTHFIH-GRRVIKTNQYSVHD-YK 323
Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
+ QG L T PG++F YD+ P+ + T + VS + F+ VC+++GG++ + +
Sbjct: 324 RNPQGSL-TFPGLYFKYDIEPLTMKVTHKSVSLVAFIVRVCSVLGGLWICTDL 375
>gi|443897407|dbj|GAC74748.1| CDK9 kinase-activating protein cyclin T [Pseudozyma antarctica
T-34]
Length = 414
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 84/154 (54%), Gaps = 8/154 (5%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+G C IYG +EV +V GN H + S H L N+SH I++ +FG +
Sbjct: 171 DGPACRIYGSMEVKRVTGNLHITTLGHGYLSMEHTDHKL------MNLSHVIHEFSFGPY 224
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
FP + PLD T + ++QYF+ +PT++ D G + ++Q+SVT++ R E G+
Sbjct: 225 FPEISQPLDSSVETTDKHFTVFQYFVSAIPTLFIDARGRRLHTHQYSVTDYARPIEHGK- 283
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
+PG+F YD+ P+++T E VS + FL +
Sbjct: 284 -GVPGIFIKYDIEPLQMTIRERSVSLVQFLVRLA 316
>gi|121710902|ref|XP_001273067.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
clavatus NRRL 1]
gi|119401217|gb|EAW11641.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
clavatus NRRL 1]
Length = 401
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 88/302 (29%), Positives = 130/302 (43%), Gaps = 64/302 (21%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI--GAPKIDKPLQRHGGRLEHNETYCGS 59
D SG++ L ++ ++ S +E R I GA + Q HG RL E
Sbjct: 106 DASGDRIL--AGELLQRERTSWNLWMEKRNYEIHGGAHEYQTLNQEHGDRLAEQE----- 158
Query: 60 CYGAESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
D + EVR RKK G L D++D C+ I
Sbjct: 159 ------QDAHVHHVLGEVRRNPRKKFPRGPRLRRGDVVDSCR-----------------I 195
Query: 117 YGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
YG LE NKV G+FH A G +H + H+ + +FN SH + +L+FG H+P ++N
Sbjct: 196 YGSLEGNKVQGDFHITARGHGYHAAAPHL------EHSTFNFSHMVTELSFGPHYPTILN 249
Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYT---------------------DVSGHTIQSNQ 214
PLD T E YQYF+ VVPT+Y+ + + + I +NQ
Sbjct: 250 PLDKTIATTEEHYYKYQYFLSVVPTIYSKGNLALDAYSGSAPTLHDPNRNRNRNLIFTNQ 309
Query: 215 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
++ T + + +PG+FF Y + PI + +EE SFL L + V GV G
Sbjct: 310 YAATSQSTALPESPY-FVPGIFFKYSIEPILLIISEERGSFLTLLVRLVNTVSGVIVTGG 368
Query: 275 II 276
+
Sbjct: 369 WL 370
>gi|123483410|ref|XP_001324018.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121906894|gb|EAY11795.1| conserved hypothetical protein [Trichomonas vaginalis G3]
Length = 384
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 74/242 (30%), Positives = 110/242 (45%), Gaps = 13/242 (5%)
Query: 51 EHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC--KREGFLQRIKE 108
E + C SCYG + CCN+CE+ + G A + D QC K G K
Sbjct: 121 ETISSICHSCYGL-LPEGSCCNSCEQTLLLHIMNGKAANTKDW-PQCQGKNPG-----KV 173
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E E C I G + +NK GNFH APG + + HVHD L+ Q +F++SH I + G
Sbjct: 174 YENEKCRIKGKVCLNKAQGNFHIAPGTNMKERYGHVHD-LSGQLPNFDLSHVIQGMRVGP 232
Query: 169 HFPGVVNPLDGVRWTQETPSG-MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
P NPL V+ Q +Y+Y + V P VY SG+ I + T G
Sbjct: 233 KIPLTYNPLRYVQQIQNPNQPVVYRYDLVVTPAVYK--SGNRILGKGYDYTAMINRFFVG 290
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
PG++F Y +P VT +++ T++ + G + + IID ++ + +
Sbjct: 291 NSGGAPGIYFHYSFTPYGVTVNATYLTIAQIFTSIFGFMSGAYAIFSIIDESMFKDDKRM 350
Query: 288 KK 289
K
Sbjct: 351 AK 352
>gi|46137745|ref|XP_390564.1| hypothetical protein FG10388.1 [Gibberella zeae PH-1]
Length = 376
Score = 104 bits (260), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 99/183 (54%), Gaps = 18/183 (9%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG L++NKV G+FH A G + G H+ FN SH I++L++G +
Sbjct: 189 DSCRIYGSLDLNKVQGDFHITARGHGYMGHGEHL------DHSKFNFSHIISELSYGPFY 242
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
P + NPLDG T + +QY++ VVPTVY+ V+ +I +NQ++VTE ++ + +
Sbjct: 243 PSLENPLDGTVNTADGNFHKFQYYLSVVPTVYS-VNSRSILTNQYAVTEQSKAVDD---R 298
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------FTVSGIIDAFIYHG 283
+PG+FF YD+ PI +T E + + I+ GV FT+S I I
Sbjct: 299 YIPGIFFKYDIEPILLTVHESRDGIISLFVKIINIISGVLVAGHWGFTISDWIHDVIGRR 358
Query: 284 QRA 286
+R+
Sbjct: 359 RRS 361
>gi|408393109|gb|EKJ72376.1| hypothetical protein FPSE_07400 [Fusarium pseudograminearum CS3096]
Length = 376
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 99/183 (54%), Gaps = 18/183 (9%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG L++NKV G+FH A G + G H+ FN SH I++L++G +
Sbjct: 189 DSCRIYGSLDLNKVQGDFHITARGHGYMGHGEHL------DHSKFNFSHIISELSYGPFY 242
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
P + NPLDG T + +QY++ VVPTVY+ V+ +I +NQ++VTE ++ + +
Sbjct: 243 PSLENPLDGTVNTADGNFHKFQYYLSVVPTVYS-VNSRSILTNQYAVTEQSKAVDD---R 298
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------FTVSGIIDAFIYHG 283
+PG+FF YD+ PI +T E + + I+ GV FT+S I I
Sbjct: 299 YIPGIFFKYDIEPILLTVHESRDGIISLFVKIINIISGVLVAGHWGFTISDWIHDVIGRR 358
Query: 284 QRA 286
+R+
Sbjct: 359 RRS 361
>gi|389749487|gb|EIM90658.1| DUF1692-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 533
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 57/163 (34%), Positives = 82/163 (50%), Gaps = 7/163 (4%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ +G C +YG LEV KV N H + S VHV N+SH I + +FG
Sbjct: 169 QADGSACRVYGSLEVKKVTANLHITSLGHGYASKVHV------DHTKINMSHVITEFSFG 222
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
HFP +V PLD YQYF++VVPT Y + +NQ+SVT + R+ EQ
Sbjct: 223 PHFPDIVQPLDNSFEITHDHFTAYQYFMRVVPTTYVAPRSAPLNTNQYSVTHYTRTFEQ- 281
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
PG+FF +++ P+++ + +F F +VGGVF
Sbjct: 282 HSGLAPGIFFKFEIEPVRLIQHQRTTTFAQFFVRWAGVVGGVF 324
>gi|346322712|gb|EGX92310.1| COPII-coated vesicle protein (Erv41), putative [Cordyceps militaris
CM01]
Length = 376
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/163 (37%), Positives = 90/163 (55%), Gaps = 10/163 (6%)
Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+ C +YG L++NKV G+FH A G + + G H+ + FN SH I++L++G
Sbjct: 186 ADSCRVYGSLDLNKVQGDFHITARGHGYMEFGQHL------DHNQFNFSHVISELSYGAF 239
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
+P +VNPLD +QY++ VVPT+Y+ V TIQ+NQ++VTE +S E
Sbjct: 240 YPSLVNPLDRTVNLAAAHFHKFQYYLSVVPTIYS-VGSSTIQTNQYAVTE--QSKEIDEH 296
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
+PG+F YD+ PI + E SF FL + IV GV
Sbjct: 297 SAVPGIFVKYDIEPILLAVHESRDSFPVFLLKLINIVSGVLVA 339
>gi|302422316|ref|XP_003008988.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
gi|261352134|gb|EEY14562.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Verticillium albo-atrum VaMs.102]
Length = 374
Score = 104 bits (259), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 97/188 (51%), Gaps = 16/188 (8%)
Query: 92 DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
D++ Q K+ R G + C I+G L++NKV G+FH A G + +G H+
Sbjct: 161 DIVAQSKKRQKWARTPRLRGPPDSCRIFGSLDLNKVQGDFHITARGHGYQGAGQHL---- 216
Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----D 204
SFN SH +N+L+FG +P + NPLD +QY++ +VPTVYT
Sbjct: 217 --DHTSFNFSHIVNELSFGAFYPNLENPLDRTVNLASANFHKFQYYLSIVPTVYTVGRSA 274
Query: 205 VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
+T+ +NQF+VTE +S E G ++PGVF YD+ PI + E F+ F V
Sbjct: 275 SKANTVYTNQFAVTE--QSKEVGD-HSVPGVFVKYDIEPILLLVEETRPGFVQFWLKVIN 331
Query: 265 IVGGVFTV 272
++ GV
Sbjct: 332 VLSGVLVA 339
>gi|358390077|gb|EHK39483.1| hypothetical protein TRIATDRAFT_302881 [Trichoderma atroviride IMI
206040]
Length = 372
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 105/205 (51%), Gaps = 20/205 (9%)
Query: 92 DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
D+I +R R G + C ++G +++NKV G+FH A G + G H+
Sbjct: 163 DIIALTQRRAKWARTPRPRGKPDSCRMFGSMDLNKVQGDFHITARGHGYMGMGQHL---- 218
Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
D FN SH I+++++G ++P +VNPLD + +QY++ VVPTVY +
Sbjct: 219 --DHDKFNFSHIISEMSYGPYYPSLVNPLDRTVNSAIVHFHKFQYYLSVVPTVYL-ANRR 275
Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
+ +NQ++VTEH ++ +PG+FF YD+ PI ++ E FL F+ + I G
Sbjct: 276 IVNTNQYAVTEHSKTISD---HQIPGIFFKYDIEPILLSVEESRDGFLSFVIKIVNIFSG 332
Query: 269 V-------FTVSGIIDAFIYHGQRA 286
V FT+S I I +R+
Sbjct: 333 VMVAGHWGFTLSDWIREVIGKRRRS 357
>gi|123425245|ref|XP_001306773.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121888365|gb|EAX93843.1| hypothetical protein TVAG_177510 [Trichomonas vaginalis G3]
Length = 353
Score = 104 bits (259), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 91/301 (30%), Positives = 139/301 (46%), Gaps = 42/301 (13%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY--CG 58
+D SG + + DI ++RLD KPL++ + + CG
Sbjct: 86 IDASGNPQPNARQDISRQRLDVHF----------------KPLEQLISDSDPKSVFQTCG 129
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
+C GA S CC C ++ ++R+ + N ++QC R+ + E+ E C I
Sbjct: 130 NCLGANVSK--CCLTCTDIANSFRQMEEFIPNLQNVEQCNRD----KKAIEDKETCRIVA 183
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD------SFNISHKINKLAFGEHFPG 172
L N HF GK +G V + ++ D + N++H I+ L FG F G
Sbjct: 184 KL-------NTHFTKGKLTIMAGGIVPTPVNYKFDLSHFGDNVNLTHTIHTLRFGRDFEG 236
Query: 173 VVNPLDGVRWTQETPSG-MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT 231
+ NPLD Q S MY Y I +VPT+ DV I ++Q+S + + + +
Sbjct: 237 LKNPLDNYTNNQLKKSQFMYNYKIDLVPTITNDVENQ-IPAHQYSASSSSKEITKMITKK 295
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
PG+ F +D +P+ F E S FLT +CAI+GG FT+ G ID+FI+ R KK
Sbjct: 296 HPGITFDFDTAPVAARFIVEKQSLSSFLTQLCAILGGGFTLGGFIDSFIF---RVRAKKF 352
Query: 292 E 292
E
Sbjct: 353 E 353
>gi|341874049|gb|EGT29984.1| hypothetical protein CAEBREN_24080 [Caenorhabditis brenneri]
Length = 286
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 102/190 (53%), Gaps = 18/190 (9%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 168
GC E+NKV GNFH + + A Q +++++ H I+ + FG+
Sbjct: 110 GCRFESRFEINKVPGNFHLSTHSA------------ASQPENYDMKHIIHSIKFGDDVSH 157
Query: 169 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
+ G +PL QE ++Y +K+VP+V+ D SG+ + S Q++ +
Sbjct: 158 KNLKGSFDPLANRDSLQENGLSTHEYILKIVPSVHEDYSGNILNSYQYTFGHKSYITYHH 217
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
+ +P V+F Y+L PI + TE+ SF FLT++CA+VGG FTV+GIID+ + +
Sbjct: 218 SGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTISELV 277
Query: 288 KKKIEIGKFS 297
KK+ ++GK +
Sbjct: 278 KKQ-QMGKLT 286
>gi|169778245|ref|XP_001823588.1| COPII-coated vesicle protein (Erv41) [Aspergillus oryzae RIB40]
gi|83772325|dbj|BAE62455.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 390
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 89/181 (49%), Gaps = 22/181 (12%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG LE NKV G+FH A G + G H+ +FN SH I +L+FG H+
Sbjct: 186 DSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHSTFNFSHMITELSFGTHY 239
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS----------VTEH 220
P ++NPLD E+ YQYF+ VVPT+Y+ + S ++ T
Sbjct: 240 PTLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALDSTLYTSKPSHSKNVIFTNQ 299
Query: 221 FRSSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
+ ++ QG +PG+FF Y++ PI + +EE SFL L + V GV G
Sbjct: 300 YAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFLSLLIRLVNTVSGVMVTGGW 359
Query: 276 I 276
+
Sbjct: 360 L 360
>gi|170108190|ref|XP_001885304.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
gi|164639780|gb|EDR04049.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
bicolor S238N-H82]
Length = 398
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 83/167 (49%), Gaps = 8/167 (4%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ G C ++G L+V +V N H + S HV + N+SH I + +FG
Sbjct: 169 QPHGNACRVWGSLQVKRVTANLHITTLGHGYASYEHV------DHNQMNLSHVITEFSFG 222
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
HFP + PLD + + YQYF+ VVPT Y +Q++Q+SVT + R +
Sbjct: 223 PHFPDITQPLDNSFESTDERFVAYQYFLHVVPTTYIAPRSAPLQTHQYSVTHYTRVMQHN 282
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
Q PG+FF +DL P+ +T + +FL L ++GGVF G
Sbjct: 283 --QGTPGIFFKFDLDPLAITQHQRTTTFLQLLIRCVGVIGGVFVCMG 327
>gi|123430864|ref|XP_001307985.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121889642|gb|EAX95055.1| hypothetical protein TVAG_428580 [Trichomonas vaginalis G3]
Length = 358
Score = 103 bits (258), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 133/299 (44%), Gaps = 35/299 (11%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
MD G Q +K+ + +RL++ G VI D + C C
Sbjct: 90 MDSLGFQRSYIKNTVTFRRLNNLGRVIGYTNDTLSD-------------------VCEPC 130
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKRE---GFLQRIKEEEGEGCNIY 117
Y ++ ++CCN+C +V+ +L +D K + ++ E C +
Sbjct: 131 YNLSTNPDECCNSCLKVQL------LSLMQNKPVDFSKYRVCNNYEKKPNVSLSEKCLVK 184
Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
G L VN++ G+FH APG + QS ++HD+ + Q +++H I +L FG H P NPL
Sbjct: 185 GKLTVNRIPGSFHIAPGTNVPQSA-YLHDLSSMQM-FHDMTHSIQRLRFGPHIPRTSNPL 242
Query: 178 DGVRWTQETPS--GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
D + Q+ P+ Y Y + + P ++ ++ +++ + Q PG+
Sbjct: 243 DNFKSFQQIPTHDRTYFYNLLITPVIFYRDGVEYLKGYEYTAFSEAIDTFQ-LFGISPGL 301
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
FF Y +P + + +FL F++N ++ G++ I+D I G+ +EIG
Sbjct: 302 FFQYQFTPYTIVVSANRQNFLQFISNTFGVISGIYACLSILDKLI--GEDIGSNVVEIG 358
>gi|391872305|gb|EIT81439.1| COPII vesicle protein [Aspergillus oryzae 3.042]
Length = 390
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 89/181 (49%), Gaps = 22/181 (12%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG LE NKV G+FH A G + G H+ +FN SH I +L+FG H+
Sbjct: 186 DSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHSTFNFSHMITELSFGPHY 239
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS----------VTEH 220
P ++NPLD E+ YQYF+ VVPT+Y+ + S ++ T
Sbjct: 240 PTLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALDSTLYTSKPSHSKNVIFTNQ 299
Query: 221 FRSSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
+ ++ QG +PG+FF Y++ PI + +EE SFL L + V GV G
Sbjct: 300 YAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFLSLLIRLVNTVSGVMVTGGW 359
Query: 276 I 276
+
Sbjct: 360 L 360
>gi|238495520|ref|XP_002378996.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus flavus
NRRL3357]
gi|220695646|gb|EED51989.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus flavus
NRRL3357]
Length = 390
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 89/181 (49%), Gaps = 22/181 (12%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG LE NKV G+FH A G + G H+ +FN SH I +L+FG H+
Sbjct: 186 DSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHSTFNFSHMITELSFGPHY 239
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS----------VTEH 220
P ++NPLD E+ YQYF+ VVPT+Y+ + S ++ T
Sbjct: 240 PTLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALDSTLYTSKPSHSKNVIFTNQ 299
Query: 221 FRSSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
+ ++ QG +PG+FF Y++ PI + +EE SFL L + V GV G
Sbjct: 300 YAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFLSLLIRLVNTVSGVMVTGGW 359
Query: 276 I 276
+
Sbjct: 360 L 360
>gi|242783317|ref|XP_002480163.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
stipitatus ATCC 10500]
gi|218720310|gb|EED19729.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
stipitatus ATCC 10500]
Length = 400
Score = 103 bits (258), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 92/183 (50%), Gaps = 25/183 (13%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG LE NKV G+FH A G +++ G H+ +FN +H I +L+FG H+
Sbjct: 190 DSCRIYGSLESNKVHGDFHITARGHGYNELGEHL------DHKTFNFTHMITELSFGPHY 243
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-----------------DVSGHTIQSN 213
P ++NPLD E +QYF+ VVPT+Y S +TI +N
Sbjct: 244 PSLLNPLDKTVAYTEDHYYKFQYFLNVVPTIYAKGNNAVEKYTANPALAFKKSRNTIFTN 303
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
Q+S T + + T PG+FF Y++ PI + +EE SFL L + +V GV
Sbjct: 304 QYSATSQSHALPENPYNT-PGIFFKYNIEPILLFVSEERGSFLALLVRLVNVVSGVIVTG 362
Query: 274 GII 276
G +
Sbjct: 363 GWL 365
>gi|300123494|emb|CBK24766.2| unnamed protein product [Blastocystis hominis]
Length = 235
Score = 103 bits (257), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 69/228 (30%), Positives = 106/228 (46%), Gaps = 24/228 (10%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVI---ESRQDGIGAPKIDKPLQR-----HGGRLEHN 53
D G D++++I K LD GN I + Q + P ++ L+ + +
Sbjct: 8 DALGNDRADIENEILKTNLDVNGNPIGKTDKSQVTVTVPTKEEVLENTKHDDDEIVVIDD 67
Query: 54 ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGE 112
+ CG C+GA+ E CCN CEE+ AYRKK W + QC +LQ+ K
Sbjct: 68 KKECGDCFGAKEKSE-CCNTCEELIAAYRKKNWDVDRIKAQAPQCAGFNYLQKWKNGVER 126
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-----DSFNISHKINKLAFG 167
GC + G L + KV G+ PG+ ++D+L+ +S N++H I+ + G
Sbjct: 127 GCRLEGKLSITKVQGHVFIIPGR--------INDLLSNSEIRQIANSLNVTHTIHHFSLG 178
Query: 168 EHFPGVVNPLDGVRWTQETP-SGMYQYFIKVVPTVYTDVSGHTIQSNQ 214
E P NP R + MYQYF+ +PT Y + SG ++S Q
Sbjct: 179 EAIPEQKNPFVDHRGVMAVDHASMYQYFVNAIPTTYINKSGKELKSYQ 226
>gi|225712562|gb|ACO12127.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Lepeophtheirus salmonis]
Length = 290
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 100/195 (51%), Gaps = 20/195 (10%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+G GC +NKV GNFH + H D+ Q D +N SH+I++++FG
Sbjct: 109 DGVGCLFEAHFHINKVPGNFHVS---------THSVDV---QPDEYNFSHEIHEVSFGSK 156
Query: 170 FP-------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
G N L G ++ ++Y +K+VPT Y + G + + Q++
Sbjct: 157 IKKISSKNIGTFNSLSGRDSSESGALDSHEYVMKIVPTTYESLGGAKLFAYQYTYAYRSY 216
Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
S + +P ++F YDL+PI V + E HFLT VCAIVGG FTV+GIID+ ++
Sbjct: 217 VSFGHGGRVVPALWFRYDLNPITVKYHETRPPIYHFLTTVCAIVGGTFTVAGIIDSTLFT 276
Query: 283 GQRAIKKKIEIGKFS 297
+ + KK E+GK S
Sbjct: 277 ATQ-LFKKFELGKLS 290
>gi|358374656|dbj|GAA91246.1| COPII-coated vesicle protein [Aspergillus kawachii IFO 4308]
Length = 399
Score = 103 bits (257), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 75/237 (31%), Positives = 109/237 (45%), Gaps = 48/237 (20%)
Query: 63 AESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
A +D + EVR+ R+K G L D +D C+ IYG
Sbjct: 156 AREADAHVHHVLGEVRKNPRRKFAKGPRLRRGDTVDSCR-----------------IYGS 198
Query: 120 LEVNKVAGNFHF-APGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
LE NKV G+FH A G + G H+ H + FN SH + +L+FG H+P ++NPL
Sbjct: 199 LEGNKVQGDFHITARGHGYRNFGEHLDHGV-------FNFSHMVTELSFGPHYPTLLNPL 251
Query: 178 DGVRWTQETPSGMYQYFIKVVPTVY------------------TDVSGHTIQSNQFSVTE 219
D T ET YQYF+ VVPT+Y T+ + + + +NQ++ T
Sbjct: 252 DKTIATTETHYYKYQYFLSVVPTLYSKGASALDTYTNHPDLIATNRNRNLVFTNQYAATT 311
Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ + +PG+FF Y++ PI + +EE SFL L + V GV G I
Sbjct: 312 QAQELPENPY-FIPGIFFKYNIEPILLMISEERTSFLSLLIRLVNTVSGVMVTGGWI 367
>gi|321479391|gb|EFX90347.1| hypothetical protein DAPPUDRAFT_309719 [Daphnia pulex]
Length = 369
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 98/183 (53%), Gaps = 9/183 (4%)
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH-QSGVHVHDILAFQRDSFNISHKINKL 164
+ + + C ++G L++ KVAGNFH GK H H + FN SH+I+K
Sbjct: 162 VPSQPSDACRLHGTLQLTKVAGNFHITAGKVLPLPMRAHAHLSPMMDDERFNYSHRIDKF 221
Query: 165 AFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHT-----IQSNQFSVT 218
+FG H ++ PL+G + + ++QYF+ VPT + + VS + +++ Q+SV
Sbjct: 222 SFG-HSSTLIQPLEGDEVITDKGAMLFQYFVTAVPTEIESLVSASSGIHGSMKTWQYSVR 280
Query: 219 EHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
R Q +PG++F YD++P++V + L F+ +CAIVGGV+T +GI+
Sbjct: 281 NQSRIIGHQKGSHGIPGIYFKYDVAPLRVRVVPDAPPLLRFVLRLCAIVGGVYTSAGIVH 340
Query: 278 AFI 280
I
Sbjct: 341 KVI 343
>gi|154286632|ref|XP_001544111.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407752|gb|EDN03293.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 315
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/205 (31%), Positives = 99/205 (48%), Gaps = 32/205 (15%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E+ + C IYG LE NKV G+FH A G + + G H+ D+FN SH + +L+FG
Sbjct: 102 EKADSCRIYGSLEGNKVQGDFHITARGHGYFEFGEHL------SHDAFNFSHMVTELSFG 155
Query: 168 EHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVS------------------- 206
H+P ++NPLD + TP+ +QY++ VVPT+YT
Sbjct: 156 PHYPSLLNPLD--KTISVTPARFFKFQYYLSVVPTIYTRAGIVDPYNHVLPDPTTIRPSE 213
Query: 207 -GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
G TI +NQ++ T + +PG+FF Y++ PI + +EE S L L + +
Sbjct: 214 RGSTIFTNQYAATSQSHEVPDPQYH-IPGIFFKYNIEPILLVVSEERGSLLALLVRLVNV 272
Query: 266 VGGVFTVSGIIDAFIYHGQRAIKKK 290
+ GV G + +KK+
Sbjct: 273 LAGVVVAGGWLFQISTWAMENLKKR 297
>gi|344229081|gb|EGV60967.1| DUF1692-domain-containing protein [Candida tenuis ATCC 10573]
gi|344229082|gb|EGV60968.1| hypothetical protein CANTEDRAFT_115996 [Candida tenuis ATCC 10573]
Length = 352
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 98/198 (49%), Gaps = 19/198 (9%)
Query: 90 NPDLIDQCKREGFL-------QRIKEEEGE-GCNIYGFLEVNKVAGNFHFAPGKSFHQSG 141
PDL D RE Q+I + G C+I+G + VN V G FH G
Sbjct: 127 TPDL-DHVMRENIRAEFYISGQKINQVAGAPACHIFGTIPVNHVQGEFHIT------AKG 179
Query: 142 VHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTV 201
V D L + N SH I + +FG +P + NPLD Y+Y+ VVPT+
Sbjct: 180 VGYQDSLHTPWERMNFSHVIQEFSFGTFYPMIDNPLDMSGKITHESLQSYKYYSNVVPTL 239
Query: 202 YTDVSGHTIQSNQFSVTEH---FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
Y + G + +NQ+S++E R GR+ + PG+FF Y+ PIK+T E+ + F+ F
Sbjct: 240 YERL-GIVVDTNQYSISEQHLVIRKDSNGRIYSPPGIFFKYEFEPIKLTIVEKRLPFIQF 298
Query: 259 LTNVCAIVGGVFTVSGII 276
+ + I+GG+ ++G +
Sbjct: 299 VARLGTILGGLLILAGYV 316
>gi|123472317|ref|XP_001319353.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121902134|gb|EAY07130.1| hypothetical protein TVAG_342940 [Trichomonas vaginalis G3]
Length = 358
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/223 (30%), Positives = 111/223 (49%), Gaps = 16/223 (7%)
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
CGSCYGA S CCN C++V+ A++KKG + I QC R+ + E C++
Sbjct: 135 CGSCYGAASG---CCNTCKDVKNAFKKKGRVPPSLSTIRQC-RDAVIDY-NHIRNESCHV 189
Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
YG + V G G S+ L D FN +HKIN + GE+ G +P
Sbjct: 190 YGTVIVPPTHGTIVMNSGDSYGAQMNTTTSSLGISIDDFNFTHKINDIYIGENDLG-DHP 248
Query: 177 LDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
L G++ Q+ G Y+ YFI+ + + + S+ H+ +G PG
Sbjct: 249 LKGIKKVQKE-VGRYKGLYFIRTLREQKGSLQVYRATSS------HYDRYREGTTGKFPG 301
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
++F YD+SPI V + + + L+F+ + AI+GG++++ ++D
Sbjct: 302 LYFNYDVSPIIVMYKRD-TTVLNFVIELMAILGGIYSLGSLLD 343
>gi|440632946|gb|ELR02865.1| hypothetical protein GMDG_05797 [Geomyces destructans 20631-21]
Length = 384
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 91/170 (53%), Gaps = 14/170 (8%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+G+ C I+G + +NKV G+FH A G + ++ H SFN SH +++ +FG
Sbjct: 187 DGDSCRIFGSMMLNKVQGDFHITARGHGYQEAFGTKH----LDHSSFNFSHIVSEFSFGA 242
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH------TIQSNQFSVTEHFR 222
+P ++NPLD T QYF+ VVPT+YT S + TI +NQ++VT R
Sbjct: 243 FYPKLINPLDQTITTTANQFYKSQYFMSVVPTIYTVSSPNPLSSKSTIFTNQYAVTHEDR 302
Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
+ +T+PG+FF YD+ P+ +T E SFL F V I+ GV
Sbjct: 303 KINE---RTVPGIFFKYDIEPLMLTIEERRDSFLRFAIKVVNILSGVLVA 349
>gi|452988546|gb|EME88301.1| hypothetical protein MYCFIDRAFT_25415 [Pseudocercospora fijiensis
CIRAD86]
Length = 380
Score = 103 bits (256), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/169 (34%), Positives = 85/169 (50%), Gaps = 11/169 (6%)
Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+ C IYG + NKV G+FH A G + + H+ FN SH+IN+L+FG
Sbjct: 181 ADSCRIYGTMHGNKVQGDFHITARGHGYLEFAEHL------DHSKFNFSHRINELSFGPF 234
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY-TDVSGHTIQSNQFSVTEHFRSSEQGR 228
+P + NPLD T + +QYF+ VVPTVY TD + N F T + +EQ R
Sbjct: 235 YPSLENPLDNTFATTDINYYKFQYFLSVVPTVYTTDARALRLLDNNFVFTNQYAVTEQSR 294
Query: 229 LQT---LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+ +PG+F +D+ PI +T EE SF + +V G+ G
Sbjct: 295 KVSENFVPGIFIKFDMEPIGLTIAEEWSSFPALFIRIVNVVSGLLVAGG 343
>gi|240275142|gb|EER38657.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Ajellomyces capsulatus H143]
gi|325094499|gb|EGC47809.1| COPII-coated vesicle protein [Ajellomyces capsulatus H88]
Length = 401
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 95/191 (49%), Gaps = 32/191 (16%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E+ + C IYG LE NKV G+FH A G + + G H+ D+FN SH + +L+FG
Sbjct: 188 EKADSCRIYGSLEGNKVQGDFHITARGHGYPEYGEHL------SHDAFNFSHMVTELSFG 241
Query: 168 EHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVS------------------- 206
H+P ++NPLD + TP+ +QY++ VVPT+YT
Sbjct: 242 PHYPSLLNPLD--KTISVTPARFFKFQYYLSVVPTIYTRAGIVDPYNHVLPDPTTIRPSE 299
Query: 207 -GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
G TI +NQ++ T + +PG+FF Y++ PI + +EE S L L + +
Sbjct: 300 RGSTIFTNQYAATSQSHEVPDPQYH-IPGIFFKYNIEPILLVVSEERGSLLALLVRLVNV 358
Query: 266 VGGVFTVSGII 276
+ GV G +
Sbjct: 359 LAGVVVAGGWL 369
>gi|351707253|gb|EHB10172.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Heterocephalus glaber]
Length = 211
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/137 (42%), Positives = 78/137 (56%), Gaps = 10/137 (7%)
Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
H H DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT
Sbjct: 80 HAHLAALVNHDSYNFSHRIDHLSFGELVPGIINPLDGTEKIAIDHNQMFQYFITVVPT-- 137
Query: 203 TDVSGHTIQ----SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 257
HT + ++QFSVTE R + + G+F YDLS + VT TEEH+ F
Sbjct: 138 ---KLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQ 194
Query: 258 FLTNVCAIVGGVFTVSG 274
F +C IVGG+F+ +G
Sbjct: 195 FFVRLCGIVGGIFSTTG 211
>gi|12006037|gb|AAG44724.1|AF267855_1 HT034 [Homo sapiens]
Length = 199
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/143 (40%), Positives = 82/143 (57%), Gaps = 10/143 (6%)
Query: 161 INKLAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
I+KL+FG ++ G N L G P + Y +K+VPTVY D SG S Q+
Sbjct: 59 IHKLSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQY 118
Query: 216 SVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
+V E+ S GR+ +P ++F YDLSPI V +TE F+T +CAI+GG FTV+
Sbjct: 119 TVANKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVA 176
Query: 274 GIIDAFIYHGQRAIKKKIEIGKF 296
GI+D+ I+ A KKI++GK
Sbjct: 177 GILDSCIFTASEAW-KKIQLGKM 198
>gi|389640739|ref|XP_003718002.1| hypothetical protein MGG_00949 [Magnaporthe oryzae 70-15]
gi|351640555|gb|EHA48418.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Magnaporthe oryzae 70-15]
gi|440464580|gb|ELQ33987.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Magnaporthe oryzae Y34]
gi|440481695|gb|ELQ62250.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Magnaporthe oryzae P131]
Length = 376
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/196 (35%), Positives = 104/196 (53%), Gaps = 23/196 (11%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
R+ + C I+G L++NKV G+FH A G + + G H+ +FN SH +N+
Sbjct: 176 RLWGATPDSCRIFGSLDLNKVQGDFHITARGHGYIEFGDHL------DHSAFNFSHIVNE 229
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG------HTIQSNQFSV 217
+FG+ +P +VNPLD T E +QYF+ VVPT+Y+ S TI +NQ++V
Sbjct: 230 FSFGDFYPSLVNPLDKTVNTCEKNFHKFQYFLSVVPTLYSVKSSTGAFGYSTIFTNQYAV 289
Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------F 270
TE +SSE + +PG+FF YD+ PI + E + L FL V I+ G F
Sbjct: 290 TE--QSSEISEMN-VPGIFFKYDIEPILLDIEESRDTILVFLIKVINILSGAMVAGHWGF 346
Query: 271 TVSGIIDAFIYHGQRA 286
T+S I + +RA
Sbjct: 347 TMSEWIKEVLGKRRRA 362
>gi|145235453|ref|XP_001390375.1| COPII-coated vesicle protein (Erv41) [Aspergillus niger CBS 513.88]
gi|134058058|emb|CAK38286.1| unnamed protein product [Aspergillus niger]
gi|350632895|gb|EHA21262.1| hypothetical protein ASPNIDRAFT_191708 [Aspergillus niger ATCC
1015]
Length = 399
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/237 (31%), Positives = 108/237 (45%), Gaps = 48/237 (20%)
Query: 63 AESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
A +D + EVR+ R+K G L D +D C+ IYG
Sbjct: 156 AREADAHVHHVLGEVRKNPRRKFAKGPRLRRGDTVDSCR-----------------IYGS 198
Query: 120 LEVNKVAGNFHF-APGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
LE NKV G+FH A G + G H+ H + FN SH + +L+FG H+P ++NPL
Sbjct: 199 LEGNKVQGDFHITARGHGYRNFGEHLDHGV-------FNFSHMVTELSFGPHYPTLLNPL 251
Query: 178 DGVRWTQETPSGMYQYFIKVVPTVY------------------TDVSGHTIQSNQFSVTE 219
D T ET YQYF+ VVPT+Y T+ + + + +NQ++ T
Sbjct: 252 DKTIATTETHYYKYQYFLSVVPTLYSKGASALDTYTNHPDLIATNRNRNLVFTNQYAATT 311
Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ +PG+FF Y++ PI + +EE SFL L + V GV G +
Sbjct: 312 QATELPENPY-FIPGIFFKYNIEPILLMISEERTSFLSLLIRLVNTVSGVMVTGGWV 367
>gi|322710423|gb|EFZ01998.1| COPII-coated vesicle protein (Erv41), putative [Metarhizium
anisopliae ARSEF 23]
Length = 372
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 97/184 (52%), Gaps = 13/184 (7%)
Query: 92 DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
D++ +R + +G + C IYG L++NKV G+FH A G + G H+
Sbjct: 163 DIVALGQRRAKWAKTPRVKGPPDSCRIYGSLDLNKVQGDFHITARGHGYRGQGSHL---- 218
Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
+ FN SH I++L+FG ++P +VNPLD E +QY++ VVPT Y+ V
Sbjct: 219 --DHEQFNFSHIISELSFGSYYPSLVNPLDRTLNIAENHFHKFQYYVSVVPTRYS-VGSS 275
Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
+I +NQ++VTE + + +PGVF YD+ PI ++ E+ L F+ + ++ G
Sbjct: 276 SIFTNQYAVTEQSKGVSE---YNVPGVFVKYDIEPILLSVNEDRDGILMFVVKLINVLSG 332
Query: 269 VFTV 272
V
Sbjct: 333 VLVA 336
>gi|261193579|ref|XP_002623195.1| COPII-coated vesicle protein [Ajellomyces dermatitidis SLH14081]
gi|239588800|gb|EEQ71443.1| COPII-coated vesicle protein [Ajellomyces dermatitidis SLH14081]
gi|239613876|gb|EEQ90863.1| COPII-coated vesicle protein [Ajellomyces dermatitidis ER-3]
gi|327349942|gb|EGE78799.1| COPII-coated vesicle protein [Ajellomyces dermatitidis ATCC 18188]
Length = 401
Score = 102 bits (253), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 97/204 (47%), Gaps = 30/204 (14%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + C IYG L NKV G+FH A G + + G H+ DSFN SH I +L+FG
Sbjct: 188 ENADSCRIYGSLVGNKVQGDFHITARGHGYFEFGEHL------SHDSFNFSHMITELSFG 241
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS--------------------G 207
H+ ++NPLD T YQY++ +VPT+YT G
Sbjct: 242 PHYSTLLNPLDKTISTTPAHFHKYQYYMSIVPTIYTRAGVVDPYSQALPDPSTITPSQRG 301
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
+TI +NQ++VT RS E + +PG+FF Y + PI + +EE S L L + ++
Sbjct: 302 NTIFTNQYAVTS--RSHELPDAEYDVPGIFFKYTIEPILLVVSEERGSLLALLVRLVNVL 359
Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
GV G + +KK+
Sbjct: 360 AGVVVAGGWLFQIFTWAMDNLKKR 383
>gi|426372082|ref|XP_004052960.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Gorilla gorilla
gorilla]
Length = 354
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/147 (39%), Positives = 84/147 (57%), Gaps = 6/147 (4%)
Query: 133 PGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQ 192
P ++ H H +S+N SH+I+ L+FGE P ++NPLDG + M+Q
Sbjct: 166 PPRAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQ 225
Query: 193 YFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFT 249
YFI VVPT ++T +S T +QFSVTE R + + G+F YDLS + VT T
Sbjct: 226 YFITVVPTKLHTYKISADT---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVT 282
Query: 250 EEHVSFLHFLTNVCAIVGGVFTVSGII 276
EEH+ F F +C IVGG+F+ +G++
Sbjct: 283 EEHMPFWQFFVRLCGIVGGIFSTTGML 309
>gi|344301277|gb|EGW31589.1| hypothetical protein SPAPADRAFT_62204 [Spathaspora passalidarum
NRRL Y-27907]
Length = 353
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/174 (32%), Positives = 90/174 (51%), Gaps = 12/174 (6%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
QR+ E C+I+G + +N+V G+F G D++A D N SH I +
Sbjct: 146 QRVNEN-APACHIFGSIPINQVKGDFRIT------AKGYGYRDVIAAPIDKLNFSHVIQE 198
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF-- 221
++GE +P + NPLD E Y Y KVVPT Y + G +++NQ+SVTE+
Sbjct: 199 FSYGEFYPFINNPLDATGKVTEEKFQKYMYSAKVVPTSYEKL-GLIVETNQYSVTENHQV 257
Query: 222 --RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
++S+ G +PG++ YD PIK+ E+ + F+ F+ + I GG+ +
Sbjct: 258 LQKNSQTGVPIGVPGIYIKYDFEPIKMVIKEKRMPFMQFVAKLATIAGGILITA 311
>gi|339233696|ref|XP_003381965.1| conserved hypothetical protein [Trichinella spiralis]
gi|316979152|gb|EFV61980.1| conserved hypothetical protein [Trichinella spiralis]
Length = 331
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 89/172 (51%), Gaps = 4/172 (2%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF-QRDSFNISHKINKLAFGEHF 170
+ C I+G+ +NK+ G ++ V I A Q + FN SH+I K FG
Sbjct: 144 DACRIHGYFLMNKLRGKLRIKFKETVRLEAVSNFIIFARRQNEGFNFSHRIEKFGFGPRI 203
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSEQGR 228
G++NPLDG + M+ Y+I+VVPT TD++G ++Q+SVT R +QG
Sbjct: 204 AGIINPLDGFQKESFDRRDMFYYYIQVVPTKITDLNGMETFTSQYSVTHKRRIIDHDQGS 263
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ G+F ++D +P+ V + S F +CAIVGG+F + I A +
Sbjct: 264 HGSC-GIFIYFDFAPMMVLIRKSKTSLFVFALRICAIVGGIFACTDFIIALM 314
>gi|170587366|ref|XP_001898447.1| HT034 [Brugia malayi]
gi|158594071|gb|EDP32661.1| HT034, putative [Brugia malayi]
Length = 286
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 100/190 (52%), Gaps = 18/190 (9%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP- 171
GC G +++KV GNFH + H D Q +++++ H I+ + FG+
Sbjct: 110 GCRFEGKFDISKVPGNFHIS---------THAADT---QPETYDMRHTIHSVVFGDDVST 157
Query: 172 ----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
G NPL + S + Y +K+VP+VY D++G+ S Q++ +
Sbjct: 158 SQNLGSFNPLKNREALESDGSFTHDYVLKIVPSVYEDITGNKKYSYQYTYAHKEYVTYHY 217
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
+ +P ++F Y+L PI + +TE F F+T++CA+VGG FTV+GIIDA ++ +
Sbjct: 218 SGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLF-SLTEL 276
Query: 288 KKKIEIGKFS 297
+K ++GK S
Sbjct: 277 YRKHQMGKLS 286
>gi|255944653|ref|XP_002563094.1| Pc20g05600 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211587829|emb|CAP85889.1| Pc20g05600 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 396
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 90/185 (48%), Gaps = 26/185 (14%)
Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+ C IYG LE NKV G+FH A G + ++ H+ SF+ SH I +L+FG H
Sbjct: 189 ADACRIYGSLEGNKVQGDFHITARGHGYRENAPHL------DHSSFDFSHMITELSFGPH 242
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG------------------HTIQ 211
+P + NPLD E +QYF+ VVPT+Y+ G T+
Sbjct: 243 YPTLQNPLDKTIAETEEHYYKFQYFLSVVPTLYSRGKGALDAYTRSPDAAASRYGRDTVF 302
Query: 212 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
+NQ++ T + + + +PG+FF Y++ PI + +EE SFL L V + GV
Sbjct: 303 TNQYAATSQSSAIPESPM-VVPGIFFKYNIEPILLLVSEERASFLSLLVRVINTISGVLV 361
Query: 272 VSGII 276
G +
Sbjct: 362 TGGWL 366
>gi|154343635|ref|XP_001567763.1| hypothetical protein, unknown function [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134065095|emb|CAM43209.1| hypothetical protein, unknown function [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 309
Score = 101 bits (252), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 98/193 (50%), Gaps = 19/193 (9%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
EGC + G+++V KV GNFH + H H + N H I+ L+FG
Sbjct: 128 SAAEGCRLEGYIKVGKVPGNFHISSHGRQHLLMTHF-------PNGTNAEHSIHHLSFGT 180
Query: 169 ------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
++PLDG E P +YQYF+ +VPT+Y + S T + QF+ T
Sbjct: 181 LDVKKLDKKAQLHPLDGKEHRSEVPK-IYQYFLDIVPTIY-ESSFSTAHTYQFTGTSSSS 238
Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
++ V F Y +SPI V ++ VS HFLT VCAI+GGV+TV+G++ F++
Sbjct: 239 PVPSSQMA---AVVFQYQMSPITVRYSSARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHS 295
Query: 283 GQRAIKKKIEIGK 295
+++I +GK
Sbjct: 296 SAAQFQRRI-LGK 307
>gi|300121843|emb|CBK22417.2| unnamed protein product [Blastocystis hominis]
Length = 251
Score = 101 bits (251), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 107/225 (47%), Gaps = 23/225 (10%)
Query: 60 CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ----CK--REGFLQRIKEEEGEG 113
CYGA ++ CCN C + EAY +GW+ P + Q C+ R L G
Sbjct: 35 CYGA-GAEGQCCNTCSAIVEAYNSRGWS---PHFVLQFSPLCRNSRPSVLSF-----KSG 85
Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQ-SGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G ++V++VAG+ H G V+D + SH I +FG+H PG
Sbjct: 86 CMIWGAIDVHQVAGDIHIQTTTGMIDILGAPVYDAEIISK--LKSSHFIEHFSFGKHIPG 143
Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH---FRSSEQGRL 229
V NPL+G R+ + + Y I+++P +Y + G I+SN+ SV E G
Sbjct: 144 VENPLNGRRFLANQLTS-HAYQIEILPAIY-ERGGVEIRSNEISVYETDKVVTVEPSGTA 201
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
PG+FF Y +SP + E+ F + +C ++GG+ V G
Sbjct: 202 DVEPGLFFKYRISPFEHVIREDRKEFWSLVVRLCGVMGGMMAVGG 246
>gi|268577857|ref|XP_002643911.1| Hypothetical protein CBG02175 [Caenorhabditis briggsae]
Length = 282
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/190 (33%), Positives = 102/190 (53%), Gaps = 19/190 (10%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 168
GC E+NKV GNFH S H + Q D +++ H I+ + FG+
Sbjct: 107 GCRFESRFEINKVPGNFHL----STHSATT--------QPDGYDMRHIIHSIKFGDDVSH 154
Query: 169 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
+ G +PL R +E+ ++Y +K+VP+V+ D SG+ + S Q++ +
Sbjct: 155 KNLKGSFDPLAN-REAKESGLNTHEYILKIVPSVHEDYSGNILNSYQYTYGHKSYVTYHH 213
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
+ +P V+F Y+L PI + TE SF FLT++CA+VGG FTV+GIID+ + +
Sbjct: 214 SGKIIPAVWFKYELQPITLKQTEHRQSFYIFLTSICAVVGGTFTVAGIIDSTFFTISEMV 273
Query: 288 KKKIEIGKFS 297
KK+ ++GK +
Sbjct: 274 KKQ-QMGKLT 282
>gi|367038975|ref|XP_003649868.1| hypothetical protein THITE_2126029 [Thielavia terrestris NRRL 8126]
gi|346997129|gb|AEO63532.1| hypothetical protein THITE_2126029 [Thielavia terrestris NRRL 8126]
Length = 380
Score = 100 bits (250), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 62/174 (35%), Positives = 89/174 (51%), Gaps = 15/174 (8%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
R+ E + C IYG LE+NKV G+FH A G + G H+ ++FN SH I++
Sbjct: 181 RLWGAEPDSCRIYGSLELNKVQGDFHITARGHGYMAFGDHL------DHNAFNFSHIISE 234
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-----DVSGHTIQSNQFSVT 218
L+FG P + NPLD +QYF+ VVPT Y+ + +I +NQ++VT
Sbjct: 235 LSFGPFLPSLANPLDRTVNIATAHFHKFQYFLSVVPTTYSVGRPGALGARSIFTNQYAVT 294
Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
E S++ T+PG+F YD+ PI + E F FL V +V GV
Sbjct: 295 EQ---SQEVPDTTIPGIFVKYDIEPILLNIVETRDGFFVFLLRVINVVSGVLVA 345
>gi|340514865|gb|EGR45124.1| predicted protein [Trichoderma reesei QM6a]
Length = 372
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 58/184 (31%), Positives = 95/184 (51%), Gaps = 13/184 (7%)
Query: 92 DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
D++ +++ + + G + C +YG L++NKV G+FH A G + G H+
Sbjct: 163 DIVSLSRKKAKWAKTPKPRGRTDSCRMYGSLDLNKVQGDFHITARGHGYSGIGGHL---- 218
Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
D FN SH I++L++G +P ++NPLD T +QY++ VVPTVY S
Sbjct: 219 --DHDKFNFSHIISELSYGPFYPSLINPLDRTVNTAIVHFHKFQYYLSVVPTVYI-ASHR 275
Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
+ +NQ++VTE ++ +PG+FF YD+ PI ++ E F FL + + G
Sbjct: 276 IVNTNQYAVTEQSKTISD---HQVPGIFFKYDIEPIMLSVEETRDGFFAFLLKLVNVFSG 332
Query: 269 VFTV 272
V
Sbjct: 333 VMVA 336
>gi|322697212|gb|EFY88994.1| COPII-coated vesicle protein (Erv41), putative [Metarhizium acridum
CQMa 102]
Length = 372
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 59/184 (32%), Positives = 96/184 (52%), Gaps = 13/184 (7%)
Query: 92 DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
D++ +R + +G + C IYG L++NKV G+FH A G + G H+
Sbjct: 163 DIVALGQRRAKWAKTPRVKGPPDSCRIYGSLDLNKVQGDFHITARGHGYRGQGSHL---- 218
Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
FN SH I++L+FG ++P +VNPLD E +QY++ VVPT Y+ V
Sbjct: 219 --DHSQFNFSHIISELSFGSYYPSLVNPLDRTINIAENHFHKFQYYVSVVPTRYS-VGSS 275
Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
+I +NQ++VTE + + +PG+F YD+ PI ++ E+ L F+ + ++ G
Sbjct: 276 SIFTNQYAVTEQSKGVSE---YNVPGIFVKYDIEPILLSVNEDRDGILMFVVKLINVLSG 332
Query: 269 VFTV 272
V
Sbjct: 333 VLVA 336
>gi|225558748|gb|EEH07032.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 401
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 94/191 (49%), Gaps = 32/191 (16%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E+ + C IYG LE NKV G+FH A G + + G H+ D+FN SH + +L+FG
Sbjct: 188 EKADSCRIYGSLEGNKVQGDFHITARGHGYPEFGEHL------SHDAFNFSHMVTELSFG 241
Query: 168 EHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVS------------------- 206
H+P ++NPLD + TP+ +QY++ VVPT+YT
Sbjct: 242 PHYPSLLNPLD--KTISVTPARFFKFQYYLSVVPTIYTRAGIVDPYNHVLPDPTTIRPSE 299
Query: 207 -GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
G TI +NQ++ T + +PG+FF Y++ PI + +EE L L + +
Sbjct: 300 RGSTIFTNQYAATSQSHEVPDPQYH-IPGIFFKYNIEPILLVVSEERGGLLALLVRLVNV 358
Query: 266 VGGVFTVSGII 276
+ GV G +
Sbjct: 359 LAGVVVAGGWL 369
>gi|425765498|gb|EKV04175.1| COPII-coated vesicle protein (Erv41), putative [Penicillium
digitatum PHI26]
gi|425783511|gb|EKV21358.1| COPII-coated vesicle protein (Erv41), putative [Penicillium
digitatum Pd1]
Length = 396
Score = 100 bits (249), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 90/184 (48%), Gaps = 26/184 (14%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG LE NKV G+FH A G + ++ H+ +FN SH I +L+FG H+
Sbjct: 190 DACRIYGSLEGNKVQGDFHITARGHGYRENAPHL------DHSAFNFSHMITELSFGPHY 243
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY---------------TDVSGH---TIQS 212
P + NPLD E +QYF+ +VPT+Y T + H T+ +
Sbjct: 244 PTLQNPLDKTIAETEEHYYKFQYFLSIVPTLYSRGKSALDLYTRSPETLAARHGRNTVFT 303
Query: 213 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
NQ++ T + + + +PG+FF YD+ PI + +EE FL L V V GV
Sbjct: 304 NQYAATSQSSAIPESPM-VVPGIFFKYDIEPILLLVSEERAGFLSLLIRVINTVSGVLVT 362
Query: 273 SGII 276
G +
Sbjct: 363 GGWL 366
>gi|366998832|ref|XP_003684152.1| hypothetical protein TPHA_0B00460 [Tetrapisispora phaffii CBS 4417]
gi|357522448|emb|CCE61718.1| hypothetical protein TPHA_0B00460 [Tetrapisispora phaffii CBS 4417]
Length = 349
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 90/172 (52%), Gaps = 12/172 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC++YG + VN+VAG G D +D + +H +N+ +FG+
Sbjct: 155 ELTGCHVYGSVTVNRVAGEMQIT------AKGYGYRDRKRAPKDLIDFNHVVNEFSFGDF 208
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF----RSS 224
+P + NPLDG + +P Y YF+ VVPT Y + G I +NQ+S+ E+ S+
Sbjct: 209 YPYIENPLDGTCKMYPNSPFSSYNYFMSVVPTFYQKL-GAEIDTNQYSIREYHVDLKNSN 267
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+L T+PG+F YD P+ + ++ ++FL F+ + AI+ V ++ I
Sbjct: 268 VNAKLSTIPGIFLKYDFEPLAIIISDVRLTFLQFIVRLVAILSFVLYIASWI 319
>gi|426200953|gb|EKV50876.1| hypothetical protein AGABI2DRAFT_113626 [Agaricus bisporus var.
bisporus H97]
Length = 542
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 82/165 (49%), Gaps = 8/165 (4%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+G C IYG + V +V N H + S HV + N+SH I + +FG +
Sbjct: 173 DGGACRIYGTMPVKRVTANLHITTVGHGYSSYQHV------DHNQMNLSHVITEFSFGPY 226
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
FP +V PLD + YQYF+ VVPT Y +++NQ+SVT + R E +
Sbjct: 227 FPEIVQPLDESFEVTQDHFTAYQYFLHVVPTTYIAPRTSPLRTNQYSVTHYTRQVEHNK- 285
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
PG+FF +DL P+ +T ++ + + L ++GGVF G
Sbjct: 286 -GTPGIFFKFDLDPLNITIHQKTTTLIQLLIRCVGVIGGVFVCMG 329
>gi|409083992|gb|EKM84349.1| hypothetical protein AGABI1DRAFT_32491 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 542
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 82/165 (49%), Gaps = 8/165 (4%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+G C IYG + V +V N H + S HV + N+SH I + +FG +
Sbjct: 173 DGGACRIYGTMPVKRVTANLHITTVGHGYSSYQHV------DHNQMNLSHVITEFSFGPY 226
Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
FP +V PLD + YQYF+ VVPT Y +++NQ+SVT + R E +
Sbjct: 227 FPEIVQPLDESFEVTQDHFTAYQYFLHVVPTTYIAPRTSPLRTNQYSVTHYTRQVEHNK- 285
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
PG+FF +DL P+ +T ++ + + L ++GGVF G
Sbjct: 286 -GTPGIFFKFDLDPLNITIHQKTTTLIQLLIRCVGVIGGVFVCMG 329
>gi|340914937|gb|EGS18278.1| hypothetical protein CTHT_0063020 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 388
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/169 (34%), Positives = 86/169 (50%), Gaps = 13/169 (7%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C IYG LE+NKV G+FH A G + + G H +FN SH I++L+FG
Sbjct: 192 QADSCRIYGSLELNKVQGDFHITARGHGYLEGGNAQH----LDHSAFNFSHIISELSFGP 247
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-----DVSGHTIQSNQFSVTEHFRS 223
P + NPLD +QYF+ +VPT Y+ ++ +I +NQ++VTE
Sbjct: 248 FLPSLSNPLDRTVNLASHHFHRFQYFLSIVPTTYSVGRPGEMGSQSIFTNQYAVTEQSHP 307
Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
+ + +PG+FF YD+ PI + E S FL V IV GV
Sbjct: 308 VSE---RNIPGIFFKYDIEPILLNIVETRDSVFKFLVKVVNIVSGVLVA 353
>gi|392564830|gb|EIW58008.1| DUF1692-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 539
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/180 (30%), Positives = 83/180 (46%), Gaps = 8/180 (4%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ +G C ++G + +V N H + S HV L N+SH I + +FG
Sbjct: 175 QPDGSACRVFGTITAKRVTANLHITTLGHGYASQTHVDHKL------MNLSHVITEFSFG 228
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
+FP + PLD P YQY++ VVPT Y + +NQ+SVT + R +
Sbjct: 229 PYFPDITQPLDNSFELTSEPFVAYQYYLHVVPTTYIAPRTKPLNTNQYSVTHYTRVLDHH 288
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
R PG+FF +DL P+K+T + SF+ ++GGVF G H A+
Sbjct: 289 R--GTPGIFFKFDLEPMKLTIHQRTTSFVQLFIRTVGVIGGVFVCMGYAVKITGHAVDAV 346
>gi|414586932|tpg|DAA37503.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
protein [Zea mays]
Length = 63
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 46/54 (85%), Positives = 53/54 (98%)
Query: 244 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
++VTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YH QRAIKKK+EIGKF+
Sbjct: 10 LQVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHSQRAIKKKMEIGKFN 63
>gi|395326723|gb|EJF59129.1| hypothetical protein DICSQDRAFT_156384 [Dichomitus squalens
LYAD-421 SS1]
Length = 559
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/168 (34%), Positives = 80/168 (47%), Gaps = 10/168 (5%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAF 166
+ +G C IYG + +V N H + S HV H + N+SH I + +F
Sbjct: 178 QPDGSACRIYGTITAKRVTANLHVTTLGHGYASHEHVDHKFM-------NLSHVITEFSF 230
Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
G +FP + PLD P YQYF+ VVPT Y + +NQ+SVT + R +
Sbjct: 231 GPYFPDITQPLDNSFEMAHDPFVAYQYFLHVVPTTYIAPRSKPLHTNQYSVTHYTRVLDH 290
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
R PG+FF +DL PI +T + S FL +VGGVF G
Sbjct: 291 HR--GTPGIFFKFDLEPIHMTIHQRTTSLAAFLLRCAGVVGGVFVCMG 336
>gi|308198100|ref|XP_001386838.2| predicted protein [Scheffersomyces stipitis CBS 6054]
gi|149388859|gb|EAZ62815.2| putative ER to golgi transport [Scheffersomyces stipitis CBS 6054]
Length = 352
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/160 (35%), Positives = 82/160 (51%), Gaps = 9/160 (5%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E C+I+G + V+ V G+FH + HV ++ N SH I + +FG+
Sbjct: 150 EGAPACHIFGSIPVSHVKGDFHITAKGLGYSDRSHV------PLEALNFSHVIQEFSFGD 203
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE--HFRSSEQ 226
+P + NPLD E P Y YF KVVPT+Y + G + +NQ+S+TE H E
Sbjct: 204 FYPFINNPLDASGKLTEEPLISYSYFAKVVPTLYQRL-GLVVDTNQYSLTENNHVFKLEH 262
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
R +PG+FF YD PIK+ E + F+ F+ + IV
Sbjct: 263 KRPTGIPGIFFKYDFEPIKLIIIERRLPFIQFVARLATIV 302
>gi|440794754|gb|ELR15909.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
Length = 306
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/191 (32%), Positives = 94/191 (49%), Gaps = 16/191 (8%)
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGK----SFHQSGVHVHDIL-----AFQRDS--FNISH 159
+ C + G + V K+ G F + + S + S ++ H DS FN++H
Sbjct: 118 ADRCLLTGHMAVRKIRGQFQISSRRFNPFSIYGSSLNKHTPTEDHPHPHPEDSLPFNVTH 177
Query: 160 KINKLAFGEHFPGVVNPLDGVRWT-QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
+I +L+FG V PLDG+ T +E Y YF+++VP Y G ++S F+ T
Sbjct: 178 RIRELSFGPKVLPDVGPLDGIVQTMREGERSQYSYFLQIVPASYHYADGRVVESYSFAFT 237
Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
H + R + PGVF+ YD SP + E SF HF+T CA++GG F V G++ A
Sbjct: 238 MH----TESRSELAPGVFWKYDFSPYATSLREVPKSFSHFITRCCAVIGGTFVVFGLLSA 293
Query: 279 FIYHGQRAIKK 289
+ A KK
Sbjct: 294 LASRLETAAKK 304
>gi|258573091|ref|XP_002540727.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237900993|gb|EEP75394.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 398
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 105/236 (44%), Gaps = 47/236 (19%)
Query: 64 ESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+ D+ + EVR ++++K G L + D +D C+ IYG L
Sbjct: 157 QEEDQHVGHVLGEVRRSWKRKFPKGPKLKSKDAMDSCR-----------------IYGSL 199
Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
E NKV GNFH G+ D F + N +H I +L+FG + ++NPLD
Sbjct: 200 EGNKVQGNFHIT------ARGLGYWDPSGFHLEGLNFTHLITELSFGPRYSTLLNPLDKT 253
Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSG--------------------HTIQSNQFSVTEH 220
+ YQY++ VVPT+YT +TI +NQ++VT
Sbjct: 254 VAGTKDAFYKYQYYLSVVPTIYTRAGTVDPYNQELPDPSTITSRQRKNTIFTNQYAVTSQ 313
Query: 221 FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ Q ++ +PG+FF +D+ PI + +EE S L L + +V GV G +
Sbjct: 314 SHAIPQ-NVRAVPGIFFKFDIEPILLVVSEERGSLLALLVRLVNVVSGVLVAGGWV 368
>gi|296821254|ref|XP_002850059.1| ER-derived vesicles protein ERV41 [Arthroderma otae CBS 113480]
gi|238837613|gb|EEQ27275.1| ER-derived vesicles protein ERV41 [Arthroderma otae CBS 113480]
Length = 399
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 78/274 (28%), Positives = 119/274 (43%), Gaps = 52/274 (18%)
Query: 44 QRHGGRLEHNETYCGSCYGAESSDEDCC--NNCEEVREAYRKK---GWALSNPDLIDQCK 98
QR GG E+ + E +ED + EVR +KK L D +D C+
Sbjct: 135 QRGGGSPEYQTLSKEDPFRLEEQEEDLHVEHVLGEVRRGRKKKFPKAPKLKKSDAVDSCR 194
Query: 99 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 157
++G LE NKV GN H A G + + G + S N
Sbjct: 195 -----------------VFGSLEGNKVQGNLHITARGFGYLEWGQPTNP------HSLNF 231
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--------- 208
+H I +L+FG H+ ++NPLD T YQY + VVPT+YT SGH
Sbjct: 232 THLITELSFGPHYARLLNPLDKTVSTTSVNFYKYQYHLSVVPTIYTK-SGHIDPNHRSLP 290
Query: 209 ------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFL 256
T+ +NQ++VT + Q R++++PG+FF Y++ PI + ++E S L
Sbjct: 291 DPSSITAKDSKTTVSTNQYAVTS-YSQPVQPRIESIPGIFFKYNIEPILLIVSQERDSLL 349
Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
L + +V GV G + A++K+
Sbjct: 350 ALLVRLVNVVSGVLVTGGWLFQIGSWAVEAMRKR 383
>gi|403216157|emb|CCK70655.1| hypothetical protein KNAG_0E04020 [Kazachstania naganishii CBS
8797]
Length = 351
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 55/171 (32%), Positives = 97/171 (56%), Gaps = 12/171 (7%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+I+G + VN+V G F G+ D+ A ++ N +H IN+ +FG+
Sbjct: 154 EFNGCHIFGSIPVNRVRGEFQIT------AKGLGYRDMNAAPKEKINFAHVINEWSFGDF 207
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH-FRSSEQG 227
+P + NPLD ++ ++ P + Y++ VVPT+Y + G + +NQ+SV+E+ F S+++
Sbjct: 208 YPYIDNPLDATAKFDKDDPLTAFVYYLSVVPTIYQKL-GAEVDTNQYSVSEYRFNSTDKT 266
Query: 228 RLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG-GVFTVSGI 275
T +PG+FF Y+ + + T+ +SFL F+ + AI+ V+ S I
Sbjct: 267 FRDTGYVPGIFFRYNFESLSIVMTDRRLSFLQFIVRLVAIMSFAVYIASWI 317
>gi|67901384|ref|XP_680948.1| hypothetical protein AN7679.2 [Aspergillus nidulans FGSC A4]
gi|40742675|gb|EAA61865.1| hypothetical protein AN7679.2 [Aspergillus nidulans FGSC A4]
gi|259484020|tpe|CBF79887.1| TPA: COPII-coated vesicle protein (Erv41), putative
(AFU_orthologue; AFUA_2G01530) [Aspergillus nidulans
FGSC A4]
Length = 394
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 99/207 (47%), Gaps = 30/207 (14%)
Query: 93 LIDQCKREG---FLQRIKEEEGE---GCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVH 145
++++ +R G F + K G+ C IYG LE NKV G+FH A G + H+
Sbjct: 164 VLNELRRNGKRKFAKGPKLRRGDVVDSCRIYGSLEGNKVQGDFHITARGHGYRDGREHL- 222
Query: 146 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-- 203
+FN SH I +L+FG H+P + NPLD T E YQYF+ +VPT+Y+
Sbjct: 223 -----DHSAFNFSHIITELSFGPHYPSLHNPLDKTIATTEFHYYKYQYFLSIVPTIYSRN 277
Query: 204 --------------DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFT 249
+ + I +NQ++ T + + +PG+FF Y++ PI + +
Sbjct: 278 QNLRLDALPSSSSARSNKNLIFTNQYAATSQSDAIPESPY-VIPGIFFKYNIEPIMLLIS 336
Query: 250 EEHVSFLHFLTNVCAIVGGVFTVSGII 276
EE FL+ L + V GV G +
Sbjct: 337 EERTGFLNLLIRIVNTVSGVLVTGGWV 363
>gi|115433364|ref|XP_001216819.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114189671|gb|EAU31371.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 449
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 98/207 (47%), Gaps = 31/207 (14%)
Query: 94 IDQCKREGFLQRIKEEEGEG---CNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILA 149
+ + R+ F + K G+ C IYG LE NKV G+FH A G + H+
Sbjct: 218 VRRNPRKKFPKSPKLRRGDAVDSCRIYGSLEGNKVQGDFHITARGHGYRDFAPHL----- 272
Query: 150 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT------ 203
+FN SH I +L+FG H+P ++NPLD ET +QYF+ VVPT+Y+
Sbjct: 273 -DHQTFNFSHMITELSFGPHYPTLLNPLDKTIAETETHYYKFQYFLSVVPTIYSKGNRVL 331
Query: 204 -----------DVSGHT---IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFT 249
D S H + +NQ++ T + + +PG+FF Y++ PI + +
Sbjct: 332 DTYSIAPPTLHDNSRHNKNLVFTNQYAATSQSDALPESPF-FVPGIFFKYNIEPILLLIS 390
Query: 250 EEHVSFLHFLTNVCAIVGGVFTVSGII 276
EE SFL L + V GV G +
Sbjct: 391 EERGSFLSLLIRLVNTVSGVMVTGGWL 417
>gi|213408569|ref|XP_002175055.1| COPII-coated vesicle component Erv41 [Schizosaccharomyces japonicus
yFS275]
gi|212003102|gb|EEB08762.1| COPII-coated vesicle component Erv41 [Schizosaccharomyces japonicus
yFS275]
Length = 331
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 102/189 (53%), Gaps = 12/189 (6%)
Query: 94 IDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAF 150
+ + +R+ F ++ K + G C YG + V++ G H APG + S + +
Sbjct: 133 LRRTRRKKFNKKSKTLPDGGSACRFYGAVTVHRTQGLLHITAPGWGYGMSNIPL------ 186
Query: 151 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 210
++ N +H I++L+FG+++P +VN LDG + + +QY+ ++PT YT + +
Sbjct: 187 --NALNFTHAIDELSFGDYYPSLVNALDGSYGFTDEHAFAFQYYTSIIPTTYTS-TFRNV 243
Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
Q+NQ++VTE+ + G PG+F YD+ P+ + E + S + + + AI GG+
Sbjct: 244 QTNQYAVTENSVRRQTGFRSDPPGIFISYDIEPLGIHIRETYPSLGNTILRILAISGGLV 303
Query: 271 TVSGIIDAF 279
TV+ ++ F
Sbjct: 304 TVTTWVERF 312
>gi|225685292|gb|EEH23576.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 386
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 64/190 (33%), Positives = 94/190 (49%), Gaps = 30/190 (15%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + C IYG LE NKV G+FH A G + + G H+ +FN SH I +L+FG
Sbjct: 173 EMPDSCRIYGSLEGNKVQGDFHITARGHGYFEFGEHL------DHHAFNFSHMITELSFG 226
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG-------------------- 207
H+ ++NPLD T YQY++ +VPT+YT
Sbjct: 227 PHYSTLLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRAGTIDPYSQVLPDPSTISPSQRK 286
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
+TI +NQ++VT RS E +Q +PG+FF Y++ PI + +EE S L L + ++
Sbjct: 287 NTIFTNQYAVTS--RSHELPDVQFHVPGIFFKYNIEPILLIISEERGSLLALLVRLVNVM 344
Query: 267 GGVFTVSGII 276
GV G +
Sbjct: 345 SGVVVAGGWL 354
>gi|396485364|ref|XP_003842153.1| hypothetical protein LEMA_P079130.1 [Leptosphaeria maculans JN3]
gi|312218729|emb|CBX98674.1| hypothetical protein LEMA_P079130.1 [Leptosphaeria maculans JN3]
Length = 486
Score = 97.8 bits (242), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 59/188 (31%), Positives = 89/188 (47%), Gaps = 34/188 (18%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E + C I+G +E NKV G+FH A G + + GVH+ +FN SH I +L+FG
Sbjct: 267 ETDSCRIFGSIEGNKVQGDFHITARGHGYIEYGVHL------DHKTFNFSHIIRELSFGP 320
Query: 169 HFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTD--------------------- 204
++P + NPLD TP +QYF+ +VPT+YTD
Sbjct: 321 YYPSLTNPLDNTIAITPTPDDHFYKFQYFLSIVPTIYTDDPSLIPYLDILNRYGKNPDLF 380
Query: 205 VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
S H +++NQ++VT + +PGVF +D+ PI + EE F L +
Sbjct: 381 NSAHAVKTNQYAVTSQSHPVSE---YYVPGVFVKFDIEPIMLNVVEEWGGFWRLLVRLVN 437
Query: 265 IVGGVFTV 272
++ GV
Sbjct: 438 VISGVMVA 445
>gi|406607484|emb|CCH41148.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
[Wickerhamomyces ciferrii]
Length = 359
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 99/196 (50%), Gaps = 23/196 (11%)
Query: 89 SNPDLIDQCKREGFLQRI------KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 142
+ PDL D+ +G + K+ C+IYG + VNKV+G+FH ++
Sbjct: 140 NTPDL-DEVMAQGIIAEFRDRGDAKDSGAPACHIYGSIPVNKVSGDFHITAQGYGYRGNS 198
Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
H + D N +H I++ +FGE +P + NPLD + YQY++ VVPTVY
Sbjct: 199 RSHVGI----DGLNFTHIISEFSFGEFYPYIHNPLDATVQITKEHLQSYQYYLSVVPTVY 254
Query: 203 TDVSGHTIQSNQFSVTEHFRSSEQGRLQT-----LPGVFFFYDLSPIKVTFTEEHVSFLH 257
+ G I++NQ+S +S Q +L + +PG+FF YD PI + ++ + F
Sbjct: 255 KKL-GVEIETNQYS------TSLQKKLYSFENKGVPGLFFKYDFEPISLIVEDKRIPFST 307
Query: 258 FLTNVCAIVGGVFTVS 273
FL + I GG+ V+
Sbjct: 308 FLVRLATIYGGIIVVA 323
>gi|449542382|gb|EMD33361.1| hypothetical protein CERSUDRAFT_117979 [Ceriporiopsis subvermispora
B]
Length = 530
Score = 97.4 bits (241), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 78/180 (43%), Gaps = 6/180 (3%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ +G C ++G + KV N H H H H N+SH I + +FG
Sbjct: 174 QPDGSACRVFGSITAKKVTANLHIT--TLGHGYATHSH----VDHSKMNLSHVITEFSFG 227
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
HFP + PLD P YQYF+ VVPT Y + ++Q+SVT + R +
Sbjct: 228 PHFPDITQPLDNSFEVAHDPFVAYQYFLHVVPTTYIAPRSSPLHTHQYSVTHYTRILDPS 287
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
+ PG+FF +DL P+ + + S + ++GGVF G H A+
Sbjct: 288 HHRHTPGIFFKFDLDPLAIKIEQRTTSLVQLAIRCVGVIGGVFVCMGYAVKITTHAVDAV 347
>gi|392594239|gb|EIW83563.1| DUF1692-domain-containing protein [Coniophora puteana RWD-64-598
SS2]
Length = 506
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 55/183 (30%), Positives = 90/183 (49%), Gaps = 16/183 (8%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ +G C IYG L V KV N H + S +HV N+SH I + +FG
Sbjct: 169 QPDGSACRIYGTLAVKKVTANLHVTTLGHGYTSHMHV------DHTKMNLSHVITEFSFG 222
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH---FRSS 224
+FP + PLD + P +QY++ VVPT Y +++NQ+SVT + +++
Sbjct: 223 PYFPDISQPLDYSFEVAKDPYTAFQYYMHVVPTNYIAPRSKPLETNQYSVTHYTHIYKTP 282
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
+G +PG+FF +DL P+ ++ + S + ++GGVFT + F+
Sbjct: 283 HEG----IPGIFFKFDLDPMVLSIHQRTTSLTALIIRCVGVIGGVFTCA---TYFVRASM 335
Query: 285 RAI 287
RA+
Sbjct: 336 RAV 338
>gi|453088947|gb|EMF16987.1| DUF1692-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 404
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 92/194 (47%), Gaps = 38/194 (19%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E + C IYG + NKV G+FH A G + + G H+ +FN SH+I +L+FG
Sbjct: 184 EADSCRIYGSMHGNKVKGDFHITARGHGYMEFGQHL------DHSTFNFSHRITELSFGP 237
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD------------------------ 204
++P + NPLD T E+ +QY++ VVPT+YT
Sbjct: 238 YYPSLTNPLDNTFATTESNFYKFQYYLSVVPTIYTADAKALRKIDKYHESPTSGDDGLSQ 297
Query: 205 ----VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
S +T+ +NQ++VTE + ++PG+F +D+ PI++T E S L
Sbjct: 298 QPKRYSKNTVFTNQYAVTEQSHPVSE---SSVPGIFVKFDIEPIQLTIAENWSSVPALLI 354
Query: 261 NVCAIVGGVFTVSG 274
+ +V G+ G
Sbjct: 355 RIVNVVSGLLVAGG 368
>gi|255726548|ref|XP_002548200.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
gi|240134124|gb|EER33679.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
Length = 355
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 96/197 (48%), Gaps = 18/197 (9%)
Query: 90 NPDLIDQCKREGFLQRIKE------EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 143
PDL D+ +E + E C+I+G + V +V G+F ++ H
Sbjct: 126 TPDL-DEIMQESLRAEFRSQGARVNEGAPACHIFGSIPVTQVRGDFRITAKGFGYRDRSH 184
Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
V ++FN SH I + +FGE +P + NPLD E Y Y+ KVVPT+Y
Sbjct: 185 V------PIEAFNFSHVIQEFSFGEFYPFINNPLDATGKITEEKLQTYLYYAKVVPTMYE 238
Query: 204 DVSGHTIQSNQFSVTEH---FRSSEQG-RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
+ G I +NQ+S+TE + EQ R +PG++F YD PIK+ E+ + F F+
Sbjct: 239 QL-GLEIDTNQYSLTESQHVIQVDEQTKRPNGIPGIYFRYDFEPIKLVIREKRIPFFQFI 297
Query: 260 TNVCAIVGGVFTVSGII 276
+ I GG+ +G +
Sbjct: 298 AKLGTIGGGIMIAAGYL 314
>gi|320580226|gb|EFW94449.1| COPii-coated vesicle-associated protein, putative [Ogataea
parapolymorpha DL-1]
Length = 901
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 54/167 (32%), Positives = 84/167 (50%), Gaps = 12/167 (7%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+E C I+G + VN+V G H G D + N +H I++ +FG
Sbjct: 707 DEGAPACRIFGAIPVNRVKGELHIT------AKGYGYRDRTRIPAEGLNFTHAISEFSFG 760
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
E FP + NPLD T + ++Y I VVPT+Y + G I +NQ+S+ S +
Sbjct: 761 EFFPYLDNPLDMTLKTTDAHLHTFKYHINVVPTLYRKL-GVEIDTNQYSL-----SLTES 814
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+ +PG+FF Y+ PIK+ E +SF F+ + I+GG+ V+G
Sbjct: 815 SGKYVPGIFFQYEFEPIKLVVEETRLSFWQFVVRLATIMGGILVVAG 861
>gi|154305556|ref|XP_001553180.1| hypothetical protein BC1G_08547 [Botryotinia fuckeliana B05.10]
Length = 381
Score = 97.1 bits (240), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 61/161 (37%), Positives = 89/161 (55%), Gaps = 26/161 (16%)
Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
G+ C +YG LEVNKV G+FH A G + + G H+ +FN SH IN+L+FG
Sbjct: 185 GDSCRVYGSLEVNKVQGDFHLTARGHGYPEMGHHL------DHSAFNFSHIINELSFGPF 238
Query: 170 FPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHT--------IQSNQFSVT- 218
+P ++NPLD R TP+ YQYF+ VVPT+Y+ +++NQ++VT
Sbjct: 239 YPSLLNPLD--RTIAGTPNHFHKYQYFLSVVPTLYSLSPSTFSPSSSPTLLRTNQYAVTS 296
Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
EH +++PG+FF YD+ P+ +T E FL F
Sbjct: 297 QEHIVGE-----RSVPGIFFKYDIEPLLLTVEESRDGFLRF 332
>gi|47219772|emb|CAG03399.1| unnamed protein product [Tetraodon nigroviridis]
Length = 378
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/224 (30%), Positives = 98/224 (43%), Gaps = 62/224 (27%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGK-----------SFHQSGVHVHDILAFQR-------- 152
C I+G L VNKVAGNFH GK S H + V L R
Sbjct: 130 RACRIHGHLYVNKVAGNFHITVGKYVTSLLGYSVVSLHSIPIGVTLFLLLSRSIPHPRGH 189
Query: 153 ---------DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE--------TP-------- 187
DS+N SH+I+ L+FGE PG+++PLDG TP
Sbjct: 190 AHLAALVSHDSYNFSHRIDHLSFGEDLPGIISPLDGTEKVSADCTAVLSLTPLHRCDFFL 249
Query: 188 ----------------SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQ 230
+ ++QYFI +VPT + + +++Q+SVTE R+
Sbjct: 250 PRLFFKMCDFRFSLLANHIFQYFITIVPT-KLNTYKVSAETHQYSVTEQDRAINHAAGSH 308
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+ G+F YD+S + V TE+H+ FL +C IVGG+F+ +
Sbjct: 309 GVSGIFMKYDISSLMVKVTEQHMPLWQFLVRLCGIVGGIFSTTA 352
>gi|146079597|ref|XP_001463805.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|398011570|ref|XP_003858980.1| hypothetical protein, conserved [Leishmania donovani]
gi|134067893|emb|CAM66174.1| conserved hypothetical protein [Leishmania infantum JPCM5]
gi|322497192|emb|CBZ32265.1| hypothetical protein, conserved [Leishmania donovani]
Length = 368
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/224 (31%), Positives = 105/224 (46%), Gaps = 22/224 (9%)
Query: 65 SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNK 124
+++ CC+ CE V + Y++ G + + I QC E QR GC + G L++ K
Sbjct: 148 AAELKCCDTCESVLDLYKELGKGIPGTEYIPQC-LEQLYQR-----ASGCTVMGSLDLKK 201
Query: 125 VAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG----EHFP--GVVNPLD 178
V F P ++ H + D++ + SH I KL G E F GV PL
Sbjct: 202 VPVTVIFGPRRTGH--FYSLKDVI-----RLDTSHFIRKLRIGDETVERFSKNGVAEPLS 254
Query: 179 GVRWTQETPSGMYQYFIKVVPTVY--TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 236
G + + +T S +Y +KVVPT Y T + ++S R+ G +P V
Sbjct: 255 GHKSSSKTYSET-RYLVKVVPTTYRKTKTKNAKASTYEYSAQWSRRTIVVGFAGAVPAVL 313
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
F ++ +PI+V E F HFL +C IVGG+F V G ID +
Sbjct: 314 FEFEPAPIQVNNVFERQPFSHFLVQLCGIVGGLFVVLGFIDNVV 357
>gi|353236810|emb|CCA68797.1| related to ERV41-component of copii vesicles involved in transport
between the ER and golgi complex [Piriformospora indica
DSM 11827]
Length = 559
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 82/166 (49%), Gaps = 9/166 (5%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAP-GKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+G C +YG V K+ GNFH G + H D+ N+SH I + +FG
Sbjct: 198 DGGACRVYGSFAVRKLTGNFHITTLGHGYGGHNAHA------SHDNINMSHVITEFSFGP 251
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
++P +V PLD T + +QYFI VVPT Y + ++Q+SVT + + E
Sbjct: 252 YYPDIVQPLDYSFETTQEHFVAFQYFITVVPTTYVAPRSKPLHTHQYSVTHYVK--ELPH 309
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
Q PG+FF YD+ P+ + + + FL + ++GGV+ G
Sbjct: 310 SQGTPGIFFKYDIDPVALEIHQRTTTLTQFLVRIVGVIGGVWVCFG 355
>gi|297803392|ref|XP_002869580.1| hypothetical protein ARALYDRAFT_492089 [Arabidopsis lyrata subsp.
lyrata]
gi|297315416|gb|EFH45839.1| hypothetical protein ARALYDRAFT_492089 [Arabidopsis lyrata subsp.
lyrata]
Length = 480
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 106/204 (51%), Gaps = 39/204 (19%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
GC I G++ V KV GN + +SG H +F N+SH +N L+FG+
Sbjct: 293 GCRIEGYIRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGQRIMP 342
Query: 170 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 214
+ G+ + LDG + + P+ ++++++V T ++SN
Sbjct: 343 QKFSELKRLSPYLGLSHDRLDGRPFINQRDLGPNVTIEHYLQIVKT-------EVVKSNG 395
Query: 215 FSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
++ E + + + LP F ++LSP++V TE SF HF+TNVCAI+GGVFT
Sbjct: 396 QALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGVFT 455
Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
V+GI+D+ ++H + KKIE+GK
Sbjct: 456 VAGILDSILHHSM-TLMKKIELGK 478
>gi|391338468|ref|XP_003743580.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Metaseiulus occidentalis]
Length = 292
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 98/202 (48%), Gaps = 32/202 (15%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+G+GCN +NKV GNFH S H + Q D ++SH+I+ L FGE
Sbjct: 109 DGKGCNFVSKFTINKVPGNFHV----STHAAKT--------QPDDIDMSHEIHSLTFGEQ 156
Query: 170 F--------PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS----- 216
G N L + + Y +K+VPTVY SG ++ Q++
Sbjct: 157 LIYELGDDIKGSFNALQNHDRLKADGKESHDYVMKIVPTVYELSSGDSLVGYQYTHAHKS 216
Query: 217 -VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
+T F + GR+ +P ++F YDL+PI V + FLTNVCAIVGG FTV GI
Sbjct: 217 YITLSFSA---GRI--IPAIWFKYDLNPITVRYHRRTQPLYSFLTNVCAIVGGTFTVVGI 271
Query: 276 IDAFIYHGQRAIKKKIEIGKFS 297
I++ + +K E+GK S
Sbjct: 272 INSICFTAGEVF-RKFEMGKLS 292
>gi|451847161|gb|EMD60469.1| hypothetical protein COCSADRAFT_98785 [Cochliobolus sativus ND90Pr]
Length = 395
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/190 (31%), Positives = 91/190 (47%), Gaps = 36/190 (18%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG L NKV G+FH A G + + G H+ + SFN SH I +++FG ++
Sbjct: 178 DSCRIYGNLVGNKVQGDFHITARGHGYMEFGEHL------EHSSFNFSHIIREMSFGPYY 231
Query: 171 PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTD-----------VS---------- 206
P + NPLD TP+ +QY++ +VPT+YTD VS
Sbjct: 232 PSLTNPLDSTIAVTPTPADHFYKFQYYLSIVPTIYTDDPALMPIMESMVSTNDQPSSNMF 291
Query: 207 --GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
H I++NQ++VT + +PG+F +D+ PI + EE SF + +
Sbjct: 292 RMAHAIKTNQYAVTSQSHKVDDSY---VPGIFVKFDIEPIMLAIVEESKSFWKLVITLVN 348
Query: 265 IVGGVFTVSG 274
+V GV G
Sbjct: 349 VVSGVMVAGG 358
>gi|326470603|gb|EGD94612.1| COPII-coated vesicle protein [Trichophyton tonsurans CBS 112818]
Length = 399
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 93/192 (48%), Gaps = 30/192 (15%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
K + + C ++G LE NKV GN H A G + + G A S N +H I +L+
Sbjct: 186 KSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWG------RATNPHSLNFTHLITELS 239
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------------- 208
FG H+ ++NPLD + YQY++ VVPT+YT SGH
Sbjct: 240 FGPHYGRLLNPLDKTVSSTSINFYKYQYYLSVVPTIYTK-SGHIDPNRRSLPDASTITAK 298
Query: 209 ----TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
T+ +NQ++VT + Q R+ + PG+FF Y++ PI + ++E S L + +
Sbjct: 299 DSKTTVSTNQYAVTS-YSQPIQPRIDSTPGIFFKYNIEPILLIVSQERDSLLALMVRLVN 357
Query: 265 IVGGVFTVSGII 276
+V GV G +
Sbjct: 358 VVSGVLVTGGWL 369
>gi|347828541|emb|CCD44238.1| similar to endoplasmic reticulum-Golgi intermediate compartment
protein 2 [Botryotinia fuckeliana]
Length = 381
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 89/161 (55%), Gaps = 26/161 (16%)
Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
G+ C +YG LEVNKV G+FH A G + + G H+ +FN SH IN+L+FG
Sbjct: 185 GDSCRVYGSLEVNKVQGDFHLTARGHGYPEMGHHL------DHSAFNFSHIINELSFGPF 238
Query: 170 FPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHT--------IQSNQFSVT- 218
+P ++NPLD R TP+ YQYF+ +VPT+Y+ +++NQ++VT
Sbjct: 239 YPSLLNPLD--RTIAGTPNHFHKYQYFLSIVPTLYSLSPSTFSPSSSPTLLRTNQYAVTS 296
Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
EH +++PG+FF YD+ P+ +T E FL F
Sbjct: 297 QEHIVGE-----RSVPGIFFKYDIEPLLLTVEESRDGFLRF 332
>gi|167523643|ref|XP_001746158.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775429|gb|EDQ89053.1| predicted protein [Monosiga brevicollis MX1]
Length = 1400
Score = 96.3 bits (238), Expect = 1e-17, Method: Composition-based stats.
Identities = 51/149 (34%), Positives = 84/149 (56%), Gaps = 7/149 (4%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ E +GC ++G + V +V+ NFHF+ GKS H + H H + + + N SH+I++ +F
Sbjct: 165 DAEPDGCRVHGTMPVARVSSNFHFSAGKSVHHASGHAHVPIDPNQKTINFSHRIDRFSFS 224
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTE--HFRSS 224
G + LDG ++ ++QYF+KVVPT + +SNQ+SVTE H ++
Sbjct: 225 SEQRGAM-ALDGDMKVSDSNKQLFQYFLKVVPTTTKRMDEAEPFRSNQYSVTEQHHILAA 283
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHV 253
+ + LPG+ F Y++ PI V E+ V
Sbjct: 284 NE---RKLPGIHFKYEIEPIGVLVHEQAV 309
>gi|149241719|ref|XP_001526345.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146450468|gb|EDK44724.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 353
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 94/177 (53%), Gaps = 12/177 (6%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
QRI E C+I+G + VN+V G+F GK F S +H LA + N +H I +
Sbjct: 146 QRINEG-APACHIFGSIPVNQVKGDFRIT-GKGFGYSD-RLHVPLA----ALNFTHVIQE 198
Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
++GE FP + NPLD E Y Y +VVPT+Y + G + +NQ+S+TE+
Sbjct: 199 FSYGEFFPFLNNPLDATGKVTEEKLQAYIYNAQVVPTLYEKL-GLEVDTNQYSLTENHHV 257
Query: 224 SE----QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+ R Q +PG++F Y+ PIK+T E+ + F F+ + I GG+ +G +
Sbjct: 258 IKLDEISNRPQGVPGIYFRYEFEPIKLTIREKRIPFFQFVARLGTICGGLLVAAGYL 314
>gi|403413226|emb|CCL99926.1| predicted protein [Fibroporia radiculosa]
Length = 546
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 89/190 (46%), Gaps = 15/190 (7%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+++G C IYG + K N H + S HV N+SH IN+ +FG
Sbjct: 177 QKDGSACRIYGTITAKKATANLHITTIGHGYASRDHV------DHKYMNLSHVINEFSFG 230
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSE 225
FP +V PLD P YQY++ VVPT Y + ++Q+SVT + R S+
Sbjct: 231 PFFPEIVQPLDNSFELALDPFVAYQYYLHVVPTTYIAPRSTPLHTHQYSVTHYTRTMSTH 290
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 285
QG PG+FF +DL P+ +T + + FL +VGG+F G + G R
Sbjct: 291 QG----TPGIFFKFDLEPMHLTIHQRTTTLAQFLIRCVGVVGGIFVCMGYA---VRVGTR 343
Query: 286 AIKKKIEIGK 295
A++ + +
Sbjct: 344 AVEAATGVDR 353
>gi|22328963|ref|NP_567765.2| protein PDI-like 5-4 [Arabidopsis thaliana]
gi|75213708|sp|Q9T042.1|PDI54_ARATH RecName: Full=Protein disulfide-isomerase 5-4; Short=AtPDIL5-4;
AltName: Full=Protein disulfide-isomerase 7; Short=PDI7;
AltName: Full=Protein disulfide-isomerase 8-2;
Short=AtPDIL8-2; Flags: Precursor
gi|4490704|emb|CAB38838.1| putative protein [Arabidopsis thaliana]
gi|7269561|emb|CAB79563.1| putative protein [Arabidopsis thaliana]
gi|15450832|gb|AAK96687.1| putative protein [Arabidopsis thaliana]
gi|20259836|gb|AAM13265.1| putative protein [Arabidopsis thaliana]
gi|332659897|gb|AEE85297.1| protein PDI-like 5-4 [Arabidopsis thaliana]
Length = 480
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 64/204 (31%), Positives = 105/204 (51%), Gaps = 39/204 (19%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
GC + G++ V KV GN + +SG H +F N+SH +N L+FG
Sbjct: 293 GCRVEGYMRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGRRIMP 342
Query: 170 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 214
+ G+ + LDG + + P+ ++++++V T ++SN
Sbjct: 343 QKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIVKT-------EVVKSNG 395
Query: 215 FSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
++ E + + + LP F ++LSP++V TE SF HF+TNVCAI+GGVFT
Sbjct: 396 QALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGVFT 455
Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
V+GI+D+ ++H + KKIE+GK
Sbjct: 456 VAGILDSILHHSM-TLMKKIELGK 478
>gi|156841160|ref|XP_001643955.1| hypothetical protein Kpol_1001p9 [Vanderwaltozyma polyspora DSM
70294]
gi|156114586|gb|EDO16097.1| hypothetical protein Kpol_1001p9 [Vanderwaltozyma polyspora DSM
70294]
Length = 349
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 97/167 (58%), Gaps = 13/167 (7%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+GC+IYG +++N+VAG F A G + +G D + F +H IN+ +FG+ +
Sbjct: 157 DGCHIYGSVKLNRVAGELQFTAKGWGYRDNGRAPLDQIDF-------NHVINEFSFGDFY 209
Query: 171 PGVVNPLDGVRWTQETPS-GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
P + NPLDG ++ S Y Y VVPT++ + G + +NQ+S+ E+ + + G++
Sbjct: 210 PYIDNPLDGTAKIEKQKSISRYIYSTSVVPTIFQKL-GAEVDTNQYSLAEYHTAPKDGKI 268
Query: 230 Q---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
+ ++PG+FF YD P+ + +++ +SF+ F+ + AI+ + ++
Sbjct: 269 KLTTSIPGIFFRYDFEPLSIVISDKRLSFVQFIVRLVAILSFILYMA 315
>gi|378726952|gb|EHY53411.1| hypothetical protein HMPREF1120_01605 [Exophiala dermatitidis
NIH/UT8656]
Length = 326
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/208 (30%), Positives = 94/208 (45%), Gaps = 47/208 (22%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG LE NKV G+FH A G + + G+ H FN SH IN+L+FG H+
Sbjct: 86 DSCRIYGSLEGNKVQGDFHITARGHGYMEFGMQQH----LDHSRFNFSHHINELSFGPHY 141
Query: 171 PGVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYT-------------------------- 203
PG++NPLD T + YQY++ +VPT++T
Sbjct: 142 PGLLNPLDKTSAVTTDVHFMRYQYYLSIVPTIFTKRRVSTSSGALDPAAIPQPPTLDLTP 201
Query: 204 ----DVSG--------HTIQSNQFSVTEHFRSSEQGRL---QTLPGVFFFYDLSPIKVTF 248
D G H + ++ T + ++ Q R T+PGVFF YD+ PI +
Sbjct: 202 NDHRDKDGVVRHVPNPHAGRDSKSVFTNQYAATSQSREVPGNTVPGVFFKYDIEPILLIV 261
Query: 249 TEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+E SFL + + ++ GV G +
Sbjct: 262 SERRSSFLGLIVRLVNVISGVLVAGGWM 289
>gi|238480964|ref|NP_680742.2| protein PDI-like 5-4 [Arabidopsis thaliana]
gi|332659898|gb|AEE85298.1| protein PDI-like 5-4 [Arabidopsis thaliana]
Length = 532
Score = 95.9 bits (237), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 102/201 (50%), Gaps = 33/201 (16%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
GC + G++ V KV GN + +SG H +F N+SH +N L+FG
Sbjct: 345 GCRVEGYMRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGRRIMP 394
Query: 170 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 214
+ G+ + LDG + + P+ ++++++V T +G +
Sbjct: 395 QKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIVKTEVVKSNGQALV-EA 453
Query: 215 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+ T H S LP F ++LSP++V TE SF HF+TNVCAI+GGVFTV+G
Sbjct: 454 YEYTAH---SSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGVFTVAG 510
Query: 275 IIDAFIYHGQRAIKKKIEIGK 295
I+D+ ++H + KKIE+GK
Sbjct: 511 ILDSILHHSM-TLMKKIELGK 530
>gi|169860063|ref|XP_001836668.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Coprinopsis cinerea okayama7#130]
gi|116502344|gb|EAU85239.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
[Coprinopsis cinerea okayama7#130]
Length = 516
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 80/163 (49%), Gaps = 8/163 (4%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ + C I+G + V KV N H + S HV L N+SH I + +FG
Sbjct: 169 QADASACRIWGTMYVKKVTANLHVTTLGHGYASYEHVDHHL------MNLSHVIQEFSFG 222
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
HFP +V PLD YQYF+ VVPT Y +++NQ+SVT + R E
Sbjct: 223 PHFPEIVQPLDNSFEATHEHFIAYQYFLHVVPTTYVAPRTAPLETNQYSVTHYTRVLEHN 282
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
R PG+FF ++L P+K+T + + L + ++GGVF
Sbjct: 283 R--GTPGIFFKFELDPLKITQYQRTTTLLQLMIRCVGVIGGVF 323
>gi|358388143|gb|EHK25737.1| hypothetical protein TRIVIDRAFT_33251 [Trichoderma virens Gv29-8]
Length = 370
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/184 (30%), Positives = 95/184 (51%), Gaps = 15/184 (8%)
Query: 92 DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
D++ +++ + +G + C +YG L++N+V G+FH A G + G H+
Sbjct: 163 DIVALSRKKAKWAKTPSPKGRPDSCRMYGSLDLNRVQGDFHITARGHGY--GGQHL---- 216
Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
D FN SH I+++++G +P +VNPLD + +QY++ VVPTVY +
Sbjct: 217 --DHDKFNFSHIISEMSYGPFYPSLVNPLDRTVNSAIVHFHKFQYYLSVVPTVYL-ANNR 273
Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
+ +NQ++VTE ++ +PG+FF YD+ PI ++ E F FL + I G
Sbjct: 274 IVNTNQYAVTEQSKTISD---HQVPGIFFKYDIEPIMLSVEESRDGFFTFLVKIVNIFSG 330
Query: 269 VFTV 272
V
Sbjct: 331 VMVA 334
>gi|451997913|gb|EMD90378.1| hypothetical protein COCHEDRAFT_27091 [Cochliobolus heterostrophus
C5]
Length = 395
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 98/207 (47%), Gaps = 37/207 (17%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C IYG L NKV G+FH A G + + G H+ SFN SH I +++FG ++
Sbjct: 178 DSCRIYGNLVGNKVQGDFHITARGHGYMEFGEHL------DHSSFNFSHIIREMSFGPYY 231
Query: 171 PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTD-----------VS---------- 206
P + NPLD TP+ +QY++ +VPT+YTD VS
Sbjct: 232 PSLTNPLDSTIAVTPTPADHFYKFQYYLSIVPTIYTDDPSLMPLMESVVSTNDQPSSNMF 291
Query: 207 --GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
H I++NQ++VT S + +PG+F +D+ PI + EE SF L +
Sbjct: 292 RMAHAIKTNQYAVTSQ---SHKVDDTYVPGIFVKFDIEPIMLAIVEESKSFWKLLITLVN 348
Query: 265 IVGGVFTV-SGIIDAFIYHGQRAIKKK 290
+V GV S + F + + K+K
Sbjct: 349 VVSGVMVAGSWVWQMFDWASEFVGKRK 375
>gi|21618302|gb|AAM67352.1| unknown [Arabidopsis thaliana]
Length = 317
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 100/201 (49%), Gaps = 33/201 (16%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
GC + G++ V KV GN + +SG H +F N+SH +N L+FG
Sbjct: 130 GCRVEGYMRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGRRIMP 179
Query: 170 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 214
+ G+ + LDG + + P+ ++++++V T +G +
Sbjct: 180 QKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIVKTEVVKSNGQAL---- 235
Query: 215 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
E+ S LP F ++LSP++V TE SF HF+TNVCAI+GG FTV+G
Sbjct: 236 VEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGAFTVAG 295
Query: 275 IIDAFIYHGQRAIKKKIEIGK 295
I+D+ ++H + KKIE+GK
Sbjct: 296 ILDSILHHSM-TLMKKIELGK 315
>gi|328352874|emb|CCA39272.1| Peroxisomal membrane protein PEX28 [Komagataella pastoris CBS 7435]
Length = 849
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 95/189 (50%), Gaps = 17/189 (8%)
Query: 94 IDQCKREGFLQRIKEE------EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 147
+D+ RE L +E+ + C+I+G + VNKV G FH GK G D
Sbjct: 644 LDEVMRESALAEFREKKSFTHGDAPACHIFGSIPVNKVHGFFHIT-GK-----GYGYRDR 697
Query: 148 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
+++ N +H I++ +FGE +P + NPLD T + Y++ VVPT Y + G
Sbjct: 698 SIVPKEALNFTHVISEFSFGEFYPYMNNPLDFTARTTNDHIHTFNYYLDVVPTEYKKL-G 756
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
I + Q+S+T +E L PG+FF Y PI ++ E+ +SF+ FL + I G
Sbjct: 757 IVIDTTQYSMT----VTELPGLSRPPGLFFNYQFEPIILSIEEKRISFVRFLVRLVTICG 812
Query: 268 GVFTVSGII 276
G+ V+ I
Sbjct: 813 GIMVVAKWI 821
>gi|345325542|ref|XP_001508860.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Ornithorhynchus anatinus]
Length = 372
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 53/116 (45%), Positives = 69/116 (59%), Gaps = 5/116 (4%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ + C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE
Sbjct: 165 QPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSYNFSHRIDHLSFGE 224
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR 222
PG++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE R
Sbjct: 225 LVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISAET---HQFSVTERER 277
>gi|260950511|ref|XP_002619552.1| hypothetical protein CLUG_00711 [Clavispora lusitaniae ATCC 42720]
gi|238847124|gb|EEQ36588.1| hypothetical protein CLUG_00711 [Clavispora lusitaniae ATCC 42720]
Length = 347
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/168 (30%), Positives = 87/168 (51%), Gaps = 8/168 (4%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E+ C+I+G + VN V G F P S ++ D + ++N SH I++ +FG+
Sbjct: 150 EDAPACHIFGTIPVNHVRGEFFIVPKGSMYR------DRSSIDPKAYNFSHVISEFSFGD 203
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
+P + NPLD E Y+YF K+VPT Y + G + + Q+S+TE + + R
Sbjct: 204 FYPFITNPLDFTAKVTEENRQAYRYFAKLVPTHYEKL-GLVVDTYQYSLTE-IHNVDHNR 261
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
PG+FF Y PIK+T E+ + F F+ + ++ G+ +G +
Sbjct: 262 GIPPPGIFFDYSFEPIKLTIREKRIGFFAFVARLMTVLSGLLIAAGYL 309
>gi|452847826|gb|EME49758.1| hypothetical protein DOTSEDRAFT_58941 [Dothistroma septosporum
NZE10]
Length = 402
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 69/238 (28%), Positives = 104/238 (43%), Gaps = 49/238 (20%)
Query: 75 EEVREAYRKKGWALSNPDL---IDQCKREGFLQRIKE----EEGEGCNIYGFLEVNKVAG 127
E +R Y KG D+ + KR+ ++ + + C IYG + NKV G
Sbjct: 140 ERIRSGYDGKGAEYEEEDVHNYLGAAKRQKKFKKTPGLPWGAQADSCRIYGSMHGNKVQG 199
Query: 128 NFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQET 186
+FH A G + + G H+ +FN SH +N+L+FG +P + NPLD T
Sbjct: 200 DFHITARGHGYMEFGAHL------DHSTFNFSHTVNELSFGPFYPSLTNPLDNT--VATT 251
Query: 187 PSGMY--QYFIKVVPTVYTD----------------------------VSGHTIQSNQFS 216
P Y QY++ VVPT+YT S +T+ +NQ++
Sbjct: 252 PDHFYKFQYYLSVVPTIYTTDAKTLRKIDKHHESPSSGEDGLSQYPHRYSRNTVFTNQYA 311
Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
VTE S + +PGVF +D+ PI +T EE S L + +V G+ G
Sbjct: 312 VTEQ---SHRVPENAVPGVFIKFDIEPIGLTIAEEWSSIPALLIRLVNVVSGLLVAGG 366
>gi|402224967|gb|EJU05029.1| DUF1692-domain-containing protein [Dacryopinax sp. DJM-731 SS1]
Length = 517
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/155 (32%), Positives = 81/155 (52%), Gaps = 11/155 (7%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAP-GKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
++G C +YG +EV KV N H G +H + H ++ N+SH I + +FG
Sbjct: 174 KDGSACRVYGSMEVKKVQANLHITTLGHGYHSNEHTDHSLM-------NLSHIITEFSFG 226
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
+FP +V PLD + + P +QYF+ VVPT Y G +++NQ+SV H + + G
Sbjct: 227 PYFPDIVQPLDYTIESSDDPFTAFQYFLTVVPTEYRTSKG-VVKTNQYSVGSHMQHIQHG 285
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
R P +FF YDL P+ + + + + FL +
Sbjct: 286 R--GTPVIFFKYDLEPLSLIVEQRTTTLIQFLIRL 318
>gi|406868300|gb|EKD21337.1| copii-coated vesicle protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 382
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/160 (38%), Positives = 84/160 (52%), Gaps = 16/160 (10%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
K E + C I+G LEVNKV G H +Q H +FN SH +++L+F
Sbjct: 184 KSAEMDSCRIFGNLEVNKVQGELHITARGHGYQELAAGH----LDHHAFNFSHVVSELSF 239
Query: 167 GEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVY-----TDVSGHTIQSNQFSVTE 219
G +P + NPLD R TP+ +QYF+ VVPTVY T S T+ +NQ++VTE
Sbjct: 240 GPFYPSLHNPLD--RTVSTTPNNFHKFQYFLSVVPTVYSVDSSTTYSSQTLFTNQYAVTE 297
Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
+ ++PG+FF YD P+ +T E SFL FL
Sbjct: 298 QSHVVSEF---SVPGIFFKYDFEPMLLTVQESRDSFLRFL 334
>gi|393231429|gb|EJD39021.1| DUF1692-domain-containing protein [Auricularia delicata TFB-10046
SS5]
Length = 518
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 76/152 (50%), Gaps = 8/152 (5%)
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
G C ++G + V KV N H + S H + N+SH I++ +FG
Sbjct: 178 GSACRVFGSMFVKKVTANLHITTAGHGYSSNAHTDHTM------MNLSHIISEFSFGPFM 231
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
P + PLD + + P YQYF+ VVPT Y + +++NQ+SVT + R E GR
Sbjct: 232 PDISQPLDNLFEVAKEPFTAYQYFLTVVPTTYVAPRSYPMRTNQYSVTNYKRVFEHGR-- 289
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
PG+FF +D+ P+++T + +F + +
Sbjct: 290 ATPGIFFKFDIDPMQLTVIQRTTTFTQLIIRI 321
>gi|156065931|ref|XP_001598887.1| hypothetical protein SS1G_00976 [Sclerotinia sclerotiorum 1980]
gi|154691835|gb|EDN91573.1| hypothetical protein SS1G_00976 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 421
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/161 (37%), Positives = 88/161 (54%), Gaps = 26/161 (16%)
Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
G+ C +YG LEVNKV G+FH A G + + G H+ ++FN SH IN+L+FG
Sbjct: 185 GDSCRVYGSLEVNKVQGDFHITAKGHGYPELGQHL------DHNAFNFSHIINELSFGPF 238
Query: 170 FPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHTI--------QSNQFSVT- 218
+P ++NPLD R TP+ YQYF+ +VPT+Y+ ++NQ++VT
Sbjct: 239 YPSLLNPLD--RTIAGTPNHFHKYQYFLSIVPTLYSLSPSTFSPSSSPSLLRTNQYAVTS 296
Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
EH + +PG+FF YD+ P+ +T E FL F
Sbjct: 297 QEHIVGE-----RNVPGIFFKYDIEPLLLTVEESRDGFLRF 332
>gi|315054535|ref|XP_003176642.1| hypothetical protein MGYG_00729 [Arthroderma gypseum CBS 118893]
gi|311338488|gb|EFQ97690.1| hypothetical protein MGYG_00729 [Arthroderma gypseum CBS 118893]
Length = 399
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 92/192 (47%), Gaps = 30/192 (15%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
K + + C ++G LE NKV GN H A G + + G A S N +H I +L+
Sbjct: 186 KSDVVDSCRVFGSLEGNKVQGNLHITARGFGYFEWG------RATNPHSLNFTHLITELS 239
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------------- 208
FG H+ ++NPLD T YQY + VVPT+YT SGH
Sbjct: 240 FGPHYGRLLNPLDKTVSTTSVNFYKYQYHLSVVPTIYTK-SGHMDPSRRSLPDSSTITAK 298
Query: 209 ----TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
T+ +NQ++VT + Q R+ + PG+FF Y++ PI + ++E S L + +
Sbjct: 299 DSKTTVSTNQYAVTS-YSQPIQPRIDSTPGIFFKYNIEPILLIVSQERDSLLGLMIRLVN 357
Query: 265 IVGGVFTVSGII 276
+V GV G +
Sbjct: 358 VVSGVLVTGGWL 369
>gi|448521200|ref|XP_003868450.1| Erv41 protein [Candida orthopsilosis Co 90-125]
gi|380352790|emb|CCG25546.1| Erv41 protein [Candida orthopsilosis]
Length = 352
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/208 (28%), Positives = 102/208 (49%), Gaps = 23/208 (11%)
Query: 85 GWALSNPDL------IDQCKREGF------LQRIKEEEGEGCNIYGFLEVNKVAGNFHFA 132
G++++NP+ +D+ +E L R E C+I+G + VN+V G F
Sbjct: 114 GFSINNPNDFHETPDLDEVMQESLRAEFSQLGRRVNEGAPACHIFGSIPVNQVKGEFRIT 173
Query: 133 PGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQ 192
G+ D ++ N SH I + ++G+ FP + NPLD E +Y
Sbjct: 174 ------AKGLGYKDRSFVPVEALNFSHVIQEFSYGDFFPFLNNPLDATGKVTEENLQIYL 227
Query: 193 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFR----SSEQGRLQTLPGVFFFYDLSPIKVTF 248
Y KVVPT+Y + G + + Q+S+TE+ + + Q +PG++F Y+ PIK+
Sbjct: 228 YHSKVVPTLYEKL-GLEVDTTQYSLTENHHIVKVNPHSKKPQGIPGIYFAYEFEPIKLII 286
Query: 249 TEEHVSFLHFLTNVCAIVGGVFTVSGII 276
E+ + FL F+ + IVGG+ +G +
Sbjct: 287 REKRIPFLQFIAKLGTIVGGIIVAAGYL 314
>gi|66773206|ref|NP_080631.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
isoform 2 [Mus musculus]
gi|12854944|dbj|BAB30175.1| unnamed protein product [Mus musculus]
Length = 302
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 67/110 (60%), Gaps = 5/110 (4%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTE 219
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTE 274
>gi|326479518|gb|EGE03528.1| COPII-coated vesicle protein [Trichophyton equinum CBS 127.97]
Length = 399
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 92/192 (47%), Gaps = 30/192 (15%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
K + + C ++G LE NKV GN H A G + + G A S N +H I +L+
Sbjct: 186 KSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWG------RATNPHSLNFTHLITELS 239
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------------- 208
FG H+ ++NPLD + YQY + VVPT+YT SGH
Sbjct: 240 FGPHYGRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYTK-SGHIDPNRRSLPDASTITAK 298
Query: 209 ----TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
T+ +NQ++VT + Q R+ + PG+FF Y++ PI + ++E S L + +
Sbjct: 299 DSKTTVSTNQYAVTS-YSQPIQPRIDSTPGIFFKYNIEPILLIVSQERDSLLALMVRLVN 357
Query: 265 IVGGVFTVSGII 276
+V GV G +
Sbjct: 358 VVSGVLVTGGWL 369
>gi|148678795|gb|EDL10742.1| ERGIC and golgi 2, isoform CRA_b [Mus musculus]
Length = 310
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/110 (47%), Positives = 67/110 (60%), Gaps = 5/110 (4%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+ C I+G L VNKVAGNFH GK+ H H DS+N SH+I+ L+FGE P
Sbjct: 176 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 235
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTE 219
G++NPLDG + M+QYFI VVPT ++T +S T +QFSVTE
Sbjct: 236 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTE 282
>gi|410082748|ref|XP_003958952.1| hypothetical protein KAFR_0I00360 [Kazachstania africana CBS 2517]
gi|372465542|emb|CCF59817.1| hypothetical protein KAFR_0I00360 [Kazachstania africana CBS 2517]
Length = 354
Score = 94.4 bits (233), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 49/169 (28%), Positives = 90/169 (53%), Gaps = 11/169 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+++G + VN+V G G+ D D N +H IN+L+FG+
Sbjct: 160 EFNGCHVFGSIPVNRVTGELQIT------AKGMGYPDREKAPIDEVNFAHVINELSFGDF 213
Query: 170 FPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
+P + NPLD ++ QE P Y Y + V+PT+Y + G + +NQ+SV+E+ +
Sbjct: 214 YPYIDNPLDNSAKFDQENPISAYVYHMNVIPTIYQKL-GAEVDTNQYSVSEYHYTEADNA 272
Query: 229 LQT---LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
++ +PG+F Y+ P+ + T++ +SF+ F+ + AI+ + ++
Sbjct: 273 IRKAGRVPGIFLKYNFEPLSIVVTDKRLSFIQFVIRLVAILSFIVYIAS 321
>gi|123361353|ref|XP_001295947.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121875215|gb|EAX83017.1| hypothetical protein TVAG_111750 [Trichomonas vaginalis G3]
Length = 338
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 77/291 (26%), Positives = 133/291 (45%), Gaps = 36/291 (12%)
Query: 8 HLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSD 67
HLD+ LDS G+ D + ++++ ++ L + + C SCY +
Sbjct: 61 HLDI--------LDSIGHKQLLVNDTLKWRRVNQ--EKGFMELYNKKKQCHSCYDF-YDN 109
Query: 68 EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAG 127
CCN CE+++E Y + P+ QCK E + K + E C++ G + VN+V G
Sbjct: 110 RFCCNGCEKLKEIYHSNN-KTATPENWTQCKPEN---KQKFDPNEKCHVKGKISVNRVPG 165
Query: 128 NFHFAPGKSFHQSGVHVHDILA-FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQET 186
+FH A G+S G H H +L +Q +F+ H I L FG + P +PL G
Sbjct: 166 SFHLAIGQSIEDYG-HQHILLDDYQTITFD--HDIIDLRFGANIPMTSHPLRGTHIKSTG 222
Query: 187 PSGMYQYFIKVVPTVYTDVSGHTIQSNQ-----FSVTEHFRSSEQGRLQTLPGVFFFYDL 241
+Y + + P V+ G I+ +S+T H +PG++F+Y
Sbjct: 223 EPLATEYNLIITPIVFY-ADGQYIEKGFEYVYFYSMTYHL----------VPGIYFYYSF 271
Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
+P + T + SF FL + ++ G++ + ++ F+ + KKK+E
Sbjct: 272 TPYTIAVTWQSRSFRSFLISTGGLLSGIYAIFSMVSTFLEKSDQK-KKKVE 321
>gi|225461068|ref|XP_002281649.1| PREDICTED: protein disulfide isomerase-like 5-4 [Vitis vinifera]
gi|297735969|emb|CBI23943.3| unnamed protein product [Vitis vinifera]
Length = 482
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 104/204 (50%), Gaps = 38/204 (18%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
GC I GF+ V KV GN + +SG H +F N+SH I+ L+FG P
Sbjct: 294 GCRIEGFVRVKKVPGNLVISA-----RSGSH-----SFDPSQMNMSHVISHLSFGRKIAP 343
Query: 172 GVVNPLDGV-------------RWTQETPSG-----MYQYFIKVVPTVYTDVSGHTIQSN 213
V++ + V R PS +++++VV T H +
Sbjct: 344 RVMSDMKRVLPYIGGSHDRLNGRSYISHPSDSNANVTIEHYLQVVKTEVITTRDHKL--- 400
Query: 214 QFSVTEHFRSSEQGRLQTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
V E+ ++ +Q+L P F ++LSP++V TE SF HF+TNVCAI+GGVFT
Sbjct: 401 ---VEEYEYTAHSSLVQSLYIPVAKFHFELSPMQVLVTENRKSFWHFITNVCAIIGGVFT 457
Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
V+GI+D+ +++ R + KKIE+GK
Sbjct: 458 VAGILDSVLHNTMR-LMKKIELGK 480
>gi|401416963|ref|XP_003872975.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322489202|emb|CBZ24457.1| conserved hypothetical protein [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 368
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 64/221 (28%), Positives = 105/221 (47%), Gaps = 26/221 (11%)
Query: 70 CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 129
CC+ CE V + Y++ G + + + QC + + ++ GCN+ G L++ KV
Sbjct: 153 CCDTCESVLDLYKELGKGIPGTEYLPQCLEQLY------QQASGCNVVGSLDLKKVHVTV 206
Query: 130 HFAPGKS--FHQSGVHVHDILAFQRDSFNISHKINKLAFG----EHFP--GVVNPLDGVR 181
F P ++ F+ + D++ + SH I KL G E F GV PL G +
Sbjct: 207 IFGPRRTGRFYS----LKDVI-----RLDTSHSIRKLRIGDEAVERFSKNGVAEPLSGHK 257
Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF--RSSEQGRLQTLPGVFFFY 239
+T S +Y +KVVPT Y +++ + + + R+ G +P V F +
Sbjct: 258 SFSKTYSET-RYLVKVVPTTYRKTKKRNAKASTYEYSAQWSKRTIVVGFAGAVPAVLFEF 316
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+ +PI+V E F HF+ +C IVGG+F V G ID +
Sbjct: 317 EPAPIQVNNVFERQPFSHFVVQLCGIVGGLFVVLGFIDNVV 357
>gi|302808800|ref|XP_002986094.1| hypothetical protein SELMODRAFT_123299 [Selaginella moellendorffii]
gi|300146242|gb|EFJ12913.1| hypothetical protein SELMODRAFT_123299 [Selaginella moellendorffii]
Length = 475
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 82/283 (28%), Positives = 130/283 (45%), Gaps = 48/283 (16%)
Query: 35 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 94
G P I + H + EH S YG +D + + EA K L+ D
Sbjct: 217 GFPSIRIFRKGHDLKDEHGHHEHDSYYGERDTD-----SLVKAMEALVPKETTLALED-- 269
Query: 95 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 154
K G ++R G GC I GF+ KV GN + SG H +F +
Sbjct: 270 ---KTNGTVKRPAPRAG-GCRIEGFIRAKKVPGNIIISA-----HSGSH-----SFDASA 315
Query: 155 FNISHKINKLAFGEH------------FPGVVNPLDGVR-------WTQETPSGMYQYFI 195
N++H +++ +FG +P + + D V + + + + +++
Sbjct: 316 MNMTHYVSQFSFGRELNFWMRRELYRIYPHLASVYDTVEANLTGRIYVSQHENITHDHYL 375
Query: 196 KVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQT--LPGVFFFYDLSPIKVTFTEEH 252
+VV T + + +FS+ E + +S +Q +P F Y+LSP++V E
Sbjct: 376 QVVKTEVVSLQ----KRKEFSLLEQYDYTSHSNTVQNTNVPVAKFHYELSPMQVLVKENP 431
Query: 253 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
SF HF+TNVCAI+GGVFTV+GI+D+ + HG + KKIE+GK
Sbjct: 432 KSFSHFITNVCAIIGGVFTVAGIVDSML-HGAMRMVKKIELGK 473
>gi|444706692|gb|ELW48018.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Tupaia chinensis]
Length = 821
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 47/108 (43%), Positives = 68/108 (62%), Gaps = 5/108 (4%)
Query: 191 YQYFIKVVPTVYTDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTF 248
+ Y +K+VPTVY D SG S Q++V E+ S GR+ +P ++F YDLSPI V +
Sbjct: 716 HDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSHTGRI--IPAIWFRYDLSPITVKY 773
Query: 249 TEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
TE F+T +CAI+GG FTV+GI+D+ I+ A KK+++GK
Sbjct: 774 TERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW-KKVQLGKM 820
>gi|189207969|ref|XP_001940318.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187976411|gb|EDU43037.1| conserved hypothetical protein [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 394
Score = 94.0 bits (232), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 59/193 (30%), Positives = 89/193 (46%), Gaps = 37/193 (19%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E + C IYG L+ NKV G+FH A G + + G H+ SFN SH I +++FG
Sbjct: 174 ETDSCRIYGSLDGNKVQGDFHITARGHGYIEFGQHL------DHSSFNFSHIIREMSFGP 227
Query: 169 HFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDV-------------------- 205
++P + NPLD TP +QY++ +VPT+YTD
Sbjct: 228 YYPSLTNPLDATIAVTPTPDDKFYKFQYYLSIVPTIYTDDPSLIPLLELVGSTSNHPGAA 287
Query: 206 ----SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
H I++NQ++VT S + +PG+F +D+ PI + EE F +
Sbjct: 288 SMFHGAHAIKTNQYAVTSQ---SHKVPENYVPGIFVKFDIEPIVLRVVEEWGGFWRLIVT 344
Query: 262 VCAIVGGVFTVSG 274
+ +V GV G
Sbjct: 345 LINVVSGVMVAGG 357
>gi|254572003|ref|XP_002493111.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv46p [Komagataella pastoris GS115]
gi|238032909|emb|CAY70932.1| Protein localized to COPII-coated vesicles, forms a complex with
Erv46p [Komagataella pastoris GS115]
Length = 333
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 95/189 (50%), Gaps = 17/189 (8%)
Query: 94 IDQCKREGFLQRIKEE------EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 147
+D+ RE L +E+ + C+I+G + VNKV G FH GK G D
Sbjct: 128 LDEVMRESALAEFREKKSFTHGDAPACHIFGSIPVNKVHGFFHIT-GK-----GYGYRDR 181
Query: 148 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
+++ N +H I++ +FGE +P + NPLD T + Y++ VVPT Y + G
Sbjct: 182 SIVPKEALNFTHVISEFSFGEFYPYMNNPLDFTARTTNDHIHTFNYYLDVVPTEYKKL-G 240
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
I + Q+S+T +E L PG+FF Y PI ++ E+ +SF+ FL + I G
Sbjct: 241 IVIDTTQYSMT----VTELPGLSRPPGLFFNYQFEPIILSIEEKRISFVRFLVRLVTICG 296
Query: 268 GVFTVSGII 276
G+ V+ I
Sbjct: 297 GIMVVAKWI 305
>gi|209877186|ref|XP_002140035.1| hypothetical protein [Cryptosporidium muris RN66]
gi|209555641|gb|EEA05686.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
Length = 384
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 71/249 (28%), Positives = 108/249 (43%), Gaps = 27/249 (10%)
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK---REGFLQRIKEEEG-E 112
CGSCY S CCN C EV +Y++ L +QCK RE + I
Sbjct: 117 CGSCYNP-SKKNHCCNTCSEVIRSYQEDNIKLPQKINFEQCKFDPRERLEKAISAPLNIS 175
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC I + + KV G + + + + + DI + +N S+ + L +G+ PG
Sbjct: 176 GCKIKVDINIPKVKGRIEISHKRWMNYNEMTNLDIS--EAHLYNFSYIVKYLHYGDDLPG 233
Query: 173 VVNPLDGVRWTQETP-------------SGMYQYFIKVVPTVYTDV-SGHTIQSNQFSV- 217
+ N + + Q + +PT + + S T +QFSV
Sbjct: 234 INNIWNNQEYIQTAKFTHNKESDNLFLEDAHLDIDMHCIPTQFNSINSKKTKIGHQFSVR 293
Query: 218 --TEHFRSSEQGRL---QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
++ GR +LPG++ YD +P V TE SFL FLT CAI+GG+F
Sbjct: 294 KQSKQVNVLNNGRFVPETSLPGIYINYDFTPFIVKITESRRSFLSFLTECCAIIGGIFAF 353
Query: 273 SGIIDAFIY 281
S +ID F++
Sbjct: 354 SSMIDIFMF 362
>gi|330935325|ref|XP_003304912.1| hypothetical protein PTT_17645 [Pyrenophora teres f. teres 0-1]
gi|311318248|gb|EFQ86993.1| hypothetical protein PTT_17645 [Pyrenophora teres f. teres 0-1]
Length = 395
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 74/260 (28%), Positives = 113/260 (43%), Gaps = 54/260 (20%)
Query: 44 QRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFL 103
Q G LE G+ G S E+ + E++ +A+++K S
Sbjct: 124 QWTGRNLERGTHELGTEAGDAPSWEEAWDVREQLGKAHKRK---FSK------------T 168
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKIN 162
RI+ + C IYG L+ NKV G+FH A G + + G H+ SFN SH I
Sbjct: 169 PRIRGNP-DSCRIYGSLDGNKVQGDFHITARGHGYMEFGEHL------DHSSFNFSHIIR 221
Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMY---QYFIKVVPTVYTD----------VS--- 206
+++FG ++P + NPLD TP + QY++ +VPT+YTD VS
Sbjct: 222 EMSFGPYYPSLTNPLDATIAVTPTPDDKFYKFQYYLSIVPTIYTDDPTLIPYLEAVSSTA 281
Query: 207 ------------GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 254
I++NQ++VT S + +PGVF +D+ PI + EE
Sbjct: 282 GNHPGAASIFHGARAIKTNQYAVTSQ---SHKVPENYVPGVFVKFDIEPIMLAVVEEWSG 338
Query: 255 FLHFLTNVCAIVGGVFTVSG 274
F + + +V GV G
Sbjct: 339 FWRLIVTLVNVVSGVMVAGG 358
>gi|255563725|ref|XP_002522864.1| thioredoxin domain-containing protein, putative [Ricinus communis]
gi|223537948|gb|EEF39562.1| thioredoxin domain-containing protein, putative [Ricinus communis]
Length = 478
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 76/241 (31%), Positives = 116/241 (48%), Gaps = 40/241 (16%)
Query: 79 EAYRKKGWALSNPDLIDQCKREGFLQRIKEEE--GEGCNIYGFLEVNKVAGNFHFAPGKS 136
E+ K +L P ++ K E Q K GC I G++ V KV GN +
Sbjct: 252 ESLVKTMESLVAPIQLESLKSENATQSTKRPAPLTGGCRIEGYVRVKKVPGNLIISA--- 308
Query: 137 FHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-PGVVN--------------PLDG-- 179
+SG H +F N+SH I+ L+FG P V+N L+G
Sbjct: 309 --RSGAH-----SFDPSQMNMSHVISHLSFGLKVSPKVMNEAKRLVPYIGGSHDKLNGRS 361
Query: 180 -VRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT---LPG 234
V + ++++++V T V T S S + + E + + L +P
Sbjct: 362 FVNHRDVDANVTIEHYLQIVKTEVVTRRS-----SREHKLLEEYEYTAHSSLVQSVYIPA 416
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
F ++LSP++V TE SF HF+TNVCAI+GGVFTV+GI+D+ ++H R + KK+E+G
Sbjct: 417 AKFHFELSPMQVLITENPKSFSHFITNVCAIIGGVFTVAGILDSILHHTVR-LMKKVELG 475
Query: 295 K 295
K
Sbjct: 476 K 476
>gi|224117462|ref|XP_002317580.1| predicted protein [Populus trichocarpa]
gi|222860645|gb|EEE98192.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 104/217 (47%), Gaps = 30/217 (13%)
Query: 98 KREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 155
K E Q +K GC I G++ V KV GN + SG H +F
Sbjct: 277 KPENATQHVKRPAPSAGGCRIEGYVRVKKVPGNLMISA-----LSGAH-----SFDSKQM 326
Query: 156 NISHKINKLAFG-EHFPGVV--------------NPLDGVRWTQETPSGMYQYFIKVVPT 200
N+SH I+ +FG + P V+ + L+G + G +
Sbjct: 327 NLSHVISHFSFGMKVLPRVMSDVKRLLPYIGRSHDKLNGRSFINHRDVGANVTIEHYLQV 386
Query: 201 VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHF 258
V T+V S + + E+ ++ QT +P F ++LSP++V TE SF HF
Sbjct: 387 VKTEVVTRRSSSERKLIEEYEYTAHSSLSQTVYMPTAKFHFELSPMQVLITENSKSFSHF 446
Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
+TNVCAI+GGVFTV+GI+D+ ++H R + KK+E+GK
Sbjct: 447 ITNVCAIIGGVFTVAGILDSILHHTVRMM-KKVELGK 482
>gi|195130281|ref|XP_002009580.1| GI15435 [Drosophila mojavensis]
gi|193908030|gb|EDW06897.1| GI15435 [Drosophila mojavensis]
Length = 433
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 104/190 (54%), Gaps = 4/190 (2%)
Query: 103 LQRIKEEEG--EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 160
LQ+I + E + C ++G L +NKVAG H G H ++ F+R N +H+
Sbjct: 184 LQQISQMESKYDACRLHGTLGINKVAGVLHLVGGAQPVVGMFEDHWMIEFRRMPANFTHR 243
Query: 161 INKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
IN+L+FG++ +V PL+G + QYFIKVVPT + TI + Q++VTE+
Sbjct: 244 INRLSFGQYSRRIVQPLEGDETIIREEATTVQYFIKVVPTEIRH-TFSTISTFQYAVTEN 302
Query: 221 FRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
R + R PG++F YD S +K+ + + + + F+ +C+I+ G+ +SG ++A
Sbjct: 303 VRKLDAERNSYGSPGIYFKYDWSALKIVVSHDRDNLVTFVIRLCSIISGIIVISGAVNAL 362
Query: 280 IYHGQRAIKK 289
+ QR + +
Sbjct: 363 LVAIQRRLLR 372
>gi|169614774|ref|XP_001800803.1| hypothetical protein SNOG_10535 [Phaeosphaeria nodorum SN15]
gi|111060809|gb|EAT81929.1| hypothetical protein SNOG_10535 [Phaeosphaeria nodorum SN15]
Length = 404
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/190 (30%), Positives = 88/190 (46%), Gaps = 35/190 (18%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C I+G L+ NKV G+FH A G + + G D +FN SH I +++FG ++
Sbjct: 177 DACRIFGSLDGNKVQGDFHITARGHGYQEFGEQHLD-----HKTFNFSHIIREMSFGPYY 231
Query: 171 PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSG-------------------- 207
P + NPLD T T +QY++ +VPT+YTD G
Sbjct: 232 PSLTNPLDNTIATTPTDQDHFYKFQYYLSIVPTIYTDNPGLLPLLESVNRDPSAHPAKSI 291
Query: 208 ---HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
H I++NQ++VT + + +PGVF +D+ PI + EE F L +
Sbjct: 292 FSTHAIKTNQYAVTSQSHTVPE---NYVPGVFVKFDIEPIMLAVVEEWGGFWRLLVRIVN 348
Query: 265 IVGGVFTVSG 274
+V GV G
Sbjct: 349 VVSGVMVAGG 358
>gi|393221326|gb|EJD06811.1| DUF1692-domain-containing protein [Fomitiporia mediterranea MF3/22]
Length = 537
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 77/154 (50%), Gaps = 12/154 (7%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ +G C +YG ++ KV N H ++S HV N+SH I +FG
Sbjct: 172 KPDGGACRVYGSIQAKKVTANLHITTAGHGYRSMHHV------DHSQMNLSHVITDFSFG 225
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSE 225
+FP + PL P YQYF+ VVPT Y +G + ++Q+SVT + R E
Sbjct: 226 PYFPDMAQPLKNTFELTHEPFIAYQYFLSVVPTTYIASNGKQVHTSQYSVTHYTRVLQHE 285
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
QG PG+FF YDL P+++T ++ + + FL
Sbjct: 286 QG----TPGIFFKYDLEPLQMTIHQKTTTLVQFL 315
>gi|327307836|ref|XP_003238609.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
gi|326458865|gb|EGD84318.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
Length = 399
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 73/260 (28%), Positives = 112/260 (43%), Gaps = 52/260 (20%)
Query: 44 QRHGGRLEHNETYCGSCYGAESSDEDCC--NNCEEVREAYRKK---GWALSNPDLIDQCK 98
+R GG E+ + E +ED + EVR + +KK L D +D C+
Sbjct: 135 RRSGGSPEYQTLNKEDTFRLEEQEEDLHVEHVLGEVRRSRKKKFPKAPKLKRSDAVDSCR 194
Query: 99 REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 157
++G LE NKV GN H A G + + G + S N
Sbjct: 195 -----------------VFGSLEGNKVQGNLHITARGFGYFEWGRTTNP------HSLNF 231
Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--------- 208
+H I +L+FG H+ ++NPLD + YQY + VVPT+YT SGH
Sbjct: 232 THLITELSFGPHYGRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYTK-SGHIDPNRRSLP 290
Query: 209 ------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFL 256
T+ +NQ++VT + Q R+ PG+FF Y++ PI + ++E S L
Sbjct: 291 DASTITAKDSKTTVSTNQYAVTS-YSQPIQPRIDATPGIFFKYNIEPILLIVSQEWDSLL 349
Query: 257 HFLTNVCAIVGGVFTVSGII 276
+ + +V GV G +
Sbjct: 350 ALMVRLVNVVSGVLVTGGWL 369
>gi|45190741|ref|NP_984995.1| AER136Wp [Ashbya gossypii ATCC 10895]
gi|44983720|gb|AAS52819.1| AER136Wp [Ashbya gossypii ATCC 10895]
gi|374108218|gb|AEY97125.1| FAER136Wp [Ashbya gossypii FDAG1]
Length = 340
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/169 (31%), Positives = 89/169 (52%), Gaps = 10/169 (5%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
+E E GC+IYG + VN+V G H P + S V D N++H N+ +F
Sbjct: 147 EEFEFNGCHIYGSIPVNRVKGELHITPKGWRYSSRQRV------PHDEINLTHIFNEFSF 200
Query: 167 GEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
GE FP + N LD V R+ Q+ + + YF+ V+PT+Y + G + +NQ+SV+ + +
Sbjct: 201 GEFFPYIDNTLDQVGRYAQQRLTR-FHYFVSVLPTIYRKM-GAVVDTNQYSVSHNDITYT 258
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
RL T PG+F Y+ + V ++ +SF FL + ++ + ++
Sbjct: 259 SSRLYT-PGIFILYNFEALTVVVQDKRISFWAFLIRLVTMLSFIVYIAA 306
>gi|343473351|emb|CCD14737.1| hypothetical protein, unlikely [Trypanosoma congolense IL3000]
Length = 141
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/134 (39%), Positives = 77/134 (57%), Gaps = 17/134 (12%)
Query: 179 GVRWTQETPSGMYQYFIKVVPTVY---TDVS-GHTIQSNQFSVTEHFRSS---------- 224
GV E G + YF+KVVPT+Y T +S G ++SNQ+SVT HF +S
Sbjct: 6 GVENPSEDLIGRFAYFVKVVPTLYQVRTLMSLGRVVESNQYSVTHHFTASWDAADQNNQT 65
Query: 225 -EQGRLQTLPGVFFFYDLSPIKVTFTEEHV--SFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
+ +PGVF YD+SPI+V+ H S +H + +CA+ GGV+TV G+ID+ +
Sbjct: 66 NRDANPRVVPGVFVSYDISPIRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVMGLIDSMFF 125
Query: 282 HGQRAIKKKIEIGK 295
H R +++KI GK
Sbjct: 126 HSIRRVQEKINRGK 139
>gi|302675040|ref|XP_003027204.1| hypothetical protein SCHCODRAFT_70909 [Schizophyllum commune H4-8]
gi|300100890|gb|EFI92301.1| hypothetical protein SCHCODRAFT_70909 [Schizophyllum commune H4-8]
Length = 528
Score = 91.7 bits (226), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 75/153 (49%), Gaps = 10/153 (6%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAF 166
E G C ++G LEV KV N H + S H H ++ N++H I++ +F
Sbjct: 168 EPHGSACRVWGSLEVKKVTANLHITTAGHGYASREHADHKVM-------NLTHVISEFSF 220
Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
G HFP +V PLD + P YQY++ VVPT Y + +NQ+SVT + + E
Sbjct: 221 GPHFPDIVQPLDYTFEVAKDPFVAYQYYLHVVPTTYIAPRSAPLSTNQYSVTHYKKVFEH 280
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
Q PG+FF +D+ P+ + + SF
Sbjct: 281 N--QATPGIFFKFDIDPLAIQIHQRTTSFARLF 311
>gi|50303625|ref|XP_451754.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|49640886|emb|CAH02147.1| KLLA0B04950p [Kluyveromyces lactis]
Length = 341
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 52/165 (31%), Positives = 86/165 (52%), Gaps = 11/165 (6%)
Query: 113 GCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
GC+I+G + VNKV G H A G + + A +D N +H IN+L+FG+ +P
Sbjct: 153 GCHIFGSVPVNKVKGELHITAHGWGYRSAS-------AIPKDQINFNHVINELSFGDFYP 205
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT 231
+ NPLD + Y YF +VPT+Y + G + +NQ++++E E +
Sbjct: 206 YIDNPLDNTAKFSDEKIKAYYYFTSIVPTLYKKM-GAEVDTNQYALSET-EYGESSKATG 263
Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG-VFTVSGI 275
+PG+F Y P+K+ ++ + F F+ + AI+ V+T S I
Sbjct: 264 VPGIFIRYQFEPMKIIISDMRIGFFQFIIRLVAILSFIVYTASWI 308
>gi|50293697|ref|XP_449260.1| hypothetical protein [Candida glabrata CBS 138]
gi|49528573|emb|CAG62234.1| unnamed protein product [Candida glabrata]
Length = 352
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/174 (31%), Positives = 91/174 (52%), Gaps = 25/174 (14%)
Query: 113 GCNIYGFLEVNKVAGNFHFA------PGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
GC+I+G + VN+V G PGK ++ + +H IN+L+F
Sbjct: 162 GCHIFGSVPVNRVKGELQITASGYGYPGKRA-------------PKEEIDFAHAINELSF 208
Query: 167 GEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH---FR 222
G+ +P + NPLD R+ +E P Y Y+I VPT+Y + G I++ Q+SV ++
Sbjct: 209 GDFYPYIDNPLDKTARFDKEHPLSAYMYYISAVPTMYKKL-GVEIETFQYSVNDYKYSMT 267
Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG-GVFTVSGI 275
++ ++ +PG+FF Y P+ + T+ +SFL F+ + AI+ +F VS I
Sbjct: 268 DADPATVRKIPGIFFRYGFEPLSIEITDVRISFLQFIVRLVAILSFFMFVVSWI 321
>gi|384244593|gb|EIE18093.1| protein disulfide isomerase [Coccomyxa subellipsoidea C-169]
Length = 479
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/213 (29%), Positives = 98/213 (46%), Gaps = 56/213 (26%)
Query: 113 GCNIYGFLEVNKVAGNFHF---APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE- 168
GC + GF+ V KV G HF +PG SF + N+SH +N L FG
Sbjct: 291 GCALSGFVLVKKVPGALHFLAKSPGHSF-------------DYQAMNMSHVVNYLYFGNK 337
Query: 169 ------------HFPGV----VNPLDGVRWTQETPSGMYQYFIKVV----------PTVY 202
H G+ + L G + ++++++VV P +
Sbjct: 338 PSPRRHQSLAKLHPAGLSDDWADKLAGQDFFSRAAKATFEHYMQVVLTTIEPSKHRPELS 397
Query: 203 TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
D +T+ S+ + + +P F YDLSPI++ +E+ ++ HF+T
Sbjct: 398 YDAYEYTVHSHTYDTAD------------IPAAKFTYDLSPIQILVSEKRRAWYHFVTTT 445
Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
CAI+GGVFTV+GI+D ++ G R KK+E+GK
Sbjct: 446 CAIIGGVFTVAGIVDGLVHTGAR-FAKKVELGK 477
>gi|255082155|ref|XP_002508296.1| predicted protein [Micromonas sp. RCC299]
gi|226523572|gb|ACO69554.1| predicted protein [Micromonas sp. RCC299]
Length = 507
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 70/257 (27%), Positives = 118/257 (45%), Gaps = 37/257 (14%)
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
+ Y + + E EE+ A++ A + D ++ Q +K+ +G GC++ G
Sbjct: 268 TSYHGDRTVEAITTFAEELLPAWK----ATDHKDTELAIRQPVETQTVKKIDGPGCSVTG 323
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-PGVVNPL 177
F+ V KV G+ H +F +S N+SH ++ FG+ P L
Sbjct: 324 FVLVKKVPGHLWVTATSKSH----------SFHAESMNMSHVVHHFYFGQQLTPQRKRYL 373
Query: 178 DGVRWTQETPSG------------------MYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
D ++ P G ++++++ V T SG N + T+
Sbjct: 374 DRFHSREKDPKGDWHDKLAGGTFTSEEDNVTHEHYLQTVLTTIKP-SGSPAPFNVYEYTQ 432
Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
H S + LP F +D SP++++ +EE F HF+T + AIVGGV++V GI D F
Sbjct: 433 HSHSLRSEK--ELPRAKFHFDPSPVQISVSEERQKFYHFITTLMAIVGGVYSVMGIADGF 490
Query: 280 IYHGQRAIKKKIEIGKF 296
+++ +A KKK E+GKF
Sbjct: 491 VHNSIQAWKKK-ELGKF 506
>gi|449468488|ref|XP_004151953.1| PREDICTED: protein disulfide-isomerase 5-4-like [Cucumis sativus]
Length = 481
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 72/223 (32%), Positives = 113/223 (50%), Gaps = 37/223 (16%)
Query: 93 LIDQCKRE-GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 151
L D+ E G ++R G GC I G++ V KV G+ A H +F
Sbjct: 274 LEDKSNNETGNVKRPAPSAG-GCRIEGYVRVKKVPGSLVIAARSESH----------SFD 322
Query: 152 RDSFNISHKINKLAFGEH--------------FPGVV-NPLDGVRWTQETPSG---MYQY 193
N+SH I+ L+FG + G+ + L+G + + G ++
Sbjct: 323 ASQMNMSHIISHLSFGRKISPKAFSDAKQLIPYIGISHDRLNGRSFINQRDLGANVTIEH 382
Query: 194 FIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEH 252
++++V T V T SG ++ ++ T H S+ +P V F + LSP++V TE
Sbjct: 383 YLQIVKTEVLTRRSGKLLE--EYEYTAHSSVSQS---LYIPVVKFHFVLSPMQVVITENQ 437
Query: 253 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
SF HF+TNVCAI+GGVFTV+GI+DA +++ R + KK+E+GK
Sbjct: 438 KSFSHFITNVCAIIGGVFTVAGILDALLHNTIR-LMKKVELGK 479
>gi|356517290|ref|XP_003527321.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 480
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 100/201 (49%), Gaps = 33/201 (16%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
GC + G++ V KV GN + H +F N+SH IN L+FG+ P
Sbjct: 293 GCRVEGYVRVKKVPGNLIISARSDAH----------SFDASQMNMSHFINNLSFGKKVTP 342
Query: 172 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 214
+ + L+G +T G +++I++V T +G+ + +
Sbjct: 343 RAMSDVKLLIPYIGSSHDRLNGRSFTNTHDLGANVTIEHYIQIVKTEVVTRNGYKLI-EE 401
Query: 215 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+ T H S +P F +LSP++V TE SF HF+TNVCAI+GGVFTV+G
Sbjct: 402 YEYTAH---SSVAHSVDIPAAKFHLELSPMQVLITENQRSFSHFITNVCAIIGGVFTVAG 458
Query: 275 IIDAFIYHGQRAIKKKIEIGK 295
I+D+ +++ R + KK+E+GK
Sbjct: 459 ILDSILHNTIRMM-KKVELGK 478
>gi|356549839|ref|XP_003543298.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 480
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 65/204 (31%), Positives = 101/204 (49%), Gaps = 39/204 (19%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
GC I G++ V KV GN F+ + H +F N+SH IN L+FG P
Sbjct: 293 GCRIDGYVRVKKVPGNLIFSARSNAH----------SFDASQMNMSHVINHLSFGRKVSP 342
Query: 172 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 214
V+ + L+G + G ++++++V T I
Sbjct: 343 RVMSDVKRLIPYVGSSHDRLNGRSFINTHDLGANVTMEHYLQIVKT-------EVITRKD 395
Query: 215 FSVTEHFRSSEQGRL-QTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
+ + E + + + Q+L P F +LSP++V TE SF HF+TNVCAIVGG+FT
Sbjct: 396 YKLVEEYEYTAHSSVAQSLHIPVAKFHLELSPMQVLITENQKSFSHFITNVCAIVGGIFT 455
Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
V+GI+DA +++ R + KK+E+GK
Sbjct: 456 VAGIMDAILHNTIR-LMKKVELGK 478
>gi|296086862|emb|CBI33029.3| unnamed protein product [Vitis vinifera]
Length = 139
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 40/66 (60%), Positives = 52/66 (78%)
Query: 20 LDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVRE 79
+D+ GN + +QD IG P+I+K LQRHGGRLE N YCGSCYGAE +D+DC N+C+E RE
Sbjct: 73 IDAHGNEVAVKQDEIGGPQIEKLLQRHGGRLERNGKYCGSCYGAEVTDDDCGNSCDEDRE 132
Query: 80 AYRKKG 85
Y+K+G
Sbjct: 133 TYKKRG 138
>gi|440798302|gb|ELR19370.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
Length = 328
Score = 90.9 bits (224), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 100/201 (49%), Gaps = 38/201 (18%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFA------------------------PGKSFHQSGVH 143
+ E GC+I G++ V KV GNFH + + F+ SGV
Sbjct: 116 DSELSGCSIAGYINVPKVPGNFHLSTHGRNVQAQDIDMQHNINSFFFTDSPRVFYPSGVS 175
Query: 144 VHDILAFQRDSFNISHKINKLA----FGEHFPGVVNPLDGV-RWTQETPSGM---YQYFI 195
V A++ N+ ++N A + G+ PLDG+ + + +G+ Y+Y+I
Sbjct: 176 VP---AWRNWHSNVVAELNAQARDQDTDDDVVGLFRPLDGITKANSQRKNGVGVSYEYYI 232
Query: 196 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 255
++VPT+ G T + QF+ + ++ +G+ P V+F YD+SPI V T S
Sbjct: 233 QIVPTILEFPDGRTKHTYQFTYNFNDVATPEGKT---PSVYFKYDISPITVKITRGRGSL 289
Query: 256 LHFLTNVCAIVGGVFTVSGII 276
HFL +CAIVGG+FTVSG+I
Sbjct: 290 GHFLLQLCAIVGGIFTVSGLI 310
>gi|119928709|ref|XP_001256294.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 1-like [Bos taurus]
Length = 144
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 48/106 (45%), Positives = 67/106 (63%), Gaps = 5/106 (4%)
Query: 193 YFIKVVPTVYTDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTE 250
Y +K+VPTVY D SG S Q++V E+ S GR+ +P ++F YDLSPI V +TE
Sbjct: 41 YILKIVPTVYEDKSGKQQFSYQYTVANKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTE 98
Query: 251 EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
F+T +CAI+GG FTV+GI+D+ I+ A KKI++GK
Sbjct: 99 RRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEA-WKKIQLGKM 143
>gi|449489976|ref|XP_004158474.1| PREDICTED: protein disulfide-isomerase 5-3-like [Cucumis sativus]
Length = 224
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 104/202 (51%), Gaps = 35/202 (17%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
GC I G++ V KV G+ A H +F N+SH I+ L+FG
Sbjct: 37 GCRIEGYVRVKKVPGSLVIAARSESH----------SFDASQMNMSHIISHLSFGRKISP 86
Query: 170 -----------FPGVV-NPLDGVRWTQETPSG---MYQYFIKVVPT-VYTDVSGHTIQSN 213
+ G+ + L+G + + G ++++++V T V T SG ++
Sbjct: 87 KAFSDAKQLIPYIGISHDRLNGRSFINQRDLGANVTIEHYLQIVKTEVLTRRSGKLLE-- 144
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
++ T H S+ +P V F + LSP++V TE SF HF+TNVCAI+GGVFTV+
Sbjct: 145 EYEYTAHSSVSQS---LYIPVVKFHFVLSPMQVVITENQKSFSHFITNVCAIIGGVFTVA 201
Query: 274 GIIDAFIYHGQRAIKKKIEIGK 295
GI+DA +++ R + KK+E+GK
Sbjct: 202 GILDALLHNTIR-LMKKVELGK 222
>gi|195042004|ref|XP_001991346.1| GH12601 [Drosophila grimshawi]
gi|193901104|gb|EDV99970.1| GH12601 [Drosophila grimshawi]
Length = 434
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 85/157 (54%), Gaps = 2/157 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ F+R N +H+IN+L+FG
Sbjct: 190 EAKYDACRLHGTLGINKVAGVLHLVGGAQPVVGMFDDHWMIEFRRMPANFTHRINRLSFG 249
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G T + QYFIKVVPT + T+ + Q++VTE+ R +
Sbjct: 250 QYSRRIVQPLEGDETTITEEATTVQYFIKVVPTEIQQ-TFSTVSTFQYAVTENVRKLDSE 308
Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
R PG++F YD S +KV + + FL F+ +C
Sbjct: 309 RNSYGSPGIYFKYDWSALKVVISHDRDYFLTFVIRLC 345
>gi|354545468|emb|CCE42196.1| hypothetical protein CPAR2_807450 [Candida parapsilosis]
Length = 351
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 92/197 (46%), Gaps = 18/197 (9%)
Query: 90 NPDLIDQCKREGF------LQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 143
PDL D+ +E L R E C+I+G + VN+V G+F G
Sbjct: 126 TPDL-DEVMQESLRAEFSQLGRRVNEGAPACHIFGSIPVNQVKGDFRIT------AKGFG 178
Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
D ++ N SH I + ++G+ +P + NPLD E Y Y KVVPT+Y
Sbjct: 179 YRDRSFVPLEALNFSHVIQEFSYGDFYPFLNNPLDATGKVTEENLQTYLYHAKVVPTLYE 238
Query: 204 DVSGHTIQSNQFSVTEHFR----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
+ G + + Q+S+TE+ R Q + G++F Y+ PIK+ E+ + FL F+
Sbjct: 239 KL-GLEVDTTQYSLTENHHVVKVDPHSKRPQEISGIYFAYEFEPIKLIIREKRIPFLQFI 297
Query: 260 TNVCAIVGGVFTVSGII 276
+ I GGV +G +
Sbjct: 298 AKLGTIAGGVVVAAGYL 314
>gi|168012320|ref|XP_001758850.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162689987|gb|EDQ76356.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 487
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 100/202 (49%), Gaps = 35/202 (17%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG----- 167
GC + GF+ V KV G + SG H +F S N++H + +FG
Sbjct: 300 GCRVEGFVRVKKVPGELMISA-----HSGSH-----SFDATSMNMTHYVGFFSFGRKTSW 349
Query: 168 -------EHFPGV---VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS----N 213
E P + ++ L G + E + + ++++VV T ++ H Q
Sbjct: 350 RSVHWVNEMLPALDSNIDRLTGQVFPSEYENITHDHYLQVVKTEV--ITLHRKQDLRVLE 407
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
Q+ T H S + +P V F Y+LSP++V E SF HFLTN+CAI+GGVFTV+
Sbjct: 408 QYDYTAH---SNMIQSTKVPVVKFHYELSPMQVLVKENPKSFSHFLTNLCAIIGGVFTVA 464
Query: 274 GIIDAFIYHGQRAIKKKIEIGK 295
GIID+ + H I KK+E+GK
Sbjct: 465 GIIDSML-HNAMHIMKKVELGK 485
>gi|440293957|gb|ELP87004.1| hypothetical protein EIN_318630 [Entamoeba invadens IP1]
Length = 316
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 92/188 (48%), Gaps = 22/188 (11%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQ--------------SGVHVHDILAFQRDSFNIS 158
GC ++G ++V++V+G FH A GK ++ + +H H + SFN +
Sbjct: 117 GCRMHGTMKVSRVSGEFHVAFGKIAYRQQRTNQVITATQKHTQMHTHQFTMQEMKSFNPT 176
Query: 159 HKINKLAFGEHFPGVVN-----PLDGVRWTQE-TPSGMYQYFIKVVPTVYTDVSGHTIQS 212
H IN LAF P PL+G +T + + Y Y+I V+PT+ HT +S
Sbjct: 177 HFINNLAFSNT-PSYTTHAGETPLNGKEYTLKGYDNARYTYYINVIPTL-NKYPTHTTRS 234
Query: 213 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
Q S+ E F G T PGVFF Y+LSP V SF H + + AI+GGV+ +
Sbjct: 235 YQLSINERFVPVTYGPTFTQPGVFFKYELSPYIVINEMMDHSFAHSIASTAAIIGGVWII 294
Query: 273 SGIIDAFI 280
G I F+
Sbjct: 295 FGWISRFL 302
>gi|226479782|emb|CAX73187.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Schistosoma japonicum]
Length = 410
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/161 (31%), Positives = 80/161 (49%), Gaps = 3/161 (1%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSG-VHVHDILAFQRDSFNISHKINKLAFGEHF 170
+ C I G L V KV GN H GK G +H+H + + N SH+IN +FG+
Sbjct: 182 DACRIVGTLFVKKVEGNIHILLGKPLEGLGNLHLHVAPFLSKTNLNFSHRINHFSFGDLV 241
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-L 229
G ++PL+ + S +QYF+ +VPT + H ++ Q++ T R+ +
Sbjct: 242 NGQIHPLEAIESITAVASTSFQYFVTMVPTKVVN-QFHVTETYQYAATVQNRTIDHASDS 300
Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
+PG+FF YD P+ V T + F T + A+ GG+F
Sbjct: 301 HGIPGIFFIYDTFPLVVKITYDRELLGTFFTRLAALAGGIF 341
>gi|344250048|gb|EGW06152.1| UPF0474 protein C5orf41-like [Cricetulus griseus]
Length = 745
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 87/186 (46%), Gaps = 25/186 (13%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
+I G GC G +NKV GNFH V H A Q + +++H I+KL
Sbjct: 125 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 172
Query: 165 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 218
+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 173 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 232
Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
E+ S GR+ +P ++F YDLSPI V +TE F+T A VF +G+
Sbjct: 233 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTREAAEWFVFWGTGM-- 288
Query: 278 AFIYHG 283
YHG
Sbjct: 289 --AYHG 292
>gi|67482091|ref|XP_656395.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
gi|56473591|gb|EAL51010.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
gi|449705171|gb|EMD45274.1| Hypothetical protein EHI5A_018710 [Entamoeba histolytica KU27]
Length = 315
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 97/200 (48%), Gaps = 20/200 (10%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGK-SFHQSGV-------------HVHDILAFQRDSFNIS 158
GC +YG ++V++V+G FH A GK SF Q + H+H + SFN +
Sbjct: 116 GCRMYGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175
Query: 159 HKINKLAFGEHFPGVV----NPLDGVRWTQET-PSGMYQYFIKVVPTVYTDVSGHTIQSN 213
H IN L+F V PL+G ++T + Y+I V+PT++ S +T+++
Sbjct: 176 HYINHLSFSNTLGSTVHSGETPLNGKKFTLSGFDNARKTYYINVIPTLFKYPS-YTLRTY 234
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
Q SV E G T PGVFF Y+LSP V SF H L +V AI+GGV +
Sbjct: 235 QLSVNERDVPVTYGASFTQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIIGGVLIIM 294
Query: 274 GIIDAFIYHGQRAIKKKIEI 293
G++ + +E+
Sbjct: 295 GLLSRLFDSKHELVTSVVEM 314
>gi|356543934|ref|XP_003540413.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 480
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 98/204 (48%), Gaps = 39/204 (19%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC I G++ V KV GN + + H +F N+SH IN L+FG
Sbjct: 293 GCRIDGYVRVKKVPGNLIISARSNAH----------SFDASQMNMSHVINHLSFGRKVSL 342
Query: 173 VV---------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 214
V + L+G + G ++++++V T I +
Sbjct: 343 RVMSDVKRLIPYVGSSHDRLNGRSFINTHDLGANVTIEHYLQIVKT-------EVITRKE 395
Query: 215 FSVTEHFRSSEQGRL-QTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
+ + E + + + Q+L P F +LSP++V TE SF HF+TNVCAI+GG+FT
Sbjct: 396 YKLVEEYEYTAHSSVAQSLHIPVAKFHLELSPMQVLITENQKSFSHFITNVCAIIGGIFT 455
Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
V+GI+DA I+H + KK+E+GK
Sbjct: 456 VAGIMDA-IFHNTIRLMKKVELGK 478
>gi|157865526|ref|XP_001681470.1| conserved hypothetical protein [Leishmania major strain Friedlin]
gi|68124767|emb|CAJ02321.1| conserved hypothetical protein [Leishmania major strain Friedlin]
Length = 365
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 69/226 (30%), Positives = 103/226 (45%), Gaps = 26/226 (11%)
Query: 65 SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNK 124
++ CC+ CE V Y++ G + + I QC E QR GC + G L++ K
Sbjct: 148 AAASKCCDTCESVLGLYKELGRGVPGTEYIPQC-LEQLYQR-----ASGCAVMGSLDLKK 201
Query: 125 VAGNFHFAPGKS--FHQSGVHVHDILAFQRDSFNISHKINKLAFG----EHFP--GVVNP 176
V F P ++ F+ + D++ + SH I KL G E F GV
Sbjct: 202 VPVTVIFGPRRTGQFYS----LKDVI-----RLDTSHFIRKLRIGDETVERFSKNGVAER 252
Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVY--TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
L G + + +T S +Y +KVVPT Y T + ++S R+ G +P
Sbjct: 253 LSGHKSSSKTYSET-RYLVKVVPTTYRKTKTKNAKASTYEYSAQWSRRTILVGFAGAVPA 311
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
V F ++ +PI+V E F HFL +C IVGG+F V G ID +
Sbjct: 312 VLFEFEPAPIQVNNVFERQPFSHFLVQLCGIVGGLFVVLGFIDNVV 357
>gi|302800507|ref|XP_002982011.1| hypothetical protein SELMODRAFT_115492 [Selaginella moellendorffii]
gi|300150453|gb|EFJ17104.1| hypothetical protein SELMODRAFT_115492 [Selaginella moellendorffii]
Length = 476
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 129/284 (45%), Gaps = 49/284 (17%)
Query: 35 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 94
G P I + H + EH S YG +D + + EA K L+ D
Sbjct: 217 GFPSIRIFHKGHDLKDEHGHHEHDSYYGERDTD-----SLVKAMEALVPKETTLALED-- 269
Query: 95 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVA-GNFHFAPGKSFHQSGVHVHDILAFQRD 153
K G ++R G GC I GF+ KV GN + SG H +F
Sbjct: 270 ---KTNGTVKRPAPRAG-GCRIEGFIRAKKVVPGNIIISA-----HSGSH-----SFDAS 315
Query: 154 SFNISHKINKLAFGEH------------FPGVVNPLDGVR-------WTQETPSGMYQYF 194
+ N++H +++ FG +P + + D V + + + + ++
Sbjct: 316 AMNMTHYVSQFTFGRELNFWMRRELYRIYPHLASVYDTVEANLTGRIYVSQHENITHDHY 375
Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQT--LPGVFFFYDLSPIKVTFTEE 251
++VV T + + +FS+ E + +S +Q +P F Y+LSP++V E
Sbjct: 376 LQVVKTEVVSLR----KRKEFSLLEQYDYTSHSNTIQNTNVPVAKFHYELSPMQVLVKEN 431
Query: 252 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
SF HF+TNVCAI+GGVFTV+GI+D+ + HG + KKIE+GK
Sbjct: 432 PKSFSHFITNVCAIIGGVFTVAGIVDSML-HGAMRMVKKIELGK 474
>gi|224000371|ref|XP_002289858.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975066|gb|EED93395.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 338
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 103/220 (46%), Gaps = 22/220 (10%)
Query: 91 PDLIDQCKR--EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 148
P +ID K GF + + +GC + G ++V +V G + + IL
Sbjct: 123 PTVIDYKKAAVSGF-KDVNTARRQGCTLVGTIKVPRVGGTMSISVSPEAWRRAT---SIL 178
Query: 149 AF------QRDSF-----NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG--MYQYFI 195
+F +D F N++H ++ + FG+ FP NPL GV + SG + +
Sbjct: 179 SFGVDLGKDQDMFHGKLPNVTHYVHDITFGDPFPPGSNPLKGVHHVMDNGSGVALANVAV 238
Query: 196 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ---GRLQTLPGVFFFYDLSPIKVTFTEEH 252
K+VPT Y ++ Q SV+ H E R LPG+ YD +P+ V E
Sbjct: 239 KLVPTTYKRTIYSAKETYQASVSRHIVQPETLAAQRSTLLPGLMLTYDFTPLAVRHVESR 298
Query: 253 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
++L FL+++ IVGGVF G++ + + +A+ KK++
Sbjct: 299 ENWLVFLSSLVGIVGGVFVTVGLVSGCLVNSAQAVAKKMD 338
>gi|302659461|ref|XP_003021421.1| hypothetical protein TRV_04495 [Trichophyton verrucosum HKI 0517]
gi|291185318|gb|EFE40803.1| hypothetical protein TRV_04495 [Trichophyton verrucosum HKI 0517]
Length = 427
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 93/214 (43%), Gaps = 46/214 (21%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFH--------FAPGKS---------------FHQSGVH 143
K + + C ++G LE NKV GN H F G++ H +
Sbjct: 186 KSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWGRATNPHSMSLLQPIITCIHGDAKN 245
Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
+ D L N +H I +L+FG H+ ++NPLD + YQY + VVPT+YT
Sbjct: 246 LTDQLTKLFPGLNFTHLITELSFGPHYGRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYT 305
Query: 204 DVSGH---------------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLS 242
SGH T+ +NQ++VT + Q R+ PG+FF Y++
Sbjct: 306 K-SGHIDPNRRSLPDASTITAKDSKTTVSTNQYAVTS-YSQPIQPRIDATPGIFFKYNIE 363
Query: 243 PIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
PI + ++E S L + + +V GV G +
Sbjct: 364 PILLIVSQERDSLLALMVRLVNVVSGVLVTGGWL 397
>gi|302508773|ref|XP_003016347.1| hypothetical protein ARB_05746 [Arthroderma benhamiae CBS 112371]
gi|291179916|gb|EFE35702.1| hypothetical protein ARB_05746 [Arthroderma benhamiae CBS 112371]
Length = 427
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 60/214 (28%), Positives = 93/214 (43%), Gaps = 46/214 (21%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFH--------FAPGKS---------------FHQSGVH 143
K + + C ++G LE NKV GN H F G++ H +
Sbjct: 186 KSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWGRATNPHSMSLLQPIITCIHGDAKN 245
Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
+ D L N +H I +L+FG H+ ++NPLD + YQY + VVPT+YT
Sbjct: 246 LTDQLTKLFPGLNFTHLITELSFGPHYGRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYT 305
Query: 204 DVSGH---------------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLS 242
SGH T+ +NQ++VT + Q R+ PG+FF Y++
Sbjct: 306 K-SGHIDPNRRSLPDTSTITAKDSKTTVSTNQYAVTS-YSQPIQPRIDATPGIFFKYNIE 363
Query: 243 PIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
PI + ++E S L + + +V GV G +
Sbjct: 364 PILLIVSQERDSLLALMVRLVNVVSGVLVTGGWL 397
>gi|356545151|ref|XP_003541008.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
Length = 453
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 98/201 (48%), Gaps = 33/201 (16%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
GC + G++ V KV GN + H +F N+SH IN L+FG+ P
Sbjct: 266 GCRVEGYVRVKKVPGNLIISARSDAH----------SFDASQMNMSHVINNLSFGKKVTP 315
Query: 172 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 214
+ + L+G + G +++I++V T G+ + +
Sbjct: 316 RAMSDVKLLIPYIGSSHDRLNGRSFINTRDLGANVTIEHYIQIVKTEVVTRKGYKLI-EE 374
Query: 215 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
+ T H S +P F +LSP++V TE SF HF+TNVCAI+GGVFTV+G
Sbjct: 375 YEYTAH---SSVAHSLDIPVAKFHLELSPMQVLITENQRSFSHFITNVCAIIGGVFTVAG 431
Query: 275 IIDAFIYHGQRAIKKKIEIGK 295
I+D+ +++ R + KKIE+GK
Sbjct: 432 ILDSILHNTIRMV-KKIELGK 451
>gi|324499844|gb|ADY39943.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Ascaris suum]
Length = 429
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 89/175 (50%), Gaps = 8/175 (4%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
+++EG C ++G + VNKV G+ GK G+ H + ++ NISH+I +L
Sbjct: 217 QKDEGTACRVHGRVRVNKVKGDSVIITAGKGAGIDGLFAH--VDGASNAGNISHRIARLH 274
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTE-HFR 222
FG G++ PL G E+ Y+YF+KVVPT ++ G + Q+SVT+ H R
Sbjct: 275 FGPWIGGLLTPLAGTEQISESGIDEYRYFLKVVPTRIFHSGFFGGSTMRYQYSVTKTHKR 334
Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
S GR P + Y+ + + V E S +C++VGGVF S I++
Sbjct: 335 PS--GREHMHPAIAIHYEFAALVVEVRETQTSLFQLFVRLCSVVGGVFATSSILN 387
>gi|224126339|ref|XP_002319814.1| predicted protein [Populus trichocarpa]
gi|222858190|gb|EEE95737.1| predicted protein [Populus trichocarpa]
Length = 484
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 64/200 (32%), Positives = 97/200 (48%), Gaps = 28/200 (14%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG-EHFP 171
GC I G++ V KV GN + +SG H +F N+SH I+ +FG + P
Sbjct: 294 GCRIEGYVRVKKVPGNLVISA-----RSGAH-----SFDSAQMNLSHVISHFSFGMKVLP 343
Query: 172 GVV--------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
V+ + L+G + G + V T+V + +
Sbjct: 344 RVMSDVKRLIPHIGRSHDKLNGRSFINHRDVGANVTIEHYLQVVKTEVVTRRSSAEHKLI 403
Query: 218 TEHFRSSEQGRLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
E+ ++ QT +P F ++LSP++V TE SF HF+TNVCAI+GGVFTV+GI
Sbjct: 404 EEYEYTAHSSLAQTVYMPTAKFHFELSPMQVLITENPKSFSHFITNVCAIIGGVFTVAGI 463
Query: 276 IDAFIYHGQRAIKKKIEIGK 295
+D+ I H + KK+E+GK
Sbjct: 464 LDS-ILHNTFRMMKKVELGK 482
>gi|403371798|gb|EJY85783.1| hypothetical protein OXYTRI_16231 [Oxytricha trifallax]
Length = 333
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 67/209 (32%), Positives = 101/209 (48%), Gaps = 40/209 (19%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGK------SFHQSGVHVHDILAFQRDSFNISHKINKLA 165
EGC+IYG + +N+V GNFH + Q G H F+ S+KI+ ++
Sbjct: 142 EGCHIYGNILINRVPGNFHISTHAFNDILMGLMQEGHH-----------FDFSYKIDHIS 190
Query: 166 FGE--HFPGV---------VNPLDG-----VRWTQETPSGMY-QYFIKVVPTVYTDVSGH 208
FG+ +F + ++PLDG R + P + +++ VP+ + DVSG
Sbjct: 191 FGKRNNFDMIRRKFRDHQLISPLDGKSETAPRDNKNFPKSLEGNFYLIAVPSYFKDVSGG 250
Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
Q Q + +H + F Y+LSPI V F+++ S FL ++CAI+GG
Sbjct: 251 VYQVYQLTANDHTNFGTGNNILK-----FNYELSPITVGFSQDRESIALFLVHICAIIGG 305
Query: 269 VFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
VFT IIDA I+ + KK IGK S
Sbjct: 306 VFTAVSIIDAIIHKSFSLLFKK-RIGKLS 333
>gi|297830752|ref|XP_002883258.1| hypothetical protein ARALYDRAFT_479582 [Arabidopsis lyrata subsp.
lyrata]
gi|297329098|gb|EFH59517.1| hypothetical protein ARALYDRAFT_479582 [Arabidopsis lyrata subsp.
lyrata]
Length = 483
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 106/204 (51%), Gaps = 36/204 (17%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
GC + G++ V KV GN + SG H +F N+SH ++ L+FG P
Sbjct: 293 GCRVEGYVRVKKVPGNLVISA-----HSGAH-----SFDSSQMNMSHVVSHLSFGRMISP 342
Query: 172 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPT-VYTDVSG--HTIQ 211
++ + LDG + + G ++++++V T V T SG H++
Sbjct: 343 RLLTDMKRLLPYLGLSHDRLDGKAFINQHEFGANVTIEHYLQIVKTEVITRRSGQEHSLI 402
Query: 212 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
++ T H S + LP F ++LSP+++ TE SF HF+TN+CAI+GGVFT
Sbjct: 403 -EEYEYTAH---SSVAQTYYLPVAKFHFELSPMQILITENPKSFSHFITNLCAIIGGVFT 458
Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
V+GI+D+ I+H + KK+E+GK
Sbjct: 459 VAGILDS-IFHNTVRLIKKVELGK 481
>gi|365759132|gb|EHN00939.1| Erv41p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
gi|401842937|gb|EJT44934.1| ERV41-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 285
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 47/157 (29%), Positives = 87/157 (55%), Gaps = 11/157 (7%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC+I+G + VN+V+G K F + H + + N +H IN+ +FG+ +P
Sbjct: 93 GCHIFGSVPVNRVSGVLQIT-AKGFGYADSHRASL-----EDLNFAHVINEFSFGDFYPY 146
Query: 173 VVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ- 230
+ NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++ ++ ++
Sbjct: 147 IDNPLDNTAQFDQDEPLTTYLYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLNKDSSVKG 205
Query: 231 --TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 206 NRRVPGIFFKYNFEPLSIVVSDVRISFIQFLVRLVAI 242
>gi|431918151|gb|ELK17379.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
[Pteropus alecto]
Length = 313
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 79/166 (47%), Gaps = 21/166 (12%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+I G GC G +NKV GNFH V H A Q + +++H I+K
Sbjct: 135 MKIPLNGGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 182
Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
L+FG+ G N L G P + Y +K+VPTVY D SG S Q++V
Sbjct: 183 LSFGDTLQVRNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVA 242
Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
E+ S GR+ +P ++F YDLSPI V +TE F+T V
Sbjct: 243 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTV 286
>gi|444732203|gb|ELW72509.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Tupaia chinensis]
Length = 250
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 64/171 (37%), Positives = 87/171 (50%), Gaps = 8/171 (4%)
Query: 92 DLIDQCKR-EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 150
DL Q K + LQ I+ E ++ + + G P H G H H
Sbjct: 63 DLSPQQKEWQRMLQVIQSRLQEEHSLQDVIFKSAFKGTTALPPRAIPHPRG-HAHLAALV 121
Query: 151 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGH 208
DS+N SH+I+ L+FGE PG++NPLDG + M+QYFI VVPT ++T +S
Sbjct: 122 NHDSYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISAD 181
Query: 209 TIQSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
T +QFSVTE R + + G+F YDLS + VT TEEH+ F F
Sbjct: 182 T---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQF 229
>gi|367012766|ref|XP_003680883.1| hypothetical protein TDEL_0D00880 [Torulaspora delbrueckii]
gi|359748543|emb|CCE91672.1| hypothetical protein TDEL_0D00880 [Torulaspora delbrueckii]
Length = 348
Score = 87.8 bits (216), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 52/167 (31%), Positives = 86/167 (51%), Gaps = 12/167 (7%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
+GC+IYG + VN+VAG G D N SH IN+ ++G+ FP
Sbjct: 156 DGCHIYGSVPVNRVAGELQIT------AKGWGYQDFEKAPVSEINFSHVINEFSYGDFFP 209
Query: 172 GVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQG 227
+ NPLD M Y Y +VPTVY + G + +NQ++V+E +S+++G
Sbjct: 210 YIDNPLDNTAKISIVDRLMGYLYDTSIVPTVYEKL-GAYVDTNQYAVSERQFDQKSTKRG 268
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
T+PG+FF YD P+ ++ + +SF+ F+ + A++ V ++
Sbjct: 269 S-TTVPGIFFRYDFEPLSISIKDRRLSFIQFIIRLVALLSFVVYIAS 314
>gi|325185550|emb|CCA20033.1| thioredoxinlike protein putative [Albugo laibachii Nc14]
Length = 503
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 96/193 (49%), Gaps = 12/193 (6%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGK---SFHQSGVHVHDI---LAFQRDSFNISHKINKLA 165
EGC + G L VN+V F SF G++V + L+F + + S K +L+
Sbjct: 316 EGCEVSGSLNVNRVPSRLVFTARSKDLSFDLRGINVTHVVHHLSFGQVTRKQSTKSTQLS 375
Query: 166 FG-EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
+HFP LDG + E + ++F+ V+ + + + + + RS+
Sbjct: 376 MSFDHFP-----LDGKTFRTENENITVEHFLSVIGVDHMEAKSKHMGLVERTYQIVARSN 430
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
+ LP F +D+SP+ + + + F FLT++CAIVGG+ T+ G +DA YH
Sbjct: 431 QYNATDMLPAALFTFDISPLVIQMSSDSTPFYRFLTSLCAIVGGMVTIIGFVDAGAYHAM 490
Query: 285 RAIKKKIEIGKFS 297
+IK+K ++GK +
Sbjct: 491 NSIKRKRQLGKLN 503
>gi|217072996|gb|ACJ84858.1| unknown [Medicago truncatula]
gi|388501234|gb|AFK38683.1| unknown [Medicago truncatula]
Length = 243
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 100/205 (48%), Gaps = 41/205 (20%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC + G++ V KV G+ + H +F N+SH IN L+FG+
Sbjct: 56 GCRVEGYVRVKKVPGSLVVSARSDAH----------SFDASQMNMSHVINHLSFGKK--- 102
Query: 173 VVNP---LDGVRW------------------TQETPSGM-YQYFIKVVPTVYTDVSGHTI 210
V P +D W T++ + +++I+VV T G+ +
Sbjct: 103 -VTPRAMIDVKHWIPYLGINHDRLNGRSFVNTRDLEGNVTIEHYIQVVKTEVITRKGYKL 161
Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
++ T H S +P F +LSP++V TE SF HF+TNVCAI+GGVF
Sbjct: 162 -IEEYEYTAH---SSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGVF 217
Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGK 295
TV+GI+D+ +++ +A+ KKIEIGK
Sbjct: 218 TVAGILDSILHNTIKAM-KKIEIGK 241
>gi|357474735|ref|XP_003607653.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355508708|gb|AES89850.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 477
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 100/205 (48%), Gaps = 41/205 (20%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC + G++ V KV G+ + H +F N+SH IN L+FG+
Sbjct: 290 GCRVEGYVRVKKVPGSLVVSARSDAH----------SFDASQMNMSHVINHLSFGKK--- 336
Query: 173 VVNP---LDGVRW------------------TQETPSGM-YQYFIKVVPTVYTDVSGHTI 210
V P +D W T++ + +++I+VV T G+ +
Sbjct: 337 -VTPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIEHYIQVVKTEVITRKGYKL 395
Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
++ T H S +P F +LSP++V TE SF HF+TNVCAI+GGVF
Sbjct: 396 I-EEYEYTAH---SSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGVF 451
Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGK 295
TV+GI+D+ +++ +A+ KKIEIGK
Sbjct: 452 TVAGILDSILHNTIKAM-KKIEIGK 475
>gi|428175103|gb|EKX43995.1| hypothetical protein GUITHDRAFT_159761 [Guillardia theta CCMP2712]
Length = 475
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 99/206 (48%), Gaps = 28/206 (13%)
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
+ G GC + G L V + APG Q+ V D F ++ ++SH +N L+
Sbjct: 282 VDSHNGVGCMVSGLLHVQR-------APGMLKVQA---VSDSHEFNWETMDVSHTVNHLS 331
Query: 166 FGE------------HFPGVVNPLDGVRWT--QETPSGMYQYFIKVVPTVYTDVSGHTI- 210
FG H V LD +T Q P+ +++++KVV T S +
Sbjct: 332 FGPFLSETAWMVLPPHIAASVGSLDDRSFTSDQHVPT-THEHYVKVVRHEVTPPSSWKVA 390
Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
Q + H S+ + +P V YD+ PI V F E+ +F HF+TN+CAIVGGVF
Sbjct: 391 QITSYGYVVH--SNNIQKAGEVPTVRINYDILPIIVQFHEKKQAFYHFVTNLCAIVGGVF 448
Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGKF 296
TV+GII + + ++KK E+GK
Sbjct: 449 TVAGIIASLMDKSINLMRKKQELGKL 474
>gi|18402672|ref|NP_566664.1| protein PDI-like 5-3 [Arabidopsis thaliana]
gi|75273652|sp|Q9LJU2.1|PDI53_ARATH RecName: Full=Protein disulfide-isomerase 5-3; Short=AtPDIL5-3;
AltName: Full=Protein disulfide-isomerase 12;
Short=PDI12; AltName: Full=Protein disulfide-isomerase
8-1; Short=AtPDIL8-1; Flags: Precursor
gi|11994143|dbj|BAB01164.1| unnamed protein product [Arabidopsis thaliana]
gi|15215847|gb|AAK91468.1| AT3g20560/K10D20_9 [Arabidopsis thaliana]
gi|332642877|gb|AEE76398.1| protein PDI-like 5-3 [Arabidopsis thaliana]
Length = 483
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 97/200 (48%), Gaps = 28/200 (14%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
GC + G++ V KV GN + SG H +F N+SH ++ +FG P
Sbjct: 293 GCRVEGYVRVKKVPGNLVISA-----HSGAH-----SFDSSQMNMSHVVSHFSFGRMISP 342
Query: 172 GVV--------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
++ + LDG + + G + TV T+V +
Sbjct: 343 RLLTDMKRLLPYLGLSHDRLDGKAFINQHEFGANVTIEHYLQTVKTEVITRRSGQEHSLI 402
Query: 218 TEHFRSSEQGRLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
E+ ++ QT LP F ++LSP+++ TE SF HF+TN+CAI+GGVFTV+GI
Sbjct: 403 EEYEYTAHSSVAQTYYLPVAKFHFELSPMQILITENPKSFSHFITNLCAIIGGVFTVAGI 462
Query: 276 IDAFIYHGQRAIKKKIEIGK 295
+D+ I+H + KK+E+GK
Sbjct: 463 LDS-IFHNTVRLVKKVELGK 481
>gi|123408947|ref|XP_001303296.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121884664|gb|EAX90366.1| hypothetical protein TVAG_036780 [Trichomonas vaginalis G3]
Length = 364
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/253 (29%), Positives = 115/253 (45%), Gaps = 34/253 (13%)
Query: 46 HGGRLEHNE-TYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGF 102
H R ++ T CG C + + CCN C++V E D I QC +
Sbjct: 130 HSARFNTSKVTECGFCNATKGLKDKYKCCNTCQQVLEV----AQVFRVVD-IPQCSDK-- 182
Query: 103 LQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKS-FHQSGVHVHDILAFQRD--SFNISH 159
++ +K+ + EGC I G E K+ FH +PG S + GVH HD+ +F D N+S+
Sbjct: 183 VKELKKMQNEGCRIKGNFETIKIKAEFHISPGYSVIDEDGVHAHDVSSFIDDVSELNLSY 242
Query: 160 KINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-DVSGHTIQSNQFSVT 218
K+N FG+ + LDG Q+ Y VYT DVS ++N +S T
Sbjct: 243 KLNHCRFGDQNH---SQLDGFSTIQKQIGYFY--------AVYTIDVS----ENNDYS-T 286
Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
+ + G L +PG+ F YD I + +H +N+ ++ GGV + I+D
Sbjct: 287 AYMEQVDNGTL--VPGIVFKYDFGIITAKSFPDRPPLIHLFSNLVSMAGGVAMIFYILDY 344
Query: 279 FIYHG--QRAIKK 289
++ QR I K
Sbjct: 345 ALFSSIKQRKIHK 357
>gi|195402035|ref|XP_002059616.1| GJ14724 [Drosophila virilis]
gi|194147323|gb|EDW63038.1| GJ14724 [Drosophila virilis]
Length = 434
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/156 (32%), Positives = 81/156 (51%), Gaps = 3/156 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ F+R N +H+IN+L+FG
Sbjct: 196 ESKYDACRLHGTLGINKVAGVLHLVGGAQPVVGMFEDHWMIEFRRMPANFTHRINRLSFG 255
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G S QYF+KVVPT + TI + Q++VTE+ S
Sbjct: 256 QYSRRIVQPLEGDETIIHEESTTVQYFLKVVPTEIQH-TFSTISTFQYAVTENVHSERNS 314
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
PG++F YD S +K+ + + L F+ +C
Sbjct: 315 YGS--PGIYFKYDWSALKIVVSHDRDYLLTFVIRLC 348
>gi|194768867|ref|XP_001966532.1| GF22223 [Drosophila ananassae]
gi|190617296|gb|EDV32820.1| GF22223 [Drosophila ananassae]
Length = 448
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 57/183 (31%), Positives = 97/183 (53%), Gaps = 2/183 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ +R N +H+IN+L+FG
Sbjct: 200 ETKYDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 259
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G + + QYF+KVVPT + TI + Q+SVTE+ R +
Sbjct: 260 QYSRRIVQPLEGDETIIQEEATTVQYFLKVVPTEIRQ-TFSTINTFQYSVTENVRKLDSE 318
Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
R PG++F YD S +K+ + F+ +C+I+ G+ +SG I++ + QR
Sbjct: 319 RNSYGSPGIYFKYDWSALKIVVDNDRDHLATFVIRLCSIISGIIVISGAINSLLIAIQRR 378
Query: 287 IKK 289
+ +
Sbjct: 379 LLR 381
>gi|159483443|ref|XP_001699770.1| protein disulfide isomerase [Chlamydomonas reinhardtii]
gi|158281712|gb|EDP07466.1| protein disulfide isomerase [Chlamydomonas reinhardtii]
Length = 474
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 61/191 (31%), Positives = 97/191 (50%), Gaps = 12/191 (6%)
Query: 113 GCNIYGFLEVNKVAGNFHF---APGKSFHQSGVHV-HDILAFQ---RDSFNISHKINKLA 165
GCN+ GF+ V KV G HF + G SF + +++ H I +F R S ++ +L
Sbjct: 286 GCNLAGFVMVKKVPGTVHFVARSEGHSFDHTWMNMTHMIHSFHVGTRPSPRKYQQLKRLH 345
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVV-PTVYTDVSGHTIQSNQFSVTEHFRSS 224
+ L + E ++++++VV T+ S HT + + T H S
Sbjct: 346 PAGLTADWADKLHDQLFVSEHTQSTHEHYLQVVLTTIEPRHSRHTGNYDAYEYTAHSHSY 405
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
+ ++P F YDLSPI++ E + FLT CAI+GGVFTV+GI+DA +Y
Sbjct: 406 QS---DSIPSARFTYDLSPIQILVHETSKPWYQFLTTSCAIIGGVFTVAGILDALLYQSF 462
Query: 285 RAIKKKIEIGK 295
+ + KK+ +GK
Sbjct: 463 KVV-KKLNLGK 472
>gi|256052432|ref|XP_002569774.1| ptx1 protein [Schistosoma mansoni]
gi|353229921|emb|CCD76092.1| putative ptx1 protein [Schistosoma mansoni]
Length = 460
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 85/166 (51%), Gaps = 5/166 (3%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG-VHVHDILAFQRDSF-NISHKINKLA 165
+ + C I G L V KV GN H GK + G +H+H ++ F S N SH+IN +
Sbjct: 228 DRNSDACRIVGTLFVKKVGGNIHILFGKPLNGFGNLHLH-VVPFSGQSLQNFSHRINHFS 286
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
FG+ G ++PL+ V + +QYF+ +VPT + H ++ Q++ T R+ +
Sbjct: 287 FGDLVNGQIHPLEAVESVTDIAFTSFQYFVTMVPTKVVN-HFHITETYQYAATLQNRTID 345
Query: 226 Q-GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
+PG+FF YD+ P+ V T + F T + A+ GG+F
Sbjct: 346 HDAGSHGIPGIFFVYDIFPLVVKITYDRELLGTFFTRLAALAGGIF 391
>gi|363748002|ref|XP_003644219.1| hypothetical protein Ecym_1151 [Eremothecium cymbalariae
DBVPG#7215]
gi|356887851|gb|AET37402.1| hypothetical protein Ecym_1151 [Eremothecium cymbalariae
DBVPG#7215]
Length = 340
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 84/165 (50%), Gaps = 10/165 (6%)
Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
+GC+IYG + VNKV+G A G ++ + +L N SH IN+L+FG+ F
Sbjct: 152 DGCSIYGSVPVNKVSGELQITAKGWTYMSTRRTPFSVL-------NFSHVINELSFGDFF 204
Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
P + N LDGV + P Y YF V+PT Y + G + +NQ+SV +SS L
Sbjct: 205 PYIDNTLDGVGRIADEPLKAYYYFTSVLPTAYKKM-GAEVHTNQYSVDAIEKSSSSHALG 263
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
G+ Y+ +KV +E + F F+ + AI+ V ++ +
Sbjct: 264 P-TGITISYNFEALKVIIKDERIGFTQFIVRLVAILSFVVYLASL 307
>gi|194911936|ref|XP_001982403.1| GG12755 [Drosophila erecta]
gi|190648079|gb|EDV45372.1| GG12755 [Drosophila erecta]
Length = 441
Score = 85.5 bits (210), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 58/183 (31%), Positives = 96/183 (52%), Gaps = 2/183 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ +R N +H+IN+L+FG
Sbjct: 194 ESKYDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 253
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G + QYF+KVVPT + TI + Q++VTE+ R +
Sbjct: 254 QYSGRIVQPLEGDEIVIHEEATTIQYFLKVVPTEIHQ-TFTTINAFQYAVTENVRKLDSE 312
Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
R PG++F YD S +K+ + L F +C+I+ G+ +SG I+A + QR
Sbjct: 313 RNSYGSPGIYFKYDWSALKIVVDNDRDHLLTFAIRLCSIISGIIVISGAINALLLGIQRR 372
Query: 287 IKK 289
+ +
Sbjct: 373 LLR 375
>gi|339244785|ref|XP_003378318.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Trichinella spiralis]
gi|316972786|gb|EFV56437.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
[Trichinella spiralis]
Length = 334
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/131 (35%), Positives = 73/131 (55%), Gaps = 6/131 (4%)
Query: 169 HFPGVVNPLDGVRWTQETPSG----MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
+ PG NPL ++P Y Y +K+VPTVY +++G+ + Q++
Sbjct: 130 NLPGNFNPLMNAE-VLDSPVDNFPFSYDYILKIVPTVYENIAGNMKHAYQYTYARKTYIE 188
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
QT P ++F YD +PI V + E FLT++CAI+GG FTV+G+ID+F +
Sbjct: 189 MSFTGQTNPTLWFRYDFTPITVKYHERRQPLYIFLTSICAIIGGTFTVAGLIDSFFFTAS 248
Query: 285 RAIKKKIEIGK 295
+ + KK+E+GK
Sbjct: 249 Q-LYKKVELGK 258
>gi|254579156|ref|XP_002495564.1| ZYRO0B14344p [Zygosaccharomyces rouxii]
gi|238938454|emb|CAR26631.1| ZYRO0B14344p [Zygosaccharomyces rouxii]
Length = 353
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/185 (29%), Positives = 96/185 (51%), Gaps = 26/185 (14%)
Query: 92 DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 151
++ D+ +R+ F C+I+G ++VN+VAG Q H +F
Sbjct: 144 NMFDEEERDAF---------NSCHIFGSVQVNRVAGEL---------QITAKGHGYSSFM 185
Query: 152 R---DSFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSG 207
R + + SH IN+L++GE +P + NPLD ++ + P + Y +VPT+Y + G
Sbjct: 186 RAPPEEIDFSHVINELSYGEFYPYIDNPLDSTAKFVPDAPRTTFVYDTAIVPTIYEKL-G 244
Query: 208 HTIQSNQFSVTEHFRSSE--QGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
I +NQ++V+E+ + E QG+ PG+F YD P+ + ++ +SF+ F+ + A
Sbjct: 245 AKIDTNQYAVSEYHINPEAQQGKGPIRFPGIFLRYDFEPLSIHISDVRLSFIQFVVRLVA 304
Query: 265 IVGGV 269
I+ V
Sbjct: 305 ILSFV 309
>gi|323303637|gb|EGA57425.1| Erv41p [Saccharomyces cerevisiae FostersB]
Length = 284
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 82/159 (51%), Gaps = 10/159 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+I+G + VN+V+G KS + + +H IN+ +FG+
Sbjct: 90 EFNGCHIFGSIPVNRVSGELQIT-AKSLXYVASRKAPL-----EELKFNHVINEFSFGDF 143
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDXRLSFIQFLVRLVAI 241
>gi|123435131|ref|XP_001308935.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121890639|gb|EAX96005.1| hypothetical protein TVAG_369150 [Trichomonas vaginalis G3]
Length = 353
Score = 85.1 bits (209), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 58/224 (25%), Positives = 100/224 (44%), Gaps = 15/224 (6%)
Query: 57 CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
C C+ + + CCN C+ ++E Y+ P+ QC+ R E C +
Sbjct: 127 CYPCFKVQFHNYTCCNGCDRLKENYKLNNLT-PEPEKWPQCQTNA---RPDINSSEKCLV 182
Query: 117 YGFLEVNKVAGNFHFAPGKSFH-QSGVHVHDIL-AFQRDSFNISHKINKLAFGEHFPGVV 174
G + VN+V G+FH A G++ + G H+H++L F +F SH I + FG
Sbjct: 183 KGKVSVNRVRGSFHIAAGRNIYLNDGSHIHELLDDFPNLAF--SHAIEHIRFGPRIITAK 240
Query: 175 NPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 233
PL V +E + + Y + V P ++ + +S +++V H + P
Sbjct: 241 QPLQNLVMRAKENLTVTHDYSLLVTPVIFVADNQFIEKSFEYTVYLHPVQDKD------P 294
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
G++F Y +P + T SF FL + G++ ++ IID
Sbjct: 295 GIYFDYQFTPYTIQITWISRSFRGFLISTAGFTAGLYAIASIID 338
>gi|409048375|gb|EKM57853.1| hypothetical protein PHACADRAFT_116248 [Phanerochaete carnosa
HHB-10118-sp]
Length = 546
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 75/153 (49%), Gaps = 10/153 (6%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAF 166
+ G C +YG + V KV N H + S HV H+++ N+SH I + +F
Sbjct: 174 KPSGSACRVYGSVAVKKVTANLHVTTLGHGYASRQHVDHNLM-------NLSHVITEFSF 226
Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
G +FP + PLD E YQY++ VVPT Y + ++Q+SVT + R +
Sbjct: 227 GPYFPDITQPLDNSFELTEDSFVSYQYYLHVVPTTYIAPRSRPLHTHQYSVTHYTRVLKH 286
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
+PG+FF +D+ P+ +T + S L L
Sbjct: 287 N--NGIPGIFFKFDVDPMSLTIHQRTTSLLQLL 317
>gi|171693749|ref|XP_001911799.1| hypothetical protein [Podospora anserina S mat+]
gi|170946823|emb|CAP73627.1| unnamed protein product [Podospora anserina S mat+]
Length = 180
Score = 84.7 bits (208), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 69/126 (54%), Gaps = 8/126 (6%)
Query: 154 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYT-----DVS 206
SFN SH IN+L+FG + P ++NPLD + S +QYF+ +VPTVY+ S
Sbjct: 15 SFNFSHIINELSFGPYLPSLINPLDQTVNSAPEHSHFHRFQYFLSIVPTVYSLGHPDSYS 74
Query: 207 GHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+I +NQ++VTE E +Q +PG+F YD+ PI + E+ SF FL V I
Sbjct: 75 SRSIFTNQYAVTEQSAPIPENMEMQMIPGIFVKYDIEPILLNIVEDRDSFFVFLIKVVNI 134
Query: 266 VGGVFT 271
+ G
Sbjct: 135 LSGAMV 140
>gi|357452761|ref|XP_003596657.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355485705|gb|AES66908.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 482
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 73/257 (28%), Positives = 117/257 (45%), Gaps = 40/257 (15%)
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
S YG +D E + ++ + + L+ D ++ + +R G GC I G
Sbjct: 244 SYYGDRDTDS-LVKTMENILASFPSEYYKLALEDKLNVTEDS---KRPAPSSG-GCRIEG 298
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-------- 170
++ V KV GN + H +F N+SH ++ L+FG+
Sbjct: 299 YVRVKKVPGNLIISARSDAH----------SFDASQMNMSHAVHHLSFGKKLSPKLMSDV 348
Query: 171 ----PGVVNP---LDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
P V N LDG+ + G ++++++V T G+ + ++ T H
Sbjct: 349 QRLIPYVGNSHDRLDGLSFINSHDFGANVTLEHYLQIVKTEVITRQGYQL-VEEYEYTAH 407
Query: 221 FRSSEQGRLQTLPGVFFFYDLSPIKV--TFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
S +P F LSP++V TE+H SF HF+TNVCAIVGGVFTV+GI ++
Sbjct: 408 ---SSLAHSLHVPVARFHLQLSPMQVCVLITEDHKSFSHFITNVCAIVGGVFTVAGITES 464
Query: 279 FIYHGQRAIKKKIEIGK 295
I H + +K+E+GK
Sbjct: 465 -ILHNTIRLMRKVELGK 480
>gi|145510182|ref|XP_001441024.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408263|emb|CAK73627.1| unnamed protein product [Paramecium tetraurelia]
Length = 320
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 51/194 (26%), Positives = 90/194 (46%), Gaps = 14/194 (7%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKS---------FHQSGVHVHDILAFQRDSF 155
R E +GC + G L++N+V G F P +S H + H ++F
Sbjct: 130 RTAVAEKQGCEVVGSLKINRVKGKISFGPHRSHTYIGAVGNLHLPLDYSHKFVSFTFGDE 189
Query: 156 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
N K+ + + ++ + S +++FI ++PT YT ++ T +
Sbjct: 190 NALKKVKSMFKQGQLESLAGSQRIKKYELASQSMQHEHFIHIIPTHYTLLNKQT-----Y 244
Query: 216 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
SV ++ + + R V YD +P VT+ + LHFL +CA++GG+FTVS +
Sbjct: 245 SVYQYTANHNEVRSHNYANVQLRYDFAPTTVTYWQTKEDILHFLVQICAVIGGIFTVSSM 304
Query: 276 IDAFIYHGQRAIKK 289
I+A +Y R++ K
Sbjct: 305 IEASVYKVMRSVLK 318
>gi|167382848|ref|XP_001736294.1| hypothetical protein [Entamoeba dispar SAW760]
gi|165901464|gb|EDR27547.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
Length = 315
Score = 84.3 bits (207), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 64/200 (32%), Positives = 96/200 (48%), Gaps = 20/200 (10%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGK-SFHQSGV-------------HVHDILAFQRDSFNIS 158
GC ++G ++V++V+G FH A GK SF Q + H+H + SFN +
Sbjct: 116 GCRMHGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175
Query: 159 HKINKLAFGEHFPGVV----NPLDGVRWTQET-PSGMYQYFIKVVPTVYTDVSGHTIQSN 213
H IN L+F V PL+G +T + Y+I V+PT++ S +T+++
Sbjct: 176 HYINHLSFSNTLGSTVHSGETPLNGKEFTLNGFDNARKTYYINVIPTLFKYPS-YTLRTY 234
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
Q SV+E G PGVFF Y+LSP V SF H L +V AIVGGV +
Sbjct: 235 QLSVSERDIPVTYGASFAQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIVGGVLIII 294
Query: 274 GIIDAFIYHGQRAIKKKIEI 293
G + + + +E+
Sbjct: 295 GWLSKLFDSNRELVTSVVEM 314
>gi|295663046|ref|XP_002792076.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
gi|226279251|gb|EEH34817.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
Length = 392
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 88/190 (46%), Gaps = 39/190 (20%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + C IYG LE NKV G+FH A G + + G H+ ++L+FG
Sbjct: 188 EMPDSCRIYGSLEGNKVQGDFHITARGHGYFEYGEHLDH---------------HELSFG 232
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG-------------------- 207
H+ ++NPLD T YQY++ +VPT+YT
Sbjct: 233 PHYSTLLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRTGTIDPYSQVLPDPSTISPSQRK 292
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
+TI +NQ++VT RS E +Q +PG+FF Y + PI + +EE S L L + ++
Sbjct: 293 NTIFTNQYAVTS--RSHELPDVQFYVPGIFFKYSIEPILLIISEERGSLLALLVRLVNVM 350
Query: 267 GGVFTVSGII 276
GV G +
Sbjct: 351 AGVVVAGGWL 360
>gi|323307814|gb|EGA61076.1| Erv41p [Saccharomyces cerevisiae FostersO]
Length = 284
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 46/159 (28%), Positives = 82/159 (51%), Gaps = 10/159 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+I+G + VN+V+G KS + + +H IN+ +FG+
Sbjct: 90 EFNGCHIFGSIPVNRVSGELQIT-AKSLXYVASRKAPL-----EELKFNHVINEFSFGDF 143
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDIRLSFIQFLVRLVAI 241
>gi|226294628|gb|EEH50048.1| endoplasmic reticulum-Golgi intermediate compartment protein
[Paracoccidioides brasiliensis Pb18]
Length = 392
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 89/190 (46%), Gaps = 39/190 (20%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + C IYG LE NKV G+FH A G + + G H+ ++L+FG
Sbjct: 188 EMPDSCRIYGSLEGNKVQGDFHITARGHGYFEFGEHLDH---------------HELSFG 232
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG-------------------- 207
H+ ++NPLD T YQY++ +VPT+YT
Sbjct: 233 PHYSTLLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRAGTVDPYSQVLPDPSTISPSQRK 292
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
+TI +NQ++VT RS E +Q +PG+FF Y++ PI + +EE S L L + ++
Sbjct: 293 NTIFTNQYAVTS--RSHELPDVQFHVPGIFFKYNIEPILLIISEERGSLLALLVRLVNVM 350
Query: 267 GGVFTVSGII 276
GV G +
Sbjct: 351 AGVVVAGGWL 360
>gi|195639434|gb|ACG39185.1| PDIL5-4 - Zea mays protein disulfide isomerase [Zea mays]
Length = 485
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 79/279 (28%), Positives = 125/279 (44%), Gaps = 51/279 (18%)
Query: 45 RHGGRLEHNETYCG-SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD----LIDQCKR 99
R G ++ N+ + Y E E E K+ AL+ D +D KR
Sbjct: 228 RKGSDIKENQGHHDHESYYGERDTESLVAAMETYVANIPKEAHALALEDKSNKTVDPAKR 287
Query: 100 EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISH 159
+ GC I GF+ V +V G+ + +SG H +F N+SH
Sbjct: 288 PAPM-------ASGCRIEGFVRVKRVPGSVVISA-----RSGSH-----SFDPSQINVSH 330
Query: 160 KINKLAFGE---------------HFPGVVNPLDGVRWT----QETPSGMYQYFIKVVPT 200
+ + +FG+ + G + L G +T + + +++++VV T
Sbjct: 331 YVTQFSFGKRLSPRMLHEFIRLTPYLRGYHDRLAGQSYTVKHGEVNANVTIEHYLQVVKT 390
Query: 201 -VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFL 256
+ T S S + V E + + L +P V F ++ SP++V TE SF
Sbjct: 391 ELVTQRS-----SKELKVLEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTEVPKSFS 445
Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
HF+TNVCAI+GGVFTV+GI+D+ I+H + KKIE+GK
Sbjct: 446 HFITNVCAIIGGVFTVAGILDS-IFHNTLRMVKKIELGK 483
>gi|207342541|gb|EDZ70277.1| YML067Cp-like protein [Saccharomyces cerevisiae AWRI1631]
gi|323336174|gb|EGA77445.1| Erv41p [Saccharomyces cerevisiae Vin13]
gi|323347070|gb|EGA81345.1| Erv41p [Saccharomyces cerevisiae Lalvin QA23]
Length = 284
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+I+G + VN+V+G KS G + FN H IN+ +FG+
Sbjct: 90 EFNGCHIFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 143
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 241
>gi|151946097|gb|EDN64328.1| ER vesicle protein [Saccharomyces cerevisiae YJM789]
gi|190408176|gb|EDV11441.1| hypothetical protein SCRG_01831 [Saccharomyces cerevisiae RM11-1a]
gi|259148509|emb|CAY81754.1| Erv41p [Saccharomyces cerevisiae EC1118]
Length = 352
Score = 84.0 bits (206), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+I+G + VN+V+G KS G + FN H IN+ +FG+
Sbjct: 158 EFNGCHIFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 211
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 212 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 270
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 271 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 309
>gi|407037175|gb|EKE38536.1| hypothetical protein ENU1_163530 [Entamoeba nuttalli P19]
Length = 315
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 95/200 (47%), Gaps = 20/200 (10%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGK-SFHQSGV-------------HVHDILAFQRDSFNIS 158
GC ++G ++V++V+G FH A GK SF Q + H+H + SFN +
Sbjct: 116 GCRMHGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175
Query: 159 HKINKLAFGEHFPGVV----NPLDGVRWTQET-PSGMYQYFIKVVPTVYTDVSGHTIQSN 213
H IN L+F V PL+G +T + Y+I V+PT++ S +T+++
Sbjct: 176 HYINHLSFSNILGSTVHSGETPLNGKEFTLNGFDNARKTYYINVIPTLFKYPS-YTLRTY 234
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
Q SV E G PGVFF Y+LSP V SF H L +V AI+GGV +
Sbjct: 235 QLSVNERDVPVTYGASFAQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIIGGVLIIM 294
Query: 274 GIIDAFIYHGQRAIKKKIEI 293
G++ + +E+
Sbjct: 295 GLLSRLFDSKHELVTSVVEM 314
>gi|195469521|ref|XP_002099686.1| GE16580 [Drosophila yakuba]
gi|194187210|gb|EDX00794.1| GE16580 [Drosophila yakuba]
Length = 430
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 57/183 (31%), Positives = 96/183 (52%), Gaps = 2/183 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ +R N +H+IN+L+FG
Sbjct: 194 ESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 253
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G + QYF+KVVPT + TI + Q++VTE+ R +
Sbjct: 254 QYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTINAFQYAVTENVRKLDSE 312
Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
R PG++F YD S +K+ + + F +C+I+ G+ +SG I+A + QR
Sbjct: 313 RNSYGSPGIYFKYDWSALKIMVDNDRDHLVTFAIRLCSIISGIIVISGAINALLLGIQRR 372
Query: 287 IKK 289
+ +
Sbjct: 373 LLR 375
>gi|123437985|ref|XP_001309782.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121891523|gb|EAX96852.1| hypothetical protein TVAG_470170 [Trichomonas vaginalis G3]
Length = 344
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 70/242 (28%), Positives = 111/242 (45%), Gaps = 21/242 (8%)
Query: 57 CGSCYGAESSD-EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
CGSCYG E ++ CCN CE+V + K G L+N QC E + KE+ C
Sbjct: 119 CGSCYGTEFAEGSRCCNTCEDVVSHHIKAGRPLTNVTTWQQCINEKYDFTGKEK----CQ 174
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
I+G V+ + G P S ++ F + N++H I+ + FG F
Sbjct: 175 IFGNHHVSAIDGGIRILPRFSSNEE--------PFTK-LLNLTHYIDHITFGTSFGP--Q 223
Query: 176 PLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSV-TEHFRSSEQGRLQTLP 233
PLD Q P Y+Y +K VPTV + G Q++V + +++ RL
Sbjct: 224 PLDDALIVQSEPGQFHYRYDLKAVPTVMHNQDGSITHGFQYAVDSAKIPITDRTRLGE-- 281
Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
G+FF Y + + V + + ++ + I GG F ++ +ID+F Y ++ K+ I
Sbjct: 282 GIFFNYYFATVAVVGKPDRFTIYILISRLFCIFGGGFFLARLIDSFGYR-IHTMEGKMRI 340
Query: 294 GK 295
GK
Sbjct: 341 GK 342
>gi|162462518|ref|NP_001105762.1| protein disulfide isomerase12 [Zea mays]
gi|59861281|gb|AAX09970.1| protein disulfide isomerase [Zea mays]
gi|414590455|tpg|DAA41026.1| TPA: putative thioredoxin superfamily protein [Zea mays]
Length = 483
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 137/308 (44%), Gaps = 52/308 (16%)
Query: 15 IFKKRLDSQGNVIESRQDGI-GAPKIDKPLQRHGGRLEHNETYCG-SCYGAESSDEDCCN 72
I ++D V R++ I G P I + R G ++ N+ + Y E E
Sbjct: 199 ILLGKVDCTEEVELCRRNHIQGYPSIR--VFRKGSDIKENQGHHDHESYYGERDTESLVA 256
Query: 73 NCEEVREAYRKKGWALSNPD--LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFH 130
E K+ AL + +D KR + GC I GF+ V +V G+
Sbjct: 257 AMETYVANIPKEAHALEDKSNKTVDPAKRPAPM-------ASGCRIEGFVRVKRVPGSVV 309
Query: 131 FAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---------------HFPGVVN 175
+ +SG H +F N+SH + + +FG+ + G +
Sbjct: 310 ISA-----RSGSH-----SFDPSQINVSHYVTQFSFGKRLSPRMLHEFIRLTPYLRGYHD 359
Query: 176 PLDGVRWT----QETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
L G +T + + +++++VV T + T S S + V E + + L
Sbjct: 360 RLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRS-----SKELKVLEEYEYTAHSSLV 414
Query: 231 ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
+P V F ++ SP++V TE SF HF+TNVCAI+GGVFTV+GI+D+ I+H +
Sbjct: 415 HSFYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVAGILDS-IFHNTLRM 473
Query: 288 KKKIEIGK 295
KKIE+GK
Sbjct: 474 VKKIELGK 481
>gi|224030141|gb|ACN34146.1| unknown [Zea mays]
Length = 483
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 86/308 (27%), Positives = 137/308 (44%), Gaps = 52/308 (16%)
Query: 15 IFKKRLDSQGNVIESRQDGI-GAPKIDKPLQRHGGRLEHNETYCG-SCYGAESSDEDCCN 72
I ++D V R++ I G P I + R G ++ N+ + Y E E
Sbjct: 199 ILLGKVDCTEEVELCRRNHIQGYPSIR--VFRKGSDIKENQGHHDHESYYGERDTESLVA 256
Query: 73 NCEEVREAYRKKGWALSNPD--LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFH 130
E K+ AL + +D KR + GC I GF+ V +V G+
Sbjct: 257 AMETYVANIPKEAHALEDKSNKTVDPAKRPAPM-------ASGCRIEGFVRVKRVPGSVV 309
Query: 131 FAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---------------HFPGVVN 175
+ +SG H +F N+SH + + +FG+ + G +
Sbjct: 310 ISA-----RSGSH-----SFDPSQINVSHYVTQFSFGKRLSPRMLHEFIRLTPYLRGYHD 359
Query: 176 PLDGVRWT----QETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
L G +T + + +++++VV T + T S S + V E + + L
Sbjct: 360 RLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRS-----SKELKVLEEYEYTAHSSLV 414
Query: 231 ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
+P V F ++ SP++V TE SF HF+TNVCAI+GGVFTV+GI+D+ I+H +
Sbjct: 415 HSFYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVAGILDS-IFHNTLRM 473
Query: 288 KKKIEIGK 295
KKIE+GK
Sbjct: 474 VKKIELGK 481
>gi|392297516|gb|EIW08616.1| Erv41p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 352
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+I+G + VN+V+G KS G + FN H IN+ +FG+
Sbjct: 158 EFNGCHIFGSIPVNRVSGELQII-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 211
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 212 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 270
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 271 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 309
>gi|323332255|gb|EGA73665.1| Erv41p [Saccharomyces cerevisiae AWRI796]
gi|323352959|gb|EGA85259.1| Erv41p [Saccharomyces cerevisiae VL3]
gi|365763687|gb|EHN05213.1| Erv41p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 250
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/165 (30%), Positives = 84/165 (50%), Gaps = 10/165 (6%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
R E GC+I+G + VN+V+G KS G + FN H IN+
Sbjct: 50 NRAHLPEFNGCHIFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINE 103
Query: 164 LAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH-- 220
+FG+ +P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 104 FSFGDFYPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRY 162
Query: 221 FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 163 LYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 207
>gi|256269733|gb|EEU05000.1| Erv41p [Saccharomyces cerevisiae JAY291]
Length = 353
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 47/159 (29%), Positives = 82/159 (51%), Gaps = 10/159 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+I+G + VN+V+G + G + FN H IN+ +FG+
Sbjct: 159 EFNGCHIFGSIPVNRVSGELQITA----NSLGYVASRKAPLEELKFN--HVINEFSFGDF 212
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 213 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 271
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 272 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 310
>gi|6323573|ref|NP_013644.1| Erv41p [Saccharomyces cerevisiae S288c]
gi|2497084|sp|Q04651.1|ERV41_YEAST RecName: Full=ER-derived vesicles protein ERV41
gi|558408|emb|CAA86254.1| unnamed protein product [Saccharomyces cerevisiae]
gi|285813935|tpg|DAA09830.1| TPA: Erv41p [Saccharomyces cerevisiae S288c]
Length = 352
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+++G + VN+V+G KS G + FN H IN+ +FG+
Sbjct: 158 EFNGCHVFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 211
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 212 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 270
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 271 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 309
>gi|558407|emb|CAA86253.1| unnamed protein product [Saccharomyces cerevisiae]
Length = 284
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+++G + VN+V+G KS G + FN H IN+ +FG+
Sbjct: 90 EFNGCHVFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 143
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 241
>gi|349580221|dbj|GAA25381.1| K7_Erv41p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 352
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC+++G + VN+V+G KS G + FN H IN+ +FG+
Sbjct: 158 EFNGCHVFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 211
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
+P + NPLD ++ Q+ P Y Y+ VVPT++ + G + +NQ+SV ++
Sbjct: 212 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 270
Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
+ +PG+FF Y+ P+ + ++ +SF+ FL + AI
Sbjct: 271 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 309
>gi|442614645|ref|NP_001259099.1| CG4293, isoform E [Drosophila melanogaster]
gi|440216271|gb|AGB94945.1| CG4293, isoform E [Drosophila melanogaster]
Length = 439
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 81/156 (51%), Gaps = 2/156 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ +R N +H+IN+L+FG
Sbjct: 194 ESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 253
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G + QYF+KVVPT + TI + Q++VTE+ R E+
Sbjct: 254 QYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTIYAFQYAVTENVRKLERN 312
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
+ PG++F YD S +K+ + + F +C
Sbjct: 313 SYGS-PGIYFKYDWSALKIIVRNDRDHLVTFAIRLC 347
>gi|115472445|ref|NP_001059821.1| Os07g0524100 [Oryza sativa Japonica Group]
gi|75118816|sp|Q69SA9.1|PDI54_ORYSJ RecName: Full=Protein disulfide isomerase-like 5-4;
Short=OsPDIL5-4; AltName: Full=Protein disulfide
isomerase-like 8-1; Short=OsPDIL8-1; Flags: Precursor
gi|50508559|dbj|BAD30858.1| thioredoxin family-like protein [Oryza sativa Japonica Group]
gi|113611357|dbj|BAF21735.1| Os07g0524100 [Oryza sativa Japonica Group]
gi|215704615|dbj|BAG94243.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218199742|gb|EEC82169.1| hypothetical protein OsI_26259 [Oryza sativa Indica Group]
gi|222637167|gb|EEE67299.1| hypothetical protein OsJ_24505 [Oryza sativa Japonica Group]
Length = 485
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 64/224 (28%), Positives = 103/224 (45%), Gaps = 44/224 (19%)
Query: 94 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 153
+D KR L GC I GF+ V KV G+ + +SG H +F
Sbjct: 282 VDPAKRPAPLT-------SGCRIEGFVRVKKVPGSVVISA-----RSGSH-----SFDPS 324
Query: 154 SFNISHKINKLAFGEHFPGV-------VNPLDG------------VRWTQETPSGMYQYF 194
N+SH + + +FG+ + P G V+ + +++
Sbjct: 325 QINVSHYVTQFSFGKRLSAKMFNELKRLTPYVGGHHDRLAGQSYIVKHGDVNANVTIEHY 384
Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEE 251
+++V T + S + + E + + L +P V F ++ SP++V TE
Sbjct: 385 LQIVKTELVTLRS----SKELKLVEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTEL 440
Query: 252 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
SF HF+TNVCAI+GGVFTV+GI+D+ I+H + KK+E+GK
Sbjct: 441 PKSFSHFITNVCAIIGGVFTVAGILDS-IFHNTLRLVKKVELGK 483
>gi|393908149|gb|EJD74928.1| hypothetical protein LOAG_17836 [Loa loa]
Length = 430
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 86/175 (49%), Gaps = 6/175 (3%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
++ EG C I+G + VNKV G+ F + GK G+ H NISH+I +
Sbjct: 222 EKNEGTACRIHGRMRVNKVKGDSFIISTGKGLDVDGIFAH--FGGVSSPSNISHRIERFN 279
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS 223
FG G+V PL G+ ET ++YF+K+VPT ++ + G + + Q+SVT +
Sbjct: 280 FGPRIYGLVTPLAGIEQISETGVDEFRYFLKIVPTRIYHSGLFGGSTLTYQYSVT-FMKK 338
Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
+ + + + Y+ + + S L L +C+ VGGVF S ++++
Sbjct: 339 TPKKDVHKHTAIIIHYEFAATVIEVRHVQSSLLQMLVRLCSAVGGVFATSILLNS 393
>gi|402595088|gb|EJW89014.1| hypothetical protein WUBG_00081 [Wuchereria bancrofti]
Length = 578
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 87/176 (49%), Gaps = 6/176 (3%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
++ EG C I+G + VNKV G+ F + GK G+ H + N+SH+I +
Sbjct: 372 EKNEGTACRIHGRMRVNKVKGDSFVVSTGKGLGVDGIFAH--FGGLSNPGNVSHRIERFN 429
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS 223
FG G+V PL G+ ET ++YF+KVVPT ++ + G + + Q+SVT +
Sbjct: 430 FGPTIYGLVTPLAGIEQISETGMDEFRYFLKVVPTRIYHSGLFGGSTLTYQYSVT-FMKK 488
Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
+ + + + Y+ + + S L L +C+ VGGVF S ++++
Sbjct: 489 TPKKDVHKHAAIIIHYEFAATVIEVRRIQSSLLQMLIRLCSAVGGVFATSVLLNSI 544
>gi|366987569|ref|XP_003673551.1| hypothetical protein NCAS_0A06100 [Naumovozyma castellii CBS 4309]
gi|342299414|emb|CCC67168.1| hypothetical protein NCAS_0A06100 [Naumovozyma castellii CBS 4309]
Length = 355
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/168 (28%), Positives = 83/168 (49%), Gaps = 11/168 (6%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC+I+G L VN+VAG G D D +H IN+ +FG+ +P
Sbjct: 164 GCHIFGSLPVNRVAGELQIT------AKGYGYADRERTPMDQIKFNHVINEFSFGDFYPY 217
Query: 173 VVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQGR 228
+ NPLD ++ ETP Y Y + V+PT + + G + + Q+SV E+ + S R
Sbjct: 218 IDNPLDKSAKFDLETPKTAYSYDLSVIPTTFRKL-GTEVNTFQYSVAEYHYKGKDSPVPR 276
Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
+PG+FF Y+ + + ++ ++F+ F+ + AI+ ++ I
Sbjct: 277 SGRVPGIFFDYNFESLSIIVSDSRLNFIQFIIRLIAILSFALYIASWI 324
>gi|301089326|ref|XP_002894975.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262104295|gb|EEY62347.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 102
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/86 (51%), Positives = 54/86 (62%)
Query: 196 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 255
+VVPT YT +S I +NQFS TEHFR + LP V F Y SPI + V F
Sbjct: 5 QVVPTEYTFLSASRIITNQFSATEHFRQLTPVSDKGLPMVSFSYTFSPIMFRIEQYRVGF 64
Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIY 281
L FLT+VCAIVGGVFT+ GI+D+ +
Sbjct: 65 LQFLTSVCAIVGGVFTILGIMDSLAF 90
>gi|303275141|ref|XP_003056869.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461221|gb|EEH58514.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 604
Score = 81.6 bits (200), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 59/209 (28%), Positives = 99/209 (47%), Gaps = 40/209 (19%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC I G + VN+V G F+ + H G H+I D N++H + L+FG+ PG
Sbjct: 402 GCIIEGSVRVNRVPGAFYV----TAHSKG---HNI---NVDVVNMTHVLRHLSFGKTVPG 451
Query: 173 VVN-------------PLD-----GVRWTQET-----PSGMYQYFIKVVPTVYTDVSGHT 209
+ P D V +ET P ++++++KVV + + G
Sbjct: 452 RPSYVPRHMRRVWSKIPKDMGGRFAVAGAEETFASAEPYTVHEHYLKVVSHAFEPIDGDA 511
Query: 210 IQ-------SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
+Q SN+F + E P + F YD+SP++V EE L + +
Sbjct: 512 VQLYEYTFNSNRFKLAPAAYGDEDDAHVDGPMIKFSYDVSPMRVVLREETKPVLDWTLGM 571
Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
CA++GGV+T SG+++AFI +G +K+++
Sbjct: 572 CALMGGVYTCSGLLEAFISNGVSVVKRRV 600
>gi|154335780|ref|XP_001564126.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134061160|emb|CAM38182.1| conserved hypothetical protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 309
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 98/224 (43%), Gaps = 22/224 (9%)
Query: 65 SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNK 124
++ CC++C+ V E Y+ + QC + + E GCN+ G L++ K
Sbjct: 89 AAASKCCDSCDSVFELYKDLEKEFPGIEYFPQCLEQLY------ERARGCNVIGSLDLKK 142
Query: 125 VAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG----EHFP--GVVNPLD 178
V F P ++ + + D++ + SH I KL G E F GV PL
Sbjct: 143 VPVTVIFGPRRTGRRYSLK--DVI-----RLDTSHVIKKLRIGDEAVERFSKHGVAEPLC 195
Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE--QGRLQTLPGVF 236
G +T S +Y +KVVPT Y +++ + + S G +P V
Sbjct: 196 GHERFSKTYSET-RYLVKVVPTTYRKTRTRDAKASTYEYSAQCSSQAIVVGFSGVVPAVL 254
Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
F ++ + I+V E HFL +C IVGG+F V G ID+ +
Sbjct: 255 FAFEPAAIQVNNVFERQPVSHFLVQLCGIVGGLFVVLGFIDSTV 298
>gi|255714272|ref|XP_002553418.1| KLTH0D16324p [Lachancea thermotolerans]
gi|238934798|emb|CAR22980.1| KLTH0D16324p [Lachancea thermotolerans CBS 6340]
Length = 340
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 84/164 (51%), Gaps = 10/164 (6%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
+ +E GC+++G + VN V G+ P V D D+ N+SH IN+ +F
Sbjct: 147 ESKEFNGCHVFGTITVNMVKGDLIIIPRSQ------SVRDFGRMPPDAINLSHVINEFSF 200
Query: 167 GEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
G+ +P + NPLD R T E + + Y VVPT++ + G + +NQ+S++E +
Sbjct: 201 GDFYPYIDNPLDRSARITAEHTTS-FHYHTSVVPTIFQKL-GAEVNTNQYSLSETKHETP 258
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
L+ +P + F Y + +T +E +SF F+ + AI+ +
Sbjct: 259 PSGLR-VPAIIFSYSFEALTITIRDERISFWQFIVRLVAILSFI 301
>gi|195564437|ref|XP_002105825.1| GD16474 [Drosophila simulans]
gi|194203186|gb|EDX16762.1| GD16474 [Drosophila simulans]
Length = 441
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 92/174 (52%), Gaps = 2/174 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ +R N +H+IN+L+FG
Sbjct: 194 ESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 253
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G + QYF+KVVPT + TI + Q++VTE+ R +
Sbjct: 254 QYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTINAFQYAVTENVRKLDSE 312
Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
R PG++F YD S +K+ + + F +C+I+ G+ +SG I+A +
Sbjct: 313 RNSYGSPGIYFKYDWSALKIMVRNDRDHLVTFAIRLCSIISGIIVISGAINALL 366
>gi|195165324|ref|XP_002023489.1| GL20164 [Drosophila persimilis]
gi|194105594|gb|EDW27637.1| GL20164 [Drosophila persimilis]
Length = 445
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 2/157 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ +R N +H+IN+L+FG
Sbjct: 199 ETKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWVIELRRMPANFTHRINRLSFG 258
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G + QYF+KVVPT + TI + Q++VTE+ R +
Sbjct: 259 QYSRRIVQPLEGDESIIHEEATTVQYFLKVVPTEIHQ-TFTTINTFQYAVTENVRKLDSE 317
Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
R PG++F YD S +K+ + + + F +C
Sbjct: 318 RNSYGSPGIYFKYDWSALKIVVSNDRDHLVTFAIRLC 354
>gi|198468706|ref|XP_001354796.2| GA18088 [Drosophila pseudoobscura pseudoobscura]
gi|198146533|gb|EAL31851.2| GA18088 [Drosophila pseudoobscura pseudoobscura]
Length = 445
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 2/157 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ +R N +H+IN+L+FG
Sbjct: 199 ETKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWVIELRRMPANFTHRINRLSFG 258
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G + QYF+KVVPT + TI + Q++VTE+ R +
Sbjct: 259 QYSRRIVQPLEGDESIIHEEATTVQYFLKVVPTEIHQ-TFTTINTFQYAVTENVRKLDSE 317
Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
R PG++F YD S +K+ + + + F +C
Sbjct: 318 RNSYGSPGIYFKYDWSALKIVVSNDRDHLVTFAIRLC 354
>gi|308487907|ref|XP_003106148.1| hypothetical protein CRE_15417 [Caenorhabditis remanei]
gi|308254138|gb|EFO98090.1| hypothetical protein CRE_15417 [Caenorhabditis remanei]
Length = 427
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/175 (30%), Positives = 88/175 (50%), Gaps = 14/175 (8%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ---RDSFNISHKINKLA 165
E+G+ C ++G +V K GK + +L F+ + NISH+I K
Sbjct: 221 EDGKACRLHGKFKVRK---------GKEEKIVMSISNPLLMFEHQEKQPGNISHRIEKFN 271
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
FG PG+V PL G E+ +Y+YFIK+VPT HT+ + Q+SVT + +
Sbjct: 272 FGPRIPGLVTPLAGAEHISESGQDIYRYFIKIVPTKIYGYFTHTL-AYQYSVTFLKKQLK 330
Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
+G + G+ F Y+ + + + V+ +L +C+I+GGV+ S II+ +
Sbjct: 331 EGE-HSHGGILFEYEFTANVIEVHKTSVTLFSYLIRICSILGGVYATSTIINNVV 384
>gi|366997520|ref|XP_003678522.1| hypothetical protein NCAS_0J02050 [Naumovozyma castellii CBS 4309]
gi|342304394|emb|CCC72184.1| hypothetical protein NCAS_0J02050 [Naumovozyma castellii CBS 4309]
Length = 347
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 81/159 (50%), Gaps = 14/159 (8%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E C+I+G + VN+VAG F HQ +V D +H IN+ +FG+
Sbjct: 158 EYSACHIFGSIPVNRVAGEFQITTIDR-HQPIENVVDF----------THVINEFSFGDF 206
Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE-HFRSSEQG 227
FP V NPLD ++ + YQY + VVPT+Y + G I +NQ+S++E H+++
Sbjct: 207 FPYVDNPLDSTAKYVPDEKLTSYQYHLSVVPTIYNKM-GVLINTNQYSLSEYHYKNITNA 265
Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
+ PG+F Y+ + + + + F FL + AI+
Sbjct: 266 NDKNSPGIFIKYNFESLTIIVNDRRLGFTQFLIRLIAIL 304
>gi|195347402|ref|XP_002040242.1| GM19035 [Drosophila sechellia]
gi|194121670|gb|EDW43713.1| GM19035 [Drosophila sechellia]
Length = 437
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 92/174 (52%), Gaps = 2/174 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ +R N +H+IN+L+FG
Sbjct: 190 ESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 249
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G + QYF+KVVPT + TI + Q++VTE+ R +
Sbjct: 250 QYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTINAFQYAVTENVRKLDSE 308
Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
R PG++F YD S +K+ + + F +C+I+ G+ +SG I+A +
Sbjct: 309 RNSYGSPGIYFKYDWSALKIMVRNDRDHLVTFAIRLCSIISGIIVISGAINALL 362
>gi|323448816|gb|EGB04710.1| hypothetical protein AURANDRAFT_55105 [Aureococcus anophagefferens]
Length = 324
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 97/194 (50%), Gaps = 27/194 (13%)
Query: 104 QRIKE-EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKIN 162
QR+ E +E GC + G + VN+V GNFH +S H H++ A N+SH +N
Sbjct: 133 QRMLEIKEHPGCMVSGHVLVNRVPGNFHIE-ARSIH------HNLNAAMT---NLSHVVN 182
Query: 163 KLAFG-----------EHFPGV--VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 209
L+FG +P V+PLDG + ++ ++ KVV T + +V G
Sbjct: 183 HLSFGTPLAKDMQRKVSKYPQFQSVHPLDGGIFVSRDYHQVHHHYSKVVSTHF-EVGGMM 241
Query: 210 IQSNQFSVTEHFRSSEQGRLQTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
+S + + S+ + P F YDLSP+ V + + + F+T+VCAI+G
Sbjct: 242 TKSREIVGYQMLAQSQIMHYNEMDVPEAKFSYDLSPMAVLVSSKGRRWYDFVTSVCAIIG 301
Query: 268 GVFTVSGIIDAFIY 281
G FTV GI+DA +Y
Sbjct: 302 GTFTVVGIVDAVLY 315
>gi|343476464|emb|CCD12449.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 224
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 65/139 (46%), Gaps = 10/139 (7%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D GE D+ D K R+DS D + +PL + + C SC
Sbjct: 90 IDAFGEYVEDMGRDTVKMRVDS---------DTLAPLGEARPLVNMNKKATSDTHDCPSC 140
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGF 119
YGAE + DCC+ C++VR A+ ++ W D+ I QC +E EGCN++
Sbjct: 141 YGAEKNPGDCCHTCDDVRRAFAERQWEFHEDDVSIMQCAKERLQMAASTASREGCNLHSS 200
Query: 120 LEVNKVAGNFHFAPGKSFH 138
V +V N HF PG+ F+
Sbjct: 201 FRVPRVTENIHFVPGRMFY 219
>gi|170588701|ref|XP_001899112.1| hypothetical protein [Brugia malayi]
gi|158593325|gb|EDP31920.1| conserved hypothetical protein [Brugia malayi]
Length = 430
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 87/175 (49%), Gaps = 6/175 (3%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
++ EG C I+G + VNKV G+ F + GK G+ H + N+SH+I +
Sbjct: 223 EKNEGTACRIHGRMRVNKVKGDSFVVSTGKGLGVDGIFAH--FGGVSNPGNLSHRIERFN 280
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS 223
FG G+V PL G+ ET ++YF+KVVPT ++ + G + + Q+SVT +
Sbjct: 281 FGPTIYGLVTPLAGIEQISETGIDEFRYFLKVVPTRIYHSGLFGGSTLTYQYSVT-FMKK 339
Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
+ + + + Y+ + + S L L +C+ VGGVF S ++++
Sbjct: 340 TPKKDVHKHAAIVIHYEFAATVIEVRRIQSSLLQMLIRLCSAVGGVFATSVLLNS 394
>gi|357122608|ref|XP_003563007.1| PREDICTED: protein disulfide isomerase-like 5-4-like [Brachypodium
distachyon]
Length = 485
Score = 80.9 bits (198), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 104/224 (46%), Gaps = 44/224 (19%)
Query: 94 IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 153
+D KR + GC + GF+ V KV G+ + +SG H +F
Sbjct: 282 VDPAKRPAPMT-------SGCRVEGFVRVKKVPGSVIISA-----RSGSH-----SFDPS 324
Query: 154 SFNISHKINKLAFGEHF-PGVVNPLDG------------------VRWTQETPSGMYQYF 194
N+SH + + +FG P + + L V+ + +++
Sbjct: 325 QINVSHYVTQFSFGNRLSPNMFSELKRLIPYVGGHHDRLAGQSYIVKHGDNNANVTIEHY 384
Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEE 251
+++V T + S + V E + + L +P V F ++ SP++V TE
Sbjct: 385 LQIVKTELVTLRS----SKELKVFEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTEL 440
Query: 252 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
SF HF+TNVCAI+GGVFTV+GI+D+ +++ R + KK+E+GK
Sbjct: 441 PKSFSHFITNVCAIIGGVFTVAGILDSILHNTLRLV-KKVELGK 483
>gi|413953324|gb|AFW85973.1| putative DUF1692 domain containing protein [Zea mays]
Length = 1070
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%)
Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 254
+KVVPT Y +S + +NQ SVTE+F S + P V+F YDLSPI T EE +
Sbjct: 515 LKVVPTEYKYLSKKILPTNQGSVTEYFLSIRPTE-RAWPAVYFLYDLSPITFTIKEERRN 573
Query: 255 FLHFLTNVCAIVGGVFTVSG 274
FLHF+T +CA++GG F ++G
Sbjct: 574 FLHFITRLCAVLGGTFAMTG 593
>gi|18921097|ref|NP_569847.1| CG4293, isoform A [Drosophila melanogaster]
gi|24638890|ref|NP_726677.1| CG4293, isoform B [Drosophila melanogaster]
gi|85724768|ref|NP_001033816.1| CG4293, isoform D [Drosophila melanogaster]
gi|85724770|ref|NP_001033817.1| CG4293, isoform C [Drosophila melanogaster]
gi|2961397|emb|CAA18090.1| EG:65F1.1 [Drosophila melanogaster]
gi|7290051|gb|AAF45518.1| CG4293, isoform A [Drosophila melanogaster]
gi|7290052|gb|AAF45519.1| CG4293, isoform B [Drosophila melanogaster]
gi|15292011|gb|AAK93274.1| LD35174p [Drosophila melanogaster]
gi|84798360|gb|ABC67159.1| CG4293, isoform C [Drosophila melanogaster]
gi|84798361|gb|ABC67160.1| CG4293, isoform D [Drosophila melanogaster]
gi|220955778|gb|ACL90432.1| CG4293-PA [synthetic construct]
Length = 441
Score = 80.5 bits (197), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 80/157 (50%), Gaps = 2/157 (1%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E + + C ++G L +NKVAG H G H ++ +R N +H+IN+L+FG
Sbjct: 194 ESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 253
Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
++ +V PL+G + QYF+KVVPT + TI + Q++VTE+ R +
Sbjct: 254 QYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTIYAFQYAVTENVRKLDSE 312
Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
R PG++F YD S +K+ + + F +C
Sbjct: 313 RNSYGSPGIYFKYDWSALKIIVRNDRDHLVTFAIRLC 349
>gi|413949740|gb|AFW82389.1| putative DUF1692 domain containing protein [Zea mays]
Length = 1061
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%)
Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 254
+KVVPT Y +S + +NQ SVTE+F S + P V+F YDLSPI T EE +
Sbjct: 501 LKVVPTEYKYLSKKILPTNQGSVTEYFLSIRPTE-RAWPAVYFLYDLSPITFTIKEERRN 559
Query: 255 FLHFLTNVCAIVGGVFTVSG 274
FLHF+T +CA++GG F ++G
Sbjct: 560 FLHFITRLCAVLGGTFAMTG 579
>gi|413951106|gb|AFW83755.1| hypothetical protein ZEAMMB73_317062 [Zea mays]
Length = 1594
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%)
Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 254
+KVVPT Y +S + +NQ SVTE+F S + P V+F YDLSPI T EE +
Sbjct: 515 LKVVPTEYKYLSKKILPTNQGSVTEYFLSIRPTE-RAWPAVYFLYDLSPITFTIKEERRN 573
Query: 255 FLHFLTNVCAIVGGVFTVSG 274
FLHF+T +CA++GG F ++G
Sbjct: 574 FLHFITRLCAVLGGTFAMTG 593
>gi|326503558|dbj|BAJ86285.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 485
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 101/205 (49%), Gaps = 37/205 (18%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
GC I GF+ V KV G+ + +SG H +F N+SH + +FG+
Sbjct: 294 GCRIEGFVRVKKVPGSVVISA-----RSGSH-----SFDPSQINVSHYVTTFSFGKRLSS 343
Query: 170 ---------FP---GVVNPLDG----VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
FP G + L G V+ + ++++++V T + S
Sbjct: 344 KMFNELKRLFPYVGGHHDRLAGQSYVVKHGDVNANVTIEHYLQIVKTELVTLR----YSK 399
Query: 214 QFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
+ V E + + L +P V F ++ SP++V TE SF HF+TNVCAI+GGVF
Sbjct: 400 ELKVLEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVF 459
Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGK 295
TV+GI+D+ +++ R + KK+E+GK
Sbjct: 460 TVAGILDSILHNTLRLV-KKVELGK 483
>gi|146163751|ref|XP_001012240.2| hypothetical protein TTHERM_00103890 [Tetrahymena thermophila]
gi|146145943|gb|EAR91995.2| hypothetical protein TTHERM_00103890 [Tetrahymena thermophila
SB210]
Length = 331
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 98/191 (51%), Gaps = 32/191 (16%)
Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF---NISHKINKLAFGE 168
EGC I G++ + KV GNFH S+H ++ I + + D++ N+++KIN L FGE
Sbjct: 138 EGCRINGYINLKKVPGNFHI----SYHAKMDVMNRIASTKPDTYSKINLNYKINHLGFGE 193
Query: 169 H--FPGVVNPLDGVRWTQETPSGMYQY---------------FIKVVPTVYTDVSGH-TI 210
+ + + G QET + Y + ++K++P Y H ++
Sbjct: 194 NTNHMATIFKIMGRTLFQETNTNDYPHDDTKYINPGKNDYDNYLKILPCRYDSNKLHMSV 253
Query: 211 QSNQFSV--TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
++++ T +SS + +P +FF Y++SPI V ++ + SF HFL + AIVGG
Sbjct: 254 SRYKYAMYSTHTPKSSTE-----IPTIFFRYEISPINVYYSTKSKSFYHFLVQIFAIVGG 308
Query: 269 VFTVSGIIDAF 279
+F V GI ++
Sbjct: 309 IFAVMGIFNSL 319
>gi|50545267|ref|XP_500171.1| YALI0A17600p [Yarrowia lipolytica]
gi|49646036|emb|CAG84103.1| YALI0A17600p [Yarrowia lipolytica CLIB122]
Length = 337
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 91/185 (49%), Gaps = 18/185 (9%)
Query: 114 CNIYGFLEVNKVAGNFHF--APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
C I G + +N V G P + + + D N++H I++L+FG++FP
Sbjct: 151 CRISGSVPINHVEGALQIFNLPDNQYFINPMKA-------SDGLNLTHAIHELSFGDYFP 203
Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-TIQSNQFSVTEHFRSSEQGRLQ 230
V+NPLDGV + P YQYF+ VP Y+ SG I + Q++V + ++ Q
Sbjct: 204 KVLNPLDGVSTVTDEPLMSYQYFLSAVPVEYS--SGRKKIHTYQYAVKKQ-TTNLQEHFV 260
Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
T P +FF Y P+ + + + F+ + +I+GG F V G ++I G +K
Sbjct: 261 TRPAIFFHYKYEPVTLKIQDSRETLTVFVVKLLSILGG-FVVCG---SWIVRGGEKAYEK 316
Query: 291 IEIGK 295
I +GK
Sbjct: 317 I-VGK 320
>gi|255074657|ref|XP_002501003.1| predicted protein [Micromonas sp. RCC299]
gi|226516266|gb|ACO62261.1| predicted protein [Micromonas sp. RCC299]
Length = 515
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 97/215 (45%), Gaps = 42/215 (19%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC I G VN+V G F+ P H D N++H + L+FG+H PG
Sbjct: 313 GCIIDGSFRVNRVPGAFYVTPHSMGHN----------LNPDVINMTHTVKHLSFGKHVPG 362
Query: 173 -----------VVNPL-----------DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 210
V N + D + E P+ ++++++K+V + + G +
Sbjct: 363 RPSYVPRNLRRVWNRVPKDLGGRFAAGDEATFYSEEPNTVHEHYLKIVSRTFEPLEGQAV 422
Query: 211 Q-------SNQFSVTEHFRSS-EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
Q SN+F + + + + P + F YD+SP+ V E L ++ +
Sbjct: 423 QLYEYTFNSNRFRLNPPLAADGDPDQHVDGPMIKFSYDVSPMSVVLKEVKKPLLDWILGM 482
Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
CA++GGV+T +G+++ F+ A+K++ +GK S
Sbjct: 483 CALLGGVYTCAGLLETFLQSSVCAVKRR--VGKIS 515
>gi|12321801|gb|AAG50943.1|AC079284_18 hypothetical protein [Arabidopsis thaliana]
Length = 451
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 123/282 (43%), Gaps = 41/282 (14%)
Query: 35 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 94
G P I + G R +H S YG +D EE+ + +K+ L+ +
Sbjct: 188 GYPSIRIFRRGSGLREDHGNHEHESYYGDRDTDS-LVKMVEELLKPIKKEDHKLA----L 242
Query: 95 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 154
D K GC I G++ KV G + SG H +F
Sbjct: 243 DGKSDNAASTFKKAPVSGGCRIEGYVRAKKVPGELVISA-----HSGAH-----SFDASQ 292
Query: 155 FNISHKINKLAFGE---------------HFPGVVNPLDGVRWTQET---PSGMYQYFIK 196
N+SH + L FG + + L+G + E + +++++
Sbjct: 293 MNMSHIVTHLTFGTMVSERLWTDMKRLLPYLGQSYDRLNGKSFINERQLDANVTIEHYLQ 352
Query: 197 VVPT-VYTDVSG--HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 253
++ T V + SG H++ ++ T H S R P F ++LSP++V +E
Sbjct: 353 IIKTEVISRRSGQEHSLI-EEYEYTAH---SSVARSYHYPEAKFHFELSPMQVLISENPK 408
Query: 254 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
SF HF+TNVCAI+GGVFTV+GI+D+ + R + KKIE+GK
Sbjct: 409 SFSHFITNVCAIIGGVFTVAGILDSIFQNTVRMV-KKIELGK 449
>gi|299469370|emb|CBG91903.1| putative PDI-like protein [Triticum aestivum]
gi|299469398|emb|CBG91917.1| putative PDI-like protein [Triticum aestivum]
Length = 485
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 62/205 (30%), Positives = 101/205 (49%), Gaps = 37/205 (18%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
GC I GF+ V KV G+ + +SG H +F N+SH + +FG+
Sbjct: 294 GCRIEGFVRVKKVPGSVVISA-----RSGSH-----SFDPSQINVSHYVTTFSFGKRLSS 343
Query: 170 ---------FP---GVVNPLDG----VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
FP G + L G V+ + ++++++V T + +
Sbjct: 344 KMFNELKRLFPYVGGHHDRLAGQSYIVKHGDVNANVTIEHYLQIVKTELVTLR----YAK 399
Query: 214 QFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
+ V E + + L +P V F ++ SP++V TE SF HF+TNVCAI+GGVF
Sbjct: 400 ELKVLEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVF 459
Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGK 295
TV+GI+D+ +++ R + KK+E+GK
Sbjct: 460 TVAGILDSILHNTLRLV-KKVELGK 483
>gi|303279378|ref|XP_003058982.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226460142|gb|EEH57437.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 486
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 95/208 (45%), Gaps = 30/208 (14%)
Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
+ ++ +G GC++ GF+ KV G+ + H +F + N++H +N
Sbjct: 291 ESVRAVKGPGCSVTGFVLAKKVPGHVWITANSNSH----------SFHPEEMNMTHTVNH 340
Query: 164 LAFGEHF----------------PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
L FG + L GV + + ++++++ V T +G
Sbjct: 341 LFFGNQLGRNKLKALERRERGASSNWHDKLAGVTFRSLQTNVTHEHYLQTVLTTLRP-AG 399
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
+ + + T+H + R LP F ++ SP++V TEE F HF+T + AIVG
Sbjct: 400 SYVAYHAYEYTQHSHALVTTR--ELPRAKFHFNPSPVQVVVTEEREPFYHFITTLMAIVG 457
Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
GV++V GI D F+ H + +K E+GK
Sbjct: 458 GVYSVCGIADGFV-HNTLNMMRKFELGK 484
>gi|145350046|ref|XP_001419434.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579665|gb|ABO97727.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 513
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 58/228 (25%), Positives = 109/228 (47%), Gaps = 28/228 (12%)
Query: 79 EAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFA---PGK 135
EA +++ L P +D KR G GC I GF+ V KV G+ + P
Sbjct: 301 EAAQEENMKLRLPASVDMQKRI---------IGPGCAITGFVLVKKVPGHLWISASSPDH 351
Query: 136 SFHQSGVHVHDILAFQRDSFNISHKIN--------KLAFGEHFPGVVNPLDGVRWTQETP 187
SFH +++ ++ + F H+++ K GE + L R+
Sbjct: 352 SFHGETMNMTHVV----NHFYFGHQLSDERRRYLEKFHAGEKAGDWHDRLASERFVSNAA 407
Query: 188 SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 247
++++++ V T T +T+ + + T+H + + LP F Y SP+++
Sbjct: 408 HVSHEHYLQTVLTTITPRGRYTLPFSVYEYTQHSHAVHE----PLPKAKFHYQPSPMQIV 463
Query: 248 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
+EE ++F F+T++ AI+GGV++V GI D +++ +++K+E+GK
Sbjct: 464 VSEEKMAFYSFITSLMAIIGGVYSVMGIADGVLFNSLALVRRKLELGK 511
>gi|308807242|ref|XP_003080932.1| Thioredoxin/protein disulfide isomerase (ISS) [Ostreococcus tauri]
gi|116059393|emb|CAL55100.1| Thioredoxin/protein disulfide isomerase (ISS) [Ostreococcus tauri]
Length = 533
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 112/228 (49%), Gaps = 28/228 (12%)
Query: 79 EAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFA---PGK 135
EA R+ + L P +D +R G GC I GF+ V KV G+ + P
Sbjct: 321 EAAREANFNLQLPASVDVQRRI---------MGPGCAITGFVLVKKVPGHLWISASSPDH 371
Query: 136 SFHQSGVHVHDILAFQRDSFNISHKIN--------KLAFGEHFPGVVNPLDGVRWTQETP 187
SFH +++ ++ + F H+++ K GE + L G + E+
Sbjct: 372 SFHGQNMNMTHVV----NHFYFGHQLSDDRRRYLEKFHAGEKAGDWHDRLAGQTFVSESA 427
Query: 188 SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 247
++++++ TV T ++ + FSV E+ + + + LP F Y SP+++
Sbjct: 428 HISHEHYLQ---TVLTSIAPRGRFALPFSVYEYTQHAHAVH-EPLPKAKFHYQPSPMQIA 483
Query: 248 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
+EE ++F F+T++ AI+GGV++V GI D +++ ++KK+E+GK
Sbjct: 484 VSEERMAFYSFITSLMAIIGGVYSVMGIADGVLFNSIALVRKKLELGK 531
>gi|32566449|ref|NP_510494.2| Protein C18B12.6 [Caenorhabditis elegans]
gi|25809204|emb|CAA20929.2| Protein C18B12.6 [Caenorhabditis elegans]
Length = 428
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 41/130 (31%), Positives = 73/130 (56%), Gaps = 2/130 (1%)
Query: 151 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 210
++ S NISH+I K FG PG+V PL G E+ +Y+YFIK+VPT +T+
Sbjct: 257 EKQSGNISHRIEKFNFGPRIPGLVTPLAGAEHISESGQDIYRYFIKIVPTKIYGYFSYTM 316
Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
+ Q+SVT + ++G + G+ F Y+ + + + ++ + +L +C+I+GGV+
Sbjct: 317 -AYQYSVTFLKKQLKEGE-HSHGGILFEYEFTANVIEVHKTSITLISYLIRICSILGGVY 374
Query: 271 TVSGIIDAFI 280
S I++ +
Sbjct: 375 ATSTIVNNIL 384
>gi|299115405|emb|CBN74236.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 447
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 59/199 (29%), Positives = 92/199 (46%), Gaps = 30/199 (15%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
R+K++ GC + GF+ VN+V GNFH + H + + NISH + L
Sbjct: 264 RLKQDY-PGCQLSGFIMVNRVPGNFHIEARSALH----------SIDPTAANISHVVKTL 312
Query: 165 AFGEHFP---------GV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 211
FG P GV + L+ ++ ++ ++IKVV T ++
Sbjct: 313 KFGTQVPVRGRRVIESGVELEGLPALEDRVYSIDSLHTAPHHYIKVVSTFVGGLAKTDNL 372
Query: 212 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
Q V+ EQ ++ P F YDLSP+ V + + FLT+V AIVGG FT
Sbjct: 373 QYQMMVSSQTMPYEQDQV---PEAKFSYDLSPMSVHIKQRRRKWYDFLTSVLAIVGGTFT 429
Query: 272 VSGIIDAFIYHGQRAIKKK 290
V G++D ++ R +K+K
Sbjct: 430 VVGVLDNILF---RVVKQK 445
>gi|302841900|ref|XP_002952494.1| hypothetical protein VOLCADRAFT_75374 [Volvox carteri f.
nagariensis]
gi|300262133|gb|EFJ46341.1| hypothetical protein VOLCADRAFT_75374 [Volvox carteri f.
nagariensis]
Length = 478
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/197 (29%), Positives = 93/197 (47%), Gaps = 23/197 (11%)
Query: 113 GCNIYGFLEVNKVAGNFHF---APGKSFHQSGVH----VHDILAFQRDSFNISHKINKLA 165
GCN+ GF+ V KV G + G SF + ++ VH R S ++ +L
Sbjct: 289 GCNLAGFVMVKKVPGTLTVVARSEGHSFDHTWMNMTHLVHTFHVGTRPSPRKYQQLKRL- 347
Query: 166 FGEHFPGVVNPLDGVRWTQ------ETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVT 218
P D W + E P +++++++V T + S H+ + + T
Sbjct: 348 ----HPAGEGEGDLFWWREKREKRGEHPQSTHEHYLQIVLTSIEPRRSRHSGNYDAYEYT 403
Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
H S + +P F YDLSPI++ E + FLT CAI+GGVFTV+GI+DA
Sbjct: 404 AH---SHTYQSDAIPSARFTYDLSPIQILVQETARPWYQFLTTSCAIIGGVFTVAGILDA 460
Query: 279 FIYHGQRAIKKKIEIGK 295
+Y + + KK+ +GK
Sbjct: 461 LLYQSFKVV-KKLNLGK 476
>gi|42562656|ref|NP_175508.2| protein Disulfide Isomerase (PDIa) family, redox active TRX
domain-containing protein [Arabidopsis thaliana]
gi|332194483|gb|AEE32604.1| protein Disulfide Isomerase (PDIa) family, redox active TRX
domain-containing protein [Arabidopsis thaliana]
Length = 484
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 123/282 (43%), Gaps = 41/282 (14%)
Query: 35 GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 94
G P I + G R +H S YG +D EE+ + +K+ L+ +
Sbjct: 221 GYPSIRIFRRGSGLREDHGNHEHESYYGDRDTDS-LVKMVEELLKPIKKEDHKLA----L 275
Query: 95 DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 154
D K GC I G++ KV G + SG H +F
Sbjct: 276 DGKSDNAASTFKKAPVSGGCRIEGYVRAKKVPGELVISA-----HSGAH-----SFDASQ 325
Query: 155 FNISHKINKLAFGE---------------HFPGVVNPLDGVRWTQET---PSGMYQYFIK 196
N+SH + L FG + + L+G + E + +++++
Sbjct: 326 MNMSHIVTHLTFGTMVSERLWTDMKRLLPYLGQSYDRLNGKSFINERQLDANVTIEHYLQ 385
Query: 197 VVPT-VYTDVSG--HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 253
++ T V + SG H++ ++ T H S R P F ++LSP++V +E
Sbjct: 386 IIKTEVISRRSGQEHSLI-EEYEYTAH---SSVARSYHYPEAKFHFELSPMQVLISENPK 441
Query: 254 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
SF HF+TNVCAI+GGVFTV+GI+D+ + R + KKIE+GK
Sbjct: 442 SFSHFITNVCAIIGGVFTVAGILDSIFQNTVRMV-KKIELGK 482
>gi|397568633|gb|EJK46248.1| hypothetical protein THAOC_35093 [Thalassiosira oceanica]
Length = 601
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 97/220 (44%), Gaps = 60/220 (27%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
E+E GC I GFL V++ GNFH H H+ N+SH IN L+FG
Sbjct: 403 EDEHPGCQISGFLLVDRAPGNFHIQAQSKNHDLAAHM----------TNVSHIINHLSFG 452
Query: 168 EHFP------GVVN----------PLDGVRWTQETPSGMYQYFIKVVPTVY--------- 202
+ F G+ N P DG + + +++KV+ T +
Sbjct: 453 KPFSKYFIKEGLKNTPAGFLDTTRPFDGNVYVTHNEHEAHHHYLKVITTEFEPQRDTKKQ 512
Query: 203 ------------TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTE 250
+ +QS+Q S+ R +P F YDLSPI V++++
Sbjct: 513 YGKKKGFYKPPEPQRAYQILQSSQLSLY---------RNDIVPEAKFTYDLSPIAVSYSK 563
Query: 251 EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
++ ++ + T++ AI+GG FTV G++++ +Y A+ KK
Sbjct: 564 KYRAWYDYFTSLMAIIGGTFTVVGMVESSLY----AVSKK 599
>gi|123499008|ref|XP_001327531.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121910461|gb|EAY15308.1| hypothetical protein TVAG_394520 [Trichomonas vaginalis G3]
Length = 357
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 58/229 (25%), Positives = 103/229 (44%), Gaps = 31/229 (13%)
Query: 59 SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
SCY A ++ C C++V +A++ + I QC + I+E + EGC +
Sbjct: 143 SCYAANNTK--VCKTCKDVVQAHKNQELLPPPLSTIAQCASTAAI--IQEMKDEGCKLTS 198
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHD--ILAFQRDSFNISHKINKLAFGE---HFPGV 173
+ ++A FH APG ++ G H H+ IL + N++H I F F
Sbjct: 199 AFQTVRLASEFHVAPGYNYLYKGWHSHNTTILGSESKDLNLTHIIRSFRFNRVDGKF--- 255
Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYT-DVSGHTIQSNQFSVTEHFRSSEQGRLQTL 232
PLD V Q T G ++ VY+ D+ +T +N++ + + + S
Sbjct: 256 --PLDNVTSIQ-TGKGSWR-------VVYSADIMDNTYTANKYELMDPPKFSS------- 298
Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
GV+F Y ++P+ + FLH T + ++G V ++D+F++
Sbjct: 299 -GVYFRYAINPVSAIDYYDTEPFLHLCTRLLTVIGAVLAAFRLLDSFLF 346
>gi|223995687|ref|XP_002287517.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976633|gb|EED94960.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 457
Score = 77.8 bits (190), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 56/193 (29%), Positives = 92/193 (47%), Gaps = 36/193 (18%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-- 170
GC I GFL V++ GNFH H H+ N+SH IN L+FG+ F
Sbjct: 277 GCQISGFLLVDRAPGNFHIQAQSKGHDLAAHMT----------NVSHIINHLSFGKPFSK 326
Query: 171 -----------PG---VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 216
PG P DG + + + +++KV+ T + G Q+++++
Sbjct: 327 YFLKDGLKNTPPGFLETTKPFDGNVYITQNEHEAHHHYLKVITTEFEPEKG--AQNSKYN 384
Query: 217 VTEHFRS-----SEQGRL---QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
E R+ S Q L +P F YDLSPI V++ +++ + + T++ AI+GG
Sbjct: 385 KKEPSRAYQILQSSQLSLYRSDIVPEAKFTYDLSPIAVSYNKKYRHWYDYFTSLMAIIGG 444
Query: 269 VFTVSGIIDAFIY 281
FTV G++++ I+
Sbjct: 445 TFTVVGMLESGIH 457
>gi|145549492|ref|XP_001460425.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428255|emb|CAK93028.1| unnamed protein product [Paramecium tetraurelia]
Length = 320
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 92/200 (46%), Gaps = 26/200 (13%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
R E +GC + G L+VN+V G F +S+ G + + SHK
Sbjct: 130 RTAINEKQGCEVIGNLKVNRVRGKISFGAHRSYSYIGA-----VGNLNLPLDYSHKFVSF 184
Query: 165 AFGEHFP----------GVVNPLDGVRWTQE----TPSGMYQYFIKVVPTVYTDVSGHTI 210
+FG+ G ++ G + ++ + S +++FI ++PT YT ++
Sbjct: 185 SFGDEDALKKVKSLFQQGQLDSFAGTQRIKKPELASQSMQHEHFISIIPTHYTLLNKQVY 244
Query: 211 QSNQFSVTEH-FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
Q++ + RS+ G +Q YD +P VT+ + LHF +CA++GG+
Sbjct: 245 SVYQYTANHNEVRSNNYGNVQ------LRYDFAPTTVTYWQTKEDILHFYVQICAVIGGI 298
Query: 270 FTVSGIIDAFIYHGQRAIKK 289
FTVS +I+A +Y R + K
Sbjct: 299 FTVSSMIEACVYKVMRMLLK 318
>gi|341884627|gb|EGT40562.1| hypothetical protein CAEBREN_07459 [Caenorhabditis brenneri]
Length = 428
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/176 (28%), Positives = 87/176 (49%), Gaps = 15/176 (8%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKL 164
E+G+ C ++G +V K GK + +L F + N ISH+I K
Sbjct: 221 EDGKACRLHGKFKVRK---------GKEEKIVMSISNPLLMFDHQAENQPGNISHRIEKF 271
Query: 165 AFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
FG PG+V PL G E+ +Y+YFIK+VPT +T+ + Q+SVT +
Sbjct: 272 NFGPRIPGLVTPLAGAEHISESGQDIYRYFIKIVPTKIYGYFTYTM-AYQYSVTFLKKQL 330
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
++G + G+ F Y+ + + + V+ +L +C+I+GGV+ S I++ +
Sbjct: 331 KEGE-HSHGGILFEYEFNANVIEVHKTSVTLFSYLIRICSILGGVYATSTIVNNIV 385
>gi|145540599|ref|XP_001455989.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124423798|emb|CAK88592.1| unnamed protein product [Paramecium tetraurelia]
Length = 322
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 97/199 (48%), Gaps = 35/199 (17%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAP-GKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+GE C + GF +VNKV GNFH + + +H D+ F++ + H I +L FGE
Sbjct: 138 QGEQCQLKGFFQVNKVPGNFHVSYHAHHYLLQRIHQRDLSVFRK--MKLDHSIYELRFGE 195
Query: 169 HFPGVVNPLDGVR------------WTQ---ETPSGM---YQYFIKVVPTVYTDVSGHTI 210
+ +R W Q P G Y+Y+I +P + D +
Sbjct: 196 -----ITTTSKMRKYSKSLQKFQNSWKQIVKSAPEGEKQDYEYYIDALPVRFYDENERNY 250
Query: 211 QS-NQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
Q+ ++S+ E ++ R T + ++F Y +SP+ + ++ + S HF+ + AI+GG
Sbjct: 251 QTLYKYSINE----AQMPRTFTEIDSIYFKYQISPVNMVYSIQKKSVYHFIVQLLAIIGG 306
Query: 269 VFTVSGIIDAFIYHGQRAI 287
VF V GI+++ + Q+AI
Sbjct: 307 VFAVIGILNSIV---QKAI 322
>gi|397641928|gb|EJK74922.1| hypothetical protein THAOC_03372 [Thalassiosira oceanica]
Length = 583
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/203 (28%), Positives = 91/203 (44%), Gaps = 43/203 (21%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC + G L VN+V GNFH KS + H++ A N++H++N ++FGE
Sbjct: 385 EHPGCQVSGHLMVNRVPGNFHIE-AKSVN------HNLNAAMT---NLTHRVNHISFGEP 434
Query: 170 FPGV--------------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
+ NP+D + + ++IKVV T
Sbjct: 435 ITKLPYHMENTPFMRKVKRVLKQVPEEHKQFNPMDDQEYITTQFHQAFHHYIKVVSTHLN 494
Query: 204 DVSGHTIQSNQFSVTEHFRSSEQGRLQ-----TLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
S T+ N + ++ EQ ++ +P F YD+SP+ V +E + +
Sbjct: 495 MGSSSTV--NDVNSITVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVVVQKEGRKWYDY 552
Query: 259 LTNVCAIVGGVFTVSGIIDAFIY 281
LT++CAI+GG FT G+IDA +Y
Sbjct: 553 LTSLCAIIGGTFTTLGLIDATLY 575
>gi|268581819|ref|XP_002645893.1| Hypothetical protein CBG07646 [Caenorhabditis briggsae]
Length = 426
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 68/125 (54%), Gaps = 2/125 (1%)
Query: 156 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
NISH+I K FG PG+V PL G E+ +Y+YFIK+VPT +T+ + Q+
Sbjct: 262 NISHRIEKFNFGPRIPGLVTPLAGAEHISESGQDIYRYFIKIVPTKIYGYFTYTL-AYQY 320
Query: 216 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
SVT + ++G + G+ F Y+ + + + + +L +C+I+GGV+ S I
Sbjct: 321 SVTFLKKQLKEGE-HSHGGILFEYEFTANVIEVHKTSTTLFSYLIRICSILGGVYATSTI 379
Query: 276 IDAFI 280
I+ +
Sbjct: 380 INNIV 384
>gi|224013158|ref|XP_002295231.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969193|gb|EED87535.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 492
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 57/205 (27%), Positives = 89/205 (43%), Gaps = 43/205 (20%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
E GC + G L VN+V GNFH KS + H++ A N++H++N L+FGE
Sbjct: 290 EHPGCQVSGHLMVNRVPGNFHIE-AKSVN------HNLNAAMT---NLTHRVNHLSFGEP 339
Query: 170 FPGV--------------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
+ NP+D + + ++IKVV T
Sbjct: 340 ITKLPPHMENTPFMRKVKRVLKQVPEEHKQFNPMDDTEYVTAQFHQAFHHYIKVVSTHLN 399
Query: 204 --DVSGHTIQSNQFSVTEHFRSSEQGRLQ-----TLPGVFFFYDLSPIKVTFTEEHVSFL 256
S N + ++ EQ ++ +P F YD+SP+ V +E +
Sbjct: 400 MGSSSKSEYSVNDVNAVTVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVVVQKEGRKWY 459
Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIY 281
+LT++CAI+GG FT G+IDA +Y
Sbjct: 460 DYLTSLCAIIGGTFTTLGLIDATLY 484
>gi|328700149|ref|XP_003241164.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 2 [Acyrthosiphon pisum]
gi|328700151|ref|XP_001951220.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 1 [Acyrthosiphon pisum]
gi|328700153|ref|XP_003241165.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like isoform 3 [Acyrthosiphon pisum]
Length = 289
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 93/199 (46%), Gaps = 9/199 (4%)
Query: 27 IESRQDGIGAPKIDKPLQRHG--GRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK 84
+ S D IGA +D Q G L+ ++T+ + E +RE Y
Sbjct: 84 VASTCDSIGADIVDTTGQNMMLFGELKTDDTWWEMTKEQQQHFEKMRKFNAYLREEYHSM 143
Query: 85 GWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 143
L D + K + F++ K + C I+G L +NKV GNFH PGKS G H
Sbjct: 144 KDILWMFDDYNTLKNKIFVRTDKPNTLPDACRIHGSLILNKVIGNFHITPGKSLIVPGGH 203
Query: 144 VHDILA-FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
VH F ++ N SH+IN+ +FG G++ PL+G + + Y+YFI VV
Sbjct: 204 VHLTGPFFGSEATNFSHRINQFSFGVPTKGIIYPLEGELYETNENAVSYKYFIDVVA--- 260
Query: 203 TDVSGHT--IQSNQFSVTE 219
TDV + I++ Q+S +
Sbjct: 261 TDVKSRSNEIKTYQYSAKD 279
>gi|145543941|ref|XP_001457656.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124425473|emb|CAK90259.1| unnamed protein product [Paramecium tetraurelia]
Length = 322
Score = 75.5 bits (184), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 88/187 (47%), Gaps = 22/187 (11%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQ-SGVHVHDILAFQRDSFNISHKINKLAFGE 168
+GE C GF VNKV GNFH + H +H D+ +++ + H I +L FG+
Sbjct: 138 QGEQCQFKGFFSVNKVPGNFHISYHAHHHLIQRIHQRDLSTYRK--LKLDHTIYELRFGD 195
Query: 169 H--------FPGVVNPLDGVRW---TQETPSGM---YQYFIKVVPTVYTDVSGHTIQS-N 213
+ +P + W + P G Y+Y+I +P + D Q+
Sbjct: 196 NSSSFKMKKYPKSLQKFQS-SWNSIAKTAPEGEKQDYEYYINALPVRFYDDKERNYQTLY 254
Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
++S+ E + + ++F Y +SP+ + ++ + S HF+ + AIVGGVF V
Sbjct: 255 KYSINE---AQMTRSFTEIDSIYFKYQISPVNMVYSIQKKSVYHFIVQLLAIVGGVFAVI 311
Query: 274 GIIDAFI 280
GI+++ I
Sbjct: 312 GIVNSII 318
>gi|154415829|ref|XP_001580938.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121915161|gb|EAY19952.1| hypothetical protein TVAG_402060 [Trichomonas vaginalis G3]
Length = 359
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 71/276 (25%), Positives = 113/276 (40%), Gaps = 35/276 (12%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
+D G + LDV +DI KR+ I+ + + + C C
Sbjct: 89 LDSIGVEMLDVSNDIKFKRMSVDNRFIDYSNESL-------------------KDICLPC 129
Query: 61 YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
+G + E CCN C+EV+ + +G NP DQC K++ E C I G +
Sbjct: 130 HGLKPEGE-CCNTCDEVKAIFEARGEDF-NPLPFDQCMGN---VNFKKDMSESCLIEGTI 184
Query: 121 EVNKVAGNFHFAPGKS--FHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
K G FH APG++ F ++G H HD S H I++ G+ + V +P+
Sbjct: 185 HTFKSPGQFHIAPGRNTKFRRTG-HQHDTGLSPEAS--CPHTIHEFYVGQKYDNVRSPIR 241
Query: 179 G--VRWTQETPS-GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
G R P +Y FI V + D +T S ++S + G PG+
Sbjct: 242 GKIFRDRDSLPRIYLYDLFITKVLHTFNDALQYT--SYEYSYNLGAKIFNPGSFYQ-PGI 298
Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
+F Y SP+ + + + FL ++ G+F
Sbjct: 299 YFKYMFSPMTIVERSISKNPMRFLVTSVGVLAGIFA 334
>gi|219125194|ref|XP_002182871.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405665|gb|EEC45607.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 467
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 61/222 (27%), Positives = 94/222 (42%), Gaps = 40/222 (18%)
Query: 84 KGWALSNPDLIDQCKREGFLQRIKEEEGE--GCNIYGFLEVNKVAGNFHFAPGKSFHQSG 141
K W D D + E Q ++ + GC + G L VN+V GNFH H
Sbjct: 254 KEWHSKASDSADPAEVEKKRQLYQQNRPDHPGCQVSGHLMVNRVPGNFHLEAKSKSHNLN 313
Query: 142 VHVHDILAFQRDSFNISHKINKLAFGE--------------HFP---GVVNPLDGVRWTQ 184
+ N+SH +N L+FGE P P+DG +
Sbjct: 314 AAMT----------NLSHVVNHLSFGEPIDENNRKSKRILKQVPEEHRQFAPMDGQAFLT 363
Query: 185 ETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ-----TLPGVFFFY 239
+ + ++IKVV T + S+ + ++ EQ ++ +P F Y
Sbjct: 364 KAFHQAFHHYIKVVSTHLN------MGSSDANSMLTYQFLEQSQIVFYDDVNVPEARFSY 417
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
DLSP+ V +E + +LT++CAI+GG FT G+IDA +Y
Sbjct: 418 DLSPMSVVVEKEGRKWYDYLTSLCAIIGGTFTTLGLIDATLY 459
>gi|223646904|gb|ACN10210.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
gi|223672767|gb|ACN12565.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Salmo salar]
Length = 238
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 33/67 (49%), Positives = 42/67 (62%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
C I+G L VNKVAGNFH GK+ H H D++N SH+I+ L+FGE PG
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDTYNFSHRIDHLSFGEEIPG 228
Query: 173 VVNPLDG 179
++NPLDG
Sbjct: 229 IINPLDG 235
>gi|428185569|gb|EKX54421.1| hypothetical protein GUITHDRAFT_99900 [Guillardia theta CCMP2712]
Length = 475
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 69/237 (29%), Positives = 103/237 (43%), Gaps = 55/237 (23%)
Query: 93 LIDQCKREGFLQRI--KEEEGE-------GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 143
L+ Q + R+ KE++GE GC + G L V + APG Q+
Sbjct: 260 LMKQVNLQAPKSRVVDKEQDGEKESHNGVGCMVAGMLHVQR-------APGSIILQA--- 309
Query: 144 VHDILAFQRDSFNISHKINKLAFGEHF---PGVVNP---------LDGVRWTQE--TPSG 189
V D F + ++SH +N L+FG VV P LD ++ E TP+
Sbjct: 310 VSDGHEFNWATMDVSHTVNHLSFGPFLSETAWVVMPPDIAQAVGSLDDKKFLSEERTPT- 368
Query: 190 MYQYFIKVVPTVY----------TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
++++++KVV V + G+ + +N+ R +P Y
Sbjct: 369 VWEHYVKVVKNVVELPRSWGIPPVEAHGYVVHTNKVQ-----------RYAEVPTARINY 417
Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
D+ PI V S HFLT +CAIVGGVFTVSGI + + G ++ K IGK
Sbjct: 418 DILPIIVHVKTSRESNYHFLTKLCAIVGGVFTVSGIFASMVEGGIASLTHKETIGKL 474
>gi|357474783|ref|XP_003607677.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
gi|355508732|gb|AES89874.1| Endoplasmic reticulum-Golgi intermediate compartment protein
[Medicago truncatula]
Length = 156
Score = 75.1 bits (183), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 85/163 (52%), Gaps = 31/163 (19%)
Query: 155 FNISHKINKLAFGEHFPGVVNP---LDGVRW------------------TQETPSGM-YQ 192
N+SH IN L+FG+ V P +D W T++ + +
Sbjct: 1 MNMSHVINHLSFGKK----VTPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIE 56
Query: 193 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEH 252
++I+VV T G+ + ++ T H S +P F +LSP++V TE
Sbjct: 57 HYIQVVKTEVITRKGYKLIE-EYEYTAH---SSVAHSVNIPVARFHLELSPMQVLITENQ 112
Query: 253 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
SF HF+TNVCAI+GGVFTV+GI+D+ +++ +A+ KKIEIGK
Sbjct: 113 KSFSHFITNVCAIIGGVFTVAGILDSILHNTIKAM-KKIEIGK 154
>gi|444316650|ref|XP_004178982.1| hypothetical protein TBLA_0B06400 [Tetrapisispora blattae CBS 6284]
gi|387512022|emb|CCH59463.1| hypothetical protein TBLA_0B06400 [Tetrapisispora blattae CBS 6284]
Length = 355
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 83/166 (50%), Gaps = 14/166 (8%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
E +GC+++G + VN+V G F A G + ++++ N H IN+ +FG
Sbjct: 160 ELDGCHVFGQIPVNRVQGELQFTAKGYGYMNWERTPYELI-------NFDHVINEFSFGN 212
Query: 169 HFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
FP + NPLD + + P + Y VVP+ Y + G + + Q+SV+++ +
Sbjct: 213 FFPYIDNPLDNTAKINLDDPVTSWIYDTSVVPSYYRKL-GAEVDTFQYSVSQYSYNGTSL 271
Query: 228 RLQT----LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
+ T +PG+FF YD + + T+ +SF FL + AI+ V
Sbjct: 272 QKMTSSTSVPGIFFKYDFEALSLVLTDHRISFFQFLIRLVAILSFV 317
>gi|297847442|ref|XP_002891602.1| hypothetical protein ARALYDRAFT_474215 [Arabidopsis lyrata subsp.
lyrata]
gi|297337444|gb|EFH67861.1| hypothetical protein ARALYDRAFT_474215 [Arabidopsis lyrata subsp.
lyrata]
Length = 484
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 61/204 (29%), Positives = 98/204 (48%), Gaps = 36/204 (17%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC I G++ KV G + SG H +F N+SH + L+FG
Sbjct: 294 GCRIEGYVRAKKVPGELVISA-----HSGAH-----SFDASQMNMSHIVTHLSFGTMVSE 343
Query: 173 VV---------------NPLDGVRWTQETP---SGMYQYFIKVVPT-VYTDVSG--HTIQ 211
+ + L+G + + + ++++++V T V + SG H++
Sbjct: 344 RLWTDMKRLLPYLGQSHDRLNGKSFINQRKFDVNVTIEHYLQIVKTEVISRRSGKEHSLI 403
Query: 212 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
++ T H S P F ++LSP++V +E SF HF+TNVCAI+GGVFT
Sbjct: 404 -EEYEYTAH---SSVAHSYHYPEAKFHFELSPMQVLISENPKSFSHFITNVCAIIGGVFT 459
Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
V+GI+D+ + R + KKIE+GK
Sbjct: 460 VAGILDSIFQNTVRMV-KKIELGK 482
>gi|443921357|gb|ELU41041.1| endoplasmic reticulum-derived transport vesicle ERV46 [Rhizoctonia
solani AG-1 IA]
Length = 579
Score = 73.9 bits (180), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 61/195 (31%), Positives = 90/195 (46%), Gaps = 47/195 (24%)
Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAF---------- 166
FL +NKV GNFHF+PG+SF H +D++ + +D + H I++ F
Sbjct: 317 FLRINKVTGNFHFSPGRSFLSQRGHAYDLVPYLKDGNHHDFGHYIHEFHFEGDREIEDRW 376
Query: 167 -----GEHFPGVV----NPLDGVRWTQETPSG-MYQYFIKVVPTVYTDVSGHTIQSNQFS 216
G + V PLDG+ E PS M QYF+KVV T + G ++++Q+S
Sbjct: 377 REGNRGTEWRARVGSDKQPLDGL----EQPSNWMIQYFLKVVSTEVRHLDGDLVRAHQYS 432
Query: 217 VTEHFRSSEQGRLQTLPGVFF--FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
VT + R PG F D + IK T +CAIVGGV T++
Sbjct: 433 VTNYERDIR-------PGHEFDPLRDANGIKTTH------------GLCAIVGGVLTLAS 473
Query: 275 IIDAFIYHGQRAIKK 289
I D+ + I++
Sbjct: 474 IADSVAFASLNKIEE 488
>gi|388517493|gb|AFK46808.1| unknown [Lotus japonicus]
Length = 156
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 82/159 (51%), Gaps = 23/159 (14%)
Query: 155 FNISHKINKLAFGE---------------HFPGVVNPLDG---VRWTQETPSGMYQYFIK 196
N+SH +N L FG+ H + L+G V + +++I+
Sbjct: 1 MNMSHVVNHLTFGKKVTPRAISDMQRLIPHIGSSHDRLNGRSFVNTHNLEANVTIEHYIQ 60
Query: 197 VVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFL 256
+V T +G+ + + + T H S +P F +LSP++V TE SF
Sbjct: 61 IVKTEVVTRNGYKLIED-YEYTAH---SSVAHSLDIPVAKFHLELSPMQVLITENQKSFS 116
Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
HF+TNVCAI+GGVFTV+GI+D+ +++ R I KK+E+GK
Sbjct: 117 HFITNVCAIIGGVFTVAGIVDSILHNTIRMI-KKVELGK 154
>gi|384486505|gb|EIE78685.1| hypothetical protein RO3G_03389 [Rhizopus delemar RA 99-880]
Length = 188
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 67/120 (55%), Gaps = 13/120 (10%)
Query: 84 KGWALSNPDLIDQCKR----EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQ 139
K A+ +P I++ R + + +I ++ G C IYG L+VNKVA N H +
Sbjct: 72 KYQAIEDPKYINEIIRAANGKSYDHQIAKDMG-ACRIYGSLKVNKVASNLHITSDGHGYA 130
Query: 140 SGVHV-HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVV 198
S VH H++L N +H+I++L+FGE +P ++NPLD ET M+QY++ VV
Sbjct: 131 SRVHTSHEVL-------NFTHRIDELSFGEFYPNLINPLDNSMEIAETHFEMFQYYLSVV 183
>gi|71409118|ref|XP_806922.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
gi|70870803|gb|EAN85071.1| hypothetical protein, conserved [Trypanosoma cruzi]
Length = 310
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 91/193 (47%), Gaps = 31/193 (16%)
Query: 1 MDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
+D++G +L+V ++FK +D+QGN I +RQ G+G + + +CG
Sbjct: 109 LDVTGTVNLNVTRNLFKTPVDAQGNFAFIGTRQ-GVGE---YGSFREQSKDDPSSPQFCG 164
Query: 59 SCYGAE------SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
C+ E + CCN C +V AY ++G + ++QC E L RI
Sbjct: 165 RCFINEHQVSMMENKNRCCNTCNDVLNAYDQQGLPRPQKNEVEQCIYE--LSRI----NP 218
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG-EHFP 171
GCN G L V K G FAP + G + D++ F+ SH INKL+ G EH
Sbjct: 219 GCNYKGTLIVKKFGGRLVFAPKRV--PGGFLIRDVM-----RFDSSHIINKLSIGDEHVT 271
Query: 172 -----GVVNPLDG 179
GV +PL+G
Sbjct: 272 RFSRRGVQHPLNG 284
>gi|365986066|ref|XP_003669865.1| hypothetical protein NDAI_0D03080 [Naumovozyma dairenensis CBS 421]
gi|343768634|emb|CCD24622.1| hypothetical protein NDAI_0D03080 [Naumovozyma dairenensis CBS 421]
Length = 353
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 87/180 (48%), Gaps = 23/180 (12%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPG----KSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
GC+I+G + VN+VAG +H++ + + N +H IN+ +FGE
Sbjct: 162 GCHIFGSVNVNQVAGELQVTAKGHGYADYHRAPL----------EKVNFAHVINEFSFGE 211
Query: 169 HFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH---FRSS 224
FP + NPLD ++ + P Y Y V+P +Y + G + + Q+SV EH + S
Sbjct: 212 FFPYIDNPLDNSAKFNMDDPLTAYVYDTSVIPMIYRKM-GAEVDTFQYSVAEHQYKSKES 270
Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG-GVFTVSGII---DAFI 280
+PG+FF Y+ + + ++ + F+ F+ + AI+ V+ S + D FI
Sbjct: 271 SSSNSFRVPGIFFQYNFENLSIVVSDRRLGFIQFIVRLVAILSFAVYIASWLFILADMFI 330
>gi|219130117|ref|XP_002185219.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403398|gb|EEC43351.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 421
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 59/212 (27%), Positives = 97/212 (45%), Gaps = 24/212 (11%)
Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR------------ 152
+ + ++G+GC I G + V VAG F K Q + + +
Sbjct: 210 KFETKKGQGCTIEGHIRVPVVAGKFEITLNKRTWQQAASILNRQMLMQVLGATSEHTSSN 269
Query: 153 ----DSFNISHKINKLAFGEHFP-GVVNPLDGVRWTQETPSG---MYQYFIKVVPT-VYT 203
D +N +H I+ + FG+ FP + PL+ R G + + I++VPT T
Sbjct: 270 DELGDRYNSTHFIHYIRFGDSFPLNIEKPLEKRRHIFRNKYGAMAVQEMKIELVPTYTST 329
Query: 204 DVSGHTIQSNQFSVTEHFRSSE---QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
+ + Q+ Q SV + E Q +LPG+ YD SP+ V T + L FL+
Sbjct: 330 WLPTSSRQTYQASVVDSTIEPEHMAQAGASSLPGLAVQYDFSPLTVYHTGGRDNILVFLS 389
Query: 261 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
++ +IVGGVF G++ + H +A+ KKI+
Sbjct: 390 SLVSIVGGVFVTVGLVSGCLVHSAQAVAKKID 421
>gi|300123978|emb|CBK25249.2| unnamed protein product [Blastocystis hominis]
Length = 109
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 61/92 (66%), Gaps = 2/92 (2%)
Query: 193 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTE 250
YF+K++P + + G T +S ++SVTE+ + ++ +T PGV+F Y ++PI++T E
Sbjct: 10 YFLKLIPVEHISLFGGTSRSYEYSVTEYTQLLDKPSYFSRTSPGVYFKYQITPIRLTKRE 69
Query: 251 EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
+ FL + T +C+IVGGV T+SGII + + H
Sbjct: 70 SRIGFLQYYTTLCSIVGGVITISGIIQSLLTH 101
>gi|219111363|ref|XP_002177433.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411968|gb|EEC51896.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 520
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 94/201 (46%), Gaps = 32/201 (15%)
Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
+ E GCNI G L +++V GNFH +S H HD++ N+SH ++ L+ G
Sbjct: 333 DAEHPGCNIAGHLLLDRVPGNFHIQ-ARSPH------HDLVPHMT---NVSHVVHHLSIG 382
Query: 168 EH------------FPGVVN----PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 211
E P V P++G + + Y +++KV+ T +V G
Sbjct: 383 EPVAERLIEQEKVILPEDVKRKLKPMNGNAYVTKELHEAYHHYLKVITT---NVDGLKFG 439
Query: 212 SNQFSVTEHFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
+ +SS+ R +P F +DLSP+ V++ + + T++ AI+GG
Sbjct: 440 KRDLRAYQILQSSQLSFYRNDIIPEAKFVFDLSPVAVSYRTTSRRWYDYFTSILAIIGGT 499
Query: 270 FTVSGIIDAFIYHGQRAIKKK 290
FTV G++++ I H A K++
Sbjct: 500 FTVVGLLESTI-HATVARKRR 519
>gi|301101702|ref|XP_002899939.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262102514|gb|EEY60566.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 101
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 52/75 (69%), Gaps = 1/75 (1%)
Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
S+ Q QT P F +D+SP+ V T +++ F HF+T++CA++GGVFT+ ++D+ ++H
Sbjct: 28 STTQYEDQT-PSALFTFDISPLVVQITTDNIPFYHFITHLCAVIGGVFTILSLVDSGVFH 86
Query: 283 GQRAIKKKIEIGKFS 297
+IKKK ++GK S
Sbjct: 87 AMNSIKKKQQLGKLS 101
>gi|327354451|gb|EGE83308.1| hypothetical protein BDDG_06252 [Ajellomyces dermatitidis ATCC
18188]
Length = 113
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 41/97 (42%), Positives = 57/97 (58%), Gaps = 13/97 (13%)
Query: 206 SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV 253
SG +I+++Q+SVT H RS + G RL + +PGVF YD+SP+KV E
Sbjct: 13 SGGSIETHQYSVTSHKRSVDGGNDAEEGHKERLHSQGGIPGVFVNYDISPMKVINREART 72
Query: 254 -SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
+F FLT VCA++GG TV+ ID +Y G +KK
Sbjct: 73 KTFSGFLTGVCAVIGGTLTVAAAIDRALYEGSVRVKK 109
>gi|300122875|emb|CBK23882.2| unnamed protein product [Blastocystis hominis]
Length = 109
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 60/92 (65%), Gaps = 2/92 (2%)
Query: 193 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTE 250
YF+K++P + G T +S ++SVTE+ + ++ +T PGV+F Y ++PI++T E
Sbjct: 10 YFLKLIPVEQISLFGGTSRSYEYSVTEYTQLLDKPSYFSRTSPGVYFKYQITPIRLTKRE 69
Query: 251 EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
+ FL + T +C+IVGGV T+SGII + + H
Sbjct: 70 SRIGFLQYYTTLCSIVGGVITISGIIQSLLTH 101
>gi|403330686|gb|EJY64240.1| hypothetical protein OXYTRI_24846 [Oxytricha trifallax]
Length = 345
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/187 (29%), Positives = 92/187 (49%), Gaps = 23/187 (12%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
++ EGC + G + +NKV GNFH S H G V I + + +H +N L+FG+
Sbjct: 153 DDQEGCMVEGTVIINKVPGNFHL----STHSFGEVVQKIYMNGK-KLDFTHTVNHLSFGD 207
Query: 169 ----------HFPGVVNPLDG--VRWTQETPSG--MYQYFIKVVPTVYTDVSGHTIQSNQ 214
+ +DG V Q G + Y++ + Y D +G + Q
Sbjct: 208 DKQMKSIQSKYNEKYTFDMDGTYVDQNQHLYQGQLLANYYLDINQVDYLDATGIFYKLLQ 267
Query: 215 FSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
++SS+ Q LP +FF Y+LSP+K+ +T + S+ F + AI+GG++ V+
Sbjct: 268 ---GFKYKSSKSIMAQMGLPAIFFRYELSPVKLQYTMTYKSWSEFFIEISAIIGGMYVVA 324
Query: 274 GIIDAFI 280
GII++F+
Sbjct: 325 GIIESFL 331
>gi|118357982|ref|XP_001012239.1| hypothetical protein TTHERM_00103880 [Tetrahymena thermophila]
gi|89294006|gb|EAR91994.1| hypothetical protein TTHERM_00103880 [Tetrahymena thermophila
SB210]
Length = 323
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 101/209 (48%), Gaps = 27/209 (12%)
Query: 96 QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 155
Q K E L++IK +E C I+G L +N + G+F F + G+ +
Sbjct: 128 QQKIEEVLEQIKNKEQ--CRIHGQLLLNTIPGSFKF---RILQMKGLDEQLL-----KQL 177
Query: 156 NISHKINKLAFG--------EHFPGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
NI+HKINKL+FG E G+ D R+ E Y +IK++P
Sbjct: 178 NINHKINKLSFGDTIKTKKIEKVLGLDKSDSEAFDESRYNYEYRCS-YDNYIKILPLNAE 236
Query: 204 DVS--GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
++ G+ I++N F T + + + + + V F Y +SPI + + ++ SF F+
Sbjct: 237 NIKELGY-IRTNSFRFTMYQQVIPKEQTDIIE-VSFNYQVSPINIVYQTKNKSFYSFVVQ 294
Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
VCAI+GG+F V G+I+ + + +I K
Sbjct: 295 VCAIIGGIFCVFGVINTLVLNIISSINSK 323
>gi|361132020|gb|EHL03635.1| hypothetical protein M7I_0279 [Glarea lozoyensis 74030]
Length = 235
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 76/180 (42%), Gaps = 56/180 (31%)
Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGEHFP 171
I G L VNKV GNFH APG+SF +HVHD+ + SH I+ L FG P
Sbjct: 38 IEGALRVNKVIGNFHIAPGRSFSNGNMHVHDLNNYFDTPVEGGHVFSHTIHHLRFGPQLP 97
Query: 172 -------GV---------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS--------- 206
G +NPLD + T P+ + YF+KVV T Y +
Sbjct: 98 EELTKKLGTKTNLWTNHHLNPLDDTKQTTTEPAYNFMYFVKVVSTSYLPLGWETQAYKSQ 157
Query: 207 -----------GH----TIQSNQFSVTEHFRSSEQGRLQT------------LPGVFFFY 239
GH +++++Q+SVT H RS G + +PGVFF Y
Sbjct: 158 LGSEWVGIGSYGHQHDGSVETHQYSVTSHRRSLNGGDDASEGHKEKVHARGGIPGVFFSY 217
>gi|412994089|emb|CCO14600.1| predicted protein [Bathycoccus prasinos]
Length = 528
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 51/208 (24%), Positives = 93/208 (44%), Gaps = 29/208 (13%)
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
++ GC+I GF+ V KV G+ F A K+ H +F D N++H+++
Sbjct: 330 VQTRASTGCSITGFVLVKKVPGHVFFTADAKNGH----------SFDVDKLNVTHQVHHF 379
Query: 165 AFGEHFPGVV-----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
FG+ + L + P ++++++ V T +
Sbjct: 380 YFGQQLSASRQKYMARFHRGEKEGDWHDKLANDFVVSKNPRTSHEHYLQTVLTTMQPLGP 439
Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
N + T+H S + +T P F + SP+++ E+ F F+T + AIVG
Sbjct: 440 FAQPFNVYEYTQHTHSVKTPDGET-PRAKFHFTPSPVQILGVEKRREFYQFITTLMAIVG 498
Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
GV++V GIID +++ K+K+++GK
Sbjct: 499 GVYSVVGIIDGLMHNTSLMFKRKMQLGK 526
>gi|299116076|emb|CBN74492.1| DEAD box helicase [Ectocarpus siliculosus]
Length = 865
Score = 70.9 bits (172), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 53/214 (24%), Positives = 95/214 (44%), Gaps = 39/214 (18%)
Query: 99 REGFLQR-IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
R+GF + + +++ GC + G + VN+V GNFH H F + N+
Sbjct: 669 RKGFPEVGLHDDKWPGCMVTGHIMVNRVPGNFHIEAASKSH----------TFHGATTNL 718
Query: 158 SHKINKLAFGEHFPGVVN--------------PLDGVRWTQETPSGMYQYFIKVVPTVY- 202
SH ++ ++FG P PLDG + ++++VV ++Y
Sbjct: 719 SHIVHHMSFGNDPPRRTQTKINRLTEDLRQNAPLDGNVYVANAYHQAPHHYLRVVGSMYH 778
Query: 203 -----TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 257
T G+ I +N ++ E+ +P F Y++SP+ V E +
Sbjct: 779 LSPMKTPWHGYQIVAN----SQMMLYDEE----EVPEARFSYNISPMSVLVRSEKRPWYD 830
Query: 258 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
F+T V AIVGG F++ G++DA ++ R +++
Sbjct: 831 FVTKVLAIVGGTFSMVGLVDAAVFRASRKAGRQL 864
>gi|312374049|gb|EFR21698.1| hypothetical protein AND_16520 [Anopheles darlingi]
Length = 252
Score = 70.5 bits (171), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 46/74 (62%)
Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
I + + C I+G L +NKVAGNFH GK+ H + H+H F N SH+IN+ +
Sbjct: 163 IPQRPHDACRIHGVLTLNKVAGNFHITVGKTIHFARGHIHLNSIFANTQTNFSHRINRFS 222
Query: 166 FGEHFPGVVNPLDG 179
FG+H G+++PL+G
Sbjct: 223 FGDHTAGIIHPLEG 236
>gi|313227239|emb|CBY22386.1| unnamed protein product [Oikopleura dioica]
Length = 380
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 63/247 (25%), Positives = 104/247 (42%), Gaps = 60/247 (24%)
Query: 78 REAYRKKGWALSNPDLIDQCKREG--------------------FLQRIKE-----EEGE 112
R K ++++P++ DQ REG + ++ + E +
Sbjct: 138 RAKLLKMKESMTDPNMRDQLLREGHDVKHLEFSRKKNKKMMEQGMMHKVVQINLDPNEPQ 197
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----------------- 155
GC ++G +E+ K+AG ++ G+ L+ D+
Sbjct: 198 GCRVWGSVELQKIAGTIKI---QAGGFGGMGGIPGLSGGLDAIMGMFMMPMMGMGAQIQD 254
Query: 156 ----NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 211
N SH+I+ +FG+ G+V LDG QE + Y +KVVP TD+ Q
Sbjct: 255 GKKANFSHRIDHFSFGDPSSGLVYGLDGDIQIQEKENDDTTYVVKVVP---TDLKTFKFQ 311
Query: 212 SN--QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
Q++VT+H S++ P V YD S + V+ TE SF+ LT + I+GG+
Sbjct: 312 QKAYQYAVTQHVGKSDK------PAVTIKYDFSGLGVSITEYRESFVGLLTRLAGILGGI 365
Query: 270 FTVSGII 276
SGI+
Sbjct: 366 AASSGIL 372
>gi|313241668|emb|CBY33893.1| unnamed protein product [Oikopleura dioica]
Length = 380
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 63/247 (25%), Positives = 104/247 (42%), Gaps = 60/247 (24%)
Query: 78 REAYRKKGWALSNPDLIDQCKREG--------------------FLQRIKE-----EEGE 112
R K ++++P++ DQ REG + ++ + E +
Sbjct: 138 RAKLLKMKESMTDPNMRDQLLREGHDVKHLEFSRKKNKKMMEQGMMHKVVQINLDPNEPQ 197
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----------------- 155
GC ++G +E+ K+AG ++ G+ L+ D+
Sbjct: 198 GCRVWGSVELQKIAGTIKI---QAGGFGGMGGIPGLSGGLDAIMGMFMMPMMGMGAQIQD 254
Query: 156 ----NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 211
N SH+I+ +FG+ G+V LDG QE + Y +KVVP TD+ Q
Sbjct: 255 GKKANFSHRIDHFSFGDPSSGLVYGLDGDIQIQEKENDDTTYVVKVVP---TDLKTFKFQ 311
Query: 212 SN--QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
Q++VT+H S++ P V YD S + V+ TE SF+ LT + I+GG+
Sbjct: 312 QKAYQYAVTQHVGKSDK------PAVTIKYDFSGLGVSITEYRESFVGLLTRLAGILGGI 365
Query: 270 FTVSGII 276
SGI+
Sbjct: 366 AASSGIL 372
>gi|303278158|ref|XP_003058372.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459532|gb|EEH56827.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 399
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 56/101 (55%), Gaps = 18/101 (17%)
Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF------NISHKINKL 164
GEGC ++G L+V +VAGNFH + VH D R +F N+SH +++L
Sbjct: 158 GEGCRVHGRLKVQRVAGNFHVS---------VHGEDARTL-RATFEHPRNVNMSHAVHRL 207
Query: 165 AFGEHFPGVVNPLDGVRWTQE--TPSGMYQYFIKVVPTVYT 203
+FG+ FP +PL G T +G Y+YF+KVVP YT
Sbjct: 208 SFGKSFPRKEDPLSGFTRTTRHANETGTYKYFLKVVPVTYT 248
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 31/72 (43%), Positives = 46/72 (63%)
Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
++N +SVTE + ++ +LP V+F YDLSPI VT ++ SF HFL A VGG +
Sbjct: 318 RTNLYSVTETYIPTKNWNGGSLPAVYFIYDLSPIAVTISDARKSFGHFLARTVAGVGGAY 377
Query: 271 TVSGIIDAFIYH 282
++G+ID I+H
Sbjct: 378 AIAGLIDRMIHH 389
>gi|298714834|emb|CBJ25733.1| similar to Endoplasmic reticulum-Golgi intermediate compartment
protein 1 (ER-Golgi intermediate compartment 32 kDa
protein) (ERGIC-32) [Ectocarpus siliculosus]
Length = 320
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/129 (35%), Positives = 66/129 (51%), Gaps = 9/129 (6%)
Query: 156 NISHKINKLAFGEHFPGVV----NPLDGVRWTQETPSGMYQYFIKVVPTVY-----TDVS 206
N++HKI+ FG G V N L + E SG+ +Y +KVVP + +V+
Sbjct: 178 NMTHKIHDFGFGPPVKGPVGVGRNSLARSTFVSEEGSGLVKYSLKVVPISHRRMHGAEVN 237
Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
HT SN V E + L GV F YD + + V +T+ S +T+VCAIV
Sbjct: 238 THTYSSNVAFVPEAAVLQDLSSSSLLLGVEFSYDFTSVMVKYTDARRSMFELITSVCAIV 297
Query: 267 GGVFTVSGI 275
GG++TVSG+
Sbjct: 298 GGIYTVSGL 306
>gi|414879928|tpg|DAA57059.1| TPA: hypothetical protein ZEAMMB73_408305, partial [Zea mays]
Length = 75
Score = 68.9 bits (167), Expect = 3e-09, Method: Composition-based stats.
Identities = 26/49 (53%), Positives = 38/49 (77%)
Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
P V+F YDLSPI VT EE +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 11 PAVYFLYDLSPITVTIKEERRNFLHFITRLCAVLGGTFAMTGMLDRWMY 59
>gi|443920575|gb|ELU40475.1| endoplasmic reticulum-derived transport vesicle ERV46 [Rhizoctonia
solani AG-1 IA]
Length = 506
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 60/129 (46%), Gaps = 8/129 (6%)
Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
+ C ++G + V KV N H ++S H L N++H IN+ +FG
Sbjct: 168 PDASACRVFGTVAVKKVTANLHITTLGHGYRSAEHTDHTL------MNLTHVINEFSFGP 221
Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
P + PLD +QYFI VVPT Y + +NQ+SVT + R+ E GR
Sbjct: 222 FIPDLSQPLDYSFEVTHEHFTAFQYFITVVPTTYQVPGQDPLHTNQYSVTHYTRNIEHGR 281
Query: 229 LQTLPGVFF 237
PG+FF
Sbjct: 282 --GTPGIFF 288
>gi|123407515|ref|XP_001303026.1| hypothetical protein [Trichomonas vaginalis G3]
gi|121884369|gb|EAX90096.1| hypothetical protein TVAG_396530 [Trichomonas vaginalis G3]
Length = 234
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 54/223 (24%), Positives = 98/223 (43%), Gaps = 15/223 (6%)
Query: 55 TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
T CGSCYGA + CCN+C+EV +A++K + +I QC+ + C
Sbjct: 13 TECGSCYGASNG---CCNSCKEVLDAFQKIEKSHPPTAMIQQCRNT--FSDADSLINDSC 67
Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
+ L V G+F G++ + D L +++ N +H + + G +
Sbjct: 68 TLGITLTVPHTHGSFFITIGQNTTNTSA---DYLGVPKENLNFTHSFDFFSMGGGYHPAQ 124
Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
+ ++ +E Y+I+ T + SVT + R ++ LPG
Sbjct: 125 ILQNYMKVQKEYGRYKAMYYIRA-----TRILNDYDTQYSLSVTSYDRYRDESS-DKLPG 178
Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
VF YD+SP+ + + + + + ++ AI+GG+F +ID
Sbjct: 179 VFINYDISPLILQYVLDRPIY-QIIIDMMAIIGGIFAFGLLID 220
>gi|365991164|ref|XP_003672411.1| hypothetical protein NDAI_0J02760 [Naumovozyma dairenensis CBS 421]
gi|343771186|emb|CCD27168.1| hypothetical protein NDAI_0J02760 [Naumovozyma dairenensis CBS 421]
Length = 341
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 95/200 (47%), Gaps = 23/200 (11%)
Query: 92 DLIDQCKREGFLQRIKEEEGE-------GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 144
+++ Q F RI E E C+++G ++VN++ G + S ++
Sbjct: 128 EVLTQAIPYEFGMRIDERPPEDDMPNINACHLFGSVDVNRLPGILEISTN-----STGNI 182
Query: 145 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYT 203
+D + + +H IN+L+FGE FP + NPLD + + P Y Y++ V+PT+Y
Sbjct: 183 ND------NGKSFAHVINELSFGEFFPFIDNPLDNTAKVLPDQPLTTYSYYLTVIPTIYE 236
Query: 204 DVSGHTIQSNQFSVTEH-FRSSEQGRLQTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
+ G + +NQ+S+ E F+ + QT + YD + + + + F+ FL
Sbjct: 237 KL-GKRVNTNQYSLNEFIFKHIYNVKSQTQYDEAIRIHYDFDALSIFMHDTRLDFIQFLV 295
Query: 261 NVCAIVGGVFTVSGIIDAFI 280
+ AI+ V ++ + FI
Sbjct: 296 RLVAILSFVVYIASWVFRFI 315
>gi|410046954|ref|XP_003952285.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
intermediate compartment protein 2 [Pan troglodytes]
Length = 333
Score = 67.8 bits (164), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 40/90 (44%), Positives = 56/90 (62%), Gaps = 6/90 (6%)
Query: 190 MYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKV 246
M+QYFI VVPT ++T +S T +QFSVTE R + + G+F YDLS + V
Sbjct: 202 MFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMV 258
Query: 247 TFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
T TEEH+ F F +C IVGG+F+ +G++
Sbjct: 259 TVTEEHMPFWQFFVRLCGIVGGIFSTTGML 288
>gi|118386954|ref|XP_001026594.1| hypothetical protein TTHERM_01146090 [Tetrahymena thermophila]
gi|89308361|gb|EAS06349.1| hypothetical protein TTHERM_01146090 [Tetrahymena thermophila
SB210]
Length = 712
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 60/181 (33%), Positives = 79/181 (43%), Gaps = 25/181 (13%)
Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
+ E C IYG V KV GNFH SFH G+ +L FN+ H I+ L F
Sbjct: 545 QREKCQIYGHFYVKKVPGNFHV----SFHNEGL----LLMNSNLIFNLRHTIHTLEFTTE 596
Query: 170 --------FPGVVNPLDGVRWTQETPS-GM-YQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
+ NPLD T P GM Y++KVV TV+ ++ +N +S T
Sbjct: 597 DGSLTLGKYTKSSNPLDK---TIHNPGHGMDTDYYLKVVNTVFENMLSE--HNNIYSFTS 651
Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
S R LP V F Y+ PI V + S F+ +CAIVGG +S I
Sbjct: 652 LETSG--VRDFRLPSVNFRYEFDPITVLHYRKSRSLTQFIVTLCAIVGGSIAISKYIYTL 709
Query: 280 I 280
+
Sbjct: 710 L 710
Score = 38.5 bits (88), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 29/108 (26%), Positives = 44/108 (40%), Gaps = 16/108 (14%)
Query: 2 DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRH------------GGR 49
D+SG D+ + K RLD G I A I K Q+ +
Sbjct: 89 DVSGAHLEDMHWTVHKIRLDQFGKFINYD----SANDIKKQEQKFYPGNPFFEAVKTNNQ 144
Query: 50 LEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 97
+++ + SCYGAE + C C +V A+ ++GW + I QC
Sbjct: 145 VQNQFSNSVSCYGAELYEGQICLTCSDVLIAFAQRGWPQPMKEQISQC 192
>gi|393908150|gb|EJD74929.1| hypothetical protein, variant [Loa loa]
Length = 368
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/115 (34%), Positives = 61/115 (53%), Gaps = 5/115 (4%)
Query: 107 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
++ EG C I+G + VNKV G+ F + GK G+ H NISH+I +
Sbjct: 222 EKNEGTACRIHGRMRVNKVKGDSFIISTGKGLDVDGIFAH--FGGVSSPSNISHRIERFN 279
Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVT 218
FG G+V PL G+ ET ++YF+K+VPT ++ + G + + Q+SVT
Sbjct: 280 FGPRIYGLVTPLAGIEQISETGVDEFRYFLKIVPTRIYHSGLFGGSTLTYQYSVT 334
>gi|307110923|gb|EFN59158.1| hypothetical protein CHLNCDRAFT_138016 [Chlorella variabilis]
Length = 360
Score = 65.9 bits (159), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 55/97 (56%), Gaps = 13/97 (13%)
Query: 199 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
P + D +T+QS++++ +H + F Y +SPI++ TE+ F
Sbjct: 275 PELQFDAYEYTVQSHKYNAEDHASAK------------FTYKMSPIQIVVTEQPKQLYKF 322
Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
LT +CA++GGVFTV+GI+D + H I KK+++GK
Sbjct: 323 LTAICAVIGGVFTVAGILDGMV-HQVNKIAKKVDLGK 358
>gi|308804553|ref|XP_003079589.1| acyl-CoA thioester hydrolase-like (ISS) [Ostreococcus tauri]
gi|116058044|emb|CAL54247.1| acyl-CoA thioester hydrolase-like (ISS) [Ostreococcus tauri]
Length = 1155
Score = 65.5 bits (158), Expect = 3e-08, Method: Composition-based stats.
Identities = 53/230 (23%), Positives = 92/230 (40%), Gaps = 63/230 (27%)
Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
GC+I G +N+V G F+F P H G +++H + L+FG H PG
Sbjct: 934 GCSINGQFSINRVPGAFYFHPRSRSHTIG------------DVDMTHVVKHLSFGTHAPG 981
Query: 173 --------------VVNPLD-GVRWTQETPSGM-----------YQYFIKVVPTVYTDVS 206
+ P D G R+ + M + +++ V+P Y V
Sbjct: 982 GPRRFVPRHLRKAWKLIPKDAGGRFAGKLSKPMQFDADTSGRTVFDHYVHVIPRTYHPVG 1041
Query: 207 GHTIQSNQFSVTEH----------------FRS---------SEQGRLQTLPGVFFFYDL 241
I +++ + H +R+ ++ R P + F YD+
Sbjct: 1042 DEPIHIYEYTFSSHAFKLRDDAAERELSRNYRTGGEIDREFGTDDFRRPDGPSIRFSYDI 1101
Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
S + V E H + L ++ AI+GG+ T S ++ F+Y RA+K++I
Sbjct: 1102 SAMGVVTREVHKNLLEWILGCSAILGGLVTCSVGLERFVYASSRAVKRRI 1151
>gi|260826492|ref|XP_002608199.1| hypothetical protein BRAFLDRAFT_90361 [Branchiostoma floridae]
gi|229293550|gb|EEN64209.1| hypothetical protein BRAFLDRAFT_90361 [Branchiostoma floridae]
Length = 336
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 38/96 (39%), Positives = 52/96 (54%), Gaps = 10/96 (10%)
Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPI 244
M+QYFI++VPT + + QF+VTE R S G + G+FF YDL+ I
Sbjct: 188 MFQYFIQIVPT-RVNTRQAQADTGQFAVTERERVINHDSGSHG----VAGIFFKYDLTSI 242
Query: 245 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
V TEE F L +C IVGG+F SG++ F+
Sbjct: 243 MVKVTEERQPFSQLLIRLCGIVGGIFATSGMLHGFV 278
>gi|30268567|emb|CAD89902.1| hypothetical protein [Homo sapiens]
Length = 132
Score = 65.5 bits (158), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 54/92 (58%), Gaps = 10/92 (10%)
Query: 190 MYQYFIKVVPTVYTDVSGHTIQ----SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPI 244
M+QYFI VVPT HT + ++QFSVTE R + + G+F YDLS +
Sbjct: 1 MFQYFITVVPT-----KLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSL 55
Query: 245 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
VT TEEH+ F F +C IVGG+F+ +G++
Sbjct: 56 MVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 87
>gi|354507876|ref|XP_003515980.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
protein 2-like [Cricetulus griseus]
gi|344235439|gb|EGV91542.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
[Cricetulus griseus]
Length = 132
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 38/92 (41%), Positives = 54/92 (58%), Gaps = 10/92 (10%)
Query: 190 MYQYFIKVVPTVYTDVSGHTIQ----SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPI 244
M+QYFI VVPT HT + ++QFSVTE R + + G+F YDLS +
Sbjct: 1 MFQYFITVVPT-----KLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSL 55
Query: 245 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
VT TEEH+ F F +C I+GG+F+ +G++
Sbjct: 56 MVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 87
>gi|11907610|gb|AAG41243.1|AF210626_1 Fun9 [Eremothecium gossypii]
Length = 138
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 16/135 (11%)
Query: 170 FPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
P PL+G + P+G + YF KVVP Y ++G +S +FSVT H R
Sbjct: 3 LPANPGPLNG--RAMKVPNGHSHFFSYFAKVVPIRYETLAGTITESAEFSVTAHDRPVHG 60
Query: 227 GRLQTLPGVFFF----------YDLSPIKVTFTEEHVS-FLHFLTNVCAIVGGVFTVSGI 275
GR P F +++SP+KV E++ S + F+ N +GGV V +
Sbjct: 61 GRDADHPNTVHFRGGMAGMTINFEMSPLKVIQREQYASTWTAFVLNAITSIGGVLAVGTV 120
Query: 276 IDAFIYHGQRAIKKK 290
+D YH QR + K
Sbjct: 121 LDRVTYHTQRTLMGK 135
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.321 0.139 0.428
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,987,611,959
Number of Sequences: 23463169
Number of extensions: 219448077
Number of successful extensions: 433412
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1023
Number of HSP's successfully gapped in prelim test: 68
Number of HSP's that attempted gapping in prelim test: 429642
Number of HSP's gapped (non-prelim): 1189
length of query: 297
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 156
effective length of database: 9,050,888,538
effective search space: 1411938611928
effective search space used: 1411938611928
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)