BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 022435
         (297 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255545672|ref|XP_002513896.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
 gi|223546982|gb|EEF48479.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
          Length = 386

 Score =  583 bits (1504), Expect = e-164,   Method: Compositional matrix adjust.
 Identities = 272/297 (91%), Positives = 290/297 (97%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLDVKHDI KKRLDS GNVIE+RQDGIGAPKI+ PLQRHGGRLEHNETYCGSC
Sbjct: 90  MDISGEQHLDVKHDIIKKRLDSHGNVIEARQDGIGAPKIENPLQRHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+SDEDCCN+CE+VREAYRKKGWALSNPDLIDQCKREGFLQRIK+EEGEGCNIYGFL
Sbjct: 150 YGAEASDEDCCNSCEDVREAYRKKGWALSNPDLIDQCKREGFLQRIKDEEGEGCNIYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+LAFQ+DSFNISHKIN+LAFG++FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNVHVHDLLAFQKDSFNISHKINRLAFGDYFPGVVNPLDGV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            WTQETPSGMYQYFIKVVPTVYTDVSG+TIQSNQFSVTEHFRS+E GRLQ+LPGVFFFYD
Sbjct: 270 HWTQETPSGMYQYFIKVVPTVYTDVSGYTIQSNQFSVTEHFRSAEAGRLQSLPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI+D+FIYHGQ+AIKKK+EIGKFS
Sbjct: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGILDSFIYHGQKAIKKKMEIGKFS 386


>gi|224082148|ref|XP_002306582.1| predicted protein [Populus trichocarpa]
 gi|222856031|gb|EEE93578.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  577 bits (1487), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 267/297 (89%), Positives = 289/297 (97%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLDVKHDI KKRLD  GNVIE+RQDGIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 90  MDISGEQHLDVKHDIIKKRLDFHGNVIEARQDGIGAPKIEKPLQRHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+SDEDCCN+CE+VREAYRKKGWA++NPDL+DQCKREGFLQ+IK+EEGEGCNIYGFL
Sbjct: 150 YGAEASDEDCCNSCEDVREAYRKKGWAVTNPDLMDQCKREGFLQKIKDEEGEGCNIYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ+DSFNI+HKIN+L FGE+FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDSFNITHKINRLTFGEYFPGVVNPLDGV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR ++ GRLQ+LPGVFFFYD
Sbjct: 270 QWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRGTDIGRLQSLPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI+D FIYHGQ+AIKKK+EIGKFS
Sbjct: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGILDTFIYHGQKAIKKKMEIGKFS 386


>gi|356552872|ref|XP_003544786.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Glycine max]
          Length = 386

 Score =  574 bits (1479), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 265/297 (89%), Positives = 288/297 (96%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLDVKHDI KKRLDS GNVIE+RQ+GIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 90  MDISGEQHLDVKHDIIKKRLDSHGNVIETRQEGIGAPKIEKPLQRHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE SD+DCCN+CE+VREAYRKKGWALSNPDLIDQCKREGFLQRIK+EEGEGCN+YGFL
Sbjct: 150 YGAEESDDDCCNSCEDVREAYRKKGWALSNPDLIDQCKREGFLQRIKDEEGEGCNVYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ+DSFN+SH IN+LAFGE+FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDSFNLSHHINRLAFGEYFPGVVNPLDNV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR+ + GRLQ+LPGVFFFYD
Sbjct: 270 HWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRTGDVGRLQSLPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTEE+VSFLHFLTNVCAIVGG+FTVSGI+D+FIYHGQRAIKKK+E+GKF+
Sbjct: 330 LSPIKVTFTEENVSFLHFLTNVCAIVGGIFTVSGILDSFIYHGQRAIKKKMELGKFN 386


>gi|356548103|ref|XP_003542443.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Glycine max]
          Length = 386

 Score =  570 bits (1468), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 262/297 (88%), Positives = 287/297 (96%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQ LDVKHDI KKRLDS+GNVIE+RQ+GIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 90  MDISGEQRLDVKHDIIKKRLDSRGNVIETRQEGIGAPKIEKPLQRHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YG+E SD+DCCN+CE+VREAYRKKGWALSNPDLIDQCKREGFLQRIK+EEGEGCN+YGFL
Sbjct: 150 YGSEVSDDDCCNSCEDVREAYRKKGWALSNPDLIDQCKREGFLQRIKDEEGEGCNVYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ+DSFN+SH IN+L FGE+FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDSFNLSHHINRLTFGEYFPGVVNPLDNV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR+ + GRLQ+LPGVFFFYD
Sbjct: 270 HWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRTGDMGRLQSLPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTEE+VSFLHFLTNVCAIVGG+FTVSGI+D+FIYHGQRAIKKK+E+GKF+
Sbjct: 330 LSPIKVTFTEENVSFLHFLTNVCAIVGGIFTVSGILDSFIYHGQRAIKKKMELGKFN 386


>gi|225459342|ref|XP_002285801.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Vitis vinifera]
 gi|302141938|emb|CBI19141.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  568 bits (1464), Expect = e-160,   Method: Compositional matrix adjust.
 Identities = 261/297 (87%), Positives = 287/297 (96%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLDV+HDI KKR+D+ G+VIE+RQDGIG+PKI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90  MDISGEQHLDVRHDIIKKRIDAHGSVIEARQDGIGSPKIEKPLQKHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+SD+DCCNNCEEVREAYRKKGWA+SNPDLIDQCKREGFLQRIK+EEGEGCNIYGFL
Sbjct: 150 YGAEASDDDCCNNCEEVREAYRKKGWAMSNPDLIDQCKREGFLQRIKDEEGEGCNIYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS +HVHD+LAFQ+DSFNISHKIN+LAFG++FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNIHVHDLLAFQKDSFNISHKINRLAFGDYFPGVVNPLDGV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W Q TPSGMYQYFIKVVPTVYT VSGHTI +NQFSVTEHFR++E GRLQ+LPGVFFFYD
Sbjct: 270 QWIQATPSGMYQYFIKVVPTVYTHVSGHTISTNQFSVTEHFRNAELGRLQSLPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI+D+FIYH Q+AIKKKIEIGKFS
Sbjct: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGILDSFIYHSQKAIKKKIEIGKFS 386


>gi|357489473|ref|XP_003615024.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
 gi|355516359|gb|AES97982.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
          Length = 386

 Score =  562 bits (1449), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 257/297 (86%), Positives = 288/297 (96%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLDV+HDI KKR+DS GNVIE+RQDGIG+P I+KPLQRHGGRLEHNETYCGSC
Sbjct: 90  MDISGEQHLDVRHDIIKKRIDSHGNVIETRQDGIGSPNIEKPLQRHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+SDE+CCN+CEEVREAYRKKGWALS+PD IDQCKREGFL+RIKEEEGEGCN+YGFL
Sbjct: 150 YGAEASDEECCNSCEEVREAYRKKGWALSSPDSIDQCKREGFLERIKEEEGEGCNVYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ++SFN+SH IN++AFG++FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKESFNLSHHINRIAFGDYFPGVVNPLDRV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            WTQETPSGMYQYFIKVVPT+YTDVSG+TIQSNQFSVTEHFR+++ GRLQ+LPGVFFFYD
Sbjct: 270 HWTQETPSGMYQYFIKVVPTMYTDVSGNTIQSNQFSVTEHFRTADVGRLQSLPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTEEHVSFLHFLTNVCAIVGG+FTVSGI+D+FIYHGQ+AIKKK+E+GKFS
Sbjct: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGIFTVSGILDSFIYHGQKAIKKKMELGKFS 386


>gi|449465886|ref|XP_004150658.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
 gi|449518819|ref|XP_004166433.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 386

 Score =  560 bits (1442), Expect = e-157,   Method: Compositional matrix adjust.
 Identities = 260/297 (87%), Positives = 281/297 (94%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLDVKHDI KKRLDS GN IE+R DGIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 90  MDISGEQHLDVKHDIIKKRLDSHGNAIEARPDGIGAPKIEKPLQRHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +GAES+D+DCCN+CEEVREAYRKKGWALSNPDLIDQCKREGFLQRIK+E+GEGCNIYGFL
Sbjct: 150 FGAESADDDCCNSCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKDEDGEGCNIYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+LAFQ+DSFNISHKIN+LAFGE+FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNVHVHDLLAFQKDSFNISHKINRLAFGEYFPGVVNPLDSV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W QETPS  YQYFIKVVPTVY  VSG+TIQSNQFSVTEH R++E GRLQ+LP VFFFYD
Sbjct: 270 QWKQETPSATYQYFIKVVPTVYNSVSGYTIQSNQFSVTEHVRTAEVGRLQSLPAVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI+D+FIYHGQ+ IKKK+EIGKFS
Sbjct: 330 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGILDSFIYHGQKVIKKKMEIGKFS 386


>gi|224066933|ref|XP_002302286.1| predicted protein [Populus trichocarpa]
 gi|222844012|gb|EEE81559.1| predicted protein [Populus trichocarpa]
          Length = 377

 Score =  556 bits (1434), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 263/297 (88%), Positives = 282/297 (94%), Gaps = 9/297 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLDVKHDI KKRLDS GNVIESRQDGIGAPKI+KPLQRHGGRLEHNETYC   
Sbjct: 90  MDISGEQHLDVKHDIIKKRLDSHGNVIESRQDGIGAPKIEKPLQRHGGRLEHNETYC--- 146

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
                 DEDCCN+CEEVREAY+KKGWA++NPDL+DQCKREGFLQRIK+EEGEGCNIYGFL
Sbjct: 147 ------DEDCCNSCEEVREAYQKKGWAVTNPDLMDQCKREGFLQRIKDEEGEGCNIYGFL 200

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QSGVHVHD+LAFQ+DSFN SHKIN+LAFGE+FPGVVNPLDGV
Sbjct: 201 EVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDSFNTSHKINRLAFGEYFPGVVNPLDGV 260

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR ++ GRLQ+LPGVFFFYD
Sbjct: 261 QWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRGADIGRLQSLPGVFFFYD 320

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI+D+FIYHGQ+AIKKK+EIGKFS
Sbjct: 321 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGILDSFIYHGQKAIKKKMEIGKFS 377


>gi|297846654|ref|XP_002891208.1| hypothetical protein ARALYDRAFT_891247 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337050|gb|EFH67467.1| hypothetical protein ARALYDRAFT_891247 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 386

 Score =  556 bits (1432), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 254/297 (85%), Positives = 282/297 (94%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE HLDVKHDI K+RLDS GN IE+RQDGIGA KI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90  MDISGELHLDVKHDIIKRRLDSNGNTIEARQDGIGATKIEKPLQKHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ + DCCN+CE+VREAYRKKGW ++NPDLIDQCKREGFLQR+K+EEGEGCNIYGFL
Sbjct: 150 YGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLIDQCKREGFLQRVKDEEGEGCNIYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ+DSFNISHKIN+L +G++FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDSFNISHKINRLTYGDYFPGVVNPLDKV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQSNQFSVTEH +SSE G+LQ+LPGVFFFYD
Sbjct: 270 EWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSNQFSVTEHVKSSEAGQLQSLPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTEEH+SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 330 LSPIKVTFTEEHISFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQKAIKKKMEIGKFS 386


>gi|238478737|ref|NP_001154394.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
 gi|12324714|gb|AAG52317.1|AC021666_6 unknown protein; 24499-21911 [Arabidopsis thaliana]
 gi|27808598|gb|AAO24579.1| At1g36050 [Arabidopsis thaliana]
 gi|110736190|dbj|BAF00066.1| hypothetical protein [Arabidopsis thaliana]
 gi|332193720|gb|AEE31841.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
          Length = 386

 Score =  551 bits (1420), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 252/297 (84%), Positives = 280/297 (94%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE HLDVKHDI K+RLDS GN IE+RQDGIGA KI+ PLQ+HGGRL HNETYCGSC
Sbjct: 90  MDISGELHLDVKHDIIKRRLDSNGNTIEARQDGIGATKIENPLQKHGGRLGHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ + DCCN+CE+VREAYRKKGW ++NPDLIDQCKREGFLQR+K+EEGEGCNIYGFL
Sbjct: 150 YGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLIDQCKREGFLQRVKDEEGEGCNIYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ+DSFNISHKIN+L +G++FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDSFNISHKINRLTYGDYFPGVVNPLDKV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQSNQFSVTEH +SSE G+LQ+LPGVFFFYD
Sbjct: 270 EWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSNQFSVTEHVKSSEAGQLQSLPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTEEH+SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 330 LSPIKVTFTEEHISFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQKAIKKKMEIGKFS 386


>gi|240254210|ref|NP_564467.5| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
 gi|332193719|gb|AEE31840.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
          Length = 489

 Score =  544 bits (1401), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 248/293 (84%), Positives = 276/293 (94%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE HLDVKHDI K+RLDS GN IE+RQDGIGA KI+ PLQ+HGGRL HNETYCGSC
Sbjct: 90  MDISGELHLDVKHDIIKRRLDSNGNTIEARQDGIGATKIENPLQKHGGRLGHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ + DCCN+CE+VREAYRKKGW ++NPDLIDQCKREGFLQR+K+EEGEGCNIYGFL
Sbjct: 150 YGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLIDQCKREGFLQRVKDEEGEGCNIYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ+DSFNISHKIN+L +G++FPGVVNPLD V
Sbjct: 210 EVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDSFNISHKINRLTYGDYFPGVVNPLDKV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQSNQFSVTEH +SSE G+LQ+LPGVFFFYD
Sbjct: 270 EWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSNQFSVTEHVKSSEAGQLQSLPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           LSPIKVTFTEEH+SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ+AIKKK+EI
Sbjct: 330 LSPIKVTFTEEHISFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQKAIKKKMEI 382


>gi|38347102|emb|CAE02574.2| OSJNBa0006M15.17 [Oryza sativa Japonica Group]
 gi|116309990|emb|CAH67017.1| H0523F07.5 [Oryza sativa Indica Group]
 gi|218194960|gb|EEC77387.1| hypothetical protein OsI_16129 [Oryza sativa Indica Group]
          Length = 386

 Score =  530 bits (1366), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 241/297 (81%), Positives = 273/297 (91%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG++HLDVKHDIFK+R+D  GNVI ++QD +G  K+++PLQRHGGRLEHNETYCGSC
Sbjct: 90  MDISGQEHLDVKHDIFKQRIDVHGNVIATKQDAVGGMKVEQPLQRHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE SDE CCN+CE+VREAYRKKGW +SNPDLIDQCKREGFLQ IK+EEGEGCNIYGFL
Sbjct: 150 YGAEESDEQCCNSCEDVREAYRKKGWGVSNPDLIDQCKREGFLQSIKDEEGEGCNIYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF ++ VHVHD+L FQ+DSFN+SHKINKL+FG+ FPGVVNPLDG 
Sbjct: 210 EVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDSFNVSHKINKLSFGQRFPGVVNPLDGA 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W Q +  GMYQYFIKVVPTVYTD++ H I SNQFSVTEHFRSSE GR+Q +PGVFFFYD
Sbjct: 270 QWMQHSSYGMYQYFIKVVPTVYTDINEHIILSNQFSVTEHFRSSESGRIQAVPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YHGQRAIKKK+EIGKF+
Sbjct: 330 LSPIKVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHGQRAIKKKMEIGKFN 386


>gi|226494692|ref|NP_001148795.1| LOC100282412 [Zea mays]
 gi|194696974|gb|ACF82571.1| unknown [Zea mays]
 gi|195622210|gb|ACG32935.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
 gi|414586929|tpg|DAA37500.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
           protein [Zea mays]
          Length = 386

 Score =  524 bits (1350), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 237/297 (79%), Positives = 269/297 (90%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG++HLDVKHD+FK+R+D+ GNVI +RQD +G  K++ PLQ HGGRLEHNETYCGSC
Sbjct: 90  MDISGQEHLDVKHDVFKQRIDAHGNVIATRQDVVGGMKMEAPLQHHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+ SD+ CCN CE+VREAYRKKGW +SNPDL+DQCKREGFLQ IK+EEGEGCNIYGF+
Sbjct: 150 YGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLLDQCKREGFLQSIKDEEGEGCNIYGFI 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKIN+L+FGE+FPGVVNPLDG 
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNVSHKINRLSFGEYFPGVVNPLDGA 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            W Q +  GMYQYFIKVVPTVYTD++ H I SNQFSVTEHFRS E GR+Q LPGVFFFYD
Sbjct: 270 NWVQHSSYGMYQYFIKVVPTVYTDINEHIILSNQFSVTEHFRSGESGRMQALPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YH QRAIKKK+EIGKF+
Sbjct: 330 LSPIKVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHSQRAIKKKMEIGKFN 386


>gi|242076030|ref|XP_002447951.1| hypothetical protein SORBIDRAFT_06g018670 [Sorghum bicolor]
 gi|241939134|gb|EES12279.1| hypothetical protein SORBIDRAFT_06g018670 [Sorghum bicolor]
          Length = 386

 Score =  524 bits (1349), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 237/297 (79%), Positives = 269/297 (90%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG++HLDVKHD+FK+R+D+ GNVI +RQD +G  K++ PLQ HGGRLEHNETYCGSC
Sbjct: 90  MDISGQEHLDVKHDVFKQRIDAHGNVIATRQDAVGGMKMEAPLQHHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+ SD  CCN+CE+VREAYRKKGW +SNPDL+DQCKREGFLQ IK+EEGEGCNIYGF+
Sbjct: 150 YGAQESDGQCCNSCEDVREAYRKKGWGVSNPDLLDQCKREGFLQSIKDEEGEGCNIYGFI 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKIN+L+FGE+FPGVVNPLDG 
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNVSHKINRLSFGEYFPGVVNPLDGA 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            W Q +  GMYQYFIKVVPTVYTD++ H I SNQFSVTEHFRS E GR+Q LPGVFFFYD
Sbjct: 270 SWVQHSSYGMYQYFIKVVPTVYTDINEHIILSNQFSVTEHFRSGESGRMQALPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YH QRAIKKK+EIGKF+
Sbjct: 330 LSPIKVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHSQRAIKKKMEIGKFN 386


>gi|6598578|gb|AAF18633.1|AC006228_4 F5J5.4 [Arabidopsis thaliana]
          Length = 440

 Score =  523 bits (1346), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 252/348 (72%), Positives = 280/348 (80%), Gaps = 51/348 (14%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE HLDVKHDI K+RLDS GN IE+RQDGIGA KI+ PLQ+HGGRL HNETYCGSC
Sbjct: 93  MDISGELHLDVKHDIIKRRLDSNGNTIEARQDGIGATKIENPLQKHGGRLGHNETYCGSC 152

Query: 61  YGAES---------------------------SDEDCCNNCEEVREAYRKKGWALSNPDL 93
           YGAE+                            + DCCN+CE+VREAYRKKGW ++NPDL
Sbjct: 153 YGAEAVIVLSLYLTLWSMVSQLSSEVCFFPVQEEHDCCNSCEDVREAYRKKGWGVTNPDL 212

Query: 94  IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 153
           IDQCKREGFLQR+K+EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD+LAFQ+D
Sbjct: 213 IDQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKD 272

Query: 154 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
           SFNISHKIN+L +G++FPGVVNPLD V W+Q+TP+ MYQYFIKVVPTVYTD+ GHTIQSN
Sbjct: 273 SFNISHKINRLTYGDYFPGVVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSN 332

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG----- 268
           QFSVTEH +SSE G+LQ+LPGVFFFYDLSPIKVTFTEEH+SFLHFLTNVCAIVGG     
Sbjct: 333 QFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGISLIS 392

Query: 269 -------------------VFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
                              VFTVSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 393 IYHNNTCWLTHIKIRNETCVFTVSGIIDAFIYHGQKAIKKKMEIGKFS 440


>gi|225448309|ref|XP_002264644.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Vitis vinifera]
 gi|296085664|emb|CBI29463.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  521 bits (1341), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 232/297 (78%), Positives = 271/297 (91%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQH D+KHDI KKR+D+ GNV+  RQDGIG P+I+KPLQRHGGRLEHNE YCGSC
Sbjct: 90  MDISGEQHHDIKHDIVKKRIDAHGNVVAVRQDGIGGPQIEKPLQRHGGRLEHNEKYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE +D+DCCN+C+EVREAYRKKGW ++NPDLIDQCKREGF+Q++KEEEGEGCN+YGFL
Sbjct: 150 YGAEVTDDDCCNSCDEVREAYRKKGWGMTNPDLIDQCKREGFVQKVKEEEGEGCNVYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHF+PGK F+QS +HV+D+LA  +D +NISH+INKLAFG+HFPGVVNPLDG 
Sbjct: 210 EVNKVAGNFHFSPGKGFYQSNIHVNDLLAISKDGYNISHRINKLAFGDHFPGVVNPLDGA 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W Q+ P GMYQYFIKVVPT+YTD+ GHTIQSNQFSVTEHFRS+E GR  +LPGV+FFYD
Sbjct: 270 QWFQDAPDGMYQYFIKVVPTIYTDIRGHTIQSNQFSVTEHFRSAEPGRPHSLPGVYFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVT  EEH SFLHF+TN+CAIVGG+FTVSGIID+F+YHG RAIKKK+E+GKFS
Sbjct: 330 LSPIKVTSKEEHSSFLHFMTNICAIVGGIFTVSGIIDSFVYHGHRAIKKKMELGKFS 386


>gi|224032113|gb|ACN35132.1| unknown [Zea mays]
 gi|414586931|tpg|DAA37502.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
           protein [Zea mays]
          Length = 391

 Score =  517 bits (1332), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 237/302 (78%), Positives = 269/302 (89%), Gaps = 5/302 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG++HLDVKHD+FK+R+D+ GNVI +RQD +G  K++ PLQ HGGRLEHNETYCGSC
Sbjct: 90  MDISGQEHLDVKHDVFKQRIDAHGNVIATRQDVVGGMKMEAPLQHHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ-----CKREGFLQRIKEEEGEGCN 115
           YGA+ SD+ CCN CE+VREAYRKKGW +SNPDL+DQ     CKREGFLQ IK+EEGEGCN
Sbjct: 150 YGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLLDQVEPSDCKREGFLQSIKDEEGEGCN 209

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           IYGF+EVNKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKIN+L+FGE+FPGVVN
Sbjct: 210 IYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNVSHKINRLSFGEYFPGVVN 269

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           PLDG  W Q +  GMYQYFIKVVPTVYTD++ H I SNQFSVTEHFRS E GR+Q LPGV
Sbjct: 270 PLDGANWVQHSSYGMYQYFIKVVPTVYTDINEHIILSNQFSVTEHFRSGESGRMQALPGV 329

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FFFYDLSPIKVTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YH QRAIKKK+EIGK
Sbjct: 330 FFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHSQRAIKKKMEIGK 389

Query: 296 FS 297
           F+
Sbjct: 390 FN 391


>gi|357163897|ref|XP_003579883.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Brachypodium distachyon]
          Length = 386

 Score =  508 bits (1307), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 233/297 (78%), Positives = 263/297 (88%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG++HLDVKHD+FK+R+D+ GNVI ++QD +G  K++KPLQ HGGRLEHNETYCGSC
Sbjct: 90  MDISGQEHLDVKHDVFKQRIDANGNVIATKQDAVGGMKVEKPLQMHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE   E CCN+CE+VREAYRKKGW +SNPD IDQCKREGFLQ IK+EEGEGCNIYGF+
Sbjct: 150 YGAEEPGEQCCNSCEDVREAYRKKGWGVSNPDSIDQCKREGFLQTIKDEEGEGCNIYGFV 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           E+NKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKINKL+FGE FPGVVNPLDG 
Sbjct: 210 EINKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNVSHKINKLSFGEPFPGVVNPLDGA 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            W Q +P GMYQYF+KVVPTVY+ ++   I SNQFSVTEH RSSE  R+Q LPGVFFFYD
Sbjct: 270 HWFQHSPYGMYQYFVKVVPTVYSHINEQIILSNQFSVTEHARSSESVRMQALPGVFFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTE HVSFLHFLTNVCAIVGGVFTVSGIID+F+YHGQRAI KK EIGKF+
Sbjct: 330 LSPIKVTFTERHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHGQRAITKKREIGKFN 386


>gi|18395087|ref|NP_564162.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
 gi|9454530|gb|AAF87853.1|AC073942_7 Contains similarity to a PR00989 protein from Homo sapiens
           gi|7959731. EST gb|AI995648 comes from this gene
           [Arabidopsis thaliana]
 gi|13878151|gb|AAK44153.1|AF370338_1 unknown protein [Arabidopsis thaliana]
 gi|21281042|gb|AAM44956.1| unknown protein [Arabidopsis thaliana]
 gi|21553754|gb|AAM62847.1| unknown [Arabidopsis thaliana]
 gi|332192089|gb|AEE30210.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
          Length = 386

 Score =  504 bits (1297), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 229/297 (77%), Positives = 272/297 (91%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE+HLDV+HDI K+RLDS GNVIE++QDGIG  KI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90  MDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHTKIEKPLQKHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +GAE+SD+ CCN+CEEVREAYRKKGWALS+P+ IDQCKREGF+Q++K+EEGEGCN++GFL
Sbjct: 150 FGAEASDDACCNSCEEVREAYRKKGWALSDPESIDQCKREGFVQKVKDEEGEGCNVHGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHF PG+SFHQSG   HD+L FQ+ ++NISHK+N+LAFG+ FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGNYNISHKVNRLAFGDFFPGVVNPLDGV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W Q   SG+YQYFIKVVP++YTDV  +TIQSNQFSVTEHF++ E GR+Q+ PGVFF+YD
Sbjct: 270 QWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQSNQFSVTEHFQNMEAGRMQSPPGVFFYYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKV F E+HV FLHFLTNVCAIVGG+FTVSGI+D+FIYHGQRAIKKK+EIGKF+
Sbjct: 330 LSPIKVIFEEQHVEFLHFLTNVCAIVGGIFTVSGIVDSFIYHGQRAIKKKMEIGKFN 386


>gi|297850670|ref|XP_002893216.1| hypothetical protein ARALYDRAFT_472456 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339058|gb|EFH69475.1| hypothetical protein ARALYDRAFT_472456 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 386

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 228/297 (76%), Positives = 271/297 (91%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE+HLDV+HDI K+RLDS GNVIE++QDGIG  KI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90  MDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHTKIEKPLQKHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +GAE+SD+ CCN+CEEVREAYRKKGWALS+P+ IDQCKREGF+Q++K+EEGEGCN++GFL
Sbjct: 150 FGAEASDDACCNSCEEVREAYRKKGWALSDPESIDQCKREGFVQKVKDEEGEGCNVHGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHF PG+SFHQSG   HD+L FQ+ ++NISH +N+LAFG+ FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGNYNISHTVNRLAFGDFFPGVVNPLDGV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W Q   SG+YQYFIKVVP++YTDV  +TIQSNQFSVTEHF++ E GR+Q+ PGVFF+YD
Sbjct: 270 QWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQSNQFSVTEHFQNMEAGRMQSPPGVFFYYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKV F E+HV FLHFLTNVCAIVGG+FTVSGI+D+FIYHGQRAIKKK+EIGKF+
Sbjct: 330 LSPIKVIFEEQHVEFLHFLTNVCAIVGGIFTVSGIVDSFIYHGQRAIKKKMEIGKFN 386


>gi|326497521|dbj|BAK05850.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 391

 Score =  499 bits (1284), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 228/297 (76%), Positives = 262/297 (88%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG++HLDVKHD+FK+R+D+ GNVI ++QD +G  K++KPLQ HGGRLEHNETYCGSC
Sbjct: 95  MDISGQEHLDVKHDVFKQRIDAHGNVIATKQDAVGGMKVEKPLQHHGGRLEHNETYCGSC 154

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+ S E CCN+CE+VREAYRKKGW +SNPD IDQCK EGFLQ IK+EEGEGCNIYGFL
Sbjct: 155 YGAQESPEQCCNSCEDVREAYRKKGWGVSNPDSIDQCKSEGFLQTIKDEEGEGCNIYGFL 214

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           E+NKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKINKL+FGE FPGV+NPLDG 
Sbjct: 215 EINKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNLSHKINKLSFGEPFPGVINPLDGA 274

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W Q +  GM QYF+KVVPTVY+ ++   I SNQFSVTEH RS + GR+Q LPGVFFFYD
Sbjct: 275 QWIQHSSYGMAQYFVKVVPTVYSHINEQIILSNQFSVTEHSRSGDSGRVQALPGVFFFYD 334

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LSPIKVTFTE HVSFLHFLTNVCAIVGGVFTVSGIID+F+YHGQRAI KK E+GKF+
Sbjct: 335 LSPIKVTFTERHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHGQRAITKKRELGKFT 391


>gi|224059030|ref|XP_002299683.1| predicted protein [Populus trichocarpa]
 gi|222846941|gb|EEE84488.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  473 bits (1218), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 213/297 (71%), Positives = 260/297 (87%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +DISGEQH D++HDI KKR+++ G+VIE RQDGIGAPKIDKPLQ+HGGRLEHNE YCGSC
Sbjct: 90  IDISGEQHHDIRHDITKKRINAHGDVIEVRQDGIGAPKIDKPLQKHGGRLEHNEEYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +GAE SD+ CCN+C+EVREAYRKKGWAL+N DLIDQC REGF+Q IK+EEGEGCNI G L
Sbjct: 150 FGAEMSDDHCCNSCDEVREAYRKKGWALTNMDLIDQCIREGFVQMIKDEEGEGCNINGSL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVN+VAGNFHF PGKSFHQS   + D+L  Q++S+NISH+IN+LAFG++FPGVVNPLDG+
Sbjct: 210 EVNRVAGNFHFVPGKSFHQSNFQLLDLLDMQKESYNISHRINRLAFGDYFPGVVNPLDGI 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +    T +G+ Q+FIKVVPT+YTD+ G T+ SNQ+SVTEHF  SE  RL +LPGV+F YD
Sbjct: 270 QLMHGTQNGVQQFFIKVVPTIYTDIRGRTVHSNQYSVTEHFTKSELMRLDSLPGVYFIYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
            SPIKVTF EEH SFLHF+T++CAI+GG+FT++GI+D+FIYHG+RAIKKK+EIGKFS
Sbjct: 330 FSPIKVTFKEEHTSFLHFMTSICAIIGGIFTIAGIVDSFIYHGRRAIKKKMEIGKFS 386


>gi|356512071|ref|XP_003524744.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Glycine max]
          Length = 431

 Score =  473 bits (1216), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 214/297 (72%), Positives = 259/297 (87%), Gaps = 2/297 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLD++H+I KKR+D+ GNVIE R+DGIGAPKI++PLQ+HGGRL H+E YCGSC
Sbjct: 137 MDISGEQHLDIRHNIVKKRIDANGNVIEERKDGIGAPKIERPLQKHGGRLGHDEKYCGSC 196

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +GAE SDE CCN+CEEVREAYRKKGWA++N DLIDQC+REG++QR+K+EEGEGCN+ G L
Sbjct: 197 FGAEESDEHCCNSCEEVREAYRKKGWAMTNMDLIDQCQREGYVQRVKDEEGEGCNLQGSL 256

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFA GKSF QS + + D+LA Q + +NISH+INKL+FG HFPG+VNPLDGV
Sbjct: 257 EVNKVAGNFHFATGKSFLQSAIFLADLLALQDNHYNISHRINKLSFGHHFPGLVNPLDGV 316

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W Q    GMYQYFIKVVPT+YTD+ G  I SNQ+SVTEHF+SSE G    +PGVFFFYD
Sbjct: 317 KWVQGPAHGMYQYFIKVVPTIYTDIRGRVIHSNQYSVTEHFKSSELG--VAVPGVFFFYD 374

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           +SPIKV F EEH+ FLHFLTN+CAI+GGVFTV+GIID+ IY+GQR IK+K+E+GKF+
Sbjct: 375 ISPIKVNFKEEHIPFLHFLTNICAIIGGVFTVAGIIDSSIYYGQRTIKRKMELGKFT 431


>gi|363806898|ref|NP_001242045.1| uncharacterized protein LOC100781612 [Glycine max]
 gi|255644390|gb|ACU22700.1| unknown [Glycine max]
          Length = 384

 Score =  472 bits (1215), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 214/297 (72%), Positives = 256/297 (86%), Gaps = 2/297 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLD++H+I KKR+D+ GNVIE R+DGIGAPKI+KPLQ+HGGRL H+E YCGSC
Sbjct: 90  MDISGEQHLDIRHNIVKKRIDANGNVIEERKDGIGAPKIEKPLQKHGGRLGHDEKYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +GAE SDE CCN+CEEVREAYRKKGWA++N DLIDQC+REG++QR+K+EEGEGCN+ G L
Sbjct: 150 FGAEESDEHCCNSCEEVREAYRKKGWAMTNMDLIDQCQREGYVQRVKDEEGEGCNLQGSL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFA GKSF QS + + D+LA Q + +NISH+INKL+FG HFPG+VNPLDGV
Sbjct: 210 EVNKVAGNFHFATGKSFLQSAIFLADVLALQDNHYNISHRINKLSFGHHFPGLVNPLDGV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           RW Q    GMYQYFIKVVPT+YTD+ G  I SNQ+SVTEHF+SSE G    +PGVFFFYD
Sbjct: 270 RWVQGPTHGMYQYFIKVVPTIYTDIRGRVIHSNQYSVTEHFKSSELG--VAVPGVFFFYD 327

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           +SPIKV F EEH  FLHFLTN+CAI+GGV  V+GIID+ IY+GQR IK+K+E+GKF+
Sbjct: 328 ISPIKVNFKEEHTPFLHFLTNICAIIGGVLAVAGIIDSSIYYGQRTIKRKMELGKFT 384


>gi|449449715|ref|XP_004142610.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 385

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 222/298 (74%), Positives = 257/298 (86%), Gaps = 3/298 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLDVKHDI KKR+D QGNVI+SR DGIG+ +I++PLQ+HGGRL+ NETYCGSC
Sbjct: 90  MDISGEQHLDVKHDIVKKRIDYQGNVIDSRPDGIGSTEIERPLQKHGGRLKQNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA  S EDCCN+C++VREAY +KGWALS+PDLIDQCKREGF QR+K EEGEGCNIYGFL
Sbjct: 150 YGA--SGEDCCNSCQDVREAYHRKGWALSHPDLIDQCKREGFFQRVKNEEGEGCNIYGFL 207

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILA-FQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           EVNKVAGNFHFAPG+ F  S   +H+ LA FQ D+FNISH+IN+L FG+ FPGVVNPLDG
Sbjct: 208 EVNKVAGNFHFAPGRGFQLSYFQIHNPLASFQWDAFNISHRINRLTFGDDFPGVVNPLDG 267

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
           V+W Q T SGM+QYFIKVVPTVY  V+G  I+SNQFSVT+H R  +    Q L GVFFFY
Sbjct: 268 VQWNQGTLSGMFQYFIKVVPTVYKAVNGKAIKSNQFSVTQHLRGIDGESFQALHGVFFFY 327

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           DLSPIKVTFTEEH+SF HFLTNVCAIVGGVFT+SGI+D+ IYHGQ+AIKKK+ +GKF+
Sbjct: 328 DLSPIKVTFTEEHISFFHFLTNVCAIVGGVFTISGILDSIIYHGQKAIKKKMALGKFT 385


>gi|302790744|ref|XP_002977139.1| hypothetical protein SELMODRAFT_271242 [Selaginella moellendorffii]
 gi|302820940|ref|XP_002992135.1| hypothetical protein SELMODRAFT_162191 [Selaginella moellendorffii]
 gi|300140061|gb|EFJ06790.1| hypothetical protein SELMODRAFT_162191 [Selaginella moellendorffii]
 gi|300155115|gb|EFJ21748.1| hypothetical protein SELMODRAFT_271242 [Selaginella moellendorffii]
          Length = 386

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 219/297 (73%), Positives = 255/297 (85%), Gaps = 1/297 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESR-QDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD+SGEQHLDVKH+IFKKRLD  G V++   Q+ IG PKIDKPLQ+HGGRLEHNETYCGS
Sbjct: 89  MDVSGEQHLDVKHNIFKKRLDPSGKVVQPPVQEDIGGPKIDKPLQKHGGRLEHNETYCGS 148

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           C+GAE SD++CCN+CEEVREAYRK+GWA+ N DLIDQCKREG+L +IKEEEGEGCNIYG 
Sbjct: 149 CFGAEQSDDECCNSCEEVREAYRKRGWAIHNADLIDQCKREGWLTKIKEEEGEGCNIYGS 208

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           LEVNKVAGNFHFAPGKSF Q  VHVHD+ +  ++ FN+SH IN+L+FG  FPGVVNPLD 
Sbjct: 209 LEVNKVAGNFHFAPGKSFSQQHVHVHDVQSLHKEKFNVSHYINELSFGARFPGVVNPLDK 268

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
            +  Q+ PS MYQYFIKVVPT YTD++GH I +NQFSVT+HF++ E    ++LPGVFFFY
Sbjct: 269 EKRIQKFPSAMYQYFIKVVPTAYTDMTGHKIVTNQFSVTDHFKAVEGLNGRSLPGVFFFY 328

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           +LSPIKV FTE   SFLHFLTNVCAI+GGVFTVSGIID+FIYHG RAIKKK+EIGK+
Sbjct: 329 ELSPIKVLFTERKTSFLHFLTNVCAIIGGVFTVSGIIDSFIYHGHRAIKKKMEIGKY 385


>gi|449510462|ref|XP_004163672.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 3-like [Cucumis
           sativus]
          Length = 385

 Score =  469 bits (1207), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 221/298 (74%), Positives = 256/298 (85%), Gaps = 3/298 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQHLDVKHDI KKR+D QGNVI+SR DGIG+ +I++PLQ+HGGRL+ NETYCGSC
Sbjct: 90  MDISGEQHLDVKHDIVKKRIDYQGNVIDSRPDGIGSTEIERPLQKHGGRLKQNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA  S EDCCN+C++VREAY +KGWALS+PDLIDQCKREGF QR+K EEGEGCNIYGFL
Sbjct: 150 YGA--SGEDCCNSCQDVREAYHRKGWALSHPDLIDQCKREGFFQRVKNEEGEGCNIYGFL 207

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILA-FQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           EVNKVAGNFHFAPG+ F  S   +H+ LA FQ D+FNISH+IN+L FG+ FPGVVNPLDG
Sbjct: 208 EVNKVAGNFHFAPGRGFQLSYFQIHNPLASFQWDAFNISHRINRLTFGDDFPGVVNPLDG 267

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
           V+W Q T SGM+QYFIKVVPTVY  V+G  I+SNQFSVT+H R  +    Q L G FFFY
Sbjct: 268 VQWNQGTLSGMFQYFIKVVPTVYKAVNGKAIKSNQFSVTQHLRGIDGESFQALHGXFFFY 327

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           DLSPIKVTFTEEH+SF HFLTNVCAIVGGVFT+SGI+D+ IYHGQ+AIKKK+ +GKF+
Sbjct: 328 DLSPIKVTFTEEHISFFHFLTNVCAIVGGVFTISGILDSIIYHGQKAIKKKMALGKFT 385


>gi|168014180|ref|XP_001759631.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689170|gb|EDQ75543.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 382

 Score =  468 bits (1205), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 214/292 (73%), Positives = 250/292 (85%), Gaps = 1/292 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIE-SRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MDISGE HLDVKH+IFKKRLD  G VIE +RQ+ I  PK+DKPLQ+HGGRLEHNETYCGS
Sbjct: 87  MDISGEAHLDVKHNIFKKRLDVNGKVIEPARQESINQPKLDKPLQKHGGRLEHNETYCGS 146

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           C+GAE+ ++ CCNNCEEVREAYRKKGWAL+NPDLIDQCKREGFLQ+IK+E+GEGCN+YG 
Sbjct: 147 CFGAETEEDHCCNNCEEVREAYRKKGWALNNPDLIDQCKREGFLQKIKDEDGEGCNVYGT 206

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           LE NKVAGNFHFAPGKSF Q+ +HVHD++AF +DSFN+SHKIN+++FG  +PG VNPLD 
Sbjct: 207 LEANKVAGNFHFAPGKSFQQANMHVHDLMAFGKDSFNVSHKINEISFGVRYPGAVNPLDK 266

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
           +   Q T  GMYQYFIKVVPTVYTD  G  I +NQF+VT+HF+    G    LPGVFFFY
Sbjct: 267 LERIQTTTHGMYQYFIKVVPTVYTDTRGRKISTNQFAVTDHFKGVGPGEDHALPGVFFFY 326

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           DLSPIKV FTE+ +SF HFLTNVCAIVGGVF+VSGIIDAF+YHGQ+ IKK++
Sbjct: 327 DLSPIKVKFTEKRMSFFHFLTNVCAIVGGVFSVSGIIDAFVYHGQKQIKKRL 378


>gi|168024878|ref|XP_001764962.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683771|gb|EDQ70178.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 215/298 (72%), Positives = 255/298 (85%), Gaps = 2/298 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES-RQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MDISGEQHL+V+H+IFKKRLD  G V+ + + D I APK+ KPLQ+HGGRLEHNETYCGS
Sbjct: 89  MDISGEQHLNVRHNIFKKRLDVHGKVVNAPKPDAINAPKVQKPLQKHGGRLEHNETYCGS 148

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           C+GAESSD++CCNNCEEVREAYRKKGWAL+N DLIDQC REGF++R+KEE GEGCNIYG 
Sbjct: 149 CFGAESSDDECCNNCEEVREAYRKKGWALTNADLIDQCHREGFIERVKEEAGEGCNIYGK 208

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           LEVNKVAGNFHFAPGKSF QS +H+ D++ F  DSFN+SH IN+L+FG HFPG VNPLD 
Sbjct: 209 LEVNKVAGNFHFAPGKSFQQSAMHLLDLMGFITDSFNVSHTINELSFGAHFPGAVNPLDK 268

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
           V   Q+  +GMYQYFIKVVPTVYTD+ G  I +NQFSVTEH+ + + G  + +PGVFFFY
Sbjct: 269 VTNIQKDLNGMYQYFIKVVPTVYTDIKGRKISTNQFSVTEHYTAGDHGP-RFVPGVFFFY 327

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           DLSPIKV F+EE  SFLHFLTNVCAIVGGV++++GIID+F+YHG RAIKKK+E+GK S
Sbjct: 328 DLSPIKVKFSEERPSFLHFLTNVCAIVGGVYSIAGIIDSFVYHGHRAIKKKMELGKLS 385


>gi|222628979|gb|EEE61111.1| hypothetical protein OsJ_15023 [Oryza sativa Japonica Group]
          Length = 369

 Score =  466 bits (1198), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 221/306 (72%), Positives = 251/306 (82%), Gaps = 25/306 (8%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG++HLDVKHDIFK+R+D  GNVI ++QD +G                 N  Y G  
Sbjct: 80  MDISGQEHLDVKHDIFKQRIDVHGNVIATKQDAVGG----------------NGPYSGMA 123

Query: 61  YGAES---------SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
            G  +         SDE CCN+CE+VREAYRKKGW +SNPDLIDQCKREGFLQ IK+EEG
Sbjct: 124 AGLNTMRPIVALVMSDEQCCNSCEDVREAYRKKGWGVSNPDLIDQCKREGFLQSIKDEEG 183

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           EGCNIYGFLEVNKVAGNFHFAPGKSF ++ VHVHD+L FQ+DSFN+SHKINKL+FG+ FP
Sbjct: 184 EGCNIYGFLEVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDSFNVSHKINKLSFGQRFP 243

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT 231
           GVVNPLDG +W Q +  GMYQYFIKVVPTVYTD++ H I SNQFSVTEHFRSSE GR+Q 
Sbjct: 244 GVVNPLDGAQWMQHSSYGMYQYFIKVVPTVYTDINEHIILSNQFSVTEHFRSSESGRIQA 303

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           +PGVFFFYDLSPIKVTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YHGQRAIKKK+
Sbjct: 304 VPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHGQRAIKKKM 363

Query: 292 EIGKFS 297
           EIGKF+
Sbjct: 364 EIGKFN 369


>gi|449438787|ref|XP_004137169.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 386

 Score =  464 bits (1194), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 207/296 (69%), Positives = 258/296 (87%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +DISGEQHLD++H+I KKR+D  G VIE+R DGIGAPKI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90  IDISGEQHLDIRHNIIKKRIDHLGTVIEARPDGIGAPKIEKPLQKHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +GAE+SD+DCCN+CEEVREAYRKKGWA++N DLIDQC+RE F+Q++K+EEGEGCNI G L
Sbjct: 150 FGAEASDDDCCNSCEEVREAYRKKGWAITNQDLIDQCQREDFIQKVKDEEGEGCNIEGSL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAG+FHF PGKSF+QS  +   +LA Q   +N+SH+IN+LAFG H+ G+VNPLDGV
Sbjct: 210 EVNKVAGSFHFVPGKSFYQSSFNFLGLLALQTSDYNVSHRINRLAFGNHYDGLVNPLDGV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            W     + M+QYF+KVVPT+Y ++ G T+ SNQ+SVTEHF+S E G  Q++PGVFF+YD
Sbjct: 270 HWEYNEQNVMHQYFVKVVPTIYKNIRGRTVHSNQYSVTEHFKSVEFGSSQSIPGVFFYYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           LSP+KVT+TEEHV FLHF+T++CAI+GGVF+V+GIIDAFIYHGQR +KKK+EIGKF
Sbjct: 330 LSPVKVTYTEEHVPFLHFMTHICAIIGGVFSVAGIIDAFIYHGQRKMKKKVEIGKF 385


>gi|224073341|ref|XP_002304080.1| predicted protein [Populus trichocarpa]
 gi|222841512|gb|EEE79059.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 205/296 (69%), Positives = 255/296 (86%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +DISGEQHLD++HDI KKR+++ G+VIE RQ+GIGAPKID+PLQ HGGRL HNE YCGSC
Sbjct: 90  IDISGEQHLDIRHDISKKRINAHGDVIEVRQEGIGAPKIDRPLQSHGGRLGHNEEYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +G E S +DCCN CEEVREAYR+KGWA++N DLIDQCKREGF+Q IK+EEGEGCNI G L
Sbjct: 150 FGGEMSHDDCCNTCEEVREAYRRKGWAMTNMDLIDQCKREGFIQMIKDEEGEGCNINGSL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVN+VAG+FHFAP KSFH S   + D+L  Q+DS+NISH+IN+LAFG++FPGVVNPL G+
Sbjct: 210 EVNRVAGSFHFAPWKSFHLSNFLIQDLLDLQKDSYNISHRINRLAFGDYFPGVVNPLAGI 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +   +TP+G+ Q+FIKVVPT+YTD+ G T+ SNQ+S TEHF+ SE   L +LPGV+FFYD
Sbjct: 270 QLMHDTPNGVQQFFIKVVPTIYTDIRGRTVHSNQYSATEHFKKSELTPLDSLPGVYFFYD 329

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
            SPIKV F EEH+SFLHF+T++CAI+GG+FT++GIID+FIY+GQRAI KK+ IGKF
Sbjct: 330 FSPIKVIFKEEHISFLHFMTSICAIIGGIFTIAGIIDSFIYYGQRAITKKVGIGKF 385


>gi|217071774|gb|ACJ84247.1| unknown [Medicago truncatula]
          Length = 384

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 206/296 (69%), Positives = 257/296 (86%), Gaps = 2/296 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE+H D+ H+I K+R+D+ G VIE+R++GIGAPKI++PLQ+HGGRLEH+E YCGSC
Sbjct: 90  MDISGERHHDILHNIMKQRIDANGKVIEARKEGIGAPKIERPLQKHGGRLEHDEKYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +GAE SD+ CCNNCEEVREAYRKKGWAL+N DLIDQC+REGF+Q++K+EEGEGCNI+G L
Sbjct: 150 FGAEESDDHCCNNCEEVREAYRKKGWALTNIDLIDQCQREGFVQKVKDEEGEGCNIHGSL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFA G+SF QS + + D+LA Q + +NISH+INKL+FG H+PG+VNPLDG+
Sbjct: 210 EVNKVAGNFHFATGQSFLQSAIFLTDLLALQDNHYNISHQINKLSFGHHYPGLVNPLDGI 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W Q    GM QYFIKVVPTVYTD+ G  I SNQ+SVTEHF+SSE G    +PGVFFFYD
Sbjct: 270 KWVQGNDHGMCQYFIKVVPTVYTDIRGRVIHSNQYSVTEHFKSSELG--AAVPGVFFFYD 327

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           +SPIKV F EEH+ FLHFLTN+CAI+GG+FT++GI+D+ IY+GQ+ IKKK+EIGK+
Sbjct: 328 ISPIKVNFKEEHIPFLHFLTNICAIIGGIFTIAGIVDSSIYYGQKTIKKKMEIGKY 383


>gi|357112459|ref|XP_003558026.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Brachypodium distachyon]
          Length = 387

 Score =  458 bits (1179), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 211/299 (70%), Positives = 256/299 (85%), Gaps = 4/299 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGEQH D++HDIFKKR+D  GNVIESR+DG+G+PKI++PLQ HGGRL+HNE YCGSC
Sbjct: 89  MDVSGEQHYDIRHDIFKKRIDHLGNVIESRKDGVGSPKIERPLQNHGGRLDHNEAYCGSC 148

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YG+E SD+ CCN+CEEVR+AYRKKGWAL+N + IDQCKREGF+QR+K+E+GEGCNI+GF+
Sbjct: 149 YGSEESDDQCCNSCEEVRDAYRKKGWALTNVESIDQCKREGFVQRLKDEQGEGCNIHGFV 208

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +VNKVAGNFHFAPGK   QS   + D+L FQ +++NISHKINKL+FG+ FPGVVNPLDGV
Sbjct: 209 DVNKVAGNFHFAPGKHLDQSFNFLQDMLNFQPENYNISHKINKLSFGKEFPGVVNPLDGV 268

Query: 181 RWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
            W QE     +GMYQYF+KVVPT+YTD+ G  I SNQFSVTEHFR +  G  +  PGV+F
Sbjct: 269 EWKQEQATGLTGMYQYFVKVVPTIYTDIRGRKIHSNQFSVTEHFREA-IGFPRPPPGVYF 327

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           FY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+GIID+F+YHG RAIKKK+EIGK 
Sbjct: 328 FYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVAGIIDSFVYHGHRAIKKKMEIGKL 386


>gi|226498912|ref|NP_001150650.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
 gi|194699894|gb|ACF84031.1| unknown [Zea mays]
 gi|195640862|gb|ACG39899.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
          Length = 387

 Score =  457 bits (1177), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 209/299 (69%), Positives = 255/299 (85%), Gaps = 4/299 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGEQH D++HDI KKR+D  GNVIESR+DG+GAPKI++PLQ+HGGRL+HNE YCGSC
Sbjct: 89  MDVSGEQHYDIRHDIIKKRIDHLGNVIESRKDGVGAPKIERPLQKHGGRLDHNEVYCGSC 148

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE SD+ CCN+CEEVR+AYRKKGWA++N +LIDQCKREG++QR+K+E+GEGC I+GF+
Sbjct: 149 YGAEESDDQCCNSCEEVRDAYRKKGWAVNNVELIDQCKREGYVQRLKDEQGEGCTIHGFV 208

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
            VNKVAGNFHFAPGKS  QS   + D+L  Q +++NISHKINKL+FGE FPGVVNPLDGV
Sbjct: 209 NVNKVAGNFHFAPGKSLDQSFNFLQDLLNLQPETYNISHKINKLSFGEEFPGVVNPLDGV 268

Query: 181 RWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
            W Q+     +GMYQYF+KVVPT+YTD+ G  I SNQFSVTEHFR +  G  +  PGV+F
Sbjct: 269 EWIQDNSNGLTGMYQYFVKVVPTIYTDIRGRKIHSNQFSVTEHFREA-IGYPRPPPGVYF 327

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           FY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+GIID+F+YHG RAIKKK+E+GK 
Sbjct: 328 FYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVAGIIDSFVYHGHRAIKKKMELGKL 386


>gi|108707873|gb|ABF95668.1| Serologically defined breast cancer antigen NY-BR-84, putative,
           expressed [Oryza sativa Japonica Group]
          Length = 387

 Score =  457 bits (1177), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 210/299 (70%), Positives = 257/299 (85%), Gaps = 4/299 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGEQH D++HDI KKR+D+ GNVIESR+DG+GAPKI++PLQ+HGGRL+HNE YCGSC
Sbjct: 89  MDVSGEQHYDIRHDIIKKRIDNLGNVIESRKDGVGAPKIERPLQKHGGRLDHNEVYCGSC 148

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YG+E SD+ CCN+CE+VR+AYRKKGWAL+N + IDQCKREGF+QR+K+E+GEGC+I+GF+
Sbjct: 149 YGSEESDDQCCNSCEDVRDAYRKKGWALTNIEEIDQCKREGFVQRLKDEQGEGCSIHGFV 208

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
            VNKVAGNFHFAPGKS  QS   + D+L FQ++++NISHKINKL+FG  FPGVVNPLDGV
Sbjct: 209 NVNKVAGNFHFAPGKSLDQSFNFLQDLLNFQQENYNISHKINKLSFGVEFPGVVNPLDGV 268

Query: 181 RWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
            W QE     +GMYQYF+KVVPT+YTD+ G  I SNQFSVTEHFR +  G  +  PGV+F
Sbjct: 269 EWIQEHTNGLTGMYQYFVKVVPTIYTDIRGRKINSNQFSVTEHFREA-IGYPRPPPGVYF 327

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           FY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+GIID+F+YHG RAIKKK+EIGK 
Sbjct: 328 FYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVAGIIDSFVYHGHRAIKKKMEIGKL 386


>gi|242035905|ref|XP_002465347.1| hypothetical protein SORBIDRAFT_01g036890 [Sorghum bicolor]
 gi|241919201|gb|EER92345.1| hypothetical protein SORBIDRAFT_01g036890 [Sorghum bicolor]
          Length = 387

 Score =  450 bits (1158), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 206/299 (68%), Positives = 252/299 (84%), Gaps = 4/299 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGEQH D++HDI KKR+D  GNVIESR+D +GAPKI++PLQ+HGGRL+HNE YCGSC
Sbjct: 89  MDVSGEQHYDIRHDITKKRIDHLGNVIESRKDRVGAPKIERPLQKHGGRLDHNEVYCGSC 148

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE +D+ CCN+CEEVR+ YRKKGWA++N +LIDQCKREG++QR+K+E GEGC I+GF+
Sbjct: 149 YGAEETDDQCCNSCEEVRDVYRKKGWAINNVELIDQCKREGYVQRLKDETGEGCTIHGFV 208

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
            VNKVAGNFHFAPGKS  QS   + D+L  Q +++NISHKINKL+FGE FPGVVNPLDGV
Sbjct: 209 NVNKVAGNFHFAPGKSLDQSFNFLQDLLNIQPETYNISHKINKLSFGEEFPGVVNPLDGV 268

Query: 181 RWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
            W Q+     +GMYQYF+KVVPT+YTD+ G  I SNQFSVTEHFR +  G  +  PGV+F
Sbjct: 269 EWIQDNSNGLTGMYQYFVKVVPTIYTDIRGRKIYSNQFSVTEHFREA-IGYPRPPPGVYF 327

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           FY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+GIID+F+YHG RAIKKK+E+GK 
Sbjct: 328 FYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVAGIIDSFVYHGHRAIKKKMELGKL 386


>gi|168004517|ref|XP_001754958.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694062|gb|EDQ80412.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  449 bits (1155), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 205/298 (68%), Positives = 249/298 (83%), Gaps = 2/298 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES-RQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MDISGE HLDV+H+I+KKRLD  G  +++ + D I APK+ KPLQ+HGGRLE +ETYCGS
Sbjct: 89  MDISGELHLDVRHNIYKKRLDVHGKAVDAPKPDAINAPKVQKPLQKHGGRLEDHETYCGS 148

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           C+GAESSD+ CCN+CEEVREAYRKKGWAL+N DLIDQC REGF++RIKEE GEGCNIYG 
Sbjct: 149 CFGAESSDDQCCNSCEEVREAYRKKGWALTNTDLIDQCHREGFIERIKEEAGEGCNIYGK 208

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           LEVNKVAGNF  APGKSF QS +H+ D++ F  DSFN+SH IN+L+FG +FPG VNPLD 
Sbjct: 209 LEVNKVAGNFQIAPGKSFQQSAMHLLDLMGFVTDSFNVSHTINELSFGAYFPGAVNPLDK 268

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
           V   Q+  +GM+QYFIKVVPTVYTD+ G  I +NQFSV EH+ + + G  + +PGVFFFY
Sbjct: 269 VTSIQKDQNGMFQYFIKVVPTVYTDIKGRKISTNQFSVMEHYTAGDHGP-RVIPGVFFFY 327

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           DL+PIKV FTEE  SFLHFLTNVCAI+GG++T++GI+D+FIYHG RAIKKK+E+GK S
Sbjct: 328 DLTPIKVKFTEERPSFLHFLTNVCAIIGGIYTIAGIVDSFIYHGHRAIKKKMELGKLS 385


>gi|115464597|ref|NP_001055898.1| Os05g0490200 [Oryza sativa Japonica Group]
 gi|50080302|gb|AAT69636.1| unknown protein [Oryza sativa Japonica Group]
 gi|113579449|dbj|BAF17812.1| Os05g0490200 [Oryza sativa Japonica Group]
 gi|218197014|gb|EEC79441.1| hypothetical protein OsI_20422 [Oryza sativa Indica Group]
 gi|222632053|gb|EEE64185.1| hypothetical protein OsJ_19017 [Oryza sativa Japonica Group]
          Length = 384

 Score =  444 bits (1141), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 204/296 (68%), Positives = 248/296 (83%), Gaps = 2/296 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQH D++HDI K+RLD+ GNVIE+R++GIG  KI+ PLQ+HGGRL   E YCG+C
Sbjct: 90  MDISGEQHHDIRHDIEKRRLDAHGNVIEARKEGIGGAKIESPLQKHGGRLSKGEEYCGTC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++GEGCN++GFL
Sbjct: 150 YGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCTREDFVERVKTQQGEGCNVHGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V+KVAGN HFAPGK F++S ++V ++ A +   FNI+HKINKL+FG  FPGVVNPLDG 
Sbjct: 210 DVSKVAGNLHFAPGKGFYESNINVPELSALEH-GFNITHKINKLSFGTEFPGVVNPLDGA 268

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +WTQ    G YQYFIKVVPT+YTD+ G  I SNQFSVTEHFR     R +  PGVFFFYD
Sbjct: 269 QWTQPASDGTYQYFIKVVPTIYTDLRGRKIHSNQFSVTEHFRDGNI-RPKPQPGVFFFYD 327

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
            SPIKV FTEE+ S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 328 FSPIKVIFTEENSSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 383


>gi|226494401|ref|NP_001141198.1| uncharacterized protein LOC100273285 [Zea mays]
 gi|194703210|gb|ACF85689.1| unknown [Zea mays]
 gi|238011828|gb|ACR36949.1| unknown [Zea mays]
 gi|413945823|gb|AFW78472.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 384

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 204/296 (68%), Positives = 245/296 (82%), Gaps = 2/296 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQH D++HDI K RLD+ GNVIE+R+  IG  KI++PLQ+HGGRL+  E YCG+C
Sbjct: 90  MDISGEQHQDIRHDIEKIRLDAHGNVIEARKVSIGGAKIERPLQKHGGRLDKGEQYCGTC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++ EGCN++GFL
Sbjct: 150 YGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQQDEGCNVHGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V+KVAGNFHFAPGK F++S + V + L+     FNI+HKINKL+FG  FPGVVNPLDG 
Sbjct: 210 DVSKVAGNFHFAPGKGFYESNIDVPE-LSLLEGGFNITHKINKLSFGTEFPGVVNPLDGA 268

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +WTQ    G YQYFIKVVPT+YTD+ GH I SNQFSVTEHFR     R +  PGVFFFYD
Sbjct: 269 QWTQPASDGTYQYFIKVVPTIYTDIRGHNIHSNQFSVTEHFRDGNV-RPKPQPGVFFFYD 327

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
            SPIKV FTEE  S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 328 FSPIKVIFTEESRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 383


>gi|242088319|ref|XP_002439992.1| hypothetical protein SORBIDRAFT_09g023960 [Sorghum bicolor]
 gi|241945277|gb|EES18422.1| hypothetical protein SORBIDRAFT_09g023960 [Sorghum bicolor]
          Length = 384

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 204/296 (68%), Positives = 247/296 (83%), Gaps = 2/296 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQH D++HDI K+RLDS GNVIE+R++GIG  KI++PLQ+HGGRL+  E YCG+C
Sbjct: 90  MDISGEQHHDIRHDIEKRRLDSHGNVIEARKEGIGGAKIERPLQKHGGRLDKGEQYCGTC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++ EGCN++GFL
Sbjct: 150 YGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQQDEGCNVHGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V+KVAGNFHFAPGK F++S + V + L+     FNI+HKINKL+FG  FPGVVNPLDG 
Sbjct: 210 DVSKVAGNFHFAPGKGFYESNIDVPE-LSVLEGGFNITHKINKLSFGTEFPGVVNPLDGA 268

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W Q    G YQYFIKVVPT+YTD+ GH I SNQFSVTEHFR       +  PGVFFFYD
Sbjct: 269 QWIQPASDGTYQYFIKVVPTIYTDIRGHNIHSNQFSVTEHFRDGNI-LPKPQPGVFFFYD 327

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
            SPIKV FTEE+ S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 328 FSPIKVIFTEENRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 383


>gi|212721670|ref|NP_001132255.1| uncharacterized protein LOC100193691 [Zea mays]
 gi|194693892|gb|ACF81030.1| unknown [Zea mays]
 gi|223949235|gb|ACN28701.1| unknown [Zea mays]
 gi|413949703|gb|AFW82352.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 384

 Score =  441 bits (1133), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 203/295 (68%), Positives = 246/295 (83%), Gaps = 2/295 (0%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           DISGEQH D++HDI K+RL+S GNVIE+R++GIG  K+++PLQ+HGGRL+  E YCG+CY
Sbjct: 91  DISGEQHHDIRHDIEKRRLNSHGNVIEARKEGIGGAKVERPLQKHGGRLDKGEQYCGTCY 150

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           GAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F+ R+K ++ EGCN+ GFL+
Sbjct: 151 GAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFIDRVKTQQDEGCNVLGFLD 210

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
           V+KVAGNFHFAPGK F++S + V + L+     FNISHKINKL+FG  FPGVVNPLDG +
Sbjct: 211 VSKVAGNFHFAPGKGFYESNIDVPE-LSLLEGGFNISHKINKLSFGTEFPGVVNPLDGAQ 269

Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDL 241
           WTQ    G YQYFIKVVPT+YTD+ G  I SNQFSVTEHFR     R ++ PGVFFFYD 
Sbjct: 270 WTQPASDGTYQYFIKVVPTIYTDIRGRGIHSNQFSVTEHFRDGNV-RPKSQPGVFFFYDF 328

Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           SPIKV FTEE+ S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 329 SPIKVIFTEENRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 383


>gi|326510689|dbj|BAJ87561.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514988|dbj|BAJ99855.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326533080|dbj|BAJ93512.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 383

 Score =  439 bits (1130), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 206/296 (69%), Positives = 246/296 (83%), Gaps = 5/296 (1%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           DISGEQH D++HDI KKRLDS GNVIESR++GIG  KI+KPLQ+HGGRL   E YCG+CY
Sbjct: 91  DISGEQHQDIRHDIEKKRLDSHGNVIESRKEGIGGTKIEKPLQKHGGRLGKGEEYCGTCY 150

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           GAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K + GEGC+++GFL+
Sbjct: 151 GAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQHGEGCSVHGFLD 210

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
           V+KVAGNFHFAPGK +++S V + ++ A     FNI+HKINKL+FG  FPG VNPLDG +
Sbjct: 211 VSKVAGNFHFAPGKGYYESNVDMPELSA--EGGFNITHKINKLSFGTEFPGAVNPLDGAQ 268

Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE-QGRLQTLPGVFFFYD 240
           WTQ    G YQYFIKVVPT+Y D+ G  I SNQFSVTEHFR    Q R Q  PGVFFFYD
Sbjct: 269 WTQPASDGTYQYFIKVVPTIYNDIRGRKIDSNQFSVTEHFRDGNVQPRPQ--PGVFFFYD 326

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
            SPIKV FTEE+ SFLH+LTN+CAIVGG+FTV+GIID+FIYHGQ+A+KKK+EIGK+
Sbjct: 327 FSPIKVIFTEENRSFLHYLTNLCAIVGGIFTVAGIIDSFIYHGQKALKKKMEIGKY 382


>gi|357133202|ref|XP_003568216.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Brachypodium distachyon]
          Length = 384

 Score =  437 bits (1125), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 205/296 (69%), Positives = 247/296 (83%), Gaps = 4/296 (1%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           DISGEQH D++HDI KKRL+S GNVIESR++GIG  KI++PLQ+HGGRL+  E YCG+CY
Sbjct: 91  DISGEQHQDIRHDIEKKRLNSHGNVIESRKEGIGGAKIERPLQKHGGRLDKGEQYCGTCY 150

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           GAE SDE CCN+C+EVREAY+KKGWAL+NPDLIDQC RE F++R+K + GEGC+++GFL+
Sbjct: 151 GAEESDEQCCNSCDEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQHGEGCSVHGFLD 210

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
           V+KVAGNFHFAPG+ F++S V V ++ + +   FNI+HKINKL+FG  FPGVVNPLDG +
Sbjct: 211 VSKVAGNFHFAPGRGFYESNVDVPELSSLE-GGFNITHKINKLSFGTEFPGVVNPLDGAQ 269

Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE-QGRLQTLPGVFFFYD 240
           WTQ    G YQYFIKVVPT YTD  G  I SNQFSVTEHFR      R Q  PGVFFFYD
Sbjct: 270 WTQPASDGTYQYFIKVVPTNYTDTRGRKIDSNQFSVTEHFRDGNVHPRPQ--PGVFFFYD 327

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
            SPIKV FTEE+ SFLH+LTN+CAIVGG+FTVSGIID+FIYHGQ+A+KKK+EIGK+
Sbjct: 328 FSPIKVIFTEENKSFLHYLTNLCAIVGGIFTVSGIIDSFIYHGQKALKKKMEIGKY 383


>gi|168019656|ref|XP_001762360.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686438|gb|EDQ72827.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 380

 Score =  437 bits (1123), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 204/298 (68%), Positives = 245/298 (82%), Gaps = 7/298 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES-RQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MDISGEQHL+V+H+IFKKRLD  G  I++ + D I APK+ +PLQ+HGGRLEHNETYCGS
Sbjct: 89  MDISGEQHLNVRHNIFKKRLDVHGKAIDAPKPDAINAPKVQRPLQKHGGRLEHNETYCGS 148

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           C+GA SSD++CCN+CEEVREAYRKKGWAL N D+IDQC REGF++R+KEE GEGCNIYG 
Sbjct: 149 CFGAASSDDECCNSCEEVREAYRKKGWALINIDIIDQCHREGFIERVKEEAGEGCNIYGK 208

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           LEVNKVAGNFH APGK F QS +H+ D+L  + DSFN+SH +N+L+FG HFPG VNPLD 
Sbjct: 209 LEVNKVAGNFHIAPGKLFQQSAMHLLDLLGIRSDSFNVSHIVNELSFGAHFPGRVNPLDK 268

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
           +   Q+  +GMYQYFIKVVPTVYTD+ G  I +NQFSVTEH+ + + G  + +PGVFFFY
Sbjct: 269 ITSIQKDQNGMYQYFIKVVPTVYTDIRGSEIATNQFSVTEHYTAGDHGP-RVVPGVFFFY 327

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           DLSPIKV FTE+  SFLHFLT VCAIVG     + IID+FIYHG RA+KKK+E+GKFS
Sbjct: 328 DLSPIKVKFTEKRPSFLHFLTTVCAIVG-----ASIIDSFIYHGHRAVKKKMELGKFS 380


>gi|413945824|gb|AFW78473.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein,
           partial [Zea mays]
          Length = 284

 Score =  419 bits (1077), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 195/285 (68%), Positives = 235/285 (82%), Gaps = 2/285 (0%)

Query: 12  KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCC 71
           +HDI K RLD+ GNVIE+R+  IG  KI++PLQ+HGGRL+  E YCG+CYGAE SDE CC
Sbjct: 1   RHDIEKIRLDAHGNVIEARKVSIGGAKIERPLQKHGGRLDKGEQYCGTCYGAEESDEQCC 60

Query: 72  NNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF 131
           N+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++ EGCN++GFL+V+KVAGNFHF
Sbjct: 61  NSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNFHF 120

Query: 132 APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMY 191
           APGK F++S + V + L+     FNI+HKINKL+FG  FPGVVNPLDG +WTQ    G Y
Sbjct: 121 APGKGFYESNIDVPE-LSLLEGGFNITHKINKLSFGTEFPGVVNPLDGAQWTQPASDGTY 179

Query: 192 QYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEE 251
           QYFIKVVPT+YTD+ GH I SNQFSVTEHFR     R +  PGVFFFYD SPIKV FTEE
Sbjct: 180 QYFIKVVPTIYTDIRGHNIHSNQFSVTEHFRDGNV-RPKPQPGVFFFYDFSPIKVIFTEE 238

Query: 252 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
             S LH+LTN+CAIVGGVFTVSGIID+FIYHGQ+A+KKK+E+GK+
Sbjct: 239 SRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELGKY 283


>gi|79318328|ref|NP_001031077.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
 gi|332192090|gb|AEE30211.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
          Length = 338

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 187/246 (76%), Positives = 224/246 (91%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE+HLDV+HDI K+RLDS GNVIE++QDGIG  KI+KPLQ+HGGRLEHNETYCGSC
Sbjct: 90  MDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHTKIEKPLQKHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +GAE+SD+ CCN+CEEVREAYRKKGWALS+P+ IDQCKREGF+Q++K+EEGEGCN++GFL
Sbjct: 150 FGAEASDDACCNSCEEVREAYRKKGWALSDPESIDQCKREGFVQKVKDEEGEGCNVHGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHF PG+SFHQSG   HD+L FQ+ ++NISHK+N+LAFG+ FPGVVNPLDGV
Sbjct: 210 EVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGNYNISHKVNRLAFGDFFPGVVNPLDGV 269

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +W Q   SG+YQYFIKVVP++YTDV  +TIQSNQFSVTEHF++ E GR+Q+ PGVFF+YD
Sbjct: 270 QWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQSNQFSVTEHFQNMEAGRMQSPPGVFFYYD 329

Query: 241 LSPIKV 246
           LSPIKV
Sbjct: 330 LSPIKV 335


>gi|326506194|dbj|BAJ86415.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 363

 Score =  407 bits (1046), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 192/277 (69%), Positives = 227/277 (81%), Gaps = 5/277 (1%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           DISGEQH D++HDI KKRLDS GNVIESR++GIG  KI+KPLQ+HGGRL   E YCG+CY
Sbjct: 91  DISGEQHQDIRHDIEKKRLDSHGNVIESRKEGIGGTKIEKPLQKHGGRLGKGEEYCGTCY 150

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           GAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K + GEGC+++GFL+
Sbjct: 151 GAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQHGEGCSVHGFLD 210

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
           V+KVAGNFHFAPGK +++S V + ++ A     FNI+HKINKL+FG  FPG VNPLDG +
Sbjct: 211 VSKVAGNFHFAPGKGYYESNVDMPELSA--EGGFNITHKINKLSFGTEFPGAVNPLDGAQ 268

Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE-QGRLQTLPGVFFFYD 240
           WTQ    G YQYFIKVVPT+Y D+ G  I SNQFSVTEHFR    Q R Q  PGVFFFYD
Sbjct: 269 WTQPASDGTYQYFIKVVPTIYNDIRGRKIDSNQFSVTEHFRDGNVQPRPQ--PGVFFFYD 326

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
            SPIKV FTEE+ SFLH+LTN+CAIVGG+FTV+GIID
Sbjct: 327 FSPIKVIFTEENRSFLHYLTNLCAIVGGIFTVAGIID 363


>gi|255578837|ref|XP_002530273.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
 gi|223530205|gb|EEF32113.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
          Length = 265

 Score =  407 bits (1045), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 183/246 (74%), Positives = 217/246 (88%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDI GEQH D+KH+I KKR+++ G+VIE R++GIGAPKI+KPLQRHGGRLEHNETYCGSC
Sbjct: 1   MDIMGEQHFDIKHNITKKRINAHGDVIEVRKEGIGAPKIEKPLQRHGGRLEHNETYCGSC 60

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE SD+DCCN+C+EVREAYRKKGWAL+  DLIDQCKREGF+Q++K+EEGEGCNIYG L
Sbjct: 61  YGAEMSDDDCCNSCDEVREAYRKKGWALTGVDLIDQCKREGFIQKVKDEEGEGCNIYGSL 120

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHF+PGK  HQS   + D+L FQ DS+NISH IN+LAFG++FPGVVNPLDGV
Sbjct: 121 EVNKVAGNFHFSPGKGLHQSSFFIQDLLVFQGDSYNISHTINRLAFGDYFPGVVNPLDGV 180

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
            W  ETP+GM+QYF+KVVPT+YTD+ G T++SNQ+SVTEHF+ SE  RL + PGVFFFYD
Sbjct: 181 PWVHETPNGMHQYFLKVVPTIYTDIRGRTVRSNQYSVTEHFKKSEFARLDSPPGVFFFYD 240

Query: 241 LSPIKV 246
            SPIKV
Sbjct: 241 FSPIKV 246


>gi|449528843|ref|XP_004171412.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like, partial [Cucumis sativus]
          Length = 355

 Score =  403 bits (1035), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 180/259 (69%), Positives = 226/259 (87%)

Query: 38  KIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 97
           +I+KPLQ+HGGRLEHNETYCGSC+GAE+SD+DCCN+CEEVREAYRKKGWA++N DLIDQC
Sbjct: 96  EIEKPLQKHGGRLEHNETYCGSCFGAEASDDDCCNSCEEVREAYRKKGWAITNQDLIDQC 155

Query: 98  KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
           +RE F+Q++K+EEGEGCNI G LEVNKVAG+FHF PGKSF+QS  +   +LA Q   +N+
Sbjct: 156 QREDFIQKVKDEEGEGCNIEGSLEVNKVAGSFHFVPGKSFYQSSFNFLGLLALQTSDYNV 215

Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
           SH+IN+LAFG H+ G+VNPLDGV W     + M+QYF+KVVPT+Y ++ G T+ SNQ+SV
Sbjct: 216 SHRINRLAFGNHYDGLVNPLDGVHWEYNEQNVMHQYFVKVVPTIYKNIRGRTVHSNQYSV 275

Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           TEHF+S E G  Q++PGVFF+YDLSP+KVT+TEEHV FLHF+T++CAI+GGVF+V+GIID
Sbjct: 276 TEHFKSVEFGSSQSIPGVFFYYDLSPVKVTYTEEHVPFLHFMTHICAIIGGVFSVAGIID 335

Query: 278 AFIYHGQRAIKKKIEIGKF 296
           AFIYHGQR +KKK+EIGKF
Sbjct: 336 AFIYHGQRKMKKKVEIGKF 354


>gi|218192721|gb|EEC75148.1| hypothetical protein OsI_11348 [Oryza sativa Indica Group]
 gi|222624836|gb|EEE58968.1| hypothetical protein OsJ_10656 [Oryza sativa Japonica Group]
          Length = 355

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 191/299 (63%), Positives = 232/299 (77%), Gaps = 36/299 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGEQH D++HDI KKR+D+ GNVIESR+DG+GAPKI++PLQ+HGGRL+HNE YCGSC
Sbjct: 89  MDVSGEQHYDIRHDIIKKRIDNLGNVIESRKDGVGAPKIERPLQKHGGRLDHNEVYCGSC 148

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YG+E SD+ CCN+CE+VR+AYRKKGWAL+N + IDQCKREGF+QR+K+E+GEGC+I+GF+
Sbjct: 149 YGSEESDDQCCNSCEDVRDAYRKKGWALTNIEEIDQCKREGFVQRLKDEQGEGCSIHGFV 208

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
            VNK                                ISHKINKL+FG  FPGVVNPLDGV
Sbjct: 209 NVNK--------------------------------ISHKINKLSFGVEFPGVVNPLDGV 236

Query: 181 RWTQETP---SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
            W QE     +GMYQYF+KVVPT+YTD+ G  I SNQFSVTEHFR +  G  +  PGV+F
Sbjct: 237 EWIQEHTNGLTGMYQYFVKVVPTIYTDIRGRKINSNQFSVTEHFREA-IGYPRPPPGVYF 295

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           FY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+GIID+F+YHG RAIKKK+EIGK 
Sbjct: 296 FYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVAGIIDSFVYHGHRAIKKKMEIGKL 354


>gi|413949704|gb|AFW82353.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 398

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 180/266 (67%), Positives = 217/266 (81%), Gaps = 2/266 (0%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           DISGEQH D++HDI K+RL+S GNVIE+R++GIG  K+++PLQ+HGGRL+  E YCG+CY
Sbjct: 91  DISGEQHHDIRHDIEKRRLNSHGNVIEARKEGIGGAKVERPLQKHGGRLDKGEQYCGTCY 150

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           GAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F+ R+K ++ EGCN+ GFL+
Sbjct: 151 GAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCAREDFIDRVKTQQDEGCNVLGFLD 210

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
           V+KVAGNFHFAPGK F++S + V + L+     FNISHKINKL+FG  FPGVVNPLDG +
Sbjct: 211 VSKVAGNFHFAPGKGFYESNIDVPE-LSLLEGGFNISHKINKLSFGTEFPGVVNPLDGAQ 269

Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDL 241
           WTQ    G YQYFIKVVPT+YTD+ G  I SNQFSVTEHFR     R ++ PGVFFFYD 
Sbjct: 270 WTQPASDGTYQYFIKVVPTIYTDIRGRGIHSNQFSVTEHFRDGNV-RPKSQPGVFFFYDF 328

Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVG 267
           SPIKV FTEE+ S LH+LTN+CAIVG
Sbjct: 329 SPIKVIFTEENRSLLHYLTNLCAIVG 354


>gi|384252531|gb|EIE26007.1| DUF1692-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 386

 Score =  368 bits (944), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 177/302 (58%), Positives = 231/302 (76%), Gaps = 14/302 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MDISGE HLDV HD++K+RLDS G VI +S +     P++D  L         NET CGS
Sbjct: 90  MDISGEMHLDVDHDVYKRRLDSNGVVIPDSIEKHQVGPELDDTLLHKA-----NETECGS 144

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYGA + DE+CCNNCEEVR AYR+KGW  ++P  I QC +EGF+++++ +EGEGC+++G 
Sbjct: 145 CYGA-APDEECCNNCEEVRAAYRRKGWGFTDPQQISQCAKEGFVEKLRAQEGEGCHMWGS 203

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           L VNKVAGNFHFAPGKSF Q  +HVHD++ FQ  +F++SH+I+KL+FG  +PG+ NPLD 
Sbjct: 204 LAVNKVAGNFHFAPGKSFQQGPMHVHDLVPFQGVTFDLSHRIDKLSFGHEYPGMTNPLDR 263

Query: 180 V---RWTQETPSGM---YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 233
           V   ++    P G+   YQYF+KVVPT+Y +   HTI SNQ+SVTEHF+ S+  + Q LP
Sbjct: 264 VNLPKFNTRNPQGLPGAYQYFLKVVPTIYVNSHNHTINSNQYSVTEHFKGSQDFQAQ-LP 322

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVFF+YDLSPIKV + E  +SFLHFLT+VCAIVGG+FTV+GI+DAFIYHG +AIKKK+++
Sbjct: 323 GVFFYYDLSPIKVKYHETRMSFLHFLTSVCAIVGGIFTVAGIVDAFIYHGHQAIKKKVDL 382

Query: 294 GK 295
           GK
Sbjct: 383 GK 384


>gi|215704311|dbj|BAG93745.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 261

 Score =  363 bits (931), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 168/259 (64%), Positives = 206/259 (79%), Gaps = 2/259 (0%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQH D++HDI K+RLD+ GNVIE+R++GIG  KI+ PLQ+HGGRL   E YCG+C
Sbjct: 1   MDISGEQHHDIRHDIEKRRLDAHGNVIEARKEGIGGAKIESPLQKHGGRLSKGEEYCGTC 60

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQC RE F++R+K ++GEGCN++GFL
Sbjct: 61  YGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCTREDFVERVKTQQGEGCNVHGFL 120

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V+KVAGN HFAPGK F++S ++V ++ A +   FNI+HKINKL+FG  FPGVVNPLDG 
Sbjct: 121 DVSKVAGNLHFAPGKGFYESNINVPELSALEH-GFNITHKINKLSFGTEFPGVVNPLDGA 179

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
           +WTQ    G YQYFIKVVPT+YTD+ G  I SNQFSVTEHFR     R +  PGVFFFYD
Sbjct: 180 QWTQPASDGTYQYFIKVVPTIYTDLRGRKIHSNQFSVTEHFRDGNI-RPKPQPGVFFFYD 238

Query: 241 LSPIKVTFTEEHVSFLHFL 259
            SPIKV   E +   + F+
Sbjct: 239 FSPIKVVTMERNSYVVMFI 257


>gi|126291179|ref|XP_001371602.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Monodelphis domestica]
          Length = 383

 Score =  353 bits (907), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 166/297 (55%), Positives = 218/297 (73%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H+++K+RLD  G  + +  +     ++ K  ++       +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLYKQRLDKDGRPVTTEAE---RHELGKEEEKAFDPSSLDPERCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I +L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRRLSFGEDYPGIVNPLDDT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  VSG  ++SNQFSVT H + +  G +  Q LPGVF  
Sbjct: 266 NITAPQASMMFQYFVKVVPTVYMKVSGEVLRSNQFSVTRHEKVA-NGLIGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKIE+GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIELGK 381


>gi|449265747|gb|EMC76893.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3,
           partial [Columba livia]
          Length = 330

 Score =  353 bits (906), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 172/301 (57%), Positives = 216/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  GN +    E  + G    K+  P      R       
Sbjct: 36  MDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEEEKVFDPNSLDADR------- 88

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAES D  CCN C++VREAYR++GWA  NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 89  CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQV 148

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FG  +PG+VNP
Sbjct: 149 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGRDYPGIVNP 208

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LDG   T +  S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPG
Sbjct: 209 LDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRTNQFSVTRHEKIA-NGLLGDQGLPG 267

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAIVGG+FTV+G ID+ IYH  RAI+KKIE+G
Sbjct: 268 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIVGGIFTVAGFIDSLIYHSARAIQKKIELG 327

Query: 295 K 295
           K
Sbjct: 328 K 328


>gi|224077228|ref|XP_002191084.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Taeniopygia guttata]
          Length = 383

 Score =  351 bits (901), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 170/301 (56%), Positives = 217/301 (72%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
           MD++G+Q LDV+H++FK+RLD  GN +    E  + G    K+  P      R       
Sbjct: 89  MDVAGDQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEEEKVFDPNSLDADR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAES D  CCN C++VREAYR++GWA  NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDSIEQCKREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FG  +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGRDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LDG   T +  S M+QYF+KVVPTVY  V G  +++NQFSVT+H + +  G L  Q LPG
Sbjct: 262 LDGTAVTAQQASMMFQYFVKVVPTVYRKVDGEVVRTNQFSVTQHEKIA-NGLLGDQGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HF+T VCAIVGG+FTV+G ID+ IYH  RAI+KKIE+G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFVTGVCAIVGGIFTVAGFIDSLIYHSARAIQKKIELG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|148222292|ref|NP_001091124.1| ERGIC and golgi 3 [Xenopus laevis]
 gi|120538715|gb|AAI29573.1| LOC100036873 protein [Xenopus laevis]
          Length = 384

 Score =  351 bits (900), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 167/297 (56%), Positives = 216/297 (72%), Gaps = 5/297 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD     + S  D     K+++ +      L+ N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDKKPVTSEADKHELGKLEEHVVLDPKTLDPNR--CESC 146

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN+C++VREAYR+KGWA   PD I+QCKREGF Q+++E++ EGC IYGFL
Sbjct: 147 YGAETEDFSCCNSCDDVREAYRRKGWAFKTPDSIEQCKREGFSQKMQEQKNEGCQIYGFL 206

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H+I  L+FG  +PG+VNPLDG 
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHEIKHLSFGRDYPGLVNPLDGT 266

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
                  S M+QYF+K+VPTVY  V G  +++NQFSVT H + +  G +  Q LPGVF  
Sbjct: 267 SIVAMQSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKMT-NGLIGDQGLPGVFVL 325

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GGVFTV+ +IDA IYH  RAI+KKIE+GK
Sbjct: 326 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVASLIDALIYHSTRAIQKKIELGK 382


>gi|363741418|ref|XP_003642491.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Gallus gallus]
 gi|363741445|ref|XP_003642499.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Gallus gallus]
          Length = 383

 Score =  350 bits (899), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 170/301 (56%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  GN +    E  + G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEEEKVFDPNSLDADR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAES D  CCN C++VREAYR++GWA  NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FG  +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGRDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LDG   T +  S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 262 LDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRTNQFSVTRHEKIA-NGLIGDQGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H  F HFLT VCAIVGG+FTV+G ID+ IYH  RAI+KKIE+G
Sbjct: 321 VFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIFTVAGFIDSLIYHSARAIQKKIELG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|13384938|ref|NP_079792.1| endoplasmic reticulum-Golgi intermediate compartment protein 3 [Mus
           musculus]
 gi|37999778|sp|Q9CQE7.1|ERGI3_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3; AltName: Full=Serologically defined breast
           cancer antigen NY-BR-84 homolog
 gi|12844094|dbj|BAB26233.1| unnamed protein product [Mus musculus]
 gi|12851518|dbj|BAB29073.1| unnamed protein product [Mus musculus]
 gi|26341008|dbj|BAC34166.1| unnamed protein product [Mus musculus]
 gi|27882157|gb|AAH43720.1| ERGIC and golgi 3 [Mus musculus]
 gi|148674217|gb|EDL06164.1| ERGIC and golgi 3, isoform CRA_d [Mus musculus]
          Length = 383

 Score =  350 bits (898), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 169/297 (56%), Positives = 218/297 (73%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FKKRLD  G  + S  +     K++  +      L+ N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTV-FDPNSLDPNR--CESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|354477966|ref|XP_003501188.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Cricetulus griseus]
 gi|344246673|gb|EGW02777.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Cricetulus griseus]
          Length = 383

 Score =  350 bits (898), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 169/297 (56%), Positives = 218/297 (73%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FKKRLD  G  + S  +     K++  +      L+ N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHELGKVEVAV-FDPNSLDPNR--CESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESDDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|348564091|ref|XP_003467839.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cavia porcellus]
          Length = 383

 Score =  350 bits (898), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 168/297 (56%), Positives = 215/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FKKRLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDLKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKIE+GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIELGK 381


>gi|157820783|ref|NP_001100003.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Rattus norvegicus]
 gi|149030853|gb|EDL85880.1| ERGIC and golgi 3 (predicted) [Rattus norvegicus]
          Length = 383

 Score =  350 bits (897), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 169/297 (56%), Positives = 218/297 (73%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FKKRLD  G  + S  +     K++  +      L+ N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHELGKVEVTV-FDPDSLDPNR--CESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIKHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|284004911|ref|NP_001164802.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Oryctolagus cuniculus]
 gi|217038333|gb|ACJ76626.1| serologically defined breast cancer antigen 84 isoform b
           (predicted) [Oryctolagus cuniculus]
          Length = 383

 Score =  348 bits (894), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 167/297 (56%), Positives = 215/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FKKRLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTVFNPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|359322740|ref|XP_864582.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 3 [Canis lupus familiaris]
          Length = 383

 Score =  348 bits (893), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 166/297 (55%), Positives = 216/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSL---NPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNPLDRT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT+VCAIVGG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTSVCAIVGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|426241390|ref|XP_004014574.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Ovis aries]
          Length = 383

 Score =  347 bits (891), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 168/301 (55%), Positives = 216/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 262 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|395830112|ref|XP_003788179.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Otolemur garnettii]
          Length = 383

 Score =  347 bits (891), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 166/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFNPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|260815243|ref|XP_002602383.1| hypothetical protein BRAFLDRAFT_63528 [Branchiostoma floridae]
 gi|229287692|gb|EEN58395.1| hypothetical protein BRAFLDRAFT_63528 [Branchiostoma floridae]
          Length = 397

 Score =  347 bits (890), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 169/311 (54%), Positives = 222/311 (71%), Gaps = 19/311 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MDI+GEQ +DV H++FK+R+D QGN++ E  ++ +G P  D+ +Q            C S
Sbjct: 92  MDIAGEQQIDVDHNLFKRRMDLQGNILDEPEKEDLGDPS-DEFMQAIKKLENKTADVCES 150

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYGAE+ D  CCN CE+VREAYR+KGWA +NPD I+QCKREG+ +++K+++ EGC +YG+
Sbjct: 151 CYGAETEDLKCCNTCEDVREAYRRKGWAFNNPDTIEQCKREGWSEKLKQQKNEGCQVYGY 210

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVH--------VHDILAFQRDSFNISHKINKLAFGEHFP 171
           LEVNKVAGNFHFAPGKSF Q  VH        VHD+  F  + FN+SH +N L+FG   P
Sbjct: 211 LEVNKVAGNFHFAPGKSFQQHHVHVSCFYHPIVHDLQPFGGEKFNLSHHVNHLSFGTDIP 270

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQ 226
           G VNPLDG     +  S MYQYF+K+VPT+Y  +SG  +++NQFSVT+H +     S EQ
Sbjct: 271 GRVNPLDGHMVAAKQGSMMYQYFVKIVPTIYKKISGQEVRTNQFSVTKHQKQVTASSGEQ 330

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
           G    LPGVF  Y+LSP+ V FTE+  SF+HFLT VCAIVGGVFTV+G+ID+ IYH  RA
Sbjct: 331 G----LPGVFVLYELSPMMVQFTEKQRSFMHFLTGVCAIVGGVFTVAGLIDSLIYHSARA 386

Query: 287 IKKKIEIGKFS 297
           I++KI++GK S
Sbjct: 387 IQQKIDLGKAS 397


>gi|417399979|gb|JAA46966.1| Putative copii vesicle protein [Desmodus rotundus]
          Length = 383

 Score =  347 bits (890), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKAEMKVFDPNSLDPER------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  + G  +++NQFSVT H + +  G L  Q LPG
Sbjct: 262 LDHTNVTALQASMMFQYFVKVVPTVYMKLDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|126291176|ref|XP_001371575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Monodelphis domestica]
          Length = 388

 Score =  347 bits (890), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 166/302 (54%), Positives = 218/302 (72%), Gaps = 11/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H+++K+RLD  G  + +  +     ++ K  ++       +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLYKQRLDKDGRPVTTEAE---RHELGKEEEKAFDPSSLDPERCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I +L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRRLSFGEDYPGIVN 265

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLD    T    S M+QYF+KVVPTVY  VSG  ++SNQFSVT H + +  G +  Q LP
Sbjct: 266 PLDDTNITAPQASMMFQYFVKVVPTVYMKVSGEVLRSNQFSVTRHEKVA-NGLIGDQGLP 324

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKIE+
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIEL 384

Query: 294 GK 295
           GK
Sbjct: 385 GK 386


>gi|61555014|gb|AAX46646.1| serologically defined breast cancer antigen 84 isoform b [Bos
           taurus]
          Length = 346

 Score =  347 bits (889), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 168/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +    G    K+  P      R       
Sbjct: 52  MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 104

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE  D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 105 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 164

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 165 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 224

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 225 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 283

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 284 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 343

Query: 295 K 295
           K
Sbjct: 344 K 344


>gi|296199725|ref|XP_002747286.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Callithrix jacchus]
 gi|403281165|ref|XP_003932068.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Saimiri boliviensis boliviensis]
          Length = 383

 Score =  347 bits (889), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 166/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|194044515|ref|XP_001929457.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Sus scrofa]
 gi|350594868|ref|XP_003483992.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Sus scrofa]
          Length = 383

 Score =  347 bits (889), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 167/301 (55%), Positives = 216/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEIKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 262 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVAS-GLMGDQGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|326931697|ref|XP_003211962.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Meleagris gallopavo]
          Length = 411

 Score =  347 bits (889), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 170/306 (55%), Positives = 215/306 (70%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  GN +    E  + G    K+  P      R       
Sbjct: 112 MDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEEEKVFDPNSLDADR------- 164

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAES D  CCN C++VREAYR++GWA  NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 165 CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQV 224

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FG  +P
Sbjct: 225 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIKHLSFGRDYP 284

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLDG   T +  S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  
Sbjct: 285 GIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRTNQFSVTRHEKIA-NGLIGD 343

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H  F HFLT VCAIVGG+FTV+G ID+ IYH  RAI+K
Sbjct: 344 QGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIFTVAGFIDSLIYHSARAIQK 403

Query: 290 KIEIGK 295
           KIE+GK
Sbjct: 404 KIELGK 409


>gi|344279905|ref|XP_003411726.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Loxodonta africana]
          Length = 386

 Score =  347 bits (889), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 92  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 144

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 145 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 204

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 205 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 264

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 265 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 323

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 324 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 383

Query: 295 K 295
           K
Sbjct: 384 K 384


>gi|164448602|ref|NP_001029525.2| endoplasmic reticulum-Golgi intermediate compartment protein 3 [Bos
           taurus]
 gi|75057944|sp|Q5EAE0.1|ERGI3_BOVIN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|59857621|gb|AAX08645.1| serologically defined breast cancer antigen 84 isoform b [Bos
           taurus]
 gi|59857623|gb|AAX08646.1| serologically defined breast cancer antigen 84 isoform b [Bos
           taurus]
 gi|59857741|gb|AAX08705.1| serologically defined breast cancer antigen 84 isoform b [Bos
           taurus]
 gi|110665562|gb|ABG81427.1| serologically defined breast cancer antigen 84 [Bos taurus]
          Length = 383

 Score =  346 bits (888), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 168/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE  D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 262 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|301762088|ref|XP_002916455.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Ailuropoda melanoleuca]
          Length = 383

 Score =  346 bits (888), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 262 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|410953936|ref|XP_003983624.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Felis catus]
          Length = 383

 Score =  346 bits (888), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 262 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|426391505|ref|XP_004062113.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Gorilla gorilla gorilla]
 gi|7959731|gb|AAF71038.1|AF116721_14 PRO0989 [Homo sapiens]
          Length = 346

 Score =  346 bits (888), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 165/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 52  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 108

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 109 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 168

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 169 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 228

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 229 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 287

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 288 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 344


>gi|95767501|gb|ABF57305.1| serologically defined breast cancer antigen 84 [Bos taurus]
          Length = 376

 Score =  346 bits (888), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 168/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +    G    K+  P      R       
Sbjct: 82  MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 134

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE  D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 135 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 194

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 195 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 254

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 255 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 313

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 314 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 373

Query: 295 K 295
           K
Sbjct: 374 K 374


>gi|95767625|gb|ABF57320.1| serologically defined breast cancer antigen 84 [Bos taurus]
          Length = 380

 Score =  346 bits (887), Expect = 7e-93,   Method: Compositional matrix adjust.
 Identities = 168/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +    G    K+  P      R       
Sbjct: 86  MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 138

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE  D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 139 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 198

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 199 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 258

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 259 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 317

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 318 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 377

Query: 295 K 295
           K
Sbjct: 378 K 378


>gi|109092202|ref|XP_001098982.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 3 [Macaca mulatta]
          Length = 383

 Score =  346 bits (887), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 165/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLKTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|431894341|gb|ELK04141.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Pteropus alecto]
          Length = 383

 Score =  346 bits (887), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFTQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  + G  +++NQFSVT H + +  G L  Q LPG
Sbjct: 262 LDRTNVTAPQASMMFQYFVKVVPTVYMKLDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 321 VFVLYELSPMVVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|335774962|gb|AEH58414.1| endoplasmic reticulum-golgi intermediat compartment protein 3-like
           protein [Equus caballus]
          Length = 354

 Score =  346 bits (887), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 167/301 (55%), Positives = 215/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 60  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 112

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 113 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 172

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 173 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 232

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 233 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 291

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 292 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 351

Query: 295 K 295
           K
Sbjct: 352 K 352


>gi|410262554|gb|JAA19243.1| ERGIC and golgi 3 [Pan troglodytes]
          Length = 383

 Score =  346 bits (887), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 165/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|395510083|ref|XP_003759313.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3, partial [Sarcophilus harrisii]
          Length = 335

 Score =  346 bits (887), Expect = 9e-93,   Method: Compositional matrix adjust.
 Identities = 169/306 (55%), Positives = 218/306 (71%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H+++K+RLD  G+ +    E  + G    K+  P      R       
Sbjct: 36  MDVAGEQQLDVEHNLYKQRLDKDGHPVTTEAERHELGKEEEKVFDPSSLDPER------- 88

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 89  CESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 148

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I +L+FGE +P
Sbjct: 149 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRRLSFGEDYP 208

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLD    T    S M+QYF+KVVPTVY  V+G  ++SNQFSVT H + +  G +  
Sbjct: 209 GIVNPLDDTNITAPQASMMFQYFVKVVPTVYMKVNGEVLRSNQFSVTRHEKVA-NGLIGD 267

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+K
Sbjct: 268 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 327

Query: 290 KIEIGK 295
           KIE+GK
Sbjct: 328 KIELGK 333


>gi|7706278|ref|NP_057050.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform b [Homo sapiens]
 gi|332858219|ref|XP_003316930.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Pan troglodytes]
 gi|397523795|ref|XP_003831904.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Pan paniscus]
 gi|37999823|sp|Q9Y282.1|ERGI3_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3; AltName: Full=Serologically defined breast
           cancer antigen NY-BR-84
 gi|4689108|gb|AAD27763.1|AF077030_1 hypothetical 43.2 kDa protein [Homo sapiens]
 gi|4929577|gb|AAD34049.1|AF151812_1 CGI-54 protein [Homo sapiens]
 gi|7671663|emb|CAB89412.1| ERGIC and golgi 3 [Homo sapiens]
 gi|14602515|gb|AAH09765.1| ERGIC and golgi 3 [Homo sapiens]
 gi|15559308|gb|AAH14014.1| ERGIC and golgi 3 [Homo sapiens]
 gi|119596605|gb|EAW76199.1| ERGIC and golgi 3, isoform CRA_a [Homo sapiens]
 gi|124249802|gb|ABM92879.1| endoplasmic reticulum-localized protein ERp43 [Homo sapiens]
 gi|312152490|gb|ADQ32757.1| ERGIC and golgi 3 [synthetic construct]
 gi|380785591|gb|AFE64671.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform b [Macaca mulatta]
 gi|383419067|gb|AFH32747.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform b [Macaca mulatta]
 gi|384947602|gb|AFI37406.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform b [Macaca mulatta]
 gi|410342895|gb|JAA40394.1| ERGIC and golgi 3 [Pan troglodytes]
          Length = 383

 Score =  345 bits (886), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 165/297 (55%), Positives = 215/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|327271489|ref|XP_003220520.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Anolis carolinensis]
          Length = 383

 Score =  345 bits (884), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 167/301 (55%), Positives = 214/301 (71%), Gaps = 14/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  +    E  + G     I  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGKHVTPEAERHELGKEEETIFDPNSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAES D  CCN C++VREAYR++GWA  NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCKV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FG  +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHIIKHLSFGRDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LDG   + +  S M+QYF+KVVPT+Y  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 262 LDGTVVSAQQASMMFQYFVKVVPTIYMKVDGEVVRTNQFSVTRHEKIA-NGLIGDQGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH  R I+KKIE+G
Sbjct: 321 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQKKIELG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|363741420|ref|XP_003642492.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Gallus gallus]
 gi|363741447|ref|XP_003642500.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Gallus gallus]
          Length = 388

 Score =  344 bits (883), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 170/306 (55%), Positives = 215/306 (70%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  GN +    E  + G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELGKEEEKVFDPNSLDADR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAES D  CCN C++VREAYR++GWA  NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FG  +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIKHLSFGRDYP 261

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLDG   T +  S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  
Sbjct: 262 GIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRTNQFSVTRHEKIA-NGLIGD 320

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H  F HFLT VCAIVGG+FTV+G ID+ IYH  RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIFTVAGFIDSLIYHSARAIQK 380

Query: 290 KIEIGK 295
           KIE+GK
Sbjct: 381 KIELGK 386


>gi|354477968|ref|XP_003501189.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Cricetulus griseus]
          Length = 388

 Score =  344 bits (883), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 169/302 (55%), Positives = 218/302 (72%), Gaps = 11/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FKKRLD  G  + S  +     K++  +      L+ N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHELGKVEVAV-FDPNSLDPNR--CESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESDDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIKHLSFGEDYPGIVN 265

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 324

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384

Query: 294 GK 295
           GK
Sbjct: 385 GK 386


>gi|197100234|ref|NP_001126130.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Pongo abelii]
 gi|75041559|sp|Q5R8G3.1|ERGI3_PONAB RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|55730450|emb|CAH91947.1| hypothetical protein [Pongo abelii]
          Length = 383

 Score =  344 bits (882), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 164/297 (55%), Positives = 214/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VRE YR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVRETYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|410218732|gb|JAA06585.1| ERGIC and golgi 3 [Pan troglodytes]
          Length = 383

 Score =  343 bits (881), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 164/297 (55%), Positives = 214/297 (72%), Gaps = 6/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++F +RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFNQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 265

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 266 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 325 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 381


>gi|47575764|ref|NP_001001226.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Xenopus (Silurana) tropicalis]
 gi|82185697|sp|Q6NVS2.1|ERGI3_XENTR RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|45708932|gb|AAH67932.1| ERGIC and golgi 3 [Xenopus (Silurana) tropicalis]
          Length = 384

 Score =  343 bits (881), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 163/297 (54%), Positives = 215/297 (72%), Gaps = 5/297 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD     + S  D     K ++ +      L+ N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDKKPVTSEADRHELGKSEEHVVFDPKSLDPNR--CESC 146

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN C++VREAYR++GWA   PD I+QCKREGF Q+++E++ EGC +YGFL
Sbjct: 147 YGAETDDFSCCNTCDDVREAYRRRGWAFKTPDSIEQCKREGFSQKMQEQKNEGCQVYGFL 206

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H+I  L+FG  +PG+VNPLDG 
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHEIRHLSFGRDYPGLVNPLDGS 266

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
                  S M+QYF+K+VPTVY  V G  +++NQFSVT H + +  G +  Q LPGVF  
Sbjct: 267 SVAAMQSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKMT-NGLIGDQGLPGVFVL 325

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GGVFTV+G+ID+ +Y+  RAI+KKIE+GK
Sbjct: 326 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLVYYSTRAIQKKIELGK 382


>gi|348521804|ref|XP_003448416.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Oreochromis niloticus]
          Length = 384

 Score =  343 bits (880), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 163/299 (54%), Positives = 214/299 (71%), Gaps = 5/299 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD +   +    +     K D         L+ +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKEFKPVTQEAEKHELGKADDGEVFDPSTLDPDR--CESC 146

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN C++VREAYR++GWA  + D I+QCKREGF Q+++E++ EGC +YGFL
Sbjct: 147 YGAETEDLKCCNTCDDVREAYRRRGWAFKSADTIEQCKREGFTQKMQEQKNEGCQVYGFL 206

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FG+ +PG+VNPLDG 
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHLIKHLSFGKDYPGLVNPLDGT 266

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S MYQYF+K+VPT+Y    G  +++NQFSVT H + +  G +  Q LPGVF  
Sbjct: 267 DVTAPQASMMYQYFVKIVPTIYMKTDGEVVKTNQFSVTRHEKVA-NGLIGDQGLPGVFVL 325

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH  R I+KKIE+GK S
Sbjct: 326 YELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQKKIELGKTS 384


>gi|148225661|ref|NP_001087591.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Xenopus laevis]
 gi|82181499|sp|Q66KH2.1|ERGI3_XENLA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|51513379|gb|AAH80394.1| MGC83277 protein [Xenopus laevis]
          Length = 389

 Score =  342 bits (878), Expect = 9e-92,   Method: Compositional matrix adjust.
 Identities = 166/302 (54%), Positives = 218/302 (72%), Gaps = 10/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD     + S  D     K ++ +      L+ N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDLDKKPVTSEADRHELGKSEEQVVFDPKTLDPNR--CESC 146

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN+C++VREAYR+KGWA   PD I+QCKREGF Q+++E++ EGC +YGFL
Sbjct: 147 YGAETDDFSCCNSCDDVREAYRRKGWAFKTPDSIEQCKREGFSQKMQEQKNEGCQVYGFL 206

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H+I  L+FG+ +PG+VN
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHEIKHLSFGKDYPGLVN 266

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLDG        S M+QYF+K+VPTVY  V G  +++NQFSVT H + +  G +  Q LP
Sbjct: 267 PLDGTSIVAMQSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKMT-NGLIGDQGLP 325

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+G+ID+ IY+  RAI+KKIE+
Sbjct: 326 GVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYYSTRAIQKKIEL 385

Query: 294 GK 295
           GK
Sbjct: 386 GK 387


>gi|387015776|gb|AFJ50007.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3-like
           [Crotalus adamanteus]
          Length = 372

 Score =  342 bits (878), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 168/297 (56%), Positives = 218/297 (73%), Gaps = 17/297 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD         +D +G    ++ L  +   L+     C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLD---------KDELGK---EEELFFNPNSLDPER--CESC 134

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCNNC++VREAYR++GWA  NPD I+QCKREGF ++++E++ EGC +YGFL
Sbjct: 135 YGAESEDIKCCNNCDDVREAYRRRGWAFKNPDTIEQCKREGFSEKMQEQKNEGCKVYGFL 194

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ ++  D+ NI+H I  L+FG+ +PG+VNPLDG 
Sbjct: 195 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSYGLDNINITHFIRHLSFGKDYPGLVNPLDGT 254

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPGVF  
Sbjct: 255 IVTAHQASMMFQYFVKVVPTVYMKVDGEMVRTNQFSVTRHEKIA-NGLIGDQGLPGVFVL 313

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH  RAI+KKIE+GK
Sbjct: 314 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARAIQKKIELGK 370


>gi|359322742|ref|XP_851879.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Canis lupus familiaris]
          Length = 388

 Score =  342 bits (877), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 166/302 (54%), Positives = 216/302 (71%), Gaps = 11/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSL---NPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYPGIVN 265

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LP
Sbjct: 266 PLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLP 324

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V  TE+H SF HFLT+VCAIVGG+FTV+G+ID+ IYH  RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTSVCAIVGGMFTVAGLIDSLIYHSARAIQKKIDL 384

Query: 294 GK 295
           GK
Sbjct: 385 GK 386


>gi|395830114|ref|XP_003788180.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Otolemur garnettii]
 gi|197215642|gb|ACH53034.1| ERGIC and golgi 3 (predicted) [Otolemur garnettii]
          Length = 388

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 166/302 (54%), Positives = 215/302 (71%), Gaps = 11/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFNPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYPGIVN 265

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 324

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384

Query: 294 GK 295
           GK
Sbjct: 385 GK 386


>gi|190402265|gb|ACE77675.1| ERGIC and golgi 3 (predicted) [Sorex araneus]
          Length = 388

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 167/302 (55%), Positives = 217/302 (71%), Gaps = 11/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     KI+  +      L+ N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGVPVSSEAERHELGKIEVKV-FDPDSLDPNR--CESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCQREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYPGIVN 265

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LP
Sbjct: 266 PLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLP 324

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384

Query: 294 GK 295
           GK
Sbjct: 385 GK 386


>gi|259155256|ref|NP_001158869.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Salmo salar]
 gi|223647782|gb|ACN10649.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Salmo salar]
          Length = 388

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 169/308 (54%), Positives = 218/308 (70%), Gaps = 19/308 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGIGAPK--IDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  GN +  E+ +  +G  +  I  P +    R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGNPVTTEAEKHDLGQEEGEIFDPSKLDPER------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN C++VREAYR++GWA  NPD I+QCKREGF Q+++E++ EGC I
Sbjct: 142 CESCYGAETEDLKCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQI 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FG  +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHLIKHLSFGRDYP 261

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLDG        S MYQYF+K+VPT+Y    G  +++NQFSVT H + +  G +  
Sbjct: 262 GIVNPLDGTDVAAPQASMMYQYFVKIVPTIYVKWDGEVVKTNQFSVTRHEKVA-NGLIGD 320

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V FTE+  SF HFLT VCAIVGGVFTV+G+ID+ IYH  +AI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCAIVGGVFTVAGLIDSLIYHSAKAIQK 380

Query: 290 KIEIGKFS 297
           KIE+GK S
Sbjct: 381 KIELGKAS 388


>gi|426241392|ref|XP_004014575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Ovis aries]
          Length = 388

 Score =  341 bits (875), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 168/306 (54%), Positives = 216/306 (70%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  
Sbjct: 262 GIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 320

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380

Query: 290 KIEIGK 295
           KI++GK
Sbjct: 381 KIDLGK 386


>gi|296199723|ref|XP_002747285.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Callithrix jacchus]
 gi|403281167|ref|XP_003932069.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Saimiri boliviensis boliviensis]
 gi|166831592|gb|ABY90117.1| serologically defined breast cancer antigen 84 isoform a
           (predicted) [Callithrix jacchus]
          Length = 388

 Score =  341 bits (874), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 166/302 (54%), Positives = 215/302 (71%), Gaps = 11/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYPGIVN 265

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 324

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384

Query: 294 GK 295
           GK
Sbjct: 385 GK 386


>gi|440797665|gb|ELR18746.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
          Length = 383

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 164/297 (55%), Positives = 211/297 (71%), Gaps = 8/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGE  LDV+H+IFKKRL + G  +   +  + A     P    G  LE  E  CGSC
Sbjct: 91  MDVSGEHQLDVEHNIFKKRLAADGRPLGIEKGELEAAATPSP----GQELEPIE--CGSC 144

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YG+E     CCN C EVRE+YRKKGWA ++P+ I+QC REGF + +++++GEGC +YG +
Sbjct: 145 YGSEQEPGQCCNTCAEVRESYRKKGWAFAHPESIEQCAREGFSENLEKQKGEGCQVYGHI 204

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
            VNKVAGNFHFAPGKSF    +HVHD+  F+  S+NISH+IN+++FG+ FPGV+NPLDGV
Sbjct: 205 LVNKVAGNFHFAPGKSFQAHHMHVHDLQPFRMSSWNISHRINRISFGKEFPGVINPLDGV 264

Query: 181 RWTQETPSG--MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
             T +  +G  MYQYF+K+VPT+Y  + G+ I +NQFSVTEH R    G    LPG+F  
Sbjct: 265 EKTTDPGAGSAMYQYFVKIVPTIYESLDGNVINTNQFSVTEHTRMLPPGDKSGLPGLFVM 324

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           YDLSPI V FTE   SF HFLT VCAI+GGVFTV+GIID+ IY+  R + KK+E+GK
Sbjct: 325 YDLSPIMVKFTERTKSFAHFLTGVCAIIGGVFTVAGIIDSLIYNSLRTLGKKMELGK 381


>gi|344279907|ref|XP_003411727.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Loxodonta africana]
          Length = 391

 Score =  340 bits (873), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 92  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 144

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 145 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 204

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +P
Sbjct: 205 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 264

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  
Sbjct: 265 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 323

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+K
Sbjct: 324 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 383

Query: 290 KIEIGK 295
           KI++GK
Sbjct: 384 KIDLGK 389


>gi|194044517|ref|XP_001929458.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Sus scrofa]
 gi|350594870|ref|XP_003483993.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Sus scrofa]
          Length = 388

 Score =  340 bits (873), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 167/306 (54%), Positives = 216/306 (70%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEIKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYP 261

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-SGLMGD 320

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380

Query: 290 KIEIGK 295
           KI++GK
Sbjct: 381 KIDLGK 386


>gi|229368723|gb|ACQ63006.1| serologically defined breast cancer antigen 84 isoform a
           (predicted) [Dasypus novemcinctus]
          Length = 388

 Score =  340 bits (873), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 320

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380

Query: 290 KIEIGK 295
           KI++GK
Sbjct: 381 KIDLGK 386


>gi|41055991|ref|NP_957309.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform 2 [Danio rerio]
 gi|82210123|sp|Q803I2.1|ERGI3_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|28278376|gb|AAH44474.1| ERGIC and golgi 3 [Danio rerio]
 gi|182890166|gb|AAI64701.1| Ergic3 protein [Danio rerio]
          Length = 383

 Score =  340 bits (872), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 163/308 (52%), Positives = 214/308 (69%), Gaps = 24/308 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESR---------QDGIGAPKIDKPLQRHGGRLE 51
           MD++GEQ LDV+H++FK+RLD  G  + +          ++G+  P    P +       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGQPVTTEAEKHDLGKEEEGVFDPSTLDPDR------- 141

Query: 52  HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
                C SCYGAE+ D  CCN C++VREAYR++GWA   PD I+QCKREGF Q+++E++ 
Sbjct: 142 -----CESCYGAETDDLKCCNTCDDVREAYRRRGWAFKTPDTIEQCKREGFSQKMQEQKN 196

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FG+ +P
Sbjct: 197 EGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHFIKHLSFGKDYP 256

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLD         S MYQYF+K+VPT+Y    G  +++NQFSVT H + +  G +  
Sbjct: 257 GIVNPLDDTNVAAPQASMMYQYFVKIVPTIYVKGDGEVVKTNQFSVTRHEKIA-NGLIGD 315

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V FTE+  SF HFLT VCAI+GGVFTV+G+ID+ IYH  RAI+K
Sbjct: 316 QGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARAIQK 375

Query: 290 KIEIGKFS 297
           KIE+GK S
Sbjct: 376 KIELGKAS 383


>gi|281346059|gb|EFB21643.1| hypothetical protein PANDA_004535 [Ailuropoda melanoleuca]
          Length = 387

 Score =  340 bits (872), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 320

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380

Query: 290 KIEIGK 295
           KI++GK
Sbjct: 381 KIDLGK 386


>gi|410953938|ref|XP_003983625.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Felis catus]
          Length = 388

 Score =  340 bits (872), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 320

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380

Query: 290 KIEIGK 295
           KI++GK
Sbjct: 381 KIDLGK 386


>gi|109092200|ref|XP_001098885.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Macaca mulatta]
          Length = 388

 Score =  340 bits (872), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 165/302 (54%), Positives = 215/302 (71%), Gaps = 11/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYPGIVN 265

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLKTNQFSVTRHEKVA-NGLLGDQGLP 324

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384

Query: 294 GK 295
           GK
Sbjct: 385 GK 386


>gi|38327615|ref|NP_938408.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform a [Homo sapiens]
 gi|281182526|ref|NP_001162565.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Papio anubis]
 gi|397523797|ref|XP_003831905.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Pan paniscus]
 gi|410055053|ref|XP_003953764.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Pan troglodytes]
 gi|57208593|emb|CAI42842.1| ERGIC and golgi 3 [Homo sapiens]
 gi|164623746|gb|ABY64672.1| ERGIC and golgi 3, isoform 1 (predicted) [Papio anubis]
 gi|380785589|gb|AFE64670.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform a [Macaca mulatta]
          Length = 388

 Score =  340 bits (872), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 165/302 (54%), Positives = 215/302 (71%), Gaps = 11/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYPGIVN 265

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 324

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384

Query: 294 GK 295
           GK
Sbjct: 385 GK 386


>gi|301762086|ref|XP_002916454.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Ailuropoda melanoleuca]
          Length = 388

 Score =  340 bits (872), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGD 320

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380

Query: 290 KIEIGK 295
           KI++GK
Sbjct: 381 KIDLGK 386


>gi|184185558|gb|ACC68956.1| serologically defined breast cancer antigen 84 isoform a
           (predicted) [Rhinolophus ferrumequinum]
          Length = 388

 Score =  340 bits (871), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 167/306 (54%), Positives = 215/306 (70%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FGE +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIRHLSFGEDYP 261

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLD    T    S M+QYF+KVVPTVY  + G  +++NQFSVT H + +  G L  
Sbjct: 262 GIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKLDGEVLRTNQFSVTRHEKVA-NGLLGD 320

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQK 380

Query: 290 KIEIGK 295
           KI++GK
Sbjct: 381 KIDLGK 386


>gi|75077200|sp|Q4R8X1.1|ERGI3_MACFA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|67967936|dbj|BAE00450.1| unnamed protein product [Macaca fascicularis]
          Length = 382

 Score =  340 bits (871), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 165/297 (55%), Positives = 215/297 (72%), Gaps = 7/297 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +    G    +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGTPVSSEAERHELGKVEVTV---FGPDSLDPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++G A  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRG-AFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 204

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 205 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 264

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 265 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 323

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 324 YELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 380


>gi|22760064|dbj|BAC11054.1| unnamed protein product [Homo sapiens]
          Length = 388

 Score =  340 bits (871), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 165/302 (54%), Positives = 214/302 (70%), Gaps = 11/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D  N++H I  L+FGE +PG+VN
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDDINMTHYIQHLSFGEDYPGIVN 265

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LP
Sbjct: 266 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 324

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++
Sbjct: 325 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 384

Query: 294 GK 295
           GK
Sbjct: 385 GK 386


>gi|334310895|ref|XP_003339551.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Monodelphis domestica]
          Length = 396

 Score =  340 bits (871), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 166/310 (53%), Positives = 218/310 (70%), Gaps = 19/310 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H+++K+RLD  G  + +  +     ++ K  ++       +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLYKQRLDKDGRPVTTEAE---RHELGKEEEKAFDPSSLDPERCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDS--------FNISHKINKLAFG 167
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+         N++H I +L+FG
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNVVLCWYLQINMTHYIRRLSFG 265

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           E +PG+VNPLD    T    S M+QYF+KVVPTVY  VSG  ++SNQFSVT H + +  G
Sbjct: 266 EDYPGIVNPLDDTNITAPQASMMFQYFVKVVPTVYMKVSGEVLRSNQFSVTRHEKVA-NG 324

Query: 228 RL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 285
            +  Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  R
Sbjct: 325 LIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSAR 384

Query: 286 AIKKKIEIGK 295
           AI+KKIE+GK
Sbjct: 385 AIQKKIELGK 394


>gi|432101449|gb|ELK29631.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Myotis davidii]
          Length = 391

 Score =  339 bits (869), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 165/305 (54%), Positives = 215/305 (70%), Gaps = 14/305 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +        H    C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEMKVFDPDSLDPHR---CESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--------FNISHKINKLAFGEHFPG 172
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+         N++H I  L+FGE +PG
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNVCTRCCLQINMTHYIRHLSFGEDYPG 265

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--Q 230
           +VNPLD    T    S M+QYF+KVVPTVY  + G  +++NQFSVT H + +  G L  Q
Sbjct: 266 IVNPLDRTNVTALQASMMFQYFVKVVPTVYMKLDGQVLRTNQFSVTRHEKVA-NGLLGDQ 324

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KK
Sbjct: 325 GLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKK 384

Query: 291 IEIGK 295
           I++GK
Sbjct: 385 IDLGK 389


>gi|410926566|ref|XP_003976749.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Takifugu rubripes]
          Length = 384

 Score =  338 bits (868), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 163/301 (54%), Positives = 215/301 (71%), Gaps = 9/301 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDS--QGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           MD++GEQ LDV+H++FK+RLD   Q    E+ +  +G    D P+         +   C 
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKNLQPVSTEAEKHELGGED-DVPVFDPSTL---DPERCE 144

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYGAE+ D  CCN+C++VREAYR++GWA  N D I+QCKREGF Q+++E++ EGC +YG
Sbjct: 145 SCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADTIEQCKREGFTQKMQEQKNEGCQVYG 204

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
            LEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FG+ +PG++NPLD
Sbjct: 205 VLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHLIRHLSFGQDYPGLINPLD 264

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVF 236
               T    S MYQYF+K+VPT+Y    G  +++NQFSVT H + +  G +  Q LPGVF
Sbjct: 265 DTNITAPQASMMYQYFVKIVPTIYVKTDGEVLKTNQFSVTRHEKVA-NGLIGDQGLPGVF 323

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
             Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH  R I+KKIE+GK 
Sbjct: 324 VLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQKKIELGKA 383

Query: 297 S 297
           S
Sbjct: 384 S 384


>gi|327271491|ref|XP_003220521.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Anolis carolinensis]
          Length = 388

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 167/306 (54%), Positives = 214/306 (69%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  +    E  + G     I  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGKHVTPEAERHELGKEEETIFDPNSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAES D  CCN C++VREAYR++GWA  NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCKV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFP 171
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FG  +P
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHIIKHLSFGRDYP 261

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-- 229
           G+VNPLDG   + +  S M+QYF+KVVPT+Y  V G  +++NQFSVT H + +  G +  
Sbjct: 262 GIVNPLDGTVVSAQQASMMFQYFVKVVPTIYMKVDGEVVRTNQFSVTRHEKIA-NGLIGD 320

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH  R I+K
Sbjct: 321 QGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQK 380

Query: 290 KIEIGK 295
           KIE+GK
Sbjct: 381 KIELGK 386


>gi|291231388|ref|XP_002735646.1| PREDICTED: serologically defined breast cancer antigen 84-like,
           partial [Saccoglossus kowalevskii]
          Length = 358

 Score =  337 bits (865), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 160/301 (53%), Positives = 212/301 (70%), Gaps = 9/301 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV H+I K R+D  G  + + +      K ++       +L+ +   C SC
Sbjct: 59  MDVAGEQQLDVDHNIMKSRIDKNGKPVATPEKEDIGDKSEEAKDFDVNKLDPDR--CESC 116

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN CE+VREAYR+KGWA +N D I QC REG+  ++K + GEGC +YG L
Sbjct: 117 YGAESKDLKCCNTCEDVREAYRRKGWAFNNADGIAQCSREGWSDKLKSQSGEGCQVYGHL 176

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF Q  VHVHD+ AF  + FN+SH+IN L+FG  +PG+ NPLD  
Sbjct: 177 EVNKVAGNFHFAPGKSFQQHHVHVHDLQAFSGEKFNLSHRINHLSFGHKYPGMENPLDNS 236

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR------SSEQGRLQTLPG 234
           + T +  S MYQYF+K+VPT YT ++G T +SNQ+SVT+H +      +S  G    LPG
Sbjct: 237 KVTSQKASIMYQYFVKIVPTTYTKLNGATTRSNQYSVTKHEKVVSTSLASAAGE-HGLPG 295

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+ +P+ V +TE+H SF+HF+T VCAI+GGVFTV+G+ID+ IYH  +AIKKKI++G
Sbjct: 296 VFILYEFAPLMVKYTEKHRSFMHFMTGVCAIIGGVFTVAGLIDSMIYHSSKAIKKKIDLG 355

Query: 295 K 295
           K
Sbjct: 356 K 356


>gi|34849462|gb|AAH57130.1| Ergic3 protein [Mus musculus]
          Length = 394

 Score =  337 bits (865), Expect = 3e-90,   Method: Compositional matrix adjust.
 Identities = 169/308 (54%), Positives = 218/308 (70%), Gaps = 17/308 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FKKRLD  G  + S  +     K++  +      L+ N   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTV-FDPNSLDPNR--CESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDS------FNISHKINKLAFGEH 169
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+       N++H I  L+FGE 
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNPSDCLQINMTHYIKHLSFGED 265

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           +PG+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L
Sbjct: 266 YPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLL 324

Query: 230 --QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
             Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI
Sbjct: 325 GDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAI 384

Query: 288 KKKIEIGK 295
           +KKI++GK
Sbjct: 385 QKKIDLGK 392


>gi|348521802|ref|XP_003448415.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Oreochromis niloticus]
          Length = 389

 Score =  337 bits (864), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 163/304 (53%), Positives = 214/304 (70%), Gaps = 10/304 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD +   +    +     K D         L+ +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKEFKPVTQEAEKHELGKADDGEVFDPSTLDPDR--CESC 146

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN C++VREAYR++GWA  + D I+QCKREGF Q+++E++ EGC +YGFL
Sbjct: 147 YGAETEDLKCCNTCDDVREAYRRRGWAFKSADTIEQCKREGFTQKMQEQKNEGCQVYGFL 206

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           EVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FG+ +PG+VN
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHLIKHLSFGKDYPGLVN 266

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLDG   T    S MYQYF+K+VPT+Y    G  +++NQFSVT H + +  G +  Q LP
Sbjct: 267 PLDGTDVTAPQASMMYQYFVKIVPTIYMKTDGEVVKTNQFSVTRHEKVA-NGLIGDQGLP 325

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH  R I+KKIE+
Sbjct: 326 GVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQKKIEL 385

Query: 294 GKFS 297
           GK S
Sbjct: 386 GKTS 389


>gi|57208594|emb|CAI42843.1| ERGIC and golgi 3 [Homo sapiens]
          Length = 396

 Score =  335 bits (860), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 165/312 (52%), Positives = 215/312 (68%), Gaps = 21/312 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 87  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 143

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 144 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 203

Query: 121 EVNKVAGNFHFAPGKSFHQSGVH---------------VHDILAFQRDSFNISHKINKLA 165
           EVNKVAGNFHFAPGKSF QS VH               VHD+ +F  D+ N++H I  L+
Sbjct: 204 EVNKVAGNFHFAPGKSFQQSHVHGCVCRLKMIARSLACVHDLQSFGLDNINMTHYIQHLS 263

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           FGE +PG+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + + 
Sbjct: 264 FGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA- 322

Query: 226 QGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
            G L  Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH 
Sbjct: 323 NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHS 382

Query: 284 QRAIKKKIEIGK 295
            RAI+KKI++GK
Sbjct: 383 ARAIQKKIDLGK 394


>gi|196008679|ref|XP_002114205.1| hypothetical protein TRIADDRAFT_37998 [Trichoplax adhaerens]
 gi|190583224|gb|EDV23295.1| hypothetical protein TRIADDRAFT_37998 [Trichoplax adhaerens]
          Length = 369

 Score =  335 bits (860), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 162/303 (53%), Positives = 213/303 (70%), Gaps = 31/303 (10%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++G Q LD+K ++ K+R+D  G                KP    G  ++ N+T CGSC
Sbjct: 92  MDVAGMQQLDIKQNLMKRRIDENG----------------KPT---GDAVQKNKTKCGSC 132

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+++  CCN+CE+VREAYRKKGWAL++P+ I+QC+ EG+ Q +KE+E EGCN++G+L
Sbjct: 133 YGAENAEMKCCNSCEDVREAYRKKGWALTSPEGIEQCQEEGWAQMLKEQEKEGCNVFGYL 192

Query: 121 EVNKV-AGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           EVNKV AGNFHFAPGKSF Q  VHVHD+ +F    FN SH I+KL+FGE FPG++NPLDG
Sbjct: 193 EVNKVVAGNFHFAPGKSFQQHRVHVHDLQSFGSRKFNTSHTIHKLSFGEEFPGIINPLDG 252

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTLPG 234
            R + +  S MYQYFIKVVPTVY  + G  ++SNQ+SVT+H +       EQG    LPG
Sbjct: 253 HRMSSDQDSAMYQYFIKVVPTVYKKLKGEEVKSNQYSVTKHLKYIKLSMGEQG----LPG 308

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ + + E   SF HFLT VCAI+GGVFTV+ +IDA +YH  + +  KIE+G
Sbjct: 309 VFISYELSPMIIRYAERRKSFAHFLTGVCAIIGGVFTVASLIDAMVYHSAKML--KIELG 366

Query: 295 KFS 297
           K S
Sbjct: 367 KAS 369


>gi|74315943|ref|NP_001028277.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform 1 [Danio rerio]
 gi|72679324|gb|AAI00126.1| ERGIC and golgi 3 [Danio rerio]
          Length = 388

 Score =  334 bits (856), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 163/313 (52%), Positives = 214/313 (68%), Gaps = 29/313 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESR---------QDGIGAPKIDKPLQRHGGRLE 51
           MD++GEQ LDV+H++FK+RLD  G  + +          ++G+  P    P +       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGQPVTTEAEKHDLGKEEEGVFDPSTLDPDR------- 141

Query: 52  HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
                C SCYGAE+ D  CCN C++VREAYR++GWA   PD I+QCKREGF Q+++E++ 
Sbjct: 142 -----CESCYGAETDDLKCCNTCDDVREAYRRRGWAFKTPDTIEQCKREGFSQKMQEQKN 196

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAF 166
           EGC +YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+F
Sbjct: 197 EGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHFIKHLSF 256

Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
           G+ +PG+VNPLD         S MYQYF+K+VPT+Y    G  +++NQFSVT H + +  
Sbjct: 257 GKDYPGIVNPLDDTNVAAPQASMMYQYFVKIVPTIYVKGDGEVVKTNQFSVTRHEKIA-N 315

Query: 227 GRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
           G +  Q LPGVF  Y+LSP+ V FTE+  SF HFLT VCAI+GGVFTV+G+ID+ IYH  
Sbjct: 316 GLIGDQGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSA 375

Query: 285 RAIKKKIEIGKFS 297
           RAI+KKIE+GK S
Sbjct: 376 RAIQKKIELGKAS 388


>gi|405966014|gb|EKC31342.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Crassostrea gigas]
          Length = 397

 Score =  332 bits (852), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 158/304 (51%), Positives = 212/304 (69%), Gaps = 9/304 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI---ESRQDGIGAPKIDKPLQRHGGRLEH----- 52
           MD+SGEQ LDV H +FK+RL++ G  I   E  ++G     I +   +    +E      
Sbjct: 92  MDVSGEQQLDVDHHLFKQRLNADGEKIKDTEPEKEGTMYEPIFELGDKSKDAVEAVTKKL 151

Query: 53  NETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           +   C SCYGAE+ D  CCN CE+VREAYRKKGWA ++P+ I+QC REG+  ++K ++ E
Sbjct: 152 DPDRCESCYGAETGDLKCCNTCEDVREAYRKKGWAFNSPEGIEQCNREGWTAKMKAQQKE 211

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GC +YG+LEVNKV GNFHFAPGKSF Q  VHVHD+ AF    FN+SH I  L+FG+ +PG
Sbjct: 212 GCQVYGYLEVNKVQGNFHFAPGKSFQQHHVHVHDLQAFGGQKFNLSHAIRHLSFGQDYPG 271

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT- 231
           ++NPLD      E    M+QY++KVVPT Y DV G T+ +NQ+SV +H ++   G   + 
Sbjct: 272 IINPLDQTSQISEDEQTMFQYYVKVVPTTYVDVKGKTLYTNQYSVNKHSKTVGNGMGDSG 331

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           LPGVFF Y+LSP+ V +TE+  SF+HFLT VCAI+GG+FTV+G+ID+ IYH  RA++KKI
Sbjct: 332 LPGVFFIYELSPMMVKYTEKQRSFMHFLTGVCAIIGGIFTVAGLIDSMIYHSSRALQKKI 391

Query: 292 EIGK 295
           E+GK
Sbjct: 392 ELGK 395


>gi|302834369|ref|XP_002948747.1| hypothetical protein VOLCADRAFT_80399 [Volvox carteri f.
           nagariensis]
 gi|300265938|gb|EFJ50127.1| hypothetical protein VOLCADRAFT_80399 [Volvox carteri f.
           nagariensis]
          Length = 392

 Score =  332 bits (851), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 170/301 (56%), Positives = 213/301 (70%), Gaps = 6/301 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGN-VIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MDISGE HLD+ HD++K+RL + G+ V E  +  + A K   P+  +G         CGS
Sbjct: 94  MDISGELHLDLDHDVYKQRLSANGSPVKEVEKHNVEATKKVVPV--NGTENSTATPVCGS 151

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYGAE    DCCN C+EVR AYR+KGWAL+N D I+QC  + + + IKE+ GEGC+++G 
Sbjct: 152 CYGAEDRQGDCCNTCDEVRAAYRRKGWALANVDHIEQCAHDLYTESIKEQTGEGCHMWGM 211

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           LEVNKVAGNFHFAPG+S+ Q  +HVHDI  F     +  H +NKL+FG  +PG+ NPLD 
Sbjct: 212 LEVNKVAGNFHFAPGRSYQQGSMHVHDIAPFGDAVIDFRHTVNKLSFGAPYPGMKNPLDN 271

Query: 180 VR--WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-QTLPGVF 236
            +  +     +GMYQYF+KVVPT YT +   T+ +NQFSVTE+FR S QG   +TLPGVF
Sbjct: 272 AKAGYKSAAATGMYQYFLKVVPTSYTGIDNKTLATNQFSVTENFRESSQGGAGKTLPGVF 331

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           FFYDLSPIKV   E   SFL FLT+VCAIVGGVFTVSGI+DAFIY   R I+KK+E+GKF
Sbjct: 332 FFYDLSPIKVRIVEHSSSFLSFLTSVCAIVGGVFTVSGIVDAFIYTSTRLIRKKMELGKF 391

Query: 297 S 297
           S
Sbjct: 392 S 392


>gi|327271493|ref|XP_003220522.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 3 [Anolis carolinensis]
          Length = 394

 Score =  332 bits (851), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 167/312 (53%), Positives = 214/312 (68%), Gaps = 25/312 (8%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  +    E  + G     I  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGKHVTPEAERHELGKEEETIFDPNSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAES D  CCN C++VREAYR++GWA  NPD I+QCKREGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCKV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDS------FNISHKINKLA 165
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+       N++H I  L+
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNVSILGKINMTHIIKHLS 261

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           FG  +PG+VNPLDG   + +  S M+QYF+KVVPT+Y  V G  +++NQFSVT H + + 
Sbjct: 262 FGRDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPTIYMKVDGEVVRTNQFSVTRHEKIA- 320

Query: 226 QGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
            G +  Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH 
Sbjct: 321 NGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHS 380

Query: 284 QRAIKKKIEIGK 295
            R I+KKIE+GK
Sbjct: 381 ARVIQKKIELGK 392


>gi|335304738|ref|XP_003360010.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Sus scrofa]
 gi|350594872|ref|XP_003134465.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Sus scrofa]
          Length = 398

 Score =  332 bits (851), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 167/316 (52%), Positives = 216/316 (68%), Gaps = 29/316 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEIKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDS----------FNISHKI 161
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+           N++H I
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNVSTGHRCCLQINMTHYI 261

Query: 162 NKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 221
             L+FGE +PG+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H 
Sbjct: 262 QHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHE 321

Query: 222 RSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
           + +  G +  Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ 
Sbjct: 322 KVAS-GLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSL 380

Query: 280 IYHGQRAIKKKIEIGK 295
           IYH  RAI+KKI++GK
Sbjct: 381 IYHSARAIQKKIDLGK 396


>gi|410926568|ref|XP_003976750.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Takifugu rubripes]
          Length = 389

 Score =  332 bits (851), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 163/306 (53%), Positives = 215/306 (70%), Gaps = 14/306 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDS--QGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           MD++GEQ LDV+H++FK+RLD   Q    E+ +  +G    D P+         +   C 
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKNLQPVSTEAEKHELGGED-DVPVFDPSTL---DPERCE 144

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYGAE+ D  CCN+C++VREAYR++GWA  N D I+QCKREGF Q+++E++ EGC +YG
Sbjct: 145 SCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADTIEQCKREGFTQKMQEQKNEGCQVYG 204

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDSFNISHKINKLAFGEHFPGV 173
            LEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+ N++H I  L+FG+ +PG+
Sbjct: 205 VLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHLIRHLSFGQDYPGL 264

Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QT 231
           +NPLD    T    S MYQYF+K+VPT+Y    G  +++NQFSVT H + +  G +  Q 
Sbjct: 265 INPLDDTNITAPQASMMYQYFVKIVPTIYVKTDGEVLKTNQFSVTRHEKVA-NGLIGDQG 323

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           LPGVF  Y+LSP+ V FTE+H SF HFLT VCAI+GGVFTV+G+ID+ IYH  R I+KKI
Sbjct: 324 LPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGVFTVAGLIDSLIYHSARVIQKKI 383

Query: 292 EIGKFS 297
           E+GK S
Sbjct: 384 ELGKAS 389


>gi|159470839|ref|XP_001693564.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283067|gb|EDP08818.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 388

 Score =  331 bits (849), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 167/302 (55%), Positives = 209/302 (69%), Gaps = 10/302 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE HLD+  +++         + E +  GIG   +     R+   L +    CGSC
Sbjct: 92  MDISGELHLDLVVELYTLWRRGAAGLTEGKGGGIGVLSVSVSRSRNATALANG---CGSC 148

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE    DCCN C+EVR AYR+KGWALSN D I+QC  + + + IKE+ GEGC+I   +
Sbjct: 149 YGAEDKQGDCCNTCDEVRAAYRRKGWALSNVDHIEQCAHDLYTEAIKEQAGEGCHIG--V 206

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPG+S+ Q  +HVHDI  F     +  H I+KL+FGE +PG+ NPLDG 
Sbjct: 207 EVNKVAGNFHFAPGRSYQQGSMHVHDIAPFGDAVIDFRHVIHKLSFGEPYPGMKNPLDGA 266

Query: 181 RWTQETP-----SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           +  Q        +GM+QYF+KVVPT YTD+S  T+ +NQFSVTE+FR ++ G  +TLPGV
Sbjct: 267 KAGQAAAAAAAATGMFQYFLKVVPTSYTDLSNKTLSTNQFSVTENFREAQGGAGRTLPGV 326

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FFFYDLSPIKV   E   SFL FLT+VCAIVGGVFTVSGI+DAF+Y G R IKKK+E+GK
Sbjct: 327 FFFYDLSPIKVKIVEHGSSFLSFLTSVCAIVGGVFTVSGIVDAFVYTGTRMIKKKMELGK 386

Query: 296 FS 297
           FS
Sbjct: 387 FS 388


>gi|351702542|gb|EHB05461.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Heterocephalus glaber]
          Length = 378

 Score =  331 bits (848), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 166/301 (55%), Positives = 209/301 (69%), Gaps = 19/301 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID----KPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +     K++     P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAERHELGKVEVTVFDPESLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAES D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS  HVH     Q    N++H I  L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQS--HVHGWCCLQ---INMTHYIQHLSFGEDYPGIVNP 256

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPG
Sbjct: 257 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPG 315

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++G
Sbjct: 316 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLG 375

Query: 295 K 295
           K
Sbjct: 376 K 376


>gi|410953940|ref|XP_003983626.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 3 [Felis catus]
          Length = 399

 Score =  331 bits (848), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 167/317 (52%), Positives = 215/317 (67%), Gaps = 30/317 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHV-----HDILAFQRDS-----------FNISHK 160
           YGFLEVNKVAGNFHFAPGKSF QS VHV     HD+ +F  D+            N++H 
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNRSRLRCWYCLQINMTHY 261

Query: 161 INKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
           I  L+FGE +PG+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H
Sbjct: 262 IRHLSFGEDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRH 321

Query: 221 FRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
            + +  G +  Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+
Sbjct: 322 EKVA-NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDS 380

Query: 279 FIYHGQRAIKKKIEIGK 295
            IYH  RAI+KKI++GK
Sbjct: 381 LIYHSARAIQKKIDLGK 397


>gi|355563183|gb|EHH19745.1| Serologically defined breast cancer antigen NY-BR-84 [Macaca
           mulatta]
 gi|355784539|gb|EHH65390.1| Serologically defined breast cancer antigen NY-BR-84 [Macaca
           fascicularis]
          Length = 401

 Score =  330 bits (845), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 164/315 (52%), Positives = 215/315 (68%), Gaps = 24/315 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 205

Query: 121 EVNKVAGNFHFAPGKSFHQS-GVH-----------------VHDILAFQRDSFNISHKIN 162
           EVNKVAGNFHFAPGKSF QS G +                 VHD+ +F  D+ N++H I 
Sbjct: 206 EVNKVAGNFHFAPGKSFQQSHGTYLTGCVCRLKMIARSLACVHDLQSFGLDNINMTHYIQ 265

Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
            L+FGE +PG+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H +
Sbjct: 266 HLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEK 325

Query: 223 SSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
            +  G L  Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ I
Sbjct: 326 VA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLI 384

Query: 281 YHGQRAIKKKIEIGK 295
           YH  RAI+KKI++GK
Sbjct: 385 YHSARAIQKKIDLGK 399


>gi|340373749|ref|XP_003385402.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Amphimedon queenslandica]
          Length = 386

 Score =  325 bits (832), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 164/306 (53%), Positives = 211/306 (68%), Gaps = 18/306 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD+SGE  LDV+H ++K+RL   G VI ES    + A       +   G+       CGS
Sbjct: 90  MDVSGEHQLDVEHTMYKQRLTLDGEVINESPTKSVLARD-----ETQDGKAGAANKTCGS 144

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYGAE+ +  CCN CE+VREAYRKKGWA S+P  I+QC++EG+  +IKE+  EGC +YG 
Sbjct: 145 CYGAETPELSCCNTCEQVREAYRKKGWAFSDPSSIEQCEKEGWTTQIKEQMNEGCRVYGL 204

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           ++V+KVAGNFHFAPGKSF Q  VHVHD+  F    FN+SH + KL+FG+ +PG++NPLDG
Sbjct: 205 IDVSKVAGNFHFAPGKSFQQHSVHVHDLQPFGVKHFNMSHTVLKLSFGQEYPGIINPLDG 264

Query: 180 VR-WTQETPSG--MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQT 231
            + +  ET  G  MYQYFIKVVPT+Y  ++  T+ +NQF+VT+H R     S E G    
Sbjct: 265 HKAFDVETTHGGIMYQYFIKVVPTLYRRLNNETMGTNQFAVTKHQRPVRSASGEHG---- 320

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           LPGVFF YD+SPI V  TE   S  HFLT+VCAIVGGVFTV+G+ID  +YH  R +KKK+
Sbjct: 321 LPGVFFIYDISPILVYLTEYRHSLTHFLTSVCAIVGGVFTVAGMIDKLLYHSGRVLKKKM 380

Query: 292 EIGKFS 297
           E+GK S
Sbjct: 381 ELGKLS 386


>gi|440902508|gb|ELR53293.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3 [Bos
           grunniens mutus]
          Length = 395

 Score =  322 bits (825), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 163/313 (52%), Positives = 209/313 (66%), Gaps = 26/313 (8%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE  D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVH------------VHDILAFQRDSFNISHKINKL 164
           YGFLEVNKVAGNFHFAPGKSF QS VH              +   +     N++H I  L
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHGCREEVRVTGARCSEAQGWCCLQINMTHYIRHL 261

Query: 165 AFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
           +FGE +PG+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +
Sbjct: 262 SFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA 321

Query: 225 EQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
             G +  Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH
Sbjct: 322 -NGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYH 380

Query: 283 GQRAIKKKIEIGK 295
             RAI+KKI++GK
Sbjct: 381 SARAIQKKIDLGK 393


>gi|330790779|ref|XP_003283473.1| hypothetical protein DICPUDRAFT_52316 [Dictyostelium purpureum]
 gi|325086583|gb|EGC39970.1| hypothetical protein DICPUDRAFT_52316 [Dictyostelium purpureum]
          Length = 383

 Score =  322 bits (825), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 165/299 (55%), Positives = 209/299 (69%), Gaps = 10/299 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGE   DV H+IFKKRL S G  I   Q  I   +I+K + ++    E++   CGSC
Sbjct: 89  MDVSGEHQFDVAHNIFKKRLSSTGQPI-IEQPPIREEEINKKIVKN----ENDVQGCGSC 143

Query: 61  YGAESSDE--DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           YGAE       CCN CEEVR AY KKGW L +P  + QC REGF + I E+ GEGC +YG
Sbjct: 144 YGAEDPARGIPCCNTCEEVRNAYSKKGWGL-DPSTVSQCLREGFTKNIVEQNGEGCQVYG 202

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
           F+ VNKVAGNFHFAPGKSF Q  +HVHD+  F+   FN+SH INKLA G  FPG+ NPLD
Sbjct: 203 FILVNKVAGNFHFAPGKSFQQHHMHVHDLQPFKDGQFNMSHTINKLAVGNEFPGIKNPLD 262

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQT-LPGVF 236
            V  T+    GM+QYFIK+VPT+Y  ++G+ I +NQ+SVTEH+R  +++G   T LPG+F
Sbjct: 263 EVTKTEVAGVGMFQYFIKIVPTIYEGLNGNRIATNQYSVTEHYRLLAKKGEEPTGLPGLF 322

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           F YDLSPI +  +E+  SF  FLTNVCAI+GGVFTV GI D+FIY+  + +KKKI++GK
Sbjct: 323 FMYDLSPIMMKVSEKGKSFASFLTNVCAIIGGVFTVFGIFDSFIYYSTKNLKKKIDLGK 381


>gi|414586930|tpg|DAA37501.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
           protein [Zea mays]
          Length = 268

 Score =  321 bits (823), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 141/179 (78%), Positives = 164/179 (91%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG++HLDVKHD+FK+R+D+ GNVI +RQD +G  K++ PLQ HGGRLEHNETYCGSC
Sbjct: 90  MDISGQEHLDVKHDVFKQRIDAHGNVIATRQDVVGGMKMEAPLQHHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+ SD+ CCN CE+VREAYRKKGW +SNPDL+DQCKREGFLQ IK+EEGEGCNIYGF+
Sbjct: 150 YGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLLDQCKREGFLQSIKDEEGEGCNIYGFI 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           EVNKVAGNFHFAPGKSF QS VHVHD+L FQ+DSFN+SHKIN+L+FGE+FPGVVNPLDG
Sbjct: 210 EVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDSFNVSHKINRLSFGEYFPGVVNPLDG 268


>gi|390359988|ref|XP_792057.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Strongylocentrotus purpuratus]
          Length = 400

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 156/311 (50%), Positives = 213/311 (68%), Gaps = 20/311 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGG-------RLE- 51
           MDISGEQ LDV H+I+K+R+D  G  I E  ++ +G  +  +  +           ++E 
Sbjct: 92  MDISGEQQLDVDHNIYKRRIDKTGTPISEPEKEELGKKEDQEKKEEEDSEQEDEKKKMEV 151

Query: 52  HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
            +   C SCYGAE+    CCN+CE V+EAYR+KGWA S+P  I+QCKREGF ++++ ++ 
Sbjct: 152 LDPNRCESCYGAETPGLKCCNDCEGVQEAYRRKGWAFSDPTSIEQCKREGFSEKMQSQKE 211

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           EGC +YG+LEVNKVAGNFHFAPGKSF Q  VHVHD+ A     FN++H +  L+FG  +P
Sbjct: 212 EGCELYGYLEVNKVAGNFHFAPGKSFQQHHVHVHDLQAIAGAKFNMTHHVKTLSFGMEYP 271

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH-------FRSS 224
           G+ NPLD ++      S M+QYF+K+VPT YT +     ++NQ+SVT+H       F + 
Sbjct: 272 GMENPLDNMKTIDVKGSSMFQYFVKIVPTTYTKLDKSITRTNQYSVTKHEKQVTTSFSTG 331

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
           E G    LPGVF  Y+LSP+ V FTE+H SF+HFLT VCAI+GGVFTV+G+ID+ IYH  
Sbjct: 332 EHG----LPGVFVLYELSPLMVKFTEKHRSFMHFLTGVCAIIGGVFTVAGLIDSLIYHSA 387

Query: 285 RAIKKKIEIGK 295
           +AI+KKI++GK
Sbjct: 388 KAIQKKIDLGK 398


>gi|328868763|gb|EGG17141.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Dictyostelium fasciculatum]
          Length = 335

 Score =  319 bits (817), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 162/303 (53%), Positives = 208/303 (68%), Gaps = 17/303 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES----RQDGIGAPKIDKPLQRHGGRLEHNETY 56
           MD+SG+   DV H+IFKKRL   G  I      R+D I         +R     E+++  
Sbjct: 40  MDVSGDHQFDVAHNIFKKRLSPTGMPIADASPQREDTIN--------KRVPAGNENDKVD 91

Query: 57  CGSCYGAESSDE--DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
           CGSCYGAE       CC+ CEEVR AY+KKGW++     I QC REGF + I E+ GEGC
Sbjct: 92  CGSCYGAEDPSRGISCCSTCEEVRTAYQKKGWSIQEYSGIAQCVREGFTKNIVEQNGEGC 151

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
            +YGF+ VNKVAGNFHFAPGKSF Q  +HVHD+ AF + SFN+SH IN+L+FG  FPG+ 
Sbjct: 152 QVYGFINVNKVAGNFHFAPGKSFQQHHMHVHDLQAF-KGSFNLSHSINRLSFGNDFPGIK 210

Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSEQGRLQTL 232
           NPLDGV  T+   SGM+QY+IKVVPT+Y  ++G+ I +NQFSVTEH+R  + +      L
Sbjct: 211 NPLDGVTKTEMVGSGMFQYYIKVVPTLYEGLNGNRISTNQFSVTEHYRLLAKKDEEPSGL 270

Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
           PG+FF YDLSPI +  +E+  SF  FLT+VCAIVGGVFTV+GI+D+ IY   + +KKKI+
Sbjct: 271 PGLFFMYDLSPIMMKVSEQGKSFASFLTSVCAIVGGVFTVAGILDSMIYKTTKNLKKKID 330

Query: 293 IGK 295
           +GK
Sbjct: 331 LGK 333


>gi|66801671|ref|XP_629760.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Dictyostelium discoideum AX4]
 gi|74851212|sp|Q54DW2.1|ERGI3_DICDI RecName: Full=Probable endoplasmic reticulum-Golgi intermediate
           compartment protein 3
 gi|60463164|gb|EAL61357.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Dictyostelium discoideum AX4]
          Length = 383

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 162/301 (53%), Positives = 210/301 (69%), Gaps = 13/301 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKI-DKPLQRHGGRLEHNETY-CG 58
           MD+SGE   DV H+IFKKRL   G  I      I AP I ++ + +     ++N+   CG
Sbjct: 88  MDVSGEHQFDVAHNIFKKRLSPTGQPI------IEAPPIREEEINKKESVKDNNDVVGCG 141

Query: 59  SCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           SCYGAE   +   CCN CEEVR AY KKGW L +P  I QC REGF + + E+ GEGC +
Sbjct: 142 SCYGAEDPSKGIGCCNTCEEVRVAYSKKGWGL-DPSGIPQCIREGFTKNLVEQNGEGCQV 200

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGF+ VNKVAGNFHFAPGKSF Q  +HVHD+  F+  SFN+SH IN+L+FG  FPG+ NP
Sbjct: 201 YGFILVNKVAGNFHFAPGKSFQQHHMHVHDLQPFKDGSFNVSHTINRLSFGNDFPGIKNP 260

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQT-LPG 234
           LD V  T+    GM+QYF+KVVPT+Y  ++G+ I +NQ+SVTEH+R  +++G   + LPG
Sbjct: 261 LDDVTKTEMVGVGMFQYFVKVVPTIYEGLNGNRIATNQYSVTEHYRLLAKKGEEPSGLPG 320

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           +FF YDLSPI +  +E   SF  FLTNVCAI+GGVFTV GI D+FIY+  + ++KKI++G
Sbjct: 321 LFFMYDLSPIMMKVSERGKSFASFLTNVCAIIGGVFTVFGIFDSFIYYSTKNLQKKIDLG 380

Query: 295 K 295
           K
Sbjct: 381 K 381


>gi|156389237|ref|XP_001634898.1| predicted protein [Nematostella vectensis]
 gi|156221986|gb|EDO42835.1| predicted protein [Nematostella vectensis]
          Length = 386

 Score =  315 bits (806), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 153/302 (50%), Positives = 203/302 (67%), Gaps = 16/302 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR--LEHNETYCG 58
           MD+SGEQ +DV  +I K+R+D  G +I+       A K D   + H  +  L+ +   C 
Sbjct: 92  MDVSGEQQIDVSSNILKRRVDLDGKIIDE-----NAEKGDLGDKSHEAKELLDLDPNRCE 146

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYGAE+ D+ CCN C++VREAYR+KGWALSN D + QC REG+  +++E++ EGC + G
Sbjct: 147 SCYGAETPDKKCCNTCDDVREAYRRKGWALSNVDDVKQCMREGWKDKLQEQKNEGCEVTG 206

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
           +LEVNKVAGNFHFAPGKSF Q  VHVHD+  F    FN++H I  L+FG  +PG   PLD
Sbjct: 207 YLEVNKVAGNFHFAPGKSFQQHHVHVHDLQPFGSTQFNLTHNIKHLSFGHDYPGKTYPLD 266

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTLP 233
                      MYQYF+K+VPT Y  +SG  + ++QFSVT+H R     S E G    LP
Sbjct: 267 NTFVPAMEAGSMYQYFVKIVPTTYRKLSGEILHTHQFSVTKHKRVIRQMSGEHG----LP 322

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+ SP+ V +TE   SF+HFLT VCAIVGG+FTV+G++D+ IYH  RA++KKI++
Sbjct: 323 GVFVLYEFSPMMVQYTESRRSFMHFLTGVCAIVGGIFTVAGLVDSMIYHSSRALQKKIDL 382

Query: 294 GK 295
           GK
Sbjct: 383 GK 384


>gi|355686517|gb|AER98082.1| ERGIC and golgi 3 [Mustela putorius furo]
          Length = 304

 Score =  311 bits (796), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 152/277 (54%), Positives = 194/277 (70%), Gaps = 14/277 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 36  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 88

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 89  CESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 148

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 149 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 208

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 209 LDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGLPG 267

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
           VF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FT
Sbjct: 268 VFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFT 304


>gi|428183328|gb|EKX52186.1| hypothetical protein GUITHDRAFT_65491 [Guillardia theta CCMP2712]
          Length = 425

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 163/298 (54%), Positives = 199/298 (66%), Gaps = 18/298 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI----ESRQDGIGAPKIDKPLQRHGG---RLEHN 53
           MDISGEQH+DV H+++K+RLD  GNVI     +  +          L+ H G    L   
Sbjct: 122 MDISGEQHIDVHHEVYKQRLDVDGNVILLLSRACLNVTNGSGDFTTLRAHAGFDAPLTGG 181

Query: 54  ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           E  CGSCYGAE S ++CCN C+ VREAYR++GWA  N D I QCK EGFL +++EE  EG
Sbjct: 182 E--CGSCYGAEESPDECCNTCDSVREAYRRRGWAFVNSDGIVQCKTEGFLLKMQEERHEG 239

Query: 114 CNIYGFL-------EVNKVAGNFHFAPGKSF-HQSGVHVHDILAFQRDSFNISHKINKLA 165
           C + G L       +VNKVAGNFHF+PGKSF  Q GVH  D+L  ++  +N+SH IN L+
Sbjct: 240 CRVVGTLQARLTREQVNKVAGNFHFSPGKSFSQQVGVHFQDLLVLRKTDYNVSHAINHLS 299

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           FG  +PG VNPLDGV    E  S MYQYF+KVVPT Y   +G  + +NQFS TE+ R  E
Sbjct: 300 FGRKYPGRVNPLDGVVRICEFRSAMYQYFVKVVPTQYQYRNGTILSTNQFSTTENTRQLE 359

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
            G  + LPGVFFFYDLSPIK T  E + SFLHFLT +CAI+GGVFTV GIID+ IY G
Sbjct: 360 -GFTRGLPGVFFFYDLSPIKATLAERNNSFLHFLTGLCAIIGGVFTVMGIIDSTIYTG 416


>gi|320167013|gb|EFW43912.1| Ergic3 protein [Capsaspora owczarzaki ATCC 30864]
          Length = 392

 Score =  308 bits (788), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 149/301 (49%), Positives = 202/301 (67%), Gaps = 10/301 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIE--SRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           MD++GE  LDV H + K RL + G V+   +  + +G     +P      R + + + CG
Sbjct: 94  MDVAGEHQLDVLHTLVKTRLSASGEVVREPTPVEALG----QQPPSDAAERRDLDNSKCG 149

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
            CYGA++    CCN+CEEV+ AYR+KGW + +PD I+QC++EGF +R++    EGC + G
Sbjct: 150 DCYGAQTEKRPCCNSCEEVQAAYREKGWGMMDPDSIEQCRQEGFSERMRSIANEGCKVQG 209

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
           F+ VNKVAGNFHFAPGKS     VHVHD+  F+  +F+++H I+ L+FG  +PG VNPLD
Sbjct: 210 FMYVNKVAGNFHFAPGKSSQHQHVHVHDLQQFKTTTFDMTHTIHLLSFGTEYPGQVNPLD 269

Query: 179 GVRWT--QETP-SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPG 234
            V     + TP S M+QYFIKVVPT Y  ++G T Q++QFS T H +       +  LPG
Sbjct: 270 AVSKVPPENTPGSAMFQYFIKVVPTEYVKLNGETEQTSQFSATSHVKMINHAAGENGLPG 329

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           VFF Y+ SP+ V  TE   SF+HFLT VCAIVGGVFTV+G++DA IYH  R+IKKK+E+G
Sbjct: 330 VFFMYEPSPMLVKITERRKSFMHFLTGVCAIVGGVFTVAGLVDATIYHSYRSIKKKMELG 389

Query: 295 K 295
           K
Sbjct: 390 K 390


>gi|326434226|gb|EGD79796.1| intermediate compartment protein 3 [Salpingoeca sp. ATCC 50818]
          Length = 396

 Score =  306 bits (783), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 157/312 (50%), Positives = 206/312 (66%), Gaps = 24/312 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI-----------ESRQDGIGAPKIDKPLQRHGGR 49
           MD+SGE  LDV+HDIFK+RL   G  I           +     +GA K+ K        
Sbjct: 92  MDVSGENELDVEHDIFKQRLTETGTPIYEEPEEVDDLGDESDSAVGALKMMKE------G 145

Query: 50  LEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
           L+ N   C SCYGAES    CCN CE VREAYR+KGWAL++   I+QC+REG+ +++K +
Sbjct: 146 LDPNR--CESCYGAESEQNKCCNTCEAVREAYRRKGWALTDIQGIEQCEREGWTEKLKAQ 203

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINKLAF 166
             EGC IYG LEVNKVAGNFH APGKSF Q  +H HD+ +F R++   FN+SH IN L+F
Sbjct: 204 AKEGCRIYGHLEVNKVAGNFHIAPGKSFQQHSIHFHDLNSFGREALGKFNMSHTINHLSF 263

Query: 167 GEHFPGVVNPLDGVRWTQE-TPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           G  +PGVVNPLDG   T +   + MYQY++K+VPT Y    G  + +NQ+SVT H R  +
Sbjct: 264 GIEYPGVVNPLDGHSETADKLGATMYQYYVKIVPTRYRKARGQELNTNQYSVTMHQRHID 323

Query: 226 QGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
               QT LPG+F  +++SPI V  +E   SF HFLT V AI+GG+F+V+G+ID+F+YHG 
Sbjct: 324 HKAGQTGLPGMFVMFEISPILVQLSERTHSFFHFLTGVLAIIGGIFSVAGMIDSFVYHGL 383

Query: 285 RAIKKKIEIGKF 296
           R++KKK E+GK 
Sbjct: 384 RSLKKKQELGKL 395


>gi|307105810|gb|EFN54058.1| hypothetical protein CHLNCDRAFT_25376, partial [Chlorella
           variabilis]
          Length = 312

 Score =  305 bits (782), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 155/307 (50%), Positives = 206/307 (67%), Gaps = 15/307 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET----Y 56
           MDISGE  L+V HD++K+RL   G  +    D  G P+        G   E + T    Y
Sbjct: 9   MDISGEVQLEVDHDVYKRRLSPDGTPL----DEGGCPRAGWLKPVPGNDSEADPTKAPGY 64

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           CGSCYG+ES    CCN C EVR+AYR KGWAL + + ++QC  EG+ + I E++GEGC++
Sbjct: 65  CGSCYGSESRAGQCCNTCAEVRDAYRTKGWALLDVEKVEQCHHEGYKEEIDEQKGEGCHV 124

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG---- 172
           +G L++NKVAGNFH APG+S+ Q  +H+HD+  F   +F+ SH I+KLAFG  +PG    
Sbjct: 125 WGELQINKVAGNFHIAPGRSYQQGNMHIHDLSPFAGQAFDFSHTIHKLAFGREYPGTRGQ 184

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSEQGRLQ 230
            ++       T+    G+YQYF+KVVPT Y+D+  +TI +NQFSVTEHFR  +S      
Sbjct: 185 ALSTFCLSVGTRRERMGLYQYFLKVVPTSYSDLRNNTIYTNQFSVTEHFRETASPTAGGG 244

Query: 231 TLPGVFFFYDLSPIKVTFT-EEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            LPGVF FYDLSPIK +      +SFL FLT++CAI+GGVFTVSGIIDA +YHGQ+AIKK
Sbjct: 245 QLPGVFLFYDLSPIKASLEGRARLSFLSFLTSLCAIIGGVFTVSGIIDATVYHGQQAIKK 304

Query: 290 KIEIGKF 296
           K+++GK 
Sbjct: 305 KLDLGKL 311


>gi|281211641|gb|EFA85803.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Polysphondylium pallidum PN500]
          Length = 388

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 157/314 (50%), Positives = 211/314 (67%), Gaps = 29/314 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIE---SRQDGIG-APKIDKPLQRHGGRLEHNETY 56
           MD+SGE   DV H+IFK+RL   G  I     R+D +   PK++          E++   
Sbjct: 87  MDVSGEHQFDVAHNIFKRRLSPTGEFIPDAPKREDNVNIKPKVN----------ENDRPE 136

Query: 57  CGSCYGAESSDE--DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
           CGSC GAE+  +  +CCN CEEVR AY+K GW   +P    QC REGF + + E+ GEGC
Sbjct: 137 CGSCMGAENPSKGINCCNTCEEVRVAYQKMGWGF-DPSDTPQCVREGFTKNVVEQNGEGC 195

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
            +YGFL VNKVAGNFHFAPGKSF Q  +HVHD+ +F +  FN+SH I++L+FG  FPG+ 
Sbjct: 196 QVYGFLLVNKVAGNFHFAPGKSFQQHHMHVHDLQSF-KGQFNLSHTISRLSFGNDFPGIK 254

Query: 175 NPLDGVRWTQETP---------SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SS 224
           NPLDGV  T+            SGM+QY++K+VPT+Y  ++G+ I +NQ+SVTEH+R  +
Sbjct: 255 NPLDGVSKTEANQYQYHNLVVGSGMFQYYVKIVPTIYEGLNGNLINTNQYSVTEHYRLLA 314

Query: 225 EQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
           ++G   T LPG+FF YDLSPI +   E   SF  F+T+VCAIVGGVFTV+GI D+FIY  
Sbjct: 315 KKGEEMTGLPGLFFMYDLSPIMMKVVERSKSFASFITSVCAIVGGVFTVAGIFDSFIYQT 374

Query: 284 QRAIKKKIEIGKFS 297
            +++K+KI++GK S
Sbjct: 375 TKSLKRKIDLGKAS 388


>gi|332248939|ref|XP_003273622.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Nomascus leucogenys]
          Length = 380

 Score =  301 bits (770), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 154/302 (50%), Positives = 201/302 (66%), Gaps = 19/302 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC   G LQR + E    C++    
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCPARG-LQRTQPENERECSL---- 200

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVH-----DILAFQRDSFNISHKINKLAFGEHFPGVVN 175
              +VAGNFHFAPGKSF QS VHVH     D+ +F  D+ N++H I  L+FGE +PG+VN
Sbjct: 201 ---QVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNINMTHYIQHLSFGEDYPGIVN 257

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLP 233
           PLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LP
Sbjct: 258 PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLP 316

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++
Sbjct: 317 GVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDL 376

Query: 294 GK 295
           GK
Sbjct: 377 GK 378


>gi|383864675|ref|XP_003707803.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Megachile rotundata]
          Length = 385

 Score =  293 bits (750), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 144/308 (46%), Positives = 197/308 (63%), Gaps = 21/308 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID----KPLQRHGGRLEHNETY 56
           MD +GEQHL ++H+I+K+RLD QG  IE  Q      K D    K L +   +   + T 
Sbjct: 88  MDTTGEQHLQIEHNIYKRRLDLQGKPIEDPQ------KTDITDTKALSKTTAKSVESTTV 141

Query: 57  --CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
             CG CYGA S    CCN CE+VR+AY  K WA  +P  I QC+ +  ++++K    +GC
Sbjct: 142 ETCGDCYGAASEKIKCCNTCEDVRKAYSDKNWAPPDPGSIKQCQNDKSVEKMKTAFTQGC 201

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
            IYG++EVN+V G+FH APG SF  + VHVHD+  +    FN++HKI  L+FG + PG  
Sbjct: 202 QIYGYMEVNRVGGSFHIAPGNSFSVNHVHVHDVQPYMSTQFNMTHKIRHLSFGLNIPGKT 261

Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRL 229
           NP+D         + M+ ++IK+VPT Y    G T+ +NQFSVT H R     S E G  
Sbjct: 262 NPIDDTTMVAMEGAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHARQVSLLSGESG-- 319

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
             +PG+FF Y+LSP+ V +TE+  SF HF TN+CAI+GGVFTV+G+ID+F+YH  RAI+K
Sbjct: 320 --MPGIFFSYELSPLMVKYTEKAKSFGHFATNMCAIIGGVFTVAGLIDSFLYHSVRAIQK 377

Query: 290 KIEIGKFS 297
           KIE+GK+S
Sbjct: 378 KIELGKYS 385


>gi|167535515|ref|XP_001749431.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772059|gb|EDQ85716.1| predicted protein [Monosiga brevicollis MX1]
          Length = 394

 Score =  291 bits (745), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 150/306 (49%), Positives = 201/306 (65%), Gaps = 11/306 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQ---DGIGAPKIDKPLQRH-GGRLEHNETY 56
           MDISGE   ++ HD+F++RLD+ GN I + Q   D +G    D    +   G  + +   
Sbjct: 89  MDISGENEQNIDHDVFRQRLDASGNKIYNGQEEIDELGESHADNVADKALDGLKDLDPNR 148

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE ++  CCN C +V+EAYRKKGWA  +   I QC+REG+   ++ +E EGC +
Sbjct: 149 CESCYGAEDTEGQCCNTCAQVQEAYRKKGWAFRSGQGIAQCEREGYDAMMEAQEREGCQL 208

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISHKINKLAFGEHFPGV 173
           YG LEVNKVAGNFH APG+SF Q  +H+HD+ +F R+    FN++H IN L+FG  +P  
Sbjct: 209 YGHLEVNKVAGNFHIAPGRSFEQHNMHIHDMQSFGREKLAKFNLTHVINHLSFGIDYPDR 268

Query: 174 VNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS--SEQGRLQ 230
           VN LDG V    E  + MYQYF+KVVPT Y  +S   I +NQ+SVT H R    +QG   
Sbjct: 269 VNSLDGHVEVPNEYGAIMYQYFLKVVPTRYRFLSQTEIDTNQYSVTMHQREIRPDQG-TS 327

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            LPG+FF YD+SP+K+  T+   SF HFLT +CAI+GGV+TV+G+ID F+YHG R +K K
Sbjct: 328 GLPGLFFMYDISPMKIQLTQSSRSFFHFLTGLCAIIGGVYTVAGMIDGFLYHGIRTLKAK 387

Query: 291 IEIGKF 296
             +GK 
Sbjct: 388 QNMGKL 393


>gi|441638772|ref|XP_004090166.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Nomascus leucogenys]
          Length = 393

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 154/315 (48%), Positives = 201/315 (63%), Gaps = 32/315 (10%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC   G LQR + E    C++    
Sbjct: 146 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCPARG-LQRTQPENERECSL---- 200

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVH-----DILAFQRDS-------------FNISHKIN 162
              +VAGNFHFAPGKSF QS VHVH     D+ +F  D+              N++H I 
Sbjct: 201 ---QVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNVQLWMSSGWCCLQINMTHYIQ 257

Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
            L+FGE +PG+VNPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H +
Sbjct: 258 HLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEK 317

Query: 223 SSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
            +  G L  Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ I
Sbjct: 318 VA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLI 376

Query: 281 YHGQRAIKKKIEIGK 295
           YH  RAI+KKI++GK
Sbjct: 377 YHSARAIQKKIDLGK 391


>gi|332024433|gb|EGI64631.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Acromyrmex echinatior]
          Length = 386

 Score =  290 bits (743), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 138/302 (45%), Positives = 193/302 (63%), Gaps = 8/302 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD +GEQHL ++H+IFK+RLD  GN IE  Q   I   K           +      CG 
Sbjct: 88  MDTTGEQHLHIEHNIFKRRLDLNGNPIEDPQRTNITDAKAMSKTTEKAVEIGSTTELCGD 147

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYGA +    CCN CE+V EAYR+K WA  +P  + QC+ +  + ++K    +GC IYG+
Sbjct: 148 CYGATTDTMKCCNTCEDVWEAYRRKKWAPPDPADVKQCQNDKSMDKLKHAFTQGCQIYGY 207

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           +EVN+V G+FH APG SF  + VHVHD+  +    FN++HKI  L+FG + PG  NP+DG
Sbjct: 208 MEVNRVGGSFHIAPGASFSVNHVHVHDVQPYTSSHFNMTHKIRHLSFGLNIPGKTNPMDG 267

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----LPGV 235
           +       + M+ ++IK+VPT Y    G T+ +NQFSVT H   S++  L T    +PG+
Sbjct: 268 MTVVDMDAAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRH---SKKVSLLTGESGMPGI 324

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FF Y+LSP+ V +TE+  SF HF TN CAI+GGVFTV+G+ID+ +YH  RAI++KIE+GK
Sbjct: 325 FFNYELSPLMVKYTEKANSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQRKIELGK 384

Query: 296 FS 297
           ++
Sbjct: 385 YN 386


>gi|307179776|gb|EFN67966.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Camponotus floridanus]
          Length = 385

 Score =  287 bits (734), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 143/303 (47%), Positives = 197/303 (65%), Gaps = 13/303 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHNET-YC 57
           MD +GEQHL ++H+IFK+RLD  G  IE   R +   +  ++K  ++    LE   T  C
Sbjct: 88  MDTTGEQHLHIEHNIFKRRLDLNGKPIEDPQRTNITDSKAVNKTAEK---ALEIGSTESC 144

Query: 58  GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
           G CYGA +    CCN CEEVREAY+ K WA  +P  I QCK +  +++IK    +GC IY
Sbjct: 145 GDCYGAATETLRCCNTCEEVREAYKLKKWAPPDPANIKQCKDDKSMEKIKHAFTQGCQIY 204

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
           G++EVN+V G+FH APG SF  + VHVHD+  +    FN++HKI  L+FG + PG  NP+
Sbjct: 205 GYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTSTHFNMTHKIRHLSFGLNIPGKTNPM 264

Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----LP 233
           D         + M+ ++IK+VPT Y    G T+ +NQFSVT H   ++Q  L T    +P
Sbjct: 265 DDTTVIATEGAMMFYHYIKIVPTTYVRTDGSTLFTNQFSVTRH---AKQVSLFTGESGMP 321

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           G+FF Y+LSP+ V +TE+  SF HF TN CAI+GGVFTV+G+ID+ +YH  RAI+KKIE+
Sbjct: 322 GIFFSYELSPLMVKYTEKAKSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQKKIEL 381

Query: 294 GKF 296
           GK+
Sbjct: 382 GKY 384


>gi|340721521|ref|XP_003399168.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Bombus terrestris]
          Length = 385

 Score =  286 bits (731), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 140/302 (46%), Positives = 187/302 (61%), Gaps = 9/302 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD +GEQHL ++H+IFK+RLD  G  IE  Q         +            E  CG C
Sbjct: 88  MDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDITDTKARSKTTEKTVESTTEKACGDC 147

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA      CCN CE+VREAYR K WA     +I QCK +  +++IK    +GC IYG++
Sbjct: 148 YGAAGDIIKCCNTCEDVREAYRLKNWAPPALGMIKQCKNDKSVEKIKTAFTQGCQIYGYM 207

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVN+V G+FH APG SF  + VHVHD+  +    FN++HKI  L+FG + PG  NP+D  
Sbjct: 208 EVNRVGGSFHIAPGDSFSVNHVHVHDVKPYTSTQFNMTHKIRHLSFGLNIPGKTNPMDDT 267

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTLPGV 235
                  + M+ ++IK+VPT Y    G T+ +NQFSVT H R     S E G    +PG+
Sbjct: 268 TVVAMEGAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHARQVSLFSGESG----MPGI 323

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FF Y+LSP+ V +TE+  SF HF TN CAI+GGVFTV+G+ID+ +YH  RAI+KKIE+GK
Sbjct: 324 FFNYELSPLMVKYTEKAKSFGHFATNACAIIGGVFTVAGLIDSLLYHSVRAIQKKIELGK 383

Query: 296 FS 297
           ++
Sbjct: 384 YN 385


>gi|350404831|ref|XP_003487234.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Bombus impatiens]
          Length = 385

 Score =  285 bits (730), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 140/302 (46%), Positives = 188/302 (62%), Gaps = 9/302 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD +GEQHL ++H+IFK+RLD  G  IE  Q         +            E  CG C
Sbjct: 88  MDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDITDTKARSKTTTKTVESTTEKACGDC 147

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA      CCN CE+VREAYR K WAL    +I QCK +  ++++K    +GC IYG++
Sbjct: 148 YGAAGDIIKCCNTCEDVREAYRLKNWALPALGMIKQCKNDKSVEKMKTAFIQGCQIYGYM 207

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVN+V G+FH APG SF  + VHVHD+  +    FN++HKI  L+FG + PG  NP+D  
Sbjct: 208 EVNRVGGSFHIAPGDSFSVNHVHVHDVKPYTSTQFNMTHKIRHLSFGLNIPGKTNPMDDT 267

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTLPGV 235
                  + M+ ++IK+VPT Y    G T+ +NQFSVT H R     S E G    +PG+
Sbjct: 268 TVVAMEGAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHARQVSLFSGESG----MPGI 323

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FF Y+LSP+ V +TE+  SF HF TN CAI+GGVFTV+G+ID+ +YH  RAI+KKIE+GK
Sbjct: 324 FFNYELSPLMVKYTEKAKSFGHFATNACAIIGGVFTVAGLIDSLLYHSVRAIQKKIELGK 383

Query: 296 FS 297
           ++
Sbjct: 384 YN 385


>gi|380016121|ref|XP_003692037.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Apis florea]
          Length = 385

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 143/305 (46%), Positives = 192/305 (62%), Gaps = 15/305 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHN-ETYC 57
           MD +GEQHL ++H+IFK+RLD  G  IE   R D      + K   +    LE   E  C
Sbjct: 88  MDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDITDTKALSKTTAK---TLESTTEKIC 144

Query: 58  GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
           G CYGA S    CCN CE+VREAYR K WA      I QC+ +  ++++K    +GC IY
Sbjct: 145 GDCYGAASEIIKCCNTCEDVREAYRLKNWAPPVLGNIKQCQNDKSVEKMKTAFTQGCQIY 204

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
           G++EVN+V G+FH APG SF  + VHVHD+  +    FN++HKI  L+FG + PG  NP+
Sbjct: 205 GYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTSTQFNMTHKIRHLSFGLNIPGKTNPM 264

Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTL 232
           D         + M+ ++IK+VPT Y    G T+ +NQFSVT H R     S E G    +
Sbjct: 265 DDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHARQVSLFSGESG----M 320

Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
           PG+FF Y+LSP+ V +TE+  SF HF TN CAI+GGVFTV+G+ID+ +YH  RAI+KKIE
Sbjct: 321 PGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGGVFTVAGLIDSLLYHSLRAIQKKIE 380

Query: 293 IGKFS 297
           +GK++
Sbjct: 381 LGKYN 385


>gi|328786822|ref|XP_393819.4| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Apis mellifera]
          Length = 383

 Score =  285 bits (729), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 145/306 (47%), Positives = 194/306 (63%), Gaps = 19/306 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHN-ETYC 57
           MD +GEQHL ++H+IFK+RLD  G  IE   R D      + K   +    LE   E  C
Sbjct: 88  MDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDITDTKALSKTTAK---TLESTTEKIC 144

Query: 58  GSCYGAESSDEDCCNNCEEVREAYRKKGWA-LSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           G CYGA S    CCN CE+VREAYR K WA L N   I QC+ +  ++++K    +GC I
Sbjct: 145 GDCYGAASEIIKCCNTCEDVREAYRLKNWAVLGN---IKQCQNDKSVEKMKTAFTQGCQI 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YG++EVN+V G+FH APG SF  + VHVHD+  +    FN++HKI  L+FG + PG  NP
Sbjct: 202 YGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTSTQFNMTHKIRHLSFGLNIPGKTNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQT 231
           +D         + M+ ++IK+VPT Y    G T+ +NQFSVT H R     S E G    
Sbjct: 262 MDDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHARQVSLFSGESG---- 317

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           +PG+FF Y+LSP+ V +TE+  SF HF TN CAI+GGVFTV+G+ID+ +YH  RAI+KKI
Sbjct: 318 MPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGGVFTVAGLIDSLLYHSLRAIQKKI 377

Query: 292 EIGKFS 297
           E+GK++
Sbjct: 378 ELGKYN 383


>gi|297602842|ref|NP_001052965.2| Os04g0455900 [Oryza sativa Japonica Group]
 gi|255675519|dbj|BAF14879.2| Os04g0455900 [Oryza sativa Japonica Group]
          Length = 253

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 125/157 (79%), Positives = 144/157 (91%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG++HLDVKHDIFK+R+D  GNVI ++QD +G  K+++PLQRHGGRLEHNETYCGSC
Sbjct: 90  MDISGQEHLDVKHDIFKQRIDVHGNVIATKQDAVGGMKVEQPLQRHGGRLEHNETYCGSC 149

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE SDE CCN+CE+VREAYRKKGW +SNPDLIDQCKREGFLQ IK+EEGEGCNIYGFL
Sbjct: 150 YGAEESDEQCCNSCEDVREAYRKKGWGVSNPDLIDQCKREGFLQSIKDEEGEGCNIYGFL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
           EVNKVAGNFHFAPGKSF ++ VHVHD+L FQ+DSFN+
Sbjct: 210 EVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDSFNV 246


>gi|270007946|gb|EFA04394.1| hypothetical protein TcasGA2_TC014693 [Tribolium castaneum]
          Length = 385

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 149/305 (48%), Positives = 193/305 (63%), Gaps = 19/305 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR--LEHNETYCG 58
           MD SGEQHL + H+I+K+RLD QG  IE  +      K D  ++R         N+T CG
Sbjct: 90  MDSSGEQHLQIDHNIYKRRLDLQGQPIEEPK------KEDITIKRKNSTEVATVNKTECG 143

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWAL-SNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
           SCYGA    + CCN CE+VREAYR++ WA   NP+ I QCK E F +++K    +GC IY
Sbjct: 144 SCYGASFDPKRCCNTCEDVREAYRERRWAFPENPENITQCKEERFSEKLKTAFAQGCQIY 203

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV-NP 176
           G L VN+V+G+FH APGKSF  + VHVHD+  F    FN +HKI  L+FG        NP
Sbjct: 204 GSLTVNRVSGSFHIAPGKSFSINHVHVHDVQPFSSTEFNTTHKIRHLSFGASIDSDTHNP 263

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQT 231
           L       E  + M+QY IK+VPT Y  + G  I +NQFSVT+H R     S E G    
Sbjct: 264 LKDTVGLAEEGASMFQYHIKIVPTAYVKLDGQFISANQFSVTKHRRVISLMSGESG---- 319

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           +PG+FF Y+LSP+ V +TE+  SF HF TNVCAI+GGV+TV+G+ID  +YH  + I+KKI
Sbjct: 320 MPGIFFQYELSPLMVKYTEQSRSFGHFATNVCAIIGGVYTVAGLIDTMLYHSVKLIQKKI 379

Query: 292 EIGKF 296
           E+GKF
Sbjct: 380 ELGKF 384


>gi|189237821|ref|XP_974331.2| PREDICTED: similar to AGAP012144-PA [Tribolium castaneum]
          Length = 395

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 149/307 (48%), Positives = 193/307 (62%), Gaps = 21/307 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR----LEHNETY 56
           MD SGEQHL + H+I+K+RLD QG  IE  +      K D  ++R           N+T 
Sbjct: 98  MDSSGEQHLQIDHNIYKRRLDLQGQPIEEPK------KEDITIKRKNSTEVSVATVNKTE 151

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWAL-SNPDLIDQCKREGFLQRIKEEEGEGCN 115
           CGSCYGA    + CCN CE+VREAYR++ WA   NP+ I QCK E F +++K    +GC 
Sbjct: 152 CGSCYGASFDPKRCCNTCEDVREAYRERRWAFPENPENITQCKEERFSEKLKTAFAQGCQ 211

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV- 174
           IYG L VN+V+G+FH APGKSF  + VHVHD+  F    FN +HKI  L+FG        
Sbjct: 212 IYGSLTVNRVSGSFHIAPGKSFSINHVHVHDVQPFSSTEFNTTHKIRHLSFGASIDSDTH 271

Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRL 229
           NPL       E  + M+QY IK+VPT Y  + G  I +NQFSVT+H R     S E G  
Sbjct: 272 NPLKDTVGLAEEGASMFQYHIKIVPTAYVKLDGQFISANQFSVTKHRRVISLMSGESG-- 329

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
             +PG+FF Y+LSP+ V +TE+  SF HF TNVCAI+GGV+TV+G+ID  +YH  + I+K
Sbjct: 330 --MPGIFFQYELSPLMVKYTEQSRSFGHFATNVCAIIGGVYTVAGLIDTMLYHSVKLIQK 387

Query: 290 KIEIGKF 296
           KIE+GKF
Sbjct: 388 KIELGKF 394


>gi|444729170|gb|ELW69597.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Tupaia chinensis]
          Length = 393

 Score =  278 bits (712), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 148/334 (44%), Positives = 196/334 (58%), Gaps = 70/334 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-------------------- 40
           MD++GEQ LDV+H++FK+RLD  G  + +  +     KI+                    
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSTEAERHELGKIEVKVFDPNSLDPDRCESCYGA 148

Query: 41  -----KPLQRHG----GRLE--------HNETYCGSCYGAESSDEDCCNNCEEVREAYRK 83
                KP         G++E         +   C SCYGAES D  CCN CE+VREAYR+
Sbjct: 149 ESEDIKPCLEAADLELGKIEVKVFDPNSLDPDRCESCYGAESEDIKCCNTCEDVREAYRR 208

Query: 84  KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 143
           +GWA  NPD I+QC+REGF Q+++E++ EGC +YGFLEVNK+                  
Sbjct: 209 RGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKI------------------ 250

Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
                       N++H I  L+FGE +PG+VNPLD    T    S M+QYF+KVVPTVY 
Sbjct: 251 ------------NMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYM 298

Query: 204 DVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
            V G  +++NQFSVT H + +  G L  Q LPGVF  Y+LSP+ V  TE+H SF HFLT 
Sbjct: 299 KVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTG 357

Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 358 VCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 391


>gi|242007856|ref|XP_002424735.1| Endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Pediculus humanus corporis]
 gi|212508228|gb|EEB11997.1| Endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Pediculus humanus corporis]
          Length = 376

 Score =  276 bits (706), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 150/300 (50%), Positives = 195/300 (65%), Gaps = 20/300 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD SGEQHL ++H+I+K  LD  G  I+  +         KP+       E  E  CGSC
Sbjct: 92  MDSSGEQHLQIEHNIYKVSLDKNGIPIKEPEK----ETFVKPVN------ETKEKKCGSC 141

Query: 61  YGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           YGAES   +  CCN C +V++AY K+GW L+N +LI+QCK    L +      EGC IYG
Sbjct: 142 YGAESETLNITCCNTCADVKDAYMKRGWGLNNLELIEQCKN---LSQ-NNIFNEGCFIYG 197

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
            +EVN+V G+FH APG+SF  + VHVHD+  F   +FN SHKI+ L+FG + PG  NPLD
Sbjct: 198 TMEVNRVGGSFHIAPGQSFSINHVHVHDVQPFSSKAFNTSHKIDHLSFGYNIPGKTNPLD 257

Query: 179 GVRWTQETPSGMYQYFIKVVPTV--YTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 236
           G+       + M+QY+IK+VPT+  Y D SG TI +NQFSVT H +S  +  +   PG+F
Sbjct: 258 GIVALTHEGATMFQYYIKIVPTIYYYYDKSG-TILTNQFSVTRHQKSGSE-TIGVPPGIF 315

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           F Y+L+PI V +TE   SF HF TNVCAI+GGVFTV+ +IDAF+Y   +A KKKIEIGKF
Sbjct: 316 FNYELAPIMVKYTERKRSFGHFATNVCAIIGGVFTVASLIDAFLYRSVQAFKKKIEIGKF 375


>gi|307193219|gb|EFN76110.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Harpegnathos saltator]
          Length = 386

 Score =  275 bits (704), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 131/304 (43%), Positives = 190/304 (62%), Gaps = 12/304 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           MD +G Q+L ++H+IF++RLD  G  IE   R +      + KP      ++      CG
Sbjct: 88  MDTTGVQYLQIEHNIFQRRLDLNGKPIEDPQRTNITKTKAVVKPTDEET-QISSTTKVCG 146

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
            CYGA +   +CCN C++V+ AYR K WA+ +   I QC+ +    + K    +GC IYG
Sbjct: 147 DCYGAATETLECCNTCDDVQMAYRLKKWAMPDLAKIKQCQNDKSADKYKHAFTQGCQIYG 206

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
           ++EVN+V G+FH APG S+  + VHVHD+  +  + FN++HKI  L+FG + PG  NP+D
Sbjct: 207 YMEVNRVGGSFHIAPGDSYSVNHVHVHDVQPYNSNHFNMTHKIRHLSFGLNIPGKTNPMD 266

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-----SEQGRLQTLP 233
                    + M+ Y+IK+VPT Y    G T+ +NQFSVT H +      S+ G    +P
Sbjct: 267 DTTTVATEGAMMFYYYIKIVPTTYVRADGSTLLTNQFSVTRHSKRMPLYMSDSG----MP 322

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           G+FF Y+LSP+ V +TE+  SF HF TN CAI+GGVFTV+G+ID+ +YH  RAI+KKIE+
Sbjct: 323 GIFFSYELSPLMVKYTEKAKSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQKKIEL 382

Query: 294 GKFS 297
           GK++
Sbjct: 383 GKYN 386


>gi|170031960|ref|XP_001843851.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Culex quinquefasciatus]
 gi|167871431|gb|EDS34814.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Culex quinquefasciatus]
          Length = 391

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 144/305 (47%), Positives = 197/305 (64%), Gaps = 12/305 (3%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIES-RQDGIGAPKIDK-PLQRHGGRLEHNETYCGS 59
           D +GEQHL + H+IFK+RLD +GN IE+ +++ I APK  K   +            CGS
Sbjct: 90  DATGEQHLHIDHNIFKRRLDLKGNPIEAPKKEDIQAPKPRKDATEAPVVNSSTTANPCGS 149

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL--IDQCKREGFLQRIKEEE---GEGC 114
           CYGA+ +   CCN C++V +AYR+K W   NP L   +QCK E  + ++  E     EGC
Sbjct: 150 CYGAQKNSSHCCNTCQDVIDAYREKQW---NPTLEEFEQCKTEVAIGKLSLEAKAFNEGC 206

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GV 173
            IYG++EVN+V G+FH APGKSF  S +HVHD+  F    FN++H IN L+FGE F  G 
Sbjct: 207 QIYGYMEVNRVGGSFHIAPGKSFSISHIHVHDVQPFSSSRFNMTHHINTLSFGEEFGFGQ 266

Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTL 232
            +PLDG     E  + M+QY+IK+VPT +  +SG  + +NQFSVT H +S S       +
Sbjct: 267 TSPLDGTDVIAEEGAMMFQYYIKIVPTEFVPLSGPKLHTNQFSVTTHRKSVSLMSGDSGM 326

Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
           PG+F  Y+LSP+ V FTE+  SF HF TN+CAI+GG+FTVSGI+D  ++    A+K+KIE
Sbjct: 327 PGIFVNYELSPLMVKFTEKRSSFSHFATNLCAIIGGIFTVSGIVDTLLFTSIHALKRKIE 386

Query: 293 IGKFS 297
           +GK S
Sbjct: 387 LGKAS 391


>gi|156552683|ref|XP_001599365.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Nasonia vitripennis]
          Length = 328

 Score =  274 bits (701), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 134/303 (44%), Positives = 192/303 (63%), Gaps = 16/303 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES-RQDGIGAPK--IDKPLQRHGGRLEHNETYC 57
           MD +GE HL+++H+IFK+RLD  G  IE  ++ GI  PK   +KP +    +       C
Sbjct: 36  MDTTGETHLEIQHNIFKRRLDLDGKPIEDPKKTGIADPKKTTEKPAENATAK-------C 88

Query: 58  GSCYGAESSDE--DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           G CYGA S +    CCN CEEV+EAYRK+ WA+ +     QCK +   +   +E   GC 
Sbjct: 89  GDCYGAASEELGIKCCNTCEEVKEAYRKRKWAVHDTSRFAQCKNDKSREMTFKE---GCQ 145

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           IYGF+EVN+V G+FH APG S     +HVHD+  +    FN++H+I  L+FG + PG  N
Sbjct: 146 IYGFMEVNRVGGSFHIAPGDSITIDHLHVHDVQPYSSSQFNLTHRIRHLSFGTNIPGKTN 205

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPG 234
           P+D         + M+ ++IK+VPT +  + G  + +NQFS+T+H RS +Q   ++ +PG
Sbjct: 206 PIDNTTVIASEGATMFHHYIKIVPTTFMRLDGSILHTNQFSLTKHSRSIKQYSGESGMPG 265

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           +FF Y+LSP+ V +T+   S  H +TN CAI+GG FTV+ IIDAF+YH  RAI+KK+E+G
Sbjct: 266 LFFSYELSPLMVKYTQTVKSLGHLMTNTCAIIGGTFTVASIIDAFLYHSVRAIQKKMELG 325

Query: 295 KFS 297
           K S
Sbjct: 326 KLS 328


>gi|157118753|ref|XP_001653244.1| ptx1 protein [Aedes aegypti]
 gi|108875623|gb|EAT39848.1| AAEL008391-PA [Aedes aegypti]
          Length = 384

 Score =  273 bits (697), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 145/304 (47%), Positives = 199/304 (65%), Gaps = 17/304 (5%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIE-SRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           D +GEQHL ++H I+K+R+D QGN IE ++++ I APK    L++     E N   C SC
Sbjct: 90  DATGEQHLHIEHTIYKRRMDLQGNPIEEAKKEDISAPK--PRLEKK----EENVKKCRSC 143

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID--QCKREGFLQRIKEEE---GEGCN 115
           YGAE +   CC  C++V +AYR+K W   NP+L D  QC+ E  L +   E     EGC 
Sbjct: 144 YGAEKNSTHCCETCQDVIDAYREKQW---NPNLDDFEQCQNEVLLGKKSLESKAFSEGCQ 200

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVV 174
           IYG ++VN+V G+FH APGKSF  S +HVHD+  F    FN SH+IN L+FGE F  G  
Sbjct: 201 IYGSMQVNRVGGSFHIAPGKSFSISHIHVHDVQPFSSSRFNTSHRINTLSFGEEFGYGQT 260

Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLP 233
            PLD    T    + M+QY+IK+VPT +  ++G T+ +NQFSVT+H +S S       +P
Sbjct: 261 RPLDFTEKTAHEGAIMFQYYIKIVPTEFVPLNGPTLHTNQFSVTKHQKSVSVMSGESGMP 320

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           G+F  Y+LSP+ V FTE+  SF HF TN+CAI+GG+FTV+GIID+ ++    A+K+KIE+
Sbjct: 321 GIFVNYELSPLMVRFTEKRNSFSHFATNLCAIIGGIFTVAGIIDSLLFTSIHALKRKIEL 380

Query: 294 GKFS 297
           GKFS
Sbjct: 381 GKFS 384


>gi|412994036|emb|CCO14547.1| predicted protein [Bathycoccus prasinos]
          Length = 436

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 152/334 (45%), Positives = 193/334 (57%), Gaps = 40/334 (11%)

Query: 1   MDISGEQHLDV-KHDIFKKRLDSQG-------NVIESRQDGIGAPKIDKPLQRHGGRLEH 52
           MD+SGE HLDV  H++ K R D  G       N    +++ +     D         L  
Sbjct: 100 MDVSGETHLDVVDHEMRKIRYDRYGVKLADALNDEHGKEEVVNEKAFDSNETETASSLRK 159

Query: 53  NET------------------YCGSCYGAESS------DEDCCNNCEEVREAYRKKGWAL 88
           N+T                  YCGSCYGA+ S      ++ CC  CEEVREAY + GWA 
Sbjct: 160 NKTKKTAKELIPRYMEDGKTKYCGSCYGADVSGANRGREQRCCQTCEEVREAYIEVGWAF 219

Query: 89  SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 148
           +    ++QCKREGF + +     EGC   GFL+VNKV GNFH APGKSF Q   HVHD+ 
Sbjct: 220 TGASSMEQCKREGFSEVLGNVHEEGCEFKGFLDVNKVQGNFHIAPGKSFQQGEQHVHDLS 279

Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETP--SGMYQYFIKVVPTVYTDVS 206
            F    FN SH++  L+FGE +PG V+PLDG + T + P  +G+YQYF ++VPT YT ++
Sbjct: 280 PFPDGKFNFSHEVRHLSFGEGYPGKVDPLDGTKRTLKLPAETGVYQYFFRIVPTTYTYLN 339

Query: 207 --GHTIQSNQFSVTEHFR----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
                I +NQ+SV +HF+    +S QG    LPGVFFFYDLSPIKV   E   S   FL 
Sbjct: 340 PFKKDISTNQYSVVDHFKPVDAASIQGGSSDLPGVFFFYDLSPIKVDIAEYRTSVWKFLA 399

Query: 261 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
            VCA VGGVF VSGI+D  +Y G  AIKKKI++G
Sbjct: 400 EVCASVGGVFAVSGIVDKVVYKGSLAIKKKIQLG 433


>gi|198425065|ref|XP_002127888.1| PREDICTED: similar to ERGIC and golgi 3 [Ciona intestinalis]
          Length = 385

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 134/300 (44%), Positives = 192/300 (64%), Gaps = 16/300 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID----KPLQRHGGRLEHNETY 56
           +D+SG++ +DV+H + K+ L+S G+ +        A K+D    KP+            Y
Sbjct: 95  IDVSGQRDIDVQHTLVKQPLNSDGSWVAE-----AAEKVDLVGTKPVL--NATEPPPADY 147

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           CGSC+GAE+ D  CCN C +++EAYR+KGWA      I  C  E      KE  G GC +
Sbjct: 148 CGSCFGAETKDMTCCNTCSDIKEAYRRKGWAFPRDGSITPCIGE---DDDKEPVGSGCYL 204

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-DSFNISHKINKLAFGEHFPGVVN 175
           +G LEVN+VAGNFH +PGKS+    +HVHD+    +    N+SH  N L+FG  +PG V+
Sbjct: 205 HGHLEVNRVAGNFHISPGKSYEVGHMHVHDMARMGKYKESNVSHVFNHLSFGSTYPGQVH 264

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           PLD +       S  +QY++K+VPT Y  +SG T  +NQFSVT H + ++  R ++LPG+
Sbjct: 265 PLDNLEVIASESSVAFQYYVKIVPTTYEKLSGDTFHTNQFSVTRHQKRNKDSR-ESLPGM 323

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           F  Y+LSP+ V + E   SF+HFLT+VCAI+GG+FTV+G+ D+FIYHG +A++KKIE+GK
Sbjct: 324 FVSYELSPMMVRYVERRRSFVHFLTSVCAIIGGIFTVAGLFDSFIYHGSKALQKKIELGK 383


>gi|145340712|ref|XP_001415464.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575687|gb|ABO93756.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 379

 Score =  267 bits (683), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 138/304 (45%), Positives = 194/304 (63%), Gaps = 15/304 (4%)

Query: 1   MDISGEQHLDV-KHDIFKKRLDSQGNVIE--SRQDGIGAPKIDKPLQRH--GGRLEHNET 55
           MD++GE  LDV + ++   R+D++G  I   S +  + A       +R   GGR     +
Sbjct: 82  MDVTGETRLDVSRSEVRTTRVDARGRAIAMTSERTAVNAKTEAGEREREATGGR-----S 136

Query: 56  YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
            CG CYGA  +   CC++C+ VREAYR KGWAL +   + QC +E  +  ++ E  EGC+
Sbjct: 137 ACGDCYGAAEAGT-CCDDCDSVREAYRVKGWALPDLRRVTQCTKEYDVVAMRNEHKEGCH 195

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ-RDSFNISHKINKLAFGEHFPGVV 174
             G  EVNKVAGNFH APGKS++  G HVHD+  F   +SFN SH I+KL+FGE FPGVV
Sbjct: 196 FSGHFEVNKVAGNFHIAPGKSYNNLGQHVHDLSPFAGVESFNFSHIIHKLSFGEEFPGVV 255

Query: 175 NPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVS--GHTIQSNQFSVTEHFRSSEQGRLQT 231
           NPLDGV R   +  +G+YQY + VVP  Y  +      ++SN +SVT+HFR  +  +   
Sbjct: 256 NPLDGVTRTMDDANAGVYQYRLSVVPARYKYLGFRARVVESNDYSVTDHFRGFDVTKNPG 315

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           LPG+FFFYDLSP++V + E  + F  +L+NV AI+GGV  V  I+D  +Y GQRA+++K+
Sbjct: 316 LPGLFFFYDLSPLRVEYEERRIGFFQYLSNVAAIIGGVSAVVNIVDGLVYRGQRALREKV 375

Query: 292 EIGK 295
           ++GK
Sbjct: 376 DLGK 379


>gi|358334909|dbj|GAA53334.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Clonorchis sinensis]
          Length = 323

 Score =  266 bits (680), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 130/298 (43%), Positives = 182/298 (61%), Gaps = 13/298 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD +GEQ +DV   I+K R+DS G+ I + +   G P         G  +  +  YCGSC
Sbjct: 28  MDSTGEQKIDVSQQIYKTRIDSTGSPISATRRDDGNPS-------KGQVVTKDPDYCGSC 80

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES    CCN C+E++ AY+++ W + N  + +QC+ E +   +     EGC I G L
Sbjct: 81  YGAESETRKCCNTCKEIQLAYQERHWVVKNLSVFEQCREEQWDDTLANLGSEGCRIQGSL 140

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +VNKVAG+FH  PG S+    VHVH++  F     N+SHKI+KLAFG  +PG  NPLDG 
Sbjct: 141 QVNKVAGSFHITPGNSYASDQVHVHNLQGFDGQKLNMSHKIDKLAFGNMYPGQTNPLDGT 200

Query: 181 RWTQETPSGMYQYFIKVVPTVY-----TDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPG 234
                 P+ M  Y++K+VPT+Y     T  S  T+ +NQ+SVT H + S      + +PG
Sbjct: 201 TMNVVEPAQMVTYYMKLVPTMYVSYNTTTRSLSTVHTNQYSVTWHSKGSPLTSDSSGIPG 260

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
           +FF Y+LSP+ V  + EH SFLHFLTN CAI+GGVFTV+ ++DAFIY     ++K++ 
Sbjct: 261 LFFNYELSPLLVKISYEHKSFLHFLTNTCAIIGGVFTVASLLDAFIYQSTCVVRKRLS 318


>gi|150036309|emb|CAO03349.1| ERGIC and golgi 3 [Homo sapiens]
          Length = 325

 Score =  263 bits (673), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 128/240 (53%), Positives = 167/240 (69%), Gaps = 6/240 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 90  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 146

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 147 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 206

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 207 EVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHT 266

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFF 238
             T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  
Sbjct: 267 NVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVL 325


>gi|328770814|gb|EGF80855.1| hypothetical protein BATDEDRAFT_19389 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 409

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 137/301 (45%), Positives = 195/301 (64%), Gaps = 9/301 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGE   ++ H + K R+D  GN++E +Q  +G       +++    +  +  YCGSC
Sbjct: 110 MDVSGEHQNNLPHSMHKVRIDQLGNLLE-KQKKLGNTN-SSGVKKEIRDMALDPKYCGSC 167

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YG  + +  CCN CE+V+EAY + GW+ ++PD I+QC REG+ +R++ +  E CNIYG +
Sbjct: 168 YGGVAPESKCCNTCEQVQEAYERSGWSFTDPDSIEQCVREGWSKRMETQINEACNIYGHI 227

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ--RDSFNISHKINKLAFGEHFPGVVNPLD 178
           EVNKV GN HFAPG SF Q+ +HVHD+  +     SFN  H I++L+FGE     VNPLD
Sbjct: 228 EVNKVQGNIHFAPGHSFQQNALHVHDLHDYNAPNGSFNFKHTIHELSFGES-SSFVNPLD 286

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ--GRLQT-LPGV 235
            V  T  T    YQY+IKVV T  + ++G  + +NQFSVTEH +      G L   +PG 
Sbjct: 287 TVTKTPPTKYFSYQYYIKVVGTDISYLNGSQLTTNQFSVTEHEQDVTPLFGALPIGMPGK 346

Query: 236 FFF-YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
            FF +++SP+ V F E    F HFLT++CAI+GGVFTV+G+IDA ++  QR+I+ K+EIG
Sbjct: 347 LFFNFEISPMLVKFKEFRKPFTHFLTDLCAIIGGVFTVAGMIDALLFATQRSIQAKVEIG 406

Query: 295 K 295
           K
Sbjct: 407 K 407


>gi|158300475|ref|XP_320382.3| AGAP012144-PA [Anopheles gambiae str. PEST]
 gi|157013177|gb|EAA00591.3| AGAP012144-PA [Anopheles gambiae str. PEST]
          Length = 386

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 139/308 (45%), Positives = 193/308 (62%), Gaps = 23/308 (7%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET------ 55
           D +GEQHL ++H+I+K+RLD QGN IE        PK  + +Q    R+   E       
Sbjct: 90  DSTGEQHLHIEHNIYKRRLDLQGNQIEE-------PK-KEDIQASTKRISSTEAPATTTV 141

Query: 56  --YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID--QCKREGFLQRIKEEEG 111
              CGSCYGA  +   CCN C+EV +AYR++ W   NP++ D  QCK         +   
Sbjct: 142 KPACGSCYGAAKNASQCCNTCQEVIDAYRERKW---NPNVEDFEQCKNGNGGSVEGKAFS 198

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           EGC+IYG +EVN+V G FH APGKSF  + +HVHD+  +    FN +H+IN L+FGE F 
Sbjct: 199 EGCHIYGTMEVNRVEGRFHIAPGKSFSINHIHVHDVQPYSSSRFNTTHRINTLSFGEQFG 258

Query: 172 -GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
            G   PLDG+       + M+QY+IK+VPT++  ++G T+ +NQFSVT+H +S      +
Sbjct: 259 FGTTRPLDGLMVEATEGAMMFQYYIKIVPTMFVPLNGPTLYTNQFSVTKHQKSVTAMSGE 318

Query: 231 T-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           T +PG+F  Y+LSP+ V FTE+  S  HF TNVCAI+GG+FTV+GIID+ ++     IK+
Sbjct: 319 TGMPGIFVNYELSPLMVKFTEKRNSLGHFATNVCAIIGGIFTVAGIIDSLLFTSIHVIKR 378

Query: 290 KIEIGKFS 297
           KIE+GK S
Sbjct: 379 KIELGKAS 386


>gi|357612408|gb|EHJ67977.1| hypothetical protein KGM_08440 [Danaus plexippus]
          Length = 385

 Score =  261 bits (667), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 133/298 (44%), Positives = 181/298 (60%), Gaps = 8/298 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD SGEQHL + H++ K+RLD  G  I E  ++ I      K         E     CGS
Sbjct: 91  MDSSGEQHLQMDHNVHKRRLDLDGVPIKEPIKEDISLSSTVKQ-----NSSEIAIVTCGS 145

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYGA  +D  CCN CE+V+EAYR + WAL +   ++QCK +  L+R      EGC IYG+
Sbjct: 146 CYGAAFNDSQCCNTCEDVKEAYRLRRWALPDLATVEQCKDDDSLERTNLALKEGCQIYGY 205

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV-VNPLD 178
           +EVN+V G+FH APGKSF  + VHVHD+  F    FN +H I  L+FG         PLD
Sbjct: 206 MEVNRVGGSFHIAPGKSFTINHVHVHDVQPFSSSVFNTTHIIRHLSFGSDIESANTAPLD 265

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFF 237
           G+    +  + M+QY++K+VPT+Y  + G  + +NQFSVT H +S     +++ +PG FF
Sbjct: 266 GITGLAKEGAVMFQYYLKIVPTMYVKLDGTILHTNQFSVTRHQKSVSNINVESGMPGAFF 325

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
            Y+LSP+ V +T +  S  HF TNVCAIVGGVFTV+GI D  +YH   A + K+ +GK
Sbjct: 326 SYELSPLMVKYTAKGRSIGHFATNVCAIVGGVFTVAGIFDTLLYHSLNAFQNKVVLGK 383


>gi|296481082|tpg|DAA23197.1| TPA: endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Bos taurus]
          Length = 306

 Score =  260 bits (664), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 124/224 (55%), Positives = 157/224 (70%), Gaps = 11/224 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE  D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNP
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIRHLSFGEDYPGIVNP 261

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
           LD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H
Sbjct: 262 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRH 305


>gi|291000812|ref|XP_002682973.1| predicted protein [Naegleria gruberi]
 gi|284096601|gb|EFC50229.1| predicted protein [Naegleria gruberi]
          Length = 416

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 139/326 (42%), Positives = 196/326 (60%), Gaps = 40/326 (12%)

Query: 1   MDISGEQHLDVK-HDIFKKRLDSQGN-VIESRQDGIGAPKIDKP----LQRHGGRLEHN- 53
           MD+SGE H+ +  H ++K RL   G  +IE + + +     DKP    L+   G ++H+ 
Sbjct: 85  MDVSGEHHVHLDYHTVYKMRLTLDGKPIIEQQAEQVSD---DKPTLDILKPPPGAVKHDL 141

Query: 54  -------------------ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 94
                                YCGSCYG+      CCN C++VRE+YR+ GWA S  + I
Sbjct: 142 VNNAELDKIRAERAKKVKDPKYCGSCYGSNRDANQCCNTCDDVRESYRRVGWAFSPNEDI 201

Query: 95  DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 154
           +QC  E   +++K  + EGCN++G+  VNKVAGNFHFAPGKSF ++  H+HD   ++ D 
Sbjct: 202 EQCYEEILERKMKYSKQEGCNLHGYFLVNKVAGNFHFAPGKSFVRAQQHMHDYTNYEVDH 261

Query: 155 FNISHKINKLAFGEHFPGVVNPLDG----VRWTQET------PSGMYQYFIKVVPTVYTD 204
           FN SH IN L FGE  PG++NPLDG    + +  ET       S ++QYF+KVVPT+Y  
Sbjct: 262 FNTSHIINYLGFGEKIPGLINPLDGTSKIIGYNAETGQRVEGESALFQYFVKVVPTIYEK 321

Query: 205 V-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
             S ++I +NQ+SVT+H R   +     +PGVFF YDLSPI V  TE   SF+ FLT++C
Sbjct: 322 YGSSNSIITNQYSVTQHSRPKNRLHPNVVPGVFFIYDLSPIMVHITENKKSFVQFLTSLC 381

Query: 264 AIVGGVFTVSGIIDAFIYHGQRAIKK 289
           AI+GGVFTVS ++D  IY  ++ + +
Sbjct: 382 AIIGGVFTVSALLDRVIYGVEKKMNR 407


>gi|345319994|ref|XP_001507420.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like, partial [Ornithorhynchus anatinus]
          Length = 203

 Score =  257 bits (657), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 121/204 (59%), Positives = 150/204 (73%), Gaps = 3/204 (1%)

Query: 70  CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 129
           CCN CE+VREAYR++GWA  NPD I+QCKREGF Q+++E++ EGC +YGFLEVNKVAGNF
Sbjct: 1   CCNTCEDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNF 60

Query: 130 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG 189
           HFAPGKSF QS VH  + L       N++H I  L+FGE +PG+VNPLDG   +    S 
Sbjct: 61  HFAPGKSFQQSHVHGKERLRIHPRPINMTHYIEHLSFGEDYPGIVNPLDGTDVSAPQASM 120

Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVT 247
           M+QYF+KVVPTVY    G  +++NQFSVT H + +  G +  Q LPGVF  Y+LSP+ V 
Sbjct: 121 MFQYFVKVVPTVYVKADGEVVRTNQFSVTRHEKVA-NGLIGDQGLPGVFVLYELSPMMVK 179

Query: 248 FTEEHVSFLHFLTNVCAIVGGVFT 271
            TE+H SF HFLT VCAI+GGVFT
Sbjct: 180 LTEKHRSFTHFLTGVCAIIGGVFT 203


>gi|321463520|gb|EFX74535.1| hypothetical protein DAPPUDRAFT_226626 [Daphnia pulex]
          Length = 381

 Score =  257 bits (657), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 129/297 (43%), Positives = 189/297 (63%), Gaps = 10/297 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
           +D SGEQ   V+H+IFK+RL+  G  +++ +      +I+K   +     E + +  C S
Sbjct: 91  VDSSGEQQFGVEHNIFKQRLNLLGEPLQAAE----LEEINKTHNKTETSTEESASKPCNS 146

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYGA+   E CC  C EVREAYR+K WA   P+  +QC+ E  L R      EGC +YG+
Sbjct: 147 CYGAK---EGCCETCAEVREAYRQKNWAF-RPEEFEQCRNEKNLTRDYSAFKEGCKLYGY 202

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           LEVN+V+G+FH APGKS+  + VHVHD+  +  + FN++H IN L+FG    G  NPLDG
Sbjct: 203 LEVNRVSGSFHIAPGKSYAINHVHVHDVQPYSSEDFNVTHHINSLSFGTSLIGKENPLDG 262

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQTLPGVFFF 238
              T +  + M+QY+IKVVPT Y  + G    +NQ+SVT H +  S  G    +PGVFF 
Sbjct: 263 FLTTADKGAMMFQYYIKVVPTWYVKLDGEEFHTNQYSVTRHQKVVSSYGGESGVPGVFFT 322

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           Y++SP+++++ E   S  HF T+VC I+GGVFTV+GIID+ +Y   + +++K+++GK
Sbjct: 323 YEMSPLQISYKESKRSIGHFATDVCTIIGGVFTVAGIIDSLLYRSSKLLQQKLQLGK 379


>gi|226486462|emb|CAX74360.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Schistosoma japonicum]
          Length = 379

 Score =  250 bits (639), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 125/293 (42%), Positives = 175/293 (59%), Gaps = 10/293 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD +G Q L+V H+++K  +   GN + +          D  L         +  YCGSC
Sbjct: 90  MDTTGAQQLNVMHEVYKTSVSISGNPLSNSVRH--TVNDDSALTT-----TRDPNYCGSC 142

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+S    CCN CEEV+ AY +  W   N    +QC+ E +    +    EGC I+G L
Sbjct: 143 YGADSPTRKCCNTCEEVQMAYHEMQWVFGNASEFEQCRNENWDGMKRNIGNEGCRIHGSL 202

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
            VN+V G FH APG S+ ++  HVH I +     FN+SH I +L FG+ +PG +N LDG 
Sbjct: 203 TVNRVGGGFHIAPGHSYTENHAHVHSIRSLGHVQFNVSHSITELRFGDAYPGQINSLDGT 262

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGH--TIQSNQFSVTEHFRSSE-QGRLQTLPGVFF 237
           + T + PS M+ Y++K+VPT+YT VS +  T+ +NQ+S T H R S   G  Q LPGVFF
Sbjct: 263 KMTVDKPSQMFNYYLKLVPTMYTSVSNNESTLITNQYSATWHSRGSPLSGDGQGLPGVFF 322

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            Y+++P+ V  TEE  SF+HFLTN CAI+GGVFTV+ ++DAFIY     ++ +
Sbjct: 323 NYEIAPLLVKITEERKSFVHFLTNTCAIIGGVFTVASLLDAFIYQSSCVLRNR 375


>gi|56753075|gb|AAW24747.1| SJCHGC09363 protein [Schistosoma japonicum]
 gi|226486460|emb|CAX74359.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Schistosoma japonicum]
 gi|226486464|emb|CAX74361.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Schistosoma japonicum]
          Length = 379

 Score =  250 bits (639), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 125/293 (42%), Positives = 175/293 (59%), Gaps = 10/293 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD +G Q L+V H+++K  +   GN + +          D  L         +  YCGSC
Sbjct: 90  MDTTGAQQLNVMHEVYKTSVSISGNPLSNSVRH--TVNDDSALTT-----TRDPNYCGSC 142

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+S    CCN CEEV+ AY +  W   N    +QC+ E +    +    EGC I+G L
Sbjct: 143 YGADSPTRKCCNTCEEVQMAYHEMQWVFGNASEFEQCRNENWDGMKRNIGNEGCRIHGSL 202

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
            VN+V G FH APG S+ ++  HVH I +     FN+SH I +L FG+ +PG +N LDG 
Sbjct: 203 TVNRVGGGFHIAPGHSYTENHAHVHSIRSLGHVQFNVSHSITELRFGDAYPGQINSLDGT 262

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGH--TIQSNQFSVTEHFRSSE-QGRLQTLPGVFF 237
           + T + PS M+ Y++K+VPT+YT VS +  T+ +NQ+S T H R S   G  Q LPGVFF
Sbjct: 263 KMTVDKPSQMFNYYLKLVPTMYTSVSNNESTLITNQYSATWHSRGSPLSGDGQGLPGVFF 322

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            Y+++P+ V  TEE  SF+HFLTN CAI+GGVFTV+ ++DAFIY     ++ +
Sbjct: 323 NYEIAPLLVKITEERKSFVHFLTNTCAIIGGVFTVASLLDAFIYQSSCVLRNR 375


>gi|358054679|dbj|GAA99605.1| hypothetical protein E5Q_06306 [Mixia osmundae IAM 14324]
          Length = 424

 Score =  249 bits (637), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 129/318 (40%), Positives = 185/318 (58%), Gaps = 35/318 (11%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE   D+ HDI K RLD  G ++++ +D      +   L+R  G ++    YCGSC
Sbjct: 93  MDISGEHQNDIHHDILKNRLDKSGALVQATRD----STLKGELERAVG-VKREPGYCGSC 147

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YG    D  CCN C+EVRE+Y ++GW+  NPD IDQC REGF ++IKE+  EGCN+ G +
Sbjct: 148 YGGAPGDSGCCNTCDEVRESYVRRGWSFVNPDGIDQCVREGFSEKIKEQSEEGCNVAGQV 207

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFG----------- 167
           +VNKV GNFH +PGKSF  +  HVHD++ +       +  H IN+ +F            
Sbjct: 208 KVNKVIGNFHLSPGKSFQSNMHHVHDLVPYLAAGQQHDFGHIINRFSFAAEGDDGFNRET 267

Query: 168 ---EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
              +    + +PL GVR   E  + M+QYF+KVV T +  + G T+ S+Q+SVT++ R  
Sbjct: 268 ARLKQSLNIEDPLTGVRAHTEQSNYMFQYFVKVVSTKFKTLDGRTLSSHQYSVTQYERDL 327

Query: 225 EQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
            +G                  +PG+FF Y++SP+ V   EE  SF HF+T+ CAIVGG+ 
Sbjct: 328 SKGNKPGKDEDGHQTSHGYAGVPGLFFNYEISPMLVVHREERQSFAHFITSTCAIVGGIL 387

Query: 271 TVSGIIDAFIYHGQRAIK 288
           TV+G+ID  +Y  Q  ++
Sbjct: 388 TVAGLIDTLVYSSQTRLQ 405


>gi|389612123|dbj|BAM19583.1| ptx1 protein [Papilio xuthus]
          Length = 285

 Score =  249 bits (636), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 129/292 (44%), Positives = 179/292 (61%), Gaps = 16/292 (5%)

Query: 11  VKHDIFKKRLDSQGNVIES-RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDED 69
           + H+I K+RLD  GN IE  +++ I    I   ++++   L      CGSCYGA  +D  
Sbjct: 1   MDHNIHKRRLDLDGNPIEEPKKEEIA---ISSTVKQNTSELA--TVTCGSCYGAAFNDSQ 55

Query: 70  CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 129
           CCN CE+V+EAYR + WAL +   I QCK +  L++      EGC IYG++EVN+V G+F
Sbjct: 56  CCNTCEDVKEAYRIRRWALPDLATIVQCKDDESLEKANLALKEGCQIYGYMEVNRVGGSF 115

Query: 130 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV-VNPLDGVRWTQETPS 188
           H APGKSF  + VHVHD+  +   +FN +H I  L+FG         PLDGV+   +  +
Sbjct: 116 HIAPGKSFTINHVHVHDVQPYSSSAFNTTHXIQHLSFGSDIKSANTAPLDGVKGIAQEGA 175

Query: 189 GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-----SEQGRLQTLPGVFFFYDLSP 243
            M+QY+IK+ PT+Y  +    + +NQFSVT H +S     SE G    +PG FF Y+LSP
Sbjct: 176 VMFQYYIKIGPTMYVKLDKTVLHTNQFSVTRHQKSVSNINSESG----MPGAFFSYELSP 231

Query: 244 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           + V +TE+  S  HF TN+CAI+GGVFTV+GI+D  +YH   A   KI +GK
Sbjct: 232 LMVKYTEKERSIGHFATNICAIIGGVFTVAGILDTLLYHSLNAFHNKIVLGK 283


>gi|349804919|gb|AEQ17932.1| putative ergic and golgi 3 [Hymenochirus curtipes]
          Length = 228

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 120/231 (51%), Positives = 162/231 (70%), Gaps = 5/231 (2%)

Query: 6   EQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAES 65
           EQ LDV+H++FK RLD     + S  +     K ++P+      L+ +   C SCYGAE+
Sbjct: 1   EQQLDVEHNLFKLRLDKDRQPVSSEAERHDLGKAEEPVIFDPKSLDPDR--CESCYGAET 58

Query: 66  SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKV 125
            D  CCN+C++VREAYR++GWA   PD I+QCKREGF Q+++E++ EGC +YGFLEVNKV
Sbjct: 59  DDFRCCNSCDDVREAYRRRGWAFKTPDSIEQCKREGFSQKMQEQKNEGCRVYGFLEVNKV 118

Query: 126 AGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE 185
           AGNFHFAPGKSF QS VHVHD+ +F  D+ N++H+I  L+FG  +PG+VNPLDG   +  
Sbjct: 119 AGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHEIKHLSFGMDYPGLVNPLDGTSVSAV 178

Query: 186 TPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPG 234
             S M+QYF+K+VPTVY  V G  +++NQFSVT H + +  G +  Q LPG
Sbjct: 179 QSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKVT-NGLIGDQGLPG 228


>gi|195378906|ref|XP_002048222.1| GJ11466 [Drosophila virilis]
 gi|194155380|gb|EDW70564.1| GJ11466 [Drosophila virilis]
          Length = 372

 Score =  246 bits (629), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 139/302 (46%), Positives = 187/302 (61%), Gaps = 23/302 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE-TYCGS 59
           MD SG+ HL V HD+FK RLD +G            P  + P++        N+ + CGS
Sbjct: 89  MDSSGDTHLRVDHDVFKHRLDLEGQ-----------PLKETPIKEIVAVSPPNKNSTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CE+V +AYR + W +   D I+QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRVRKWNM-QVDKIEQCK--GKYKRTDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFTNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           DG+R   QE+ S M+ Y++K+VPT+Y   S G  I +NQFSVT H R     R + +PG+
Sbjct: 252 DGLRVEVQESKSEMFNYYLKIVPTLYERHSDGQPIYTNQFSVTRH-RKDLTDRERGMPGI 310

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FF Y+LSP+ V + E HVSF HF TN C+IVGGVFTV+GI+   + +   A+++K+E+GK
Sbjct: 311 FFSYELSPLMVKYAERHVSFGHFATNCCSIVGGVFTVAGILAVLLNNSWEALQRKLEVGK 370

Query: 296 FS 297
            S
Sbjct: 371 LS 372


>gi|331241265|ref|XP_003333281.1| hypothetical protein PGTG_14201 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|309312271|gb|EFP88862.1| hypothetical protein PGTG_14201 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 421

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 136/323 (42%), Positives = 187/323 (57%), Gaps = 46/323 (14%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLE-----HNET 55
           MDISGE   DV HD+ K RL   G            P      Q+  G LE       + 
Sbjct: 93  MDISGEHQNDVAHDLAKTRLGLDG-----------VPLSTNTTQKLQGELETIIASRAKD 141

Query: 56  YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           YCGSCYG E     CCN+CEEVRE+Y ++GW+ +NPD I+QC +E + +RIKE+  EGCN
Sbjct: 142 YCGSCYGGEPGPSGCCNSCEEVRESYVRRGWSFNNPDGIEQCVQEHWSERIKEQSKEGCN 201

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGE----- 168
           I G L+VNKV GNFH +PG+SF    VHVHD++ + +DS   +  H I+  AF +     
Sbjct: 202 INGVLKVNKVIGNFHLSPGRSFQTHQVHVHDLVPYLQDSNLHDFGHVIHNFAFMDANQPT 261

Query: 169 ---------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
                       G+VNPLDGV+   E  + M+QYF+KVV T +  + G   +++Q+SVT+
Sbjct: 262 ETAHTLRLKKTLGIVNPLDGVKAHTEASNYMFQYFLKVVGTQFQLLDGQVAKTHQYSVTQ 321

Query: 220 HFR---------SSEQGRLQT-----LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
           + R         + E G L +     +PGVFF Y++SP++V   E   SF HF T+ CAI
Sbjct: 322 YERDLDNSDKSDADELGHLTSHGHSGVPGVFFNYEISPMQVVHQEYRQSFAHFATSTCAI 381

Query: 266 VGGVFTVSGIIDAFIYHGQRAIK 288
           VGGV TV+G++D+F+Y  Q  +K
Sbjct: 382 VGGVLTVAGLLDSFVYGAQNRMK 404


>gi|195126511|ref|XP_002007714.1| GI12235 [Drosophila mojavensis]
 gi|193919323|gb|EDW18190.1| GI12235 [Drosophila mojavensis]
          Length = 372

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 139/302 (46%), Positives = 187/302 (61%), Gaps = 23/302 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE-TYCGS 59
           MD SG+ HL V HD+FK RLD  GN           P  + P++        N+ + CGS
Sbjct: 89  MDSSGDTHLRVDHDVFKHRLDLDGN-----------PLKETPIKEIVAVSPPNKNSTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CE+V +AYR + W +   D I+QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNSTHCCNTCEDVLDAYRIRKWNM-QVDKIEQCK--GKYKRTDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFTNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           DG+R   +E+ S M+ Y++K+VPT+Y   S G  I +NQFSVT H R     R + +PG+
Sbjct: 252 DGLRVDVEESKSEMFNYYLKIVPTLYERHSDGKPIYTNQFSVTRH-RKDLTDRERGMPGI 310

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FF Y+LSP+ V + E HVSF HF TN C+I+GGVFTV+GI+   + +   AI++K+E+GK
Sbjct: 311 FFSYELSPLMVKYAERHVSFGHFATNCCSIIGGVFTVAGILAVVLNNSLEAIQRKLEVGK 370

Query: 296 FS 297
            S
Sbjct: 371 LS 372


>gi|195021391|ref|XP_001985385.1| GH17030 [Drosophila grimshawi]
 gi|193898867|gb|EDV97733.1| GH17030 [Drosophila grimshawi]
          Length = 372

 Score =  244 bits (623), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 139/302 (46%), Positives = 186/302 (61%), Gaps = 23/302 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE-TYCGS 59
           MD SG+ HL V HD+FK RLD QG            P  + P++        N+ + CGS
Sbjct: 89  MDSSGDTHLRVDHDVFKHRLDLQGE-----------PLKETPIKEIVAVSPPNKNSTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CE+V +AYR + W +   D I+QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNSTHCCNTCEDVLDAYRIRKWNM-QVDKIEQCK--GKYKRTDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFTNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           DG+R   +E+ S M+ Y++K+VPT+Y   S G  I +NQFSVT H R     R + +PG+
Sbjct: 252 DGIRVDVEESKSEMFNYYLKIVPTLYERHSDGEPIYTNQFSVTRH-RKDLTDRERGMPGI 310

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FF Y+LSP+ V + E H SF HF TN C+IVGGVFTV+GI+   + +   AI++K+E+GK
Sbjct: 311 FFSYELSPLMVKYAERHNSFGHFATNCCSIVGGVFTVAGILAVLLNNSWEAIQRKLEVGK 370

Query: 296 FS 297
            S
Sbjct: 371 LS 372


>gi|58264656|ref|XP_569484.1| ER to Golgi transport-related protein [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|134109945|ref|XP_776358.1| hypothetical protein CNBC5750 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50259032|gb|EAL21711.1| hypothetical protein CNBC5750 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57225716|gb|AAW42177.1| ER to Golgi transport-related protein, putative [Cryptococcus
           neoformans var. neoformans JEC21]
          Length = 422

 Score =  244 bits (622), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 126/326 (38%), Positives = 188/326 (57%), Gaps = 40/326 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE   + +H + K R++  GNVI   Q G     +++        L  +  YCGSC
Sbjct: 92  MDISGEHQTEFEHQVTKTRMNKDGNVISKVQGGQLKGDVER------ANLNQDPNYCGSC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA   +  CCN+CEEVR+AY +KGW+ S+P+ I+QC  EG++ ++KE+  EGC I G +
Sbjct: 146 YGALPPESGCCNSCEEVRQAYGRKGWSFSDPEGIEQCVEEGWMDKMKEQNEEGCRIDGHI 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGEHFP------- 171
            VNKV GN HF+PG+SF  + + + +++ + RD    +  H ++K  FG           
Sbjct: 206 RVNKVIGNLHFSPGRSFQNNMMQMLELVPYLRDKNHHDFGHIVHKFRFGADMTKAEELTV 265

Query: 172 -----------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
                      G+ +PL G++   E  + M+QYF+KVV T +  +SG  I S+Q+SVT++
Sbjct: 266 LPKEQRWRDKLGLRDPLQGIKAHTEVSNYMFQYFLKVVSTNFISLSGEEISSHQYSVTQY 325

Query: 221 FRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
            R    G               +  +PGVFF Y++SP+KV  TEE  SF HFLT+ CAIV
Sbjct: 326 ERDLRTGNAPGKDAHGHMTSHGMMGVPGVFFNYEISPMKVIHTEERQSFAHFLTSTCAIV 385

Query: 267 GGVFTVSGIIDAFIYHGQRAIKKKIE 292
           GGV TV+ ++D+ I++  + +KKK E
Sbjct: 386 GGVLTVASLVDSLIFNSSKRLKKKSE 411


>gi|443732120|gb|ELU16969.1| hypothetical protein CAPTEDRAFT_192533 [Capitella teleta]
          Length = 304

 Score =  243 bits (621), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 124/255 (48%), Positives = 164/255 (64%), Gaps = 6/255 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGEQ +DV HDIFK+RLD  G  +++     G       L         +     SC
Sbjct: 52  MDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKEGQSSESCALNHALSSFLFSRF---SC 108

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES    CCN C EVREAYR+KGWA  +   I+QC REG++ +++E + EGC IYGFL
Sbjct: 109 YGAESEAHKCCNTCNEVREAYRQKGWAFVDAQNIEQCMREGYVSQLEEGKNEGCRIYGFL 168

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKVAGNFH APG+SF Q   H+HD+ A Q   FN+SH+I  L+FG+ +PG VNPLD  
Sbjct: 169 EVNKVAGNFHVAPGRSFSQHHAHIHDMQALQGMKFNMSHRIQHLSFGDDYPGQVNPLDAS 228

Query: 181 -RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFF 237
            + T++    M+ Y++KVVPT Y   +G  + SNQ+SVT+H +    G L  Q LPGVF 
Sbjct: 229 EQVTEQADFVMFSYYVKVVPTSYLRANGEFVSSNQYSVTKHHKKVGGGILGEQGLPGVFV 288

Query: 238 FYDLSPIKVTFTEEH 252
            Y+LSP+ V +TE++
Sbjct: 289 TYELSPMMVKYTEKN 303


>gi|194751543|ref|XP_001958085.1| GF10736 [Drosophila ananassae]
 gi|190625367|gb|EDV40891.1| GF10736 [Drosophila ananassae]
          Length = 372

 Score =  243 bits (621), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 138/302 (45%), Positives = 184/302 (60%), Gaps = 23/302 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
           MD SG+ HL V HDIFK RLD +G            P  + P++        N+   CGS
Sbjct: 89  MDSSGDTHLRVDHDIFKHRLDLKGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CEEV +AYR + W +   D I+QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNSTHCCNTCEEVLDAYRLRKWNV-QVDKIEQCK--GKYKRTDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYT-DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           DG+    +E  S M+ Y++K+VPT+Y  D  G  I +NQFSVT H R     R + +PG+
Sbjct: 252 DGMHVEVEEKKSEMFNYYLKIVPTLYMRDSDGKPIYTNQFSVTRH-RKDLSDRERGMPGI 310

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV+GI+   + +   AI++K+E+GK
Sbjct: 311 FFSYELSPLMVKYAEKHSSFGHFATNCCSIIGGVFTVAGILAVLLNNSLEAIQRKLEVGK 370

Query: 296 FS 297
            S
Sbjct: 371 LS 372


>gi|321253192|ref|XP_003192660.1| ER to Golgi transport-related protein [Cryptococcus gattii WM276]
 gi|317459129|gb|ADV20873.1| ER to Golgi transport-related protein, putative [Cryptococcus
           gattii WM276]
          Length = 435

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 126/327 (38%), Positives = 190/327 (58%), Gaps = 40/327 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE   + +H + K R+D  G +I   Q G    ++   L+R    L  +  YCGSC
Sbjct: 92  MDISGEHQTEFEHQVTKTRIDKNGKIISKVQGG----QLKGDLER--ANLNQDPNYCGSC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA   +  CCN+CEEVR+AY +KGW+ S+P+ I+QC  EG++ ++KE+  EGC I G +
Sbjct: 146 YGAPPPESGCCNSCEEVRQAYGRKGWSFSDPEGIEQCVEEGWMDKMKEQNEEGCRIGGHI 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGEHFP------- 171
            VNKV GN HF+PG+SF  + + + +++ + RD    +  H ++K  FG           
Sbjct: 206 RVNKVIGNLHFSPGRSFQNNMMQMLELVPYLRDKNHHDFGHIVHKFRFGGDMTKAEELTV 265

Query: 172 -----------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
                      G+ +PL G++   E  + M+QYF+KVV T +  ++G  I S+Q+SVT++
Sbjct: 266 LPKEQRWRDKLGLKDPLQGIKVHTEVSNYMFQYFLKVVSTNFISLNGEEIPSHQYSVTQY 325

Query: 221 FRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
            R    G               +  +PGVFF Y++SP+KV  TEE  SF HFLT+ CAIV
Sbjct: 326 ERDLRTGNAPGKDAHGHMTSHGMMGVPGVFFNYEISPMKVIHTEERQSFAHFLTSTCAIV 385

Query: 267 GGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GGV TV+ ++D+FI++  + +KK  E+
Sbjct: 386 GGVLTVASLLDSFIFNSSKRLKKTSEV 412


>gi|194374867|dbj|BAG62548.1| unnamed protein product [Homo sapiens]
          Length = 321

 Score =  242 bits (618), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 119/224 (53%), Positives = 153/224 (68%), Gaps = 21/224 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHG-GRLE-------- 51
           MD++GEQ LDV+H++FK+RLD  G  + S     GA       +RH  G++E        
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSS-----GA-------ERHELGKVEVTVFDPDS 136

Query: 52  HNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
            +   C SCYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ 
Sbjct: 137 LDPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKN 196

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           EGC +YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +P
Sbjct: 197 EGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYP 256

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
           G+VNPLD    T    S M+QYF+KVVPTVY  V G   Q   +
Sbjct: 257 GIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVSQGAPY 300


>gi|195327731|ref|XP_002030571.1| GM24497 [Drosophila sechellia]
 gi|195590409|ref|XP_002084938.1| GD12569 [Drosophila simulans]
 gi|194119514|gb|EDW41557.1| GM24497 [Drosophila sechellia]
 gi|194196947|gb|EDX10523.1| GD12569 [Drosophila simulans]
          Length = 373

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 136/303 (44%), Positives = 185/303 (61%), Gaps = 24/303 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
           MD SG+ HL V HD+FK RLD  G            P  + P++        N+   CGS
Sbjct: 89  MDSSGDTHLRVDHDVFKHRLDLNGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CE+V +AYR + W ++  D I+QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRLRKWTVA-VDKIEQCK--GKYKRSDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
           DG+R    ET S M+ Y++K+VPT+Y   +  G  I +NQFSVT  +R     R + +PG
Sbjct: 252 DGLRVDVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYTNQFSVTR-YRKDLSDRERGMPG 310

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           +FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV+GI+   + +   AI++K+E+G
Sbjct: 311 IFFSYELSPLMVKYAEKHSSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEAIQRKLEVG 370

Query: 295 KFS 297
           K S
Sbjct: 371 KLS 373


>gi|21357439|ref|NP_648758.1| CG7011 [Drosophila melanogaster]
 gi|7294304|gb|AAF49653.1| CG7011 [Drosophila melanogaster]
 gi|16768234|gb|AAL28336.1| GH25868p [Drosophila melanogaster]
 gi|220946650|gb|ACL85868.1| CG7011-PA [synthetic construct]
          Length = 373

 Score =  241 bits (616), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 136/303 (44%), Positives = 184/303 (60%), Gaps = 24/303 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
           MD SG+ HL V HD+FK RLD  G            P  + P++        N+   CGS
Sbjct: 89  MDSSGDTHLRVDHDVFKHRLDLNGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CE+V +AYR + W ++  D I+QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRLRKWTVA-VDKIEQCK--GKYKRSDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
           DG+R    ET S M+ Y++K+VPT+Y   +  G  I +NQFSVT  +R     R + +PG
Sbjct: 252 DGLRVDVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYTNQFSVTR-YRKDLSDRERGMPG 310

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           +FF Y+LSP+ V + E H SF HF TN C+I+GGVFTV+GI+   + +   AI++K+E+G
Sbjct: 311 IFFSYELSPLMVKYAERHSSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEAIQRKLEVG 370

Query: 295 KFS 297
           K S
Sbjct: 371 KLS 373


>gi|195441336|ref|XP_002068468.1| GK20487 [Drosophila willistoni]
 gi|194164553|gb|EDW79454.1| GK20487 [Drosophila willistoni]
          Length = 372

 Score =  241 bits (614), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 136/302 (45%), Positives = 186/302 (61%), Gaps = 23/302 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE-TYCGS 59
           MD SG+ HL V HD+FK RLD +G            P  + P++        N+ + CGS
Sbjct: 89  MDSSGDTHLRVDHDVFKHRLDLKGE-----------PLKETPIKEIVAVSPANKNSTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CE+V +AY  K W++   D ++QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYHLKKWSV-QVDKLEQCK--GKYKRTDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           DG+R   +E+ S M+ Y+IK+VPT+Y   S G  I +NQFSVT  +R     R + +PG+
Sbjct: 252 DGLRVNVEESKSEMFNYYIKIVPTLYERNSDGQPIYTNQFSVTR-YRKDLTDRERGMPGI 310

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FF Y+LSP+ V + E H SF HF TN C+I+GGVFTV+GI+   + +   AI++K+E+GK
Sbjct: 311 FFSYELSPLMVKYAERHNSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEAIQRKLEVGK 370

Query: 296 FS 297
            S
Sbjct: 371 LS 372


>gi|195495133|ref|XP_002095138.1| GE19855 [Drosophila yakuba]
 gi|194181239|gb|EDW94850.1| GE19855 [Drosophila yakuba]
          Length = 373

 Score =  240 bits (613), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 135/303 (44%), Positives = 185/303 (61%), Gaps = 24/303 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
           MD SG+ HL V HD+FK RLD  G            P  + P++        N+   CGS
Sbjct: 89  MDSSGDTHLRVDHDVFKHRLDLNGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CE+V +AYR + W ++  D I+QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRLRKWNVA-VDKIEQCK--GKYKRSDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
           DG+R    ET S M+ Y++K+VPT+Y   +  G  I +NQFSVT  +R     R + +PG
Sbjct: 252 DGLRVDVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYTNQFSVTR-YRKDLSDRERGMPG 310

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           +FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV+GI+   + +   A+++K+E+G
Sbjct: 311 IFFSYELSPLMVKYAEKHSSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEALQRKLEVG 370

Query: 295 KFS 297
           K S
Sbjct: 371 KLS 373


>gi|125978263|ref|XP_001353164.1| GA20029 [Drosophila pseudoobscura pseudoobscura]
 gi|54641917|gb|EAL30666.1| GA20029 [Drosophila pseudoobscura pseudoobscura]
          Length = 372

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 137/302 (45%), Positives = 184/302 (60%), Gaps = 23/302 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
           MD SG+ HL V HDIFK RLD +G            P  + P++        N+   CGS
Sbjct: 89  MDSSGDTHLRVDHDIFKHRLDLKGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CE+V +AYR   W +   D I+QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRLHKWNV-QVDKIEQCK--GKYKRTDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           DG+R    ET S M+ Y++K+VPT+Y   S G  I +NQFSVT  +R     R + +PG+
Sbjct: 252 DGLRVDVAETKSEMFNYYLKIVPTLYMRQSDGQPIYTNQFSVTR-YRKDLTDRERGMPGI 310

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           FF Y+LSP+ V + E+H SF HF TN C+I+GGVFTV+GI+   + +   AI++K+++GK
Sbjct: 311 FFSYELSPLMVKYAEKHNSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEAIQRKLDVGK 370

Query: 296 FS 297
            S
Sbjct: 371 LS 372


>gi|194872681|ref|XP_001973062.1| GG13555 [Drosophila erecta]
 gi|190654845|gb|EDV52088.1| GG13555 [Drosophila erecta]
          Length = 373

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 135/303 (44%), Positives = 184/303 (60%), Gaps = 24/303 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
           MD SG+ HL V HD+FK RLD  G            P  + P++        N+   CGS
Sbjct: 89  MDSSGDTHLRVDHDVFKHRLDLNGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CEEV +AYR + W ++  D I+QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEEVLDAYRLRKWNVA-VDKIEQCK--GKYKRSDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKVVPTVYT--DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
           DG+R    ET S M+ Y++K+VPT+Y   +  G  I +NQFSVT  +R     R + +PG
Sbjct: 252 DGLRVEVAETKSEMFNYYLKIVPTLYMRGNSDGEPIYTNQFSVTR-YRKDLSDRERGMPG 310

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           +FF Y+LSP+ V + E+  SF HF TN C+I+GGVFTV+GI+   + +   A+++K+E+G
Sbjct: 311 IFFSYELSPLMVKYAEKRSSFGHFATNCCSIIGGVFTVAGILAVLLNNSWEALQRKLEVG 370

Query: 295 KFS 297
           K S
Sbjct: 371 KLS 373


>gi|301106576|ref|XP_002902371.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
 gi|262098991|gb|EEY57043.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
          Length = 393

 Score =  236 bits (602), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 121/295 (41%), Positives = 173/295 (58%), Gaps = 18/295 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GE  +++   + K RLD+ G  I +  D +   K D P             YCGSC
Sbjct: 112 MDVAGELQVNMHQTVVKTRLDANGRSISTTADELA--KTDLP-----------AGYCGSC 158

Query: 61  YGAE-SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           YG    + ++CCN CEEV+EA+     +L   +  +QC RE        ++GEGC   G 
Sbjct: 159 YGTRHPAGKECCNTCEEVKEAFIHSDLSLEEAEQKEQCVRESIDTEKLAQDGEGCRFTGK 218

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           + VN+VAGNFH A G++FH+ G  VH     Q  +FN SH I+ L+FGE  PG  +PLDG
Sbjct: 219 MFVNRVAGNFHVALGRTFHRQGRLVHQFRPGQEHTFNSSHIIHSLSFGEPIPGATSPLDG 278

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFF 238
           V    E   G++QY+IK+VPT+Y+D+    I S QFSVT+     + +G++ +LPG FF 
Sbjct: 279 VSKIAEQSGGVFQYYIKIVPTIYSDIDESAIHSYQFSVTQQSNYLNPRGQMTSLPGTFFV 338

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY---HGQRAIKKK 290
           +DLSP  V    + V F HFLT +CAIVGGV +++G +D+F+Y   H +R +  K
Sbjct: 339 FDLSPFMVKVENDRVPFTHFLTKICAIVGGVISIAGFVDSFMYNSLHVRRRVSSK 393


>gi|3860008|gb|AAC72954.1| unknown [Homo sapiens]
          Length = 198

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 111/185 (60%), Positives = 138/185 (74%), Gaps = 3/185 (1%)

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGF
Sbjct: 8   CYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGF 67

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           LEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+ N++H I  L+FGE +PG+VNPLD 
Sbjct: 68  LEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDH 127

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFF 237
              T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF 
Sbjct: 128 TNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFA 186

Query: 238 FYDLS 242
              LS
Sbjct: 187 HLPLS 191


>gi|328858670|gb|EGG07782.1| hypothetical protein MELLADRAFT_105603 [Melampsora larici-populina
           98AG31]
          Length = 422

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 132/327 (40%), Positives = 190/327 (58%), Gaps = 38/327 (11%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIE-SRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MDISGE   DV HD+ K RL+  G ++  S   G+          R  G       YCGS
Sbjct: 93  MDISGEHQNDVNHDMTKTRLNPDGTLVSASVSKGLKGELDTIAATRAPG-------YCGS 145

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYG    +  CCN CEEVRE+Y ++GW+ SNPD I+QC +E +  +IKE+E EGCN+ G 
Sbjct: 146 CYGGTPPESGCCNTCEEVRESYVRRGWSFSNPDGIEQCVQEHWSDKIKEQEKEGCNMNGQ 205

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAF-GEHFP----- 171
           ++VNKV GNFH +PG+SF  + +HVHD++ + +  +S +  H I+K AF  EH       
Sbjct: 206 VKVNKVIGNFHMSPGRSFQTNAMHVHDLVPYLQTGNSHDFGHIIHKFAFLAEHQSPDDDE 265

Query: 172 --------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR- 222
                   G+VNPLDG++   E  + M+QYF+KVV T +  +    ++++Q+SVT++ R 
Sbjct: 266 TRRIKTSLGIVNPLDGIKAHTEESNYMFQYFLKVVGTEFHLLDQRVVKTHQYSVTQYERD 325

Query: 223 --SSEQGRLQTL-----------PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
              S +G    L           PG+FF Y++SP++V   E   SF HF T+ CAI+GGV
Sbjct: 326 LTKSSRGGTDELGHQTSHGYAGVPGLFFNYEISPMQVIHKEYRQSFAHFATSTCAIIGGV 385

Query: 270 FTVSGIIDAFIYHGQRAIKKKIEIGKF 296
            TV+G+ID+ +Y  +  IK +   G F
Sbjct: 386 LTVAGLIDSAVYGARNRIKLQSSDGGF 412


>gi|348680250|gb|EGZ20066.1| CopII vesicle protein [Phytophthora sojae]
          Length = 409

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 122/301 (40%), Positives = 179/301 (59%), Gaps = 10/301 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GE  +++   + K RLD+ GN I     G     I         +    E YCGSC
Sbjct: 113 MDVAGELQVNMHQTVVKTRLDADGNTI-----GRPISMITDEGAEEQAKTALPEGYCGSC 167

Query: 61  YGAE-SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           +GA+  + ++CCN CE+V+EA+    ++L + +  +QC RE        ++GEGC   G 
Sbjct: 168 HGAQHPAGKECCNTCEDVKEAFIYSDFSLEDAEQKEQCVREIMEAEKLAQDGEGCRFTGK 227

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           + VN+VAGNFH A G++FH+ G  VH     Q  ++N SH I+ L+FGE  PGV  PLDG
Sbjct: 228 MFVNRVAGNFHVALGRTFHRQGRLVHQFRPGQEHTYNSSHIIHSLSFGEPMPGVAGPLDG 287

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFF 238
           V    E   G++QY+IK+VPT+Y+D+  +TI S QFSVT+     + +G++ +LPG FF 
Sbjct: 288 VSKIAEQSGGVFQYYIKIVPTIYSDIDENTIHSYQFSVTQQGNYLNPRGQMTSLPGTFFV 347

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY---HGQRAIKKKIEIGK 295
           +DLSP  V    + + F HFLT VCAIVGGV +++G +D+F+Y   H +R +       K
Sbjct: 348 FDLSPFMVKVENDRMPFTHFLTKVCAIVGGVISIAGFVDSFMYNSLHVRRRVSTNSGATK 407

Query: 296 F 296
           F
Sbjct: 408 F 408


>gi|409079094|gb|EKM79456.1| hypothetical protein AGABI1DRAFT_120853 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 1000

 Score =  234 bits (598), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 123/323 (38%), Positives = 182/323 (56%), Gaps = 42/323 (13%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK--PLQRHGGRLEHNETYCG 58
           MDISGE   D+ H+I K RL++ G ++ +        ++DK   +Q+ G        YCG
Sbjct: 673 MDISGEVQRDISHNILKTRLENNGTIVPASYSAQLQNELDKMNEVQQSG--------YCG 724

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYG       CCN C+EVR+AY  +GW+ S+PD I+QCKREG+ +++K++  EGCN+ G
Sbjct: 725 SCYGGVEPASGCCNTCDEVRQAYVNRGWSFSSPDAIEQCKREGWSEKMKDQADEGCNVSG 784

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLAF---------- 166
            L VNKV GN H +PG+SF  +  ++++++ + RD    + SH+I+  AF          
Sbjct: 785 RLRVNKVIGNIHLSPGRSFQTNSRNLYELVPYLRDENKHDFSHEIHHFAFEGDDEYVYWK 844

Query: 167 ---GEHFPGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
              G          +NPLDG ++       M+QYF+KVV T +  + G  + ++Q+SVT 
Sbjct: 845 ASAGREMKNRLGLNINPLDGAKYRTSKVQYMFQYFLKVVSTQFRTLDGKIVNTHQYSVTH 904

Query: 220 HFRSSE-------------QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
             R  E             Q   Q LPG FF Y++SPI V   +   SF HFLT+ CAIV
Sbjct: 905 FERDLEEGGGGQSPGGINIQHGAQGLPGAFFNYEISPILVVHADSRQSFAHFLTSTCAIV 964

Query: 267 GGVFTVSGIIDAFIYHGQRAIKK 289
           GGV TV+ ++D+ ++   RA+KK
Sbjct: 965 GGVLTVASLVDSLLFATTRALKK 987


>gi|426196003|gb|EKV45932.1| hypothetical protein AGABI2DRAFT_207344 [Agaricus bisporus var.
           bisporus H97]
          Length = 1000

 Score =  234 bits (598), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 123/323 (38%), Positives = 182/323 (56%), Gaps = 42/323 (13%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK--PLQRHGGRLEHNETYCG 58
           MDISGE   D+ H+I K RL++ G ++ +        ++DK   +Q+ G        YCG
Sbjct: 673 MDISGEVQRDISHNILKTRLENNGTIVPASYSAQLQNELDKMNEVQQSG--------YCG 724

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYG       CCN C+EVR+AY  +GW+ S+PD I+QCKREG+ +++K++  EGCN+ G
Sbjct: 725 SCYGGVEPASGCCNTCDEVRQAYVNRGWSFSSPDAIEQCKREGWSEKMKDQADEGCNVSG 784

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLAF---------- 166
            L VNKV GN H +PG+SF  +  ++++++ + RD    + SH+I+  AF          
Sbjct: 785 RLRVNKVIGNIHLSPGRSFQTNSRNLYELVPYLRDENKHDFSHEIHHFAFEGDDEYVYWK 844

Query: 167 ---GEHFPGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
              G          +NPLDG ++       M+QYF+KVV T +  + G  + ++Q+SVT 
Sbjct: 845 ASAGREMKNRLGLNINPLDGAKYRTSKVQYMFQYFLKVVSTQFRTLDGKIVNTHQYSVTH 904

Query: 220 HFRSSE-------------QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
             R  E             Q   Q LPG FF Y++SPI V   +   SF HFLT+ CAIV
Sbjct: 905 FERDLEEGGGGQSPGGINIQHGAQGLPGAFFNYEISPILVVHADSRQSFAHFLTSTCAIV 964

Query: 267 GGVFTVSGIIDAFIYHGQRAIKK 289
           GGV TV+ ++D+ ++   RA+KK
Sbjct: 965 GGVLTVASLVDSLLFATTRALKK 987


>gi|302688477|ref|XP_003033918.1| hypothetical protein SCHCODRAFT_75438 [Schizophyllum commune H4-8]
 gi|300107613|gb|EFI99015.1| hypothetical protein SCHCODRAFT_75438 [Schizophyllum commune H4-8]
          Length = 415

 Score =  233 bits (595), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 125/321 (38%), Positives = 177/321 (55%), Gaps = 39/321 (12%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           DISG+   DV H++ K RLD  G  I          +IDK  ++ G        YCGSCY
Sbjct: 92  DISGDVQRDVSHNMLKTRLDKDGKAIRGAHTAELRNEIDKQNEQRGA------DYCGSCY 145

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           G       CCN CEEVR AY  +GW+ +NPD I+QCK EG+  +++E+  EGCNI G L 
Sbjct: 146 GGLPPASGCCNTCEEVRTAYVNRGWSFNNPDSIEQCKNEGWADKLREQANEGCNIAGRLR 205

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF------------ 166
           +NKVAGN H +PG+SF   G +V++++ + RD  N    SH I+ L+F            
Sbjct: 206 INKVAGNIHLSPGRSFQTGGRNVYELVPYLRDDGNRHDFSHTIHSLSFEGDDAYDNRKRE 265

Query: 167 -----GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 221
                 +      NPLDG          M+QYF+KVV T +  ++G T+ S+ +SVT   
Sbjct: 266 TSKEMRQRMGLSSNPLDGTVRVTNKAQYMFQYFVKVVSTKFRPLNGRTVNSHSYSVTHFE 325

Query: 222 RS-SEQGRLQT------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
           R  ++ G+ QT            LPG F  +D+SPI++  TE   SF HF+T+ CAIVGG
Sbjct: 326 RDLTDGGQAQTGQNVQVQHGVTGLPGAFINFDVSPIQLVHTEWRQSFAHFVTSTCAIVGG 385

Query: 269 VFTVSGIIDAFIYHGQRAIKK 289
           V TV+ ++D+ ++   +A+KK
Sbjct: 386 VLTVASLLDSVLFATSKALKK 406


>gi|405123077|gb|AFR97842.1| COPII-coated vesicle component Erv46 [Cryptococcus neoformans var.
           grubii H99]
          Length = 422

 Score =  233 bits (595), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 122/315 (38%), Positives = 183/315 (58%), Gaps = 40/315 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE   + +H + K R++  GNVI   Q      ++   ++R    L  +  YCGSC
Sbjct: 92  MDISGEHQTEFEHQVTKTRMNKDGNVISKVQ----GSQLKGDVER--ANLNQDPNYCGSC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA   +  CCN+CEEVR+AY +KGW+ S+P+ I+QC  EG++ ++KE+  EGC I G +
Sbjct: 146 YGAPPPESGCCNSCEEVRQAYGRKGWSFSDPEGIEQCVEEGWMDKMKEQNEEGCRIDGHI 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGEHFP------- 171
            VNKV GN HF+PG+SF  + + + +++ + RD    +  H ++K  FG           
Sbjct: 206 RVNKVIGNLHFSPGRSFQNNMMQMLELVPYLRDKNHHDFGHIVHKFRFGGDMTKAEELTV 265

Query: 172 -----------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
                      G+ +PL G++   E  + M+QYF+KVV T +  ++G  I S+Q+SVT++
Sbjct: 266 LPKEQRWRDKLGLRDPLQGMKAHTEVSNYMFQYFLKVVSTNFISLNGEEIPSHQYSVTQY 325

Query: 221 FRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
            R    G               +  +PGVFF Y++SP+KV  TEE  SF HFLT+ CAIV
Sbjct: 326 ERDLRTGNAPGKDAHGHMTSHGMMGVPGVFFNYEISPMKVIHTEERQSFAHFLTSTCAIV 385

Query: 267 GGVFTVSGIIDAFIY 281
           GGV TV+ ++D+FI+
Sbjct: 386 GGVLTVASLVDSFIF 400


>gi|225708964|gb|ACO10328.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Caligus rogercresseyi]
          Length = 385

 Score =  233 bits (595), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 126/303 (41%), Positives = 180/303 (59%), Gaps = 17/303 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKID-KPLQRHGGRLEHNETYC 57
           MD+SGE H+D+ H+I+K+RL  +G+ +E   R+  +G  K    P  ++    E +   C
Sbjct: 90  MDVSGESHVDIVHNIYKRRLSLEGSPMEEPRRETEVGQKKTTHAPSPKN----ETSTPPC 145

Query: 58  GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
           GSCYGAE+    CCN+C EV+EAYR+KGW +      +QC+ +   + I+    EGC IY
Sbjct: 146 GSCYGAETPGSPCCNSCGEVKEAYRRKGWTIVAAKF-EQCEMD--TEGIERVYKEGCQIY 202

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF---PGVV 174
           G L VN+V G+FH  PGKSF  + +H+HD+  F    FN SH+I  L+FG      PG  
Sbjct: 203 GSLLVNRVGGSFHIVPGKSFTLNHLHIHDLQPFSSGEFNTSHRIRHLSFGSKTALDPGG- 261

Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT--L 232
           N LD V         MYQY++K+VPT Y+   G T   NQ+SVT          L +  +
Sbjct: 262 NALDAVSALSPKGGLMYQYYLKIVPTTYSRSDGGTFTGNQYSVT-RLEKDVSSSLDSGGM 320

Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
           PGVFF Y+L+P+ V ++E+  SF HF T +CAI+GGVFT++   D FIY   + +++K  
Sbjct: 321 PGVFFNYELAPLMVKYSEKEKSFGHFATGLCAIIGGVFTLASAFDKFIYSSSKILEEKFG 380

Query: 293 IGK 295
           +GK
Sbjct: 381 LGK 383


>gi|71021625|ref|XP_761043.1| hypothetical protein UM04896.1 [Ustilago maydis 521]
 gi|46100607|gb|EAK85840.1| hypothetical protein UM04896.1 [Ustilago maydis 521]
          Length = 435

 Score =  231 bits (590), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 122/329 (37%), Positives = 187/329 (56%), Gaps = 51/329 (15%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE--TYCG 58
           MDISGE   D++HD+ + R++  G +IE  +         K L+    R+ + +   YCG
Sbjct: 92  MDISGEHVNDIQHDVERTRINHDGKIIEQGK---------KSLKGDAARIANTKGKDYCG 142

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
            CYG +     CCN C+EVREAY +KGW+ ++PD +DQC  EG+ ++IKE+  EGC I G
Sbjct: 143 DCYGGQPPASKCCNTCDEVREAYVRKGWSFADPDHVDQCVAEGWSEKIKEQNKEGCRISG 202

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFGEHFP--- 171
            L VNKV G+FH +PGK+F ++ +H+HD++ +   +     +  H I++ +FG       
Sbjct: 203 KLHVNKVVGSFHLSPGKAFQRNSMHIHDLVPYLSGTGSEHHDFGHIIHEFSFGSEQEYHG 262

Query: 172 -------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
                        GV +PL+GVR   +    M+QYF+KVV T +  +SG T+++ Q+SVT
Sbjct: 263 LTSAKERAVKAKLGVKDPLEGVRAQTQQSQFMFQYFVKVVSTEFRPLSGETLKTQQYSVT 322

Query: 219 EHFRS-------------SEQGR-------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
            + R              S +G           +PGVFF Y++SP+K   +E   S  HF
Sbjct: 323 TYERDLSPGANAAALAGLSNEGSGAHISHGFAGVPGVFFNYEISPLKTIHSEYRQSLSHF 382

Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
           LT+ CAIVGG+ TV+GI+D+ +Y+ +R +
Sbjct: 383 LTSTCAIVGGILTVAGILDSLVYNSRRRL 411


>gi|390603136|gb|EIN12528.1| endoplasmic reticulum-derived transport vesicle ERV46 [Punctularia
           strigosozonata HHB-11173 SS5]
          Length = 419

 Score =  229 bits (585), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 121/322 (37%), Positives = 180/322 (55%), Gaps = 39/322 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGEQ  D+ H+I K RLDS G +I   Q      +++    R    +   + YCGSC
Sbjct: 92  MDISGEQQRDISHNILKTRLDSTGKLIPGSQRS----ELESEFDRQNKPMP--DGYCGSC 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE S+  CCN+C+ VR+AY  +GW+  NPD I+QC +E + +++K++  EGCNI G +
Sbjct: 146 YGAEPSEGACCNSCDAVRQAYVNRGWSFGNPDSIEQCVKENWSEKLKDQASEGCNIAGRV 205

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF---GEHFPGV- 173
            VNKV GN H +PG+SF   G  +++++ + R+  N    SH I++ AF    E+ P   
Sbjct: 206 RVNKVIGNIHLSPGRSFQSQGRSMYELVPYLREDGNRHDFSHTIHEFAFEGDDEYLPDKY 265

Query: 174 -------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
                          PLDG          M+QYF+KVV T +  + G T+ S+Q+S T  
Sbjct: 266 KVSKEMRAKMGLEAGPLDGAVGRTIKAQYMFQYFLKVVSTQFRTLDGQTVNSHQYSATHF 325

Query: 221 FRSSEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
            R  ++G                 +PG FF +++SPI +  +E   SF HFLT+ CAIVG
Sbjct: 326 ERDLDKGSEDNTAEGVHISHTTYGVPGAFFNFEISPILIVHSETRQSFAHFLTSTCAIVG 385

Query: 268 GVFTVSGIIDAFIYHGQRAIKK 289
           GV T++ I+D+ ++   +A+KK
Sbjct: 386 GVLTIASIVDSVLFATTKALKK 407


>gi|392591676|gb|EIW81003.1| ER-derived vesicles protein ERV46 [Coniophora puteana RWD-64-598
           SS2]
          Length = 419

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 124/324 (38%), Positives = 181/324 (55%), Gaps = 41/324 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE   DV H++ K+RLD  G  I   + G    +IDK  +  G        YCGSC
Sbjct: 91  MDISGETQRDVSHNVVKQRLDKTGKGIAGSRSGDLRNEIDKLAELRG------PDYCGSC 144

Query: 61  YGA-ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           YG   S+D  CCN+CEEVR+AY  KGW+  NP+ I+QC +EG+  ++K++  EGCNI G 
Sbjct: 145 YGGYTSTDNGCCNSCEEVRQAYVNKGWSFGNPEGIEQCTQEGWTDKVKDQADEGCNISGR 204

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINKLAF---GEHFPGV 173
           + VNKV GN + +PG+SF     + +D + + ++     + +H I++L F    E+ P  
Sbjct: 205 IRVNKVVGNINISPGRSFQTGSRNFYDFVPYLKEDGGQHDFTHYIDELTFLADDEYNPNK 264

Query: 174 V--------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
           +              NPLDG + +      MYQYF+KVV T +  ++G TI ++Q+S T 
Sbjct: 265 MKHGKELKQRMGLDSNPLDGFKASTTKKMFMYQYFLKVVSTQFRTLNGRTINTHQYSATH 324

Query: 220 HFRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
             R   +G                   PG +F +++SPI+V   E   SF HFLT+ CAI
Sbjct: 325 FERDLSRGMGGGENNQGVYVQHGAGGAPGAYFNFEISPIQVVHAETRQSFAHFLTSTCAI 384

Query: 266 VGGVFTVSGIIDAFIYHGQRAIKK 289
           VGGV TV+ ++D+F++   RA+KK
Sbjct: 385 VGGVLTVAALLDSFLFATSRALKK 408


>gi|389744843|gb|EIM86025.1| ER-derived vesicles protein ERV46 [Stereum hirsutum FP-91666 SS1]
          Length = 419

 Score =  228 bits (580), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 121/322 (37%), Positives = 177/322 (54%), Gaps = 40/322 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNETYCGS 59
           MDISGE   D+ H+I K RL+S G  + +  +     ++DK   QR  G       YCGS
Sbjct: 91  MDISGETQRDISHNIVKTRLNSDGTQVPNSANMQLRNELDKLNAQRQDG-------YCGS 143

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYG    +  CCN C++VREAY ++GW+  NPD I+QC +E + +++ E+  EGCNI G 
Sbjct: 144 CYGGTPPEGGCCNTCDQVREAYVQRGWSFGNPDSIEQCVQEHWSEKLHEQSSEGCNISGR 203

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAFG--------- 167
           + VNKV GN H +PGKSF  S   +++++ + +D  N    SH ++ L FG         
Sbjct: 204 VRVNKVIGNIHLSPGKSFQNSASSIYELVPYLKDDKNRHDFSHIVHSLTFGADDEYDSRK 263

Query: 168 --------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
                   +      NPLDG       PS M+QYF+K V T +  + G  + ++Q+ VT 
Sbjct: 264 TKIANEMKQRMGLDSNPLDGYHARTSQPSTMFQYFLKAVSTQFRTIDGKVVNTHQYQVTH 323

Query: 220 HFRSSEQGRLQT------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
           + R +   + +T            +PG FF Y++SPIKV   E   SF HFLT+ CAIVG
Sbjct: 324 YNRDAGNPQDKTNQGVNVMHGITGVPGAFFNYEISPIKVIHEETRQSFAHFLTSTCAIVG 383

Query: 268 GVFTVSGIIDAFIYHGQRAIKK 289
           GV TV+ I+D+ ++   + +KK
Sbjct: 384 GVLTVTSILDSVLFAANQRLKK 405


>gi|443894052|dbj|GAC71402.1| hypothetical protein PANT_3d00017 [Pseudozyma antarctica T-34]
          Length = 461

 Score =  227 bits (579), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 123/331 (37%), Positives = 184/331 (55%), Gaps = 51/331 (15%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE--TYCG 58
           MDISGE   D++HDI + R+   G  I   +         K L+    R+   +   YCG
Sbjct: 119 MDISGEHVNDIQHDIERTRVTHDGKPITQGK---------KNLKGDAARIAATKGKDYCG 169

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
            CYG +     CCN C+EVREAY +KGW+ ++PD +DQC  EG+  +IKE+  EGC I G
Sbjct: 170 DCYGGQPPASGCCNTCDEVREAYVRKGWSFADPDHVDQCVAEGWSDKIKEQNKEGCRISG 229

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFG------- 167
            L VNKV G+FH +PGK+F ++ VH+HD++ +   +     +  H I+  +FG       
Sbjct: 230 KLHVNKVVGSFHLSPGKAFQRNSVHIHDLVPYLSGTGAEHHDFGHIIHDFSFGSEQQYHG 289

Query: 168 ---------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
                    +   GV +PL+GVR   +    M+QYF+KVV T +  +SG T+++ Q+SVT
Sbjct: 290 LTTAKEREVKQKLGVKDPLEGVRAQTQQSQFMFQYFLKVVSTEFRPLSGDTLKTQQYSVT 349

Query: 219 EHFRS-------------SEQGR-------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
            + R              S +G           +PGVFF Y++SP+K   +E   S  HF
Sbjct: 350 TYERDLSPGANAAAMAGMSNEGSGAHISHGFAGVPGVFFNYEISPLKTIHSEHRQSLSHF 409

Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           LT+ CAIVGG+ TV+GI+D+ +Y+ +R +++
Sbjct: 410 LTSTCAIVGGILTVAGIVDSLVYNSRRRLRR 440


>gi|324511490|gb|ADY44781.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Ascaris suum]
          Length = 382

 Score =  227 bits (579), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 127/301 (42%), Positives = 184/301 (61%), Gaps = 12/301 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLE-HNETYCGS 59
           MD+SG+   DV+ D++K+RLD QGN I     G  A ++   +       +   E  CGS
Sbjct: 90  MDVSGDNQDDVQDDVYKQRLDQQGNNIT----GQAAVRLGVNVNTSTPASQLTTEPKCGS 145

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYGA    + CCN CE+V+EAY  +GW + + + ++QCK + +++ I + +GEGC +YG 
Sbjct: 146 CYGAS---DRCCNTCEDVKEAYSARGWQMLDIESVEQCKSDAWVRTINDFKGEGCRVYGK 202

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           ++V KVAGNFH APG        H HD+ +     F+ +H IN L+FG  FPG   PLDG
Sbjct: 203 VQVAKVAGNFHIAPGDPLRSLRSHFHDLHSIAPAKFDTAHIINHLSFGTPFPGKNYPLDG 262

Query: 180 VRW-TQETPSG-MYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 236
             + T +  SG M+QY++KVVPT+Y  + S + I S+QFSVT H +    G    LPG F
Sbjct: 263 KSFGTNKDSSGIMFQYYMKVVPTMYEFLDSSNNIFSHQFSVTTHQKDIGMGA-SGLPGFF 321

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
             Y+ SP+ V + E       FL ++CAI+GGVFTV+ +ID+ IYH  RAI+ K+E+ K+
Sbjct: 322 VQYEFSPLMVKYEERRQPLSTFLVSLCAIIGGVFTVASLIDSLIYHSSRAIQHKVEMNKY 381

Query: 297 S 297
           +
Sbjct: 382 N 382


>gi|343425773|emb|CBQ69306.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 435

 Score =  227 bits (578), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 119/329 (36%), Positives = 181/329 (55%), Gaps = 51/329 (15%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE--TYCG 58
           MDISGE   D++HDI + R+   G V+E  +         K L+    R+ + +   YCG
Sbjct: 92  MDISGEHVNDIQHDIERTRISHDGKVVEQGK---------KHLKGDAARIANTKGKDYCG 142

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
            CYG +     CCN C+EVREAY ++GW+ ++PD +DQC  EG+  +IK++  EGC I G
Sbjct: 143 DCYGGQPPASGCCNTCDEVREAYVRRGWSFADPDHVDQCVAEGWSDKIKQQNKEGCRISG 202

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFGEHFP--- 171
            L VNKV G+FH +PGK+F ++ +H+HD++ +   +     +  H I++ +FG       
Sbjct: 203 KLHVNKVVGSFHLSPGKAFQRNSMHIHDLVPYLSGTGAEHHDFGHIIHEFSFGSEQEYHG 262

Query: 172 -------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
                        GV +PL GVR   +    M+QYF+KVV T +  ++G T+++ Q+SVT
Sbjct: 263 LTTAKERAVKAKLGVKDPLAGVRAQTQQSQFMFQYFVKVVATEFRPLAGETLKTQQYSVT 322

Query: 219 EHFRSSEQGR--------------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
            + R    G                        +PGVFF Y++SP+K    E   S  HF
Sbjct: 323 TYERDLSPGASAAALAGMSNEGSGAHISHGFAGVPGVFFNYEISPLKTIHAEYRQSLAHF 382

Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
           LT+ CAIVGG+ TV+GI+D+ +Y+ +R +
Sbjct: 383 LTSTCAIVGGILTVAGILDSLVYNSRRRL 411


>gi|312075860|ref|XP_003140604.1| hypothetical protein LOAG_05019 [Loa loa]
          Length = 365

 Score =  226 bits (576), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 119/300 (39%), Positives = 174/300 (58%), Gaps = 27/300 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SG+   D++ D++K ++    N+  S    + A ++                 CGSC
Sbjct: 90  MDLSGDNQDDIRDDVYKIKV----NINTSTASSVPASQV----------------LCGSC 129

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+   E CCN CEEV+EAY +KGW L N + ++QCK + +++++ E + EGC +YG +
Sbjct: 130 YGAK---EGCCNTCEEVKEAYMRKGWELINIETVEQCKSDLWVKKMSEHKNEGCRVYGKV 186

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V KVAGNFH APG        H HD+ +     F+ SH +N  +FG  FPG V PLDG 
Sbjct: 187 QVAKVAGNFHIAPGDPLRAHRSHFHDLHSLSPSKFDTSHTVNHFSFGNSFPGKVYPLDGK 246

Query: 181 RW--TQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
            +   + +   MYQY +K+VPT Y  + S   I S+ FSVT + +   QG    LPG F 
Sbjct: 247 FFGSARNSDGIMYQYHLKLVPTSYVFLDSTRNIFSHLFSVTTYQKDISQGA-SGLPGFFV 305

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
            Y+ SP+ V + E   S   FL ++CAI+GG+FTV+ +IDAFIY   R I +KI + K++
Sbjct: 306 QYEFSPLMVKYEERQQSLSTFLVSICAIIGGIFTVASLIDAFIYRSGRIISQKIALNKYT 365


>gi|393907059|gb|EFO23462.2| hypothetical protein LOAG_05019 [Loa loa]
          Length = 378

 Score =  226 bits (575), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 119/300 (39%), Positives = 174/300 (58%), Gaps = 14/300 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SG+   D++ D++K  L          ++G G  +           +  ++  CGSC
Sbjct: 90  MDLSGDNQDDIRDDVYKISL-------LDGKEGNGVRQEVNINTSTASSVPASQVLCGSC 142

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+   E CCN CEEV+EAY +KGW L N + ++QCK + +++++ E + EGC +YG +
Sbjct: 143 YGAK---EGCCNTCEEVKEAYMRKGWELINIETVEQCKSDLWVKKMSEHKNEGCRVYGKV 199

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V KVAGNFH APG        H HD+ +     F+ SH +N  +FG  FPG V PLDG 
Sbjct: 200 QVAKVAGNFHIAPGDPLRAHRSHFHDLHSLSPSKFDTSHTVNHFSFGNSFPGKVYPLDGK 259

Query: 181 RW--TQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
            +   + +   MYQY +K+VPT Y  + S   I S+ FSVT + +   QG    LPG F 
Sbjct: 260 FFGSARNSDGIMYQYHLKLVPTSYVFLDSTRNIFSHLFSVTTYQKDISQGA-SGLPGFFV 318

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
            Y+ SP+ V + E   S   FL ++CAI+GG+FTV+ +IDAFIY   R I +KI + K++
Sbjct: 319 QYEFSPLMVKYEERQQSLSTFLVSICAIIGGIFTVASLIDAFIYRSGRIISQKIALNKYT 378


>gi|341884797|gb|EGT40732.1| CBN-ERV-46 protein [Caenorhabditis brenneri]
          Length = 379

 Score =  225 bits (574), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 125/299 (41%), Positives = 174/299 (58%), Gaps = 11/299 (3%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQG-NVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCG 58
           MD+S E   ++  DI++ RLD+ G NV E+ Q      KI+    +     E   E  CG
Sbjct: 90  MDVSSEAQDNINDDIYRLRLDADGKNVSETAQ------KIEINQNKTVDATELIQEVKCG 143

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYGA ++D  CCN CE+V+ AY  KGW + N + ++QCK + +++   E + EGC +YG
Sbjct: 144 SCYGA-AADGICCNTCEDVKNAYAIKGWQV-NIEEVEQCKNDKWVKEFNEHKNEGCRVYG 201

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
            ++V KVAGNFH APG        HVHD+       F+ SH +N ++FG+ FPG   PLD
Sbjct: 202 TVKVAKVAGNFHLAPGDPHQSMRSHVHDLHNLDPVKFDASHTVNHISFGKSFPGKNYPLD 261

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
           G   T+     MYQY++KVVPT Y  + G   QS+QFSVT H +     R   LPG F  
Sbjct: 262 GKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQSHQFSVTTH-KKDLGFRQSGLPGFFLQ 320

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           Y+ SP+ V + E   S   FL ++CAIVGGVF ++ ++D  IYH  R +K +I  GK +
Sbjct: 321 YEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAMAQLVDITIYHSSRYMKNRIAGGKLT 379


>gi|388856238|emb|CCF50047.1| uncharacterized protein [Ustilago hordei]
          Length = 435

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 123/329 (37%), Positives = 185/329 (56%), Gaps = 51/329 (15%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNE--TYCG 58
           MDISGE   D++HDI + R+          QDG  + +  K L+    R+ + +   YCG
Sbjct: 92  MDISGEHVNDIQHDIERTRIS---------QDGKVSIQGTKSLKGDAARIANTKGKDYCG 142

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
            CYG +     CCN C+EVREAY +KGW+ S+PD ++QC  EG+ ++IKE+  EGC I G
Sbjct: 143 DCYGGQPPASGCCNTCDEVREAYVRKGWSFSDPDHVEQCVAEGWSEKIKEQNKEGCRISG 202

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFGEHFP--- 171
            L VNKV G+FH +PG++F ++ +H+HD++ +   S     +  H I++ +FG       
Sbjct: 203 KLHVNKVVGSFHLSPGRAFQRNSMHIHDLVPYLSGSGAEHHDFGHIIHEFSFGSEQEYHG 262

Query: 172 -------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
                        GV +PL+GVR   +    M+QYF+KVV T +  ++G T+++ Q+SVT
Sbjct: 263 LTTAKERAVKDKLGVKDPLEGVRARTKESQYMFQYFLKVVSTEFRPLAGETLKTQQYSVT 322

Query: 219 EHFRS-------------SEQGR-------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
            + R              S +G           +PGVFF Y++SP+K   +E   S  HF
Sbjct: 323 TYERDLSPGANAAALAGLSNEGSGARISHGFAGVPGVFFNYEISPLKTIHSEYRQSLSHF 382

Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
           LT+ CAIVGG+ TV+GI+D+ IY+  R +
Sbjct: 383 LTSTCAIVGGILTVAGILDSLIYNSGRRL 411


>gi|393212588|gb|EJC98088.1| endoplasmic reticulum-derived transport vesicle ERV46 [Fomitiporia
           mediterranea MF3/22]
          Length = 421

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 121/322 (37%), Positives = 181/322 (56%), Gaps = 39/322 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE   D+ H+I K RLD+ G V+ +        K+D          +  + YCGSC
Sbjct: 91  MDISGEAQRDISHNIVKARLDANGAVVPNSHSAELRNKLDVMND------QTQDNYCGSC 144

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YG  + +  CCN CEEVR+AY  KGW+ SNPD I+QC RE + +++ E+  EGCNI G L
Sbjct: 145 YGGVAPEGGCCNTCEEVRQAYVNKGWSFSNPDSIEQCVREHWSEKLHEQSTEGCNISGRL 204

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF----------G 167
            VNKV GN H +PG+SF  + +++H+++ + ++  N     H +++L+F           
Sbjct: 205 RVNKVIGNIHLSPGRSFQTNYMNIHELVPYLKEDKNRHDFGHIVHELSFEGDDEYNFRKK 264

Query: 168 EHFPGV-------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
           E   G+        NPLDG      +   M+QYF+KVV T +  + G T++++Q+S T  
Sbjct: 265 ERSKGIKKKLGIEANPLDGAVGKAASLQYMFQYFVKVVSTKFELMDGQTVKTHQYSATHF 324

Query: 221 FRSSEQGRL-QT------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
            R    G + QT            +PGVF  Y++SP+ V  +E   SF HFLT+ CAI+G
Sbjct: 325 ERDLTTGAIGQTKEGVHIAHTNVGMPGVFINYEISPLLVVHSETRQSFAHFLTSTCAIIG 384

Query: 268 GVFTVSGIIDAFIYHGQRAIKK 289
           GV T++ I+D+ ++   R +KK
Sbjct: 385 GVLTIATIVDSVVFATGRRLKK 406


>gi|17568835|ref|NP_510575.1| Protein ERV-46 [Caenorhabditis elegans]
 gi|3878494|emb|CAB01889.1| Protein ERV-46 [Caenorhabditis elegans]
          Length = 380

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 125/302 (41%), Positives = 177/302 (58%), Gaps = 16/302 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQG-NVIESRQDGIGAPKIDKPLQRHGGRLEHN----ET 55
           MD+S E   ++  DI++ RLD +G N+ ES Q      KI+  + ++   +E      E 
Sbjct: 90  MDVSSEAQENINDDIYRLRLDPEGRNISESAQ------KIE--INQNKTSVETTDVIQEV 141

Query: 56  YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
            CGSCYGA ++D  CCN C++V+ AY  KGW + N + ++QCK + +++   E + EGC 
Sbjct: 142 KCGSCYGA-AADGICCNTCDDVKSAYAVKGWQV-NIEEVEQCKNDKWVKEFNEHKNEGCR 199

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           +YG ++V KVAGNFH APG        HVHD+       F+ SH +N ++FG+ FPG   
Sbjct: 200 VYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVKFDASHTVNHVSFGKSFPGKNY 259

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           PLDG   T      MYQY++KVVPT Y  + G   QS+QFSVT H +     R   LPG 
Sbjct: 260 PLDGKVNTDNRGGIMYQYYVKVVPTRYDYLDGRVDQSHQFSVTTH-KKDLGFRQSGLPGF 318

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           F  Y+ SP+ V + E   SF  FL ++CAIVGGVF ++ ++D  IYH  R +K +I  GK
Sbjct: 319 FLQYEFSPLMVQYEEFRQSFASFLVSLCAIVGGVFAMAQLVDITIYHSSRYMKSRIAGGK 378

Query: 296 FS 297
            +
Sbjct: 379 LT 380


>gi|268581953|ref|XP_002645960.1| C. briggsae CBR-ERV-46 protein [Caenorhabditis briggsae]
          Length = 380

 Score =  224 bits (572), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 126/300 (42%), Positives = 173/300 (57%), Gaps = 12/300 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQG-NVIESRQDGIGAPKIDKPLQRHGGRLEH--NETYC 57
           MD+S E   ++  DI++ RLD+ G NV ES Q      KI+    +  G       E  C
Sbjct: 90  MDVSSEAQENINDDIYRLRLDADGRNVSESAQ------KIEINQNKTIGEPTELVQEVKC 143

Query: 58  GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
           GSCYGA  +D  CCN CE+V+ AY  KGW + N + ++QCK + +++   E + EGC +Y
Sbjct: 144 GSCYGA-VADGICCNTCEDVKNAYAVKGWQV-NIEEVEQCKNDKWVKEFNEHKNEGCRVY 201

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
           G ++V KVAGNFH APG        HVHD+       F+ SH +N ++FG+ FPG   PL
Sbjct: 202 GTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVKFDASHTVNHISFGKSFPGKNYPL 261

Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
           DG   T+     MYQY++KVVPT Y  + G   QS+QFSVT H +     R   LPG F 
Sbjct: 262 DGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQSHQFSVTTH-KKDLGFRQAGLPGFFL 320

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
            Y+ SP+ V + E   S   FL ++CAIVGGVF ++ ++D  IYH  R +K +I  GK +
Sbjct: 321 QYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAMAQLVDITIYHTSRYMKSRIAGGKLT 380


>gi|170089933|ref|XP_001876189.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
           bicolor S238N-H82]
 gi|164649449|gb|EDR13691.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
           bicolor S238N-H82]
          Length = 421

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 119/322 (36%), Positives = 171/322 (53%), Gaps = 39/322 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE   D+ H++ K RLD+ G  + +         +DK            E YCGSC
Sbjct: 91  MDISGELQRDISHNVMKVRLDTHGKEVPNSHSAELRNDLDKMND------AKRENYCGSC 144

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +G    +  CCN CE+VR AY  +GW+ SNP+ I+QCK EG+  ++KE+  EGCNI G +
Sbjct: 145 FGGLEPEGGCCNTCEDVRLAYVNRGWSFSNPEAIEQCKNEGWADKLKEQADEGCNISGRI 204

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF----------- 166
            VNKV GN H +PG+SF  +  ++++++ + RD  N    SH I+ LAF           
Sbjct: 205 RVNKVIGNIHLSPGRSFQTNARNLYELVPYLRDDGNRHDFSHTIHHLAFEGDDEYDYWKA 264

Query: 167 ------GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
                  +      NPLDG          M+QYF+KVV T +  + G  + ++Q+S T+ 
Sbjct: 265 AAGSAMRQRMGLTENPLDGAIARTAKAQYMFQYFLKVVSTQFRTLDGRKVNTHQYSTTQF 324

Query: 221 FRSSEQGR-------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
            R   +G              +  LPG FF +++SPI V   E   SF HFLT+ CAI+G
Sbjct: 325 ERDLTEGAAGETAGGIHVQHGVSGLPGAFFNFEISPILVVHAETRQSFAHFLTSTCAIIG 384

Query: 268 GVFTVSGIIDAFIYHGQRAIKK 289
           GV TV+ IID+ ++   R +KK
Sbjct: 385 GVLTVASIIDSILFATNRRLKK 406


>gi|336369994|gb|EGN98335.1| hypothetical protein SERLA73DRAFT_109778 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336382751|gb|EGO23901.1| hypothetical protein SERLADRAFT_450196 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 988

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 123/324 (37%), Positives = 182/324 (56%), Gaps = 42/324 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNETYCGS 59
           MDISGEQ  DV H+I K R+  +G  +   ++G    +IDK   QR  G       YCGS
Sbjct: 662 MDISGEQQRDVSHNIHKTRITPEGGPVPGARNGELRNEIDKLNDQRSNG-------YCGS 714

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYG    +  CCN+CE+VR+AY  +GW+ +NPD I+QC  EG+ +++K++  EGCNI G 
Sbjct: 715 CYGGVEPEGGCCNSCEDVRQAYVNRGWSFNNPDNIEQCVAEGWSEKLKDQAEEGCNISGR 774

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF---------- 166
           L VNKV GN + +PG+SF  S  + ++++ + R+  N    SH I++ +F          
Sbjct: 775 LRVNKVIGNINVSPGRSFQSSSRNFYELVPYLREDNNRHDFSHVIHEFSFMTDDEYNLHK 834

Query: 167 ------GEHFPGVV-NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
                  +   G+  NPLDG+         M+QYF+KVV T +  + G TI ++Q+S T 
Sbjct: 835 AKLGKDMKQRMGIAENPLDGLNAKTNKAQYMFQYFLKVVSTQFRTIDGKTINTHQYSATH 894

Query: 220 HFRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
             R   +G               +  +PG FF +++SPI V  +E   SF HFLT+ CAI
Sbjct: 895 FERDLSKGSQGGDNGEGVVTQHGVSGVPGAFFNFEISPILVVHSEGRQSFAHFLTSTCAI 954

Query: 266 VGGVFTVSGIIDAFIYHGQRAIKK 289
           VGGV TV+ ++D+F++   R +KK
Sbjct: 955 VGGVLTVAALLDSFLFATGRRLKK 978


>gi|308483051|ref|XP_003103728.1| CRE-ERV-46 protein [Caenorhabditis remanei]
 gi|308259746|gb|EFP03699.1| CRE-ERV-46 protein [Caenorhabditis remanei]
          Length = 380

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 127/300 (42%), Positives = 172/300 (57%), Gaps = 12/300 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQG-NVIESRQD-GIGAPK-IDKPLQRHGGRLEHNETYC 57
           MD+S E   ++  DI++ RLD+ G N+ ES Q   I   K I  P +         E  C
Sbjct: 90  MDVSSEAQDNINDDIYRLRLDADGRNISESAQKIEINQNKTIADPTELT------QEVKC 143

Query: 58  GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
           GSCYGA ++D  CCN CE+V+ AY  KGW + N + ++QCK + +++   E + EGC +Y
Sbjct: 144 GSCYGA-AADGICCNTCEDVKSAYAIKGWQV-NIEEVEQCKNDKWVKEFTEHKNEGCRVY 201

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
           G ++V KVAGNFH APG        HVHD+       F+ SH +N L FG+ FPG   PL
Sbjct: 202 GTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVKFDASHTVNHLTFGKSFPGKHYPL 261

Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFF 237
           DG   T+     MYQY++KVVPT Y  + G   QS+QFSVT H +     R   LPG F 
Sbjct: 262 DGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQSHQFSVTTH-KKDLGFRQSGLPGFFV 320

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
            Y+ SP+ V + E   S   FL ++CAIVGGVF ++ +ID  IY   R +K +I  GK +
Sbjct: 321 QYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAMAQLIDITIYQTHRYMKNRIAGGKLT 380


>gi|384483831|gb|EIE76011.1| hypothetical protein RO3G_00715 [Rhizopus delemar RA 99-880]
          Length = 408

 Score =  221 bits (562), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 118/297 (39%), Positives = 179/297 (60%), Gaps = 23/297 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD SGE   +  HD++K+RLD  G VI + +    +    K  + H   +   + YCGSC
Sbjct: 112 MDESGEHISNYDHDVYKERLDPNGEVITAEKSNDLSNSQAKNAREHSMNVP--DDYCGSC 169

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+ S+E CCN CEE++ AY + GW + +PD  +QC REG+ ++I+ +  EGC ++G L
Sbjct: 170 YGAKGSNE-CCNTCEEIQNAYSELGWNV-DPDNFEQCIREGWKEKIESQSREGCRMHGTL 227

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLAFGEH--------- 169
            VNK+ GNFHF+ GK+F QSG H+HD+  F  +  + N  H I  L FG H         
Sbjct: 228 LVNKIRGNFHFSAGKAFKQSGSHIHDMSTFLHNDKNQNFMHTIQHLQFGNHDYNSEKQKR 287

Query: 170 --FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT--EHFRSSE 225
                +++PL+ ++      + MYQYF+K+VPT +  ++G  I++ Q+SV+  +H  S  
Sbjct: 288 TKSRELIHPLENIKSGNSETAIMYQYFLKIVPTEFNFLNGKRIRTFQYSVSKQDHIVSYL 347

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
            G    LPGVFF  D SP+++ ++E   S   +LT++CAI+GG+FTV+ +ID  I H
Sbjct: 348 GG----LPGVFFMLDHSPMRIIYSETKTSLASYLTSLCAIIGGIFTVASVIDGSIQH 400


>gi|402218655|gb|EJT98731.1| ER to Golgi transport-related protein [Dacryopinax sp. DJM-731 SS1]
          Length = 455

 Score =  219 bits (559), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 131/350 (37%), Positives = 186/350 (53%), Gaps = 68/350 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIG-APKIDKPLQ-RHGGRLEHNETYCG 58
           MDISGE+  DV H++ + RL  QG  I       G + +I+K ++ R GG        CG
Sbjct: 92  MDISGERQHDVTHNMQRVRLSPQGIPIPDVLPESGLSNEIEKVIEAREGGE-------CG 144

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYG +     CCN CE+VREAY ++GW+ S+P+ I QC  EG+ +++K +  EGCNI G
Sbjct: 145 SCYGGDPPASGCCNTCEDVREAYMRRGWSFSSPEDIKQCVNEGWTEKVKSQSEEGCNISG 204

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAF---GEHFPGV 173
            + VNKV GNFHF+PGKSF  + +HVHD++ + +D+   +  H+I+   F   GE    V
Sbjct: 205 RVRVNKVIGNFHFSPGKSFQTNAMHVHDLVPYLKDANRHDFGHEIHYFGFESDGEQQAEV 264

Query: 174 --------------VNPLDGVRW---------TQETPSG-----------------MYQY 193
                          NPLDG+R          T+  P                   M+QY
Sbjct: 265 GRLSKSIKTKLGIDKNPLDGLRAHVRSLSRRETRRVPGMSSNRRSYRPEQTEKSNYMFQY 324

Query: 194 FIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR--------------LQTLPGVFFFY 239
           F+KVV T Y  + G  + S+Q+SVT + R   QG               +  +PG FF +
Sbjct: 325 FLKVVSTKYEMLRGTVVNSHQYSVTSYERDLSQGDKAQRDEHGTMTSHGVSGIPGAFFNF 384

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           ++SP+ V   E   SF HFLT+ CAIVGGV TV+ I D+ ++  +R +KK
Sbjct: 385 EISPMVVVHQETRQSFAHFLTSTCAIVGGVLTVAAIFDSMLFSAERKLKK 434


>gi|345569114|gb|EGX51983.1| hypothetical protein AOL_s00043g717 [Arthrobotrys oligospora ATCC
           24927]
          Length = 397

 Score =  219 bits (559), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 129/316 (40%), Positives = 174/316 (55%), Gaps = 38/316 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
           MD+SG+    V H I K RLD  G +IES           K L+ H    +H + +YCG 
Sbjct: 89  MDVSGDLQPSVSHGIGKHRLDKSGGIIES-----------KFLELHPEHPKHLDPSYCGE 137

Query: 60  CYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           CYGA + D      CC  C++VREAY  KGWA  +   + QC+ EG+ + +KE+ GEGC 
Sbjct: 138 CYGAVAPDTSKKAGCCQTCDDVREAYAAKGWAFGDGTGVHQCEEEGYKEMLKEQAGEGCR 197

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGV 173
           I G L VNKV GNFH APGKSF  + +HVHD+  + +     + +H IN L+FG   P  
Sbjct: 198 IDGHLWVNKVVGNFHIAPGKSFSNAQMHVHDLANYLQGDVHHDFTHTINALSFGPPLPTD 257

Query: 174 V--------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSS 224
           +        NPLD         +  Y YF+K+V T Y  +  G+TI ++Q+SVT H RS 
Sbjct: 258 LLHENHHQQNPLDATSKKTSDRNYNYLYFLKIVSTSYEHLDHGYTIHTHQYSVTSHERSL 317

Query: 225 EQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 273
           E G+             +PG+FF YD+SP+KV   E    SF  FLT++CAI+GG  TV+
Sbjct: 318 EGGKDDVHPGTVHARGGIPGIFFSYDISPMKVVNREIRTKSFSGFLTSICAIIGGTLTVA 377

Query: 274 GIIDAFIYHGQRAIKK 289
             +D  +Y G R I K
Sbjct: 378 AALDRGLYEGARRIGK 393


>gi|299743758|ref|XP_002910702.1| ER-derived vesicles protein ERV46 [Coprinopsis cinerea
           okayama7#130]
 gi|298405804|gb|EFI27208.1| ER-derived vesicles protein ERV46 [Coprinopsis cinerea
           okayama7#130]
          Length = 416

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 116/321 (36%), Positives = 173/321 (53%), Gaps = 42/321 (13%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN--ETYCG 58
           MDISGE   D+ H++ K RLD  G  +        +  ++K        L H   E YCG
Sbjct: 91  MDISGEVQRDISHNVLKVRLDRSGKEVPGSHTADLSADVEK--------LSHTKKEGYCG 142

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYG    +  CCN CE+VR AY  +GW+ +NPD I+QC+ EG+  +++++  EGCNI G
Sbjct: 143 SCYGGLEPESGCCNTCEDVRMAYVNRGWSFTNPDAIEQCRNEGWADKLRDQADEGCNISG 202

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF--------- 166
            + VNKV GN H +PG+SF  +  ++++++ + RD  N    SH I+   F         
Sbjct: 203 RIRVNKVIGNIHMSPGRSFQSNSRNIYELVPYLRDDQNRHDFSHIIHHFGFEGDDEYDYW 262

Query: 167 ----GEHFPGVV----NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
               G+     +    NPLDG+         M+QYF+KVV T +  + G T+ ++Q+S T
Sbjct: 263 KAEAGQKMRRRMGLTENPLDGIEARTWKSQYMFQYFLKVVSTRFRTLDGQTVNTHQYSTT 322

Query: 219 EHFRSSEQGRLQT------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
              R   +G  Q             LPG FF Y++SPI+V   E   SF HFLT+ CA++
Sbjct: 323 SFERDLGEGMNQDDGGIRVQHGVSGLPGAFFNYEISPIQVVHAESRQSFAHFLTSTCAVI 382

Query: 267 GGVFTVSGIIDAFIYHGQRAI 287
           GGV TV+ ++D+ ++   +AI
Sbjct: 383 GGVLTVAALVDSALFVTAKAI 403


>gi|313231322|emb|CBY08437.1| unnamed protein product [Oikopleura dioica]
          Length = 386

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 122/305 (40%), Positives = 180/305 (59%), Gaps = 19/305 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRL-DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET---- 55
           MD++G++  D +H +FK R+ D Q   +  + + I A K+      H  + E  ET    
Sbjct: 89  MDLTGDR-ADAEHQLFKVRMKDGQEVALSEKVEEINAEKL------HDEKQEEEETGLAV 141

Query: 56  --YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS-NPDLIDQCKREGF--LQRIKEEE 110
              C SCYGAE+ ++ CCN+CEEV++AYR KGWA   +     QC  E F   + +++ E
Sbjct: 142 KDECQSCYGAETEEQPCCNSCEEVQQAYRNKGWAFDHSAQQFSQCVNEHFDLNEELQKTE 201

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           GE C ++G LEVN+V+G+   +PGK+    G  VHDI   +  SF+ SH I+ L+FGE F
Sbjct: 202 GESCRVHGHLEVNRVSGSLQISPGKTLVLDGSVVHDIRGMKHMSFDTSHTIHHLSFGEVF 261

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
           PG  NPLD      E+ +  + Y  KV+PT +  + G    +NQFSVT H ++  Q   +
Sbjct: 262 PGQENPLDNTEHEAESMNMAWHYNFKVIPTEFRKLDGSRTATNQFSVTRHEKALSQMSSR 321

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            LPG+ F ++++PI V   E   S +HF T+VCAI+GGV+T+S I+D+FI H    +  K
Sbjct: 322 -LPGINFHFEIAPIAVIKMETRRSAVHFATSVCAIIGGVWTISSILDSFI-HKTNKLLIK 379

Query: 291 IEIGK 295
            E+GK
Sbjct: 380 TELGK 384


>gi|449684240|ref|XP_002157414.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like, partial [Hydra magnipapillata]
          Length = 311

 Score =  218 bits (554), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 109/224 (48%), Positives = 147/224 (65%), Gaps = 19/224 (8%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           MD+SGEQ  D++H+IFKKR D +GN I++  +++ +G  K ++ ++     L+ ++  C 
Sbjct: 89  MDVSGEQQTDLEHNIFKKRYDEKGNPIDTVEKKEELGD-KSEEAVKVLNSTLD-DKPKCE 146

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYGAE++D  CCN CE+VR AYRKKGW   +PD I+QCKRE +    +++  EGC IYG
Sbjct: 147 SCYGAETTDHPCCNTCEDVRVAYRKKGWGFHDPDSIEQCKREHWKDTFQQQSNEGCQIYG 206

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHV---------------HDILAFQRDSFNISHKINK 163
           ++EV+KVAGNFH APGKSF Q  +HV               HD+  F    FN+SH I  
Sbjct: 207 YIEVSKVAGNFHIAPGKSFQQQHIHVQTIRFGKDGTISLNMHDLQPFGAKQFNVSHNIWS 266

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
           L+FGE  PGV NPLDG   + E  S MYQYF+K+VPTVY  +SG
Sbjct: 267 LSFGEPIPGVENPLDGTNVSAEAGSLMYQYFVKIVPTVYKKLSG 310


>gi|393233667|gb|EJD41236.1| endoplasmic reticulum-derived transport vesicle ERV46 [Auricularia
           delicata TFB-10046 SS5]
          Length = 419

 Score =  217 bits (553), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 117/323 (36%), Positives = 175/323 (54%), Gaps = 41/323 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRL--EHNETYCG 58
           MDISGE+  DV H+I K R+D+      +RQ  I        LQ    ++       YCG
Sbjct: 91  MDISGERQADVTHNILKTRIDA------NRQR-IADQTTTYDLQNEAEKVVAARGANYCG 143

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYG    +  CC  CE VR+AY  +GWA S+PD I+QCK+EG+ ++I+ +  EGCN+ G
Sbjct: 144 SCYGGLEPEGGCCQTCEAVRQAYINRGWAFSDPDAIEQCKQEGWKEKIQAQMNEGCNVEG 203

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-------------------DSFNISH 159
            + VNKV G+  F+ G+SF  + + +HD++ + R                   D FNI  
Sbjct: 204 RVRVNKVVGSIQFSFGRSFQMNQMSLHDLVPYLRDENVHDWRHRVQHFYFSSDDEFNIYK 263

Query: 160 KINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
                +  +      NPLDG     E+   M+QYF+KVV T +  + G  I ++Q+S T 
Sbjct: 264 AGISSSMKQRLGIAANPLDGNYGHTESTEYMFQYFLKVVSTQFRTIGGEVINTHQYSATH 323

Query: 220 HFRSSEQGR-------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
             R   +G              +Q LPGVFF +++SP+++  +E   SF HF+T+ CAIV
Sbjct: 324 FDRDLAEGVRGKTEDGVVVTHGVQGLPGVFFNFEISPMRIIHSETRQSFAHFITSTCAIV 383

Query: 267 GGVFTVSGIIDAFIYHGQRAIKK 289
           GGV T++ I+D+ ++  Q+A+KK
Sbjct: 384 GGVLTIASIVDSLLFTTQQALKK 406


>gi|409042254|gb|EKM51738.1| hypothetical protein PHACADRAFT_150385 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 422

 Score =  217 bits (552), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 122/329 (37%), Positives = 179/329 (54%), Gaps = 53/329 (16%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGN------VIESRQDGIGAPKIDK-PLQRHGGRLEHN 53
           MDISGE   D+ H++ K RL+ QGN      ++E R D      IDK   QR  G     
Sbjct: 91  MDISGETQTDIVHNVIKTRLNEQGNPVPANKIVELRND------IDKLNEQRQDG----- 139

Query: 54  ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
             YCGSCYG       CCN CE+VR+AY  +GW+ + PD I+QC +EG+  +++++  EG
Sbjct: 140 --YCGSCYGGVEPAGGCCNTCEDVRQAYVNRGWSFTAPDSIEQCAQEGWADKLRDQANEG 197

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAFG--- 167
           CN  G L VNKV GN H +PG+SF     +++DI+ + ++  N    SH ++  AF    
Sbjct: 198 CNAAGKLRVNKVVGNIHLSPGRSFRSGSHNIYDIVPYLKEDGNRHDFSHTVHAFAFAGDD 257

Query: 168 -------------EHFPGVVN-PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
                        +   G+ + PLDG        + M+QYF+KVV T +  + G +I+++
Sbjct: 258 EFNFQKADHGNSLKRRLGIADGPLDGTTQKTSKQAYMFQYFLKVVSTQFITLDGKSIKTH 317

Query: 214 QFSVTEHFR--------SSEQGR-----LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
           Q S T   R        +S+QG      +  +PG FF Y++SPI V   E   SF HFLT
Sbjct: 318 QHSATHFERDLSKGIAENSQQGMHVMHGMTGIPGAFFNYEISPILVVHRETRQSFAHFLT 377

Query: 261 NVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           + CA+VGGV TV+ +ID+ ++   + +KK
Sbjct: 378 STCAVVGGVLTVASLIDSMLFATSKKLKK 406


>gi|395324643|gb|EJF57079.1| endoplasmic reticulum-derived transport vesicle ERV46 [Dichomitus
           squalens LYAD-421 SS1]
          Length = 423

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 121/323 (37%), Positives = 177/323 (54%), Gaps = 41/323 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNETYCGS 59
           MDISGE   D+ H+I K RLD +G  +           +DK   QR  G       YCGS
Sbjct: 91  MDISGETQSDITHNILKTRLDEKGKPVSHSLIAELQNDLDKLNEQRQSG-------YCGS 143

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYG    +  CCN CEEVR+AY  +GW+ + PD I+QC +EG+  ++KE+  EGCNI G 
Sbjct: 144 CYGGIEPEGGCCNTCEEVRQAYVNRGWSFNRPDSIEQCVKEGWSDKLKEQAHEGCNIAGR 203

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF---GEHFP-- 171
           + VNKV GN H +PG+SF  S  ++++++ + R   N    +H+I+  AF    E+ P  
Sbjct: 204 VRVNKVVGNIHLSPGRSFRTSAHNLYELVPYLRTDGNRHDFTHQIHHFAFEGDDEYDPRN 263

Query: 172 -----------GV-VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
                      G+  NPLDG +        M+QYF+KVV T +  + G  + ++Q+S T 
Sbjct: 264 AKLGKELKNRLGIDANPLDGTQGRTIKQQYMFQYFLKVVSTQFQTIDGKKVGTHQYSATH 323

Query: 220 HFRSSEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
             R  ++G  +              +PG FF Y++SP+ +   E   SF HFLT+ CAIV
Sbjct: 324 FERDLDKGPSEDSPAGLHVAHGNGGIPGAFFNYEISPLLIRHVETRQSFAHFLTSTCAIV 383

Query: 267 GGVFTVSGIIDAFIYHGQRAIKK 289
           GGV TV+ +ID+ ++  ++A KK
Sbjct: 384 GGVLTVASLIDSLLFATRKAFKK 406


>gi|401888400|gb|EJT52358.1| ER to golgi family transport-related protein [Trichosporon asahii
           var. asahii CBS 2479]
 gi|406696432|gb|EKC99721.1| ER to transport-related protein [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 378

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 124/337 (36%), Positives = 177/337 (52%), Gaps = 49/337 (14%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE+  D+ HD+ K RL + G  +E  + G    + ++  Q        +  YCGSC
Sbjct: 47  MDISGERQNDITHDMAKHRLSASGEELEVTRSGQLKGEAERAAQ------NRDPNYCGSC 100

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA++ +  CCN+C++VR+AY + GW   NP  I+QC  E + + + ++  EGC I G +
Sbjct: 101 YGAQAPESGCCNSCDDVRKAYSESGWQFPNPSTIEQCVEENWAENMAQQNTEGCRIVGQV 160

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINKLAFGEHFP------ 171
           +VNKV GN  F  G  F +      D+L + RD     +  H INK  F    P      
Sbjct: 161 KVNKVVGNLQFTHGNVFTRGHT---DLLPYLRDGNVHHDFGHIINKFRFTGEMPGQLYHR 217

Query: 172 --------------GVVNPLDGVRWTQETPSG--MYQYFIKVVPTVYTDVSGHTIQSNQF 215
                         G+ +PL GVR   E      MYQYF+KVV T +  ++G  I +NQ+
Sbjct: 218 SQIQKKEDETRKELGIHDPLQGVRSHAENDGSNIMYQYFVKVVSTAFVYLNGQNINTNQY 277

Query: 216 SVTEHFRSSEQGRLQT--------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
           S TE+ R  + G L T              +PGVF  Y++SP+KV  TE   SF HF+T+
Sbjct: 278 SATEYERDLKHGNLPTKDQHGHVTTHYTNAIPGVFINYEISPMKVVHTETRQSFAHFVTS 337

Query: 262 VCAIVGGVFTVSGIIDAFIYHG-QRAIKKKIEIGKFS 297
            CAIVGGV TV+ +IDA I++  +R + +K   G  S
Sbjct: 338 TCAIVGGVLTVASLIDAAIFNSRKRLMGEKESYGALS 374


>gi|403417426|emb|CCM04126.1| predicted protein [Fibroporia radiculosa]
          Length = 419

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 113/319 (35%), Positives = 177/319 (55%), Gaps = 36/319 (11%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK-IDKPLQRHGGRLEHNETYCGS 59
           MDISGE   D+ H+I K RL+ +G  ++S          +DK  ++ G      + YCGS
Sbjct: 92  MDISGESQADITHNILKTRLNEKGIPLQSLAKSAELRNDLDKINEQRG------DNYCGS 145

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYG ++    CCN C++VR+AY  +GW+ + PD I+QC  EG+ +++KE+  EGCNI G 
Sbjct: 146 CYGGQAPPGGCCNTCDQVRQAYIDRGWSFTRPDSIEQCTNEGWSEKLKEQASEGCNIAGK 205

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAFG--------- 167
           + VNKV GN   +PG+SF  +  +++D++ + ++  N    SH I++ AF          
Sbjct: 206 VRVNKVIGNIQLSPGRSFRTAAQNMYDLVPYLKEDKNRHDFSHTIHQFAFESDQEKERHR 265

Query: 168 ----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
               +   G+ +PLD           M+QYF+KVV T +  +     +++Q+S T   R 
Sbjct: 266 ARDFQKRVGIESPLDNTERKTSKQQYMFQYFLKVVSTHFAMLDNKVYKTHQYSATHFERD 325

Query: 224 SEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
             +G+ +              +PGVF  YD+SP+ +  +E   SF HFLT+ CAIVGGV 
Sbjct: 326 LTKGQQEDNKEGVHIAHTATGIPGVFINYDISPMLILHSETRQSFAHFLTSTCAIVGGVL 385

Query: 271 TVSGIIDAFIYHGQRAIKK 289
           TV+ +ID+ ++   RA+KK
Sbjct: 386 TVASLIDSVLFATTRALKK 404


>gi|388581981|gb|EIM22287.1| endoplasmic reticulum-derived transport vesicle ERV46 [Wallemia
           sebi CBS 633.66]
          Length = 407

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 117/319 (36%), Positives = 181/319 (56%), Gaps = 38/319 (11%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGEQ  D++H I + RL  +G  I    DG+    +   L       E     CGSC
Sbjct: 88  MDVSGEQVRDLRHAIVRTRLSEKGETI----DGMKTAGMSGYLNEVAKPRE-----CGSC 138

Query: 61  YG-AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           YG    ++E CC  C++VRE+Y K+GW+  NPD + QC  E + +R+KE+  EGCN+ G 
Sbjct: 139 YGGVPPNEEKCCYTCDDVRESYVKQGWSFVNPDGVKQCLDEHWAERVKEQSSEGCNVAGL 198

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAFG--------- 167
           ++VNKV GNFH +PG+SF  +  H+HD++ + +++ N     H ++  +F          
Sbjct: 199 VDVNKVVGNFHISPGRSFQSNAHHIHDLVPYLKNANNHHDFGHILHHFSFKSSNEPADTD 258

Query: 168 --EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-- 223
             +    + +PL   +   E  + M+QYF+KVV T +  ++G  + S+Q+S T + R+  
Sbjct: 259 NLKEMLNINDPLSNTKAHTEVSNYMFQYFLKVVSTDFDFLNGEKLNSHQYSATAYERNLD 318

Query: 224 -----SEQGRLQTL-------PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
                ++ G  QT+       PGVFF YD+SP++V +TE   SF  FLT+ CAIVGGV T
Sbjct: 319 EKGIYAQDGHGQTILHGVEGFPGVFFNYDISPLRVIYTESRRSFASFLTSTCAIVGGVLT 378

Query: 272 VSGIIDAFIYHGQRAIKKK 290
           V+ IIDA ++  ++ +  K
Sbjct: 379 VASIIDAGVFGARQKLTGK 397


>gi|407418919|gb|EKF38246.1| hypothetical protein MOQ_001547 [Trypanosoma cruzi marinkellei]
          Length = 406

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 123/312 (39%), Positives = 173/312 (55%), Gaps = 30/312 (9%)

Query: 11  VKHDIFKKRLDSQG--NVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE 68
           V+ D  K R+ +     + E+R       KI K L  +G   E+    C SCYGAE    
Sbjct: 100 VERDTVKSRVAASTLEKISEARPLVDEKKKITKALDPNGAEKEN----CPSCYGAEPEPG 155

Query: 69  DCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAG 127
            CC+ C++VR AY  + W  +  D+ ++QC  E   +       EGCN++   +V +V G
Sbjct: 156 ACCHTCDDVRRAYSLRRWVFNEDDISVEQCAGERLRKAAILISQEGCNLFVKYKVARVTG 215

Query: 128 NFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG-------V 180
           N HF PG+ F+  G H+HD         N+SH ++ L FGE FPG VNP+DG       V
Sbjct: 216 NIHFVPGRMFNLMGQHLHDFRGKTVRQLNLSHIVHTLCFGERFPGQVNPMDGLVNSRGAV 275

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSSEQGRLQT----- 231
             T+E  +G + YF+KVVPT Y   S    G  ++SNQ+SVT HF +S    L T     
Sbjct: 276 DATEEV-NGRFSYFVKVVPTQYQAASILGVGSVVESNQYSVTHHFTASPSAELSTTTPES 334

Query: 232 ----LPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 285
               +PGVF  YDLSPIKV   E+H   S LH +  +CA+ GGVFTV+G++D+ I+HG R
Sbjct: 335 TPVIVPGVFITYDLSPIKVFVMEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVR 394

Query: 286 AIKKKIEIGKFS 297
            +++K++ GK S
Sbjct: 395 RVQRKMQQGKQS 406


>gi|164655211|ref|XP_001728736.1| hypothetical protein MGL_4071 [Malassezia globosa CBS 7966]
 gi|159102620|gb|EDP41522.1| hypothetical protein MGL_4071 [Malassezia globosa CBS 7966]
          Length = 427

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 109/323 (33%), Positives = 181/323 (56%), Gaps = 45/323 (13%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRL--EHNETYCG 58
           +D+ GE  +DV HD+ ++RLD  G  +        + ++ + L+    R+  E    YCG
Sbjct: 92  VDVVGETQMDVHHDVERRRLDETGKPV--------SEEVIRELESEAKRVIAERGPDYCG 143

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
            CYGA+  +  CCN+C+ VREAY    W+ ++PD I+QC +E + + ++E+  EGCNI G
Sbjct: 144 DCYGADPPEGGCCNSCDAVREAYMLHNWSFTSPDDIEQCAQEHWSEHVREQNHEGCNIAG 203

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR----DSFNISHKINKLAFG------- 167
            + VNKV GN HF PG++FH++ +H HD++ +      D  +  HKI++ +FG       
Sbjct: 204 EVRVNKVVGNLHFIPGRTFHRNDIHTHDLVPYLHGTGDDVHHFGHKIHRFSFGMEDEFAI 263

Query: 168 ------------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
                       ++  G+ N L+G      + + M+QYF+KVVP     ++GH + + Q+
Sbjct: 264 ERTSRGRRQGPLKNRMGIKNALEGRSAKTLSSNYMFQYFLKVVPVEVHKLNGHEMSTYQY 323

Query: 216 SVTEHFRSSEQ------------GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
           S T + R+ E               ++ +PGV+F Y++SP++V  TE H S  H ++N+ 
Sbjct: 324 SATSYERNLEDFDRGGQMSGHIVRMIEGIPGVYFNYEISPLRVIQTEWHHSIWHLVSNLF 383

Query: 264 AIVGGVFTVSGIIDAFIYHGQRA 286
           A++GG+ TV+G+ID  IY  +R 
Sbjct: 384 ALIGGIVTVAGLIDGAIYRSRRT 406


>gi|358391585|gb|EHK40989.1| ER-derived vesicle Erv46-like protein [Trichoderma atroviride IMI
           206040]
          Length = 422

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 131/339 (38%), Positives = 175/339 (51%), Gaps = 61/339 (17%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
           MD+SGEQ   V H I K RL      G VIES             L +   + EH N  Y
Sbjct: 89  MDVSGEQQHGVAHGITKLRLQPPSRGGGVIESNS-----------LAQLHEKAEHLNPDY 137

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     +    CCN C+EVREAY +  WA    + ++QC+RE + +R+ ++  E
Sbjct: 138 CGGCYGATAPANAEKPGCCNTCDEVREAYAQASWAFGRGEGVEQCEREHYSERLDQQREE 197

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI-----LAFQRDSFNISHKINKLAFG 167
           GC I G L+VNKV GNFH APG+SF    +HVHD+     L     + + +H I+ L FG
Sbjct: 198 GCRIEGLLQVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDLPNGMKAHDFTHVIHSLRFG 257

Query: 168 EHFPGVV---------------NPLDGVRWTQETPSGMYQYFIKVVPTVY---------T 203
              P  V               NPLDG+      P+  Y YF+K+VPT Y          
Sbjct: 258 PQLPPEVIARMGRRTAWTNHHLNPLDGIHQETSDPNFNYMYFVKIVPTSYLPLGWEQKSA 317

Query: 204 DVSGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEE 251
             S  +++++Q+SVT H RS   G         RL +   +PGVFF YD+SP+KV   EE
Sbjct: 318 SASDGSVETHQYSVTSHKRSLMGGDDAKEGHAERLHSKGGIPGVFFSYDISPMKVINREE 377

Query: 252 HV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
              +FL FL+ +CAIVGG  TV+  ID  ++ G   +KK
Sbjct: 378 RAKTFLGFLSGLCAIVGGTLTVAAAIDRGLFEGATRLKK 416


>gi|387219467|gb|AFJ69442.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Nannochloropsis gaditana CCMP526]
          Length = 432

 Score =  211 bits (536), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 114/306 (37%), Positives = 169/306 (55%), Gaps = 31/306 (10%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-----NET 55
           MD++G+  + V+H++ K+RL SQG       + IG P ++ P      + +         
Sbjct: 117 MDVAGDNQMQVEHNMLKQRLSSQG-------ERIGFPFLEDPTDFDSKKADALLGAAPWD 169

Query: 56  YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE---EEGE 112
           YCGSC+ A +    CCN+C+++ +AY  +G  +            GF         ++GE
Sbjct: 170 YCGSCFQARTHTGACCNSCQDLEQAYLTQGLPMGKIKTTAPQCLPGFQAPAPSGPMQKGE 229

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GCN+ GF+ VNKVAGNFH A G S  + G H+H  +  +   FN+SH I  ++FG+ +PG
Sbjct: 230 GCNLKGFMSVNKVAGNFHIAFGDSVVKDGRHIHQFIPSEAPFFNVSHTIQHVSFGDEYPG 289

Query: 173 VVNPLDG-VRWTQETP-SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS------- 223
            VNPLDG V++   T  +G++QYFIKV+PT Y   +G  I++N+ SVTE F+        
Sbjct: 290 RVNPLDGKVKYVSSTVGTGLFQYFIKVIPTHYKGRAGEAIRTNRISVTERFKPLHKEGEA 349

Query: 224 -------SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                  +   +   LPGVFF YDLSP  V  +   V F HFL  +CAI GGVF++S ++
Sbjct: 350 RLTGDSHAHNDQTSVLPGVFFIYDLSPFNVEVSTVSVPFSHFLVKLCAIAGGVFSISRLL 409

Query: 277 DAFIYH 282
           D   Y+
Sbjct: 410 DNVFYY 415


>gi|322693278|gb|EFY85144.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Metarhizium acridum CQMa 102]
          Length = 356

 Score =  210 bits (535), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 128/344 (37%), Positives = 174/344 (50%), Gaps = 64/344 (18%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGEQ   V H +   RL         R +  G   ID K ++ H    EH + +YCG
Sbjct: 16  MDVSGEQQHGVSHGVKNVRL---------RPESQGGGVIDIKSMKVHDDPAEHLDPSYCG 66

Query: 59  SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            CYGA     +    CCN C+EVREAY  +GWA    + ++QC RE + +R+ E+  EGC
Sbjct: 67  ECYGATAPPNARKAGCCNTCDEVREAYASQGWAFGRGENVEQCTREHYAERLDEQREEGC 126

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR----DSFNISHKINKLAFGEHF 170
            + G LEVNKV GNFH APG+SF    +HVHD+  +         + +H I++L FG   
Sbjct: 127 RVEGHLEVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETPNGKQHDFTHTIHQLRFGPQL 186

Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVY---------TDV 205
           P  V                NPLDG R     P+  Y YF+K+VPT Y          + 
Sbjct: 187 PAAVSDRLGKGSMPWTNHHINPLDGTRQETGDPAFNYMYFVKIVPTSYLPLGWEKRFKNA 246

Query: 206 SGHT-------IQSNQFSVTEHFRSSEQGRLQT------------LPGVFFFYDLSPIKV 246
           +G T       ++++Q+SVT H RS E G                +PGVFF YD+SP+KV
Sbjct: 247 AGSTYGNADGSLETHQYSVTSHKRSLEGGNDAAEGHAERQHSQGGIPGVFFSYDISPMKV 306

Query: 247 TFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
              EE   +F  FL  +CAIVGG  TV+  +D  ++ G   +KK
Sbjct: 307 INREEPAKTFTGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 350


>gi|51214107|emb|CAH17876.1| hypothetical protein (22C8.0001), conserved [Pneumocystis carinii]
          Length = 388

 Score =  209 bits (533), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 123/310 (39%), Positives = 167/310 (53%), Gaps = 28/310 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGE   DV H++ K RLDS G  I S    +      +P +           YCGSC
Sbjct: 92  MDVSGELETDVSHNVVKNRLDSNGIFINST--SLNTLNFQQPAKTRP------PDYCGSC 143

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+   E CCN C++V +AY    W + +    +QCK +        E  EGCN  G +
Sbjct: 144 YGAK---EGCCNTCQQVIDAYASNNWPVPDTKAFEQCKEK---YNNLNEFDEGCNFVGRI 197

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGEHFPG--VVNP 176
           EVNKV GNFHFAPG S      H+HDI  +  DS   + SH INKL+FG    G  + NP
Sbjct: 198 EVNKVVGNFHFAPGHSSQIMRNHIHDIYDYMTDSSPHDFSHTINKLSFGPEVEGRSLQNP 257

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS----------SEQ 226
           LD V+   + P+  Y YFIK V   +  +S  ++ +N++SVT H RS          +  
Sbjct: 258 LDNVKKETDNPTLRYSYFIKCVAYRFEYLSKPSLDTNKYSVTVHERSISGDSDPNYPTHI 317

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
                +PGVFF YD+SPIK+   E   +F  FLT+   I+ GV T++GI+D  +Y  +R 
Sbjct: 318 SPKDGIPGVFFSYDISPIKIIERETRGNFSTFLTSTVIIISGVLTIAGIVDRILYETERQ 377

Query: 287 IKKKIEIGKF 296
           I+KK+  GKF
Sbjct: 378 IEKKLREGKF 387


>gi|322708973|gb|EFZ00550.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Metarhizium anisopliae ARSEF 23]
          Length = 429

 Score =  208 bits (529), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 127/344 (36%), Positives = 174/344 (50%), Gaps = 64/344 (18%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGEQ   V H +   RL         R +  G   ID K ++ H    +H + +YCG
Sbjct: 89  MDVSGEQQHGVSHGVKNVRL---------RPESQGGGVIDIKSMKVHDDPADHLDPSYCG 139

Query: 59  SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            CYGA     +    CCN C+EVREAY  +GWA    + ++QC RE + +R+ E+  EGC
Sbjct: 140 ECYGATAPPNARKAGCCNTCDEVREAYASQGWAFGRGENVEQCTREHYAERLDEQREEGC 199

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR----DSFNISHKINKLAFGEHF 170
            + G LEVNKV GNFH APG+SF    +HVHD+  +         + +H I++L FG   
Sbjct: 200 RVEGHLEVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETPNGKQHDFTHTIHQLRFGPQL 259

Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVY---------TDV 205
           P  V                NPLDG R     P+  Y YF+K+VPT Y          + 
Sbjct: 260 PAAVSDRLGKGSMPWTNHHLNPLDGTRQEIGDPAFNYMYFVKIVPTSYLPLGWEKRFKNA 319

Query: 206 SGHT-------IQSNQFSVTEHFRSSEQGRLQT------------LPGVFFFYDLSPIKV 246
           +G T       ++++Q+SVT H RS E G                +PGVFF YD+SP+KV
Sbjct: 320 AGSTYGNADGSLETHQYSVTSHKRSLEGGNDAAEGHAERQHSQGGIPGVFFSYDISPMKV 379

Query: 247 TFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
              EE   +F  FL  +CAIVGG  TV+  +D  ++ G   +KK
Sbjct: 380 INREEPAKTFTGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 423


>gi|224000966|ref|XP_002290155.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973577|gb|EED91907.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 396

 Score =  208 bits (529), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 115/304 (37%), Positives = 172/304 (56%), Gaps = 50/304 (16%)

Query: 8   HLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRH-----GGRL---EHNETYCGS 59
           H+D KH I+K RL+  G                KP+ R      GG L   +H+E  CGS
Sbjct: 101 HIDKKHRIWKHRLNKDG----------------KPIGRKSRFELGGTLTSSDHDEEECGS 144

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           CYGA    E CCN C++V+ AYR K W +++   I QC     L R+K+E+GEGCNI+G+
Sbjct: 145 CYGAGGEGE-CCNTCDDVKRAYRTKQWHITDMTKITQCAH---LVRVKDEDGEGCNIHGY 200

Query: 120 LEVNKVAGNFHFAPGKSFHQSG------------VHVHDILAFQRDS---FNISHKINKL 164
           + ++   GN HFAP + + + G            +++  I+    D+   FN++H +NKL
Sbjct: 201 VALSTGGGNLHFAPDRQWEKEGDKQNGLMIMGGFINLDSIVEMFNDAYEQFNVTHTVNKL 260

Query: 165 AFGEHFP-------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
           +FG + P        + + LDG   T     GM+Q+++++VPTVY  ++G TI++ Q+SV
Sbjct: 261 SFGPYMPKHVKNSLNLTSQLDGATRTVTDGYGMFQFYLQIVPTVYRFLNGTTIETFQYSV 320

Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           TEH R  + G  + +PGVFFFY++S + V F E    + HF T VCA VGG FTV G++D
Sbjct: 321 TEHVRHVDPGSNRGMPGVFFFYEVSALHVEFEEYRRGWTHFFTGVCAAVGGAFTVMGMLD 380

Query: 278 AFIY 281
             ++
Sbjct: 381 RLVF 384


>gi|407852879|gb|EKG06122.1| hypothetical protein TCSYLVIO_002790, partial [Trypanosoma cruzi]
          Length = 472

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 120/309 (38%), Positives = 169/309 (54%), Gaps = 28/309 (9%)

Query: 11  VKHDIFKKRLDSQG--NVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE 68
           V+ D  K R+ +     + E+R       KI K L   G   E+    C SCYGAE    
Sbjct: 166 VERDTVKSRVAASTLEKISEARPLVDEKKKITKALDPSGAEKEN----CPSCYGAEPEPG 221

Query: 69  DCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAG 127
            CC+ CE+VR AY  + W  +  D+ ++QC  E   +       EGCN++   +V +V G
Sbjct: 222 ACCHTCEDVRRAYSLRRWVFNEDDISVEQCAEERLRKAATLSSQEGCNLFVNYKVARVTG 281

Query: 128 NFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQ--- 184
           N HF PG+ F+  G H+HD         N+SH ++ L FGE FPG VNP+DG+  ++   
Sbjct: 282 NIHFVPGRMFNLMGQHLHDFRGKTVRQLNLSHIVHTLGFGERFPGQVNPMDGLVNSRGAV 341

Query: 185 ---ETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSSEQGRLQ------- 230
              E  +G + YF+KVVPT Y   S    G  ++SNQ+SVT HF  S    L        
Sbjct: 342 DATEEVNGRFSYFVKVVPTQYQSASVLGVGSVVESNQYSVTRHFTPSPSAELSAAAAESS 401

Query: 231 --TLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
              +PGVF  YDLSPIKV   E+H   S LH +  +CA+ GGVFTV+G++D+ I+HG R 
Sbjct: 402 PVVVPGVFITYDLSPIKVFVIEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRR 461

Query: 287 IKKKIEIGK 295
           +++K++ GK
Sbjct: 462 VQRKMQQGK 470


>gi|340053482|emb|CCC47775.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 404

 Score =  207 bits (526), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 119/316 (37%), Positives = 172/316 (54%), Gaps = 35/316 (11%)

Query: 3   ISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYG 62
           I   + + V  D  +   +++G V+E RQ    A          GG        C SCYG
Sbjct: 101 IRSTRKMRVHADTLQPISEARGLVVEKRQSSTNADS--------GG-----AEGCPSCYG 147

Query: 63  AESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           AE +  DCCN C++VR A++ KGW+ +  D+ I QC  E           EGCNIY    
Sbjct: 148 AEKNPGDCCNTCDDVRNAFKDKGWSFNEDDIGIAQCAEERLRHAESSSSREGCNIYAKFS 207

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
            ++V GN HF PG  F   G H+H +        N+SH I++L FGE FPG  NPLDG+ 
Sbjct: 208 ASRVKGNIHFVPGSMFDYYGQHMHVLKGEIIRKMNLSHIIHQLDFGERFPGQKNPLDGMV 267

Query: 182 WTQ------ETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSS--EQGRL 229
            ++      E+ +G + YF++VVPT Y  VS    G  +++NQ+SVT +F  S    GR 
Sbjct: 268 NSRGVVDKSESTNGRFSYFVQVVPTQYQHVSIFGTGRLLETNQYSVTHYFTESWNATGRD 327

Query: 230 QT-------LPGVFFFYDLSPIK--VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           ++       +PG+F  YD+SPIK  V  T  + S +H +  +CA+ GGVF V+ +ID+F+
Sbjct: 328 KSANDAPSVVPGIFILYDISPIKTSVKATHPYPSVVHLVLQLCAVGGGVFNVASLIDSFL 387

Query: 281 YHGQRAIKKKIEIGKF 296
           +HG R ++KKI  GK+
Sbjct: 388 FHGTRQVQKKIRQGKY 403


>gi|193627365|ref|XP_001948436.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Acyrthosiphon pisum]
          Length = 404

 Score =  207 bits (526), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 119/308 (38%), Positives = 165/308 (53%), Gaps = 23/308 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIES--RQDGIGAPKIDKPLQRHGGRLEHNET--- 55
           +D SGE HL V H+I+K+RL+ +G  I    + D +G+ K   P       L+ NET   
Sbjct: 95  VDNSGETHLQVDHNIYKRRLNLEGQPISDPEKSDDVGSKKTLNP----PSMLKSNETDDA 150

Query: 56  -----YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
                 CGSCYGAESS   CCN C++V+ AY+ K W    P  I+QCK +     + ++ 
Sbjct: 151 NNTEDICGSCYGAESSTIPCCNTCDDVKRAYKMKNWDF-RPSSIEQCKNQSSQNEMYDKA 209

Query: 111 -GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
             EGC +YG L VN+V+G+FH APG SF  + +HVHD+  F   SFN +H I  L+FG+ 
Sbjct: 210 FKEGCQLYGTLLVNRVSGSFHIAPGMSFSFNHMHVHDVHPFSSSSFNTTHTIRHLSFGQK 269

Query: 170 FPGVV-----NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
              +      NPLD         + M+QY+IK+VPT+Y         +NQFSVT+H   +
Sbjct: 270 LESINTSHGGNPLDSTESIAGEGATMFQYYIKIVPTLYQRRDLSIFSTNQFSVTKHKVQA 329

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
                   PG+FF Y+ SPI +  TE+     H  T     + GVF    IID F+Y   
Sbjct: 330 FDKGPSGAPGIFFSYEFSPIMIKLTEKPRLLGHLFTQFLCNISGVFICFWIIDIFMYKVS 389

Query: 285 RA--IKKK 290
           +   I+KK
Sbjct: 390 KVYNIRKK 397


>gi|50548631|ref|XP_501785.1| YALI0C13112p [Yarrowia lipolytica]
 gi|49647652|emb|CAG82095.1| YALI0C13112p [Yarrowia lipolytica CLIB122]
          Length = 401

 Score =  206 bits (525), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 172/314 (54%), Gaps = 26/314 (8%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D SGE    V HD+ K  LD +GN++ S    +G     K + +       +  YCGSC
Sbjct: 87  IDSSGEVQQSVDHDMTKVTLDERGNILSSEALTLGENPDSKAVAKR--TFLDDPNYCGSC 144

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES  + CCN CE+VR AY  KGWA ++   ++QC+  GF +++K +  +GCNI G  
Sbjct: 145 YGAESEPDQCCNTCEQVRAAYATKGWAFTDGSGVEQCEVIGFKEQLKAQYNQGCNIAGKF 204

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ--RDSFNISHKINKLAFGEHF-------- 170
            V KVAGNFHFAPG S H+   H+HD+  F+     F  SH I+ L+FGE          
Sbjct: 205 TVQKVAGNFHFAPGVSSHRDEQHLHDLSHFKDPEAPFTFSHIIHDLSFGEQVDVSGLDWD 264

Query: 171 PGV---VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            GV    +PL+      +     + YF KVV T +  + G  I++NQ++ T H R  + G
Sbjct: 265 KGVAMETSPLENTPHHTDNKWFRFNYFTKVVSTRFEFLDGKKIETNQYAATAHERPLQGG 324

Query: 228 RLQT----------LPGVFFFYDLSPIKVTFTEEHVS-FLHFLTNVCAIVGGVFTVSGII 276
           R +           LPGVFF YD+SP+++   +E+ S F  F+  V A +GGV TV+ ++
Sbjct: 325 RDEDHQNTRHMRGGLPGVFFSYDISPMRIVNKQEYRSHFGAFVMQVVATIGGVLTVAAVL 384

Query: 277 DAFIYHGQRAIKKK 290
           D  IY   + +K+K
Sbjct: 385 DRGIYEVDQVLKRK 398


>gi|71407913|ref|XP_806393.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70870127|gb|EAN84542.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 406

 Score =  206 bits (524), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 123/312 (39%), Positives = 170/312 (54%), Gaps = 30/312 (9%)

Query: 11  VKHDIFKKRLDSQG--NVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDE 68
           V+ D  K R+ +     + E+R       KI K L   G   E+    C SCYGAE    
Sbjct: 100 VERDTVKSRVAASTLEKISEARPLVDEKKKITKALDPSGAEKEN----CPSCYGAEPEPG 155

Query: 69  DCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAG 127
            CC+ CE+VR AY  + W  +  D+ ++QC  E   +       EGCN++   +V +V G
Sbjct: 156 ACCHTCEDVRRAYSLRRWVFNEDDVSVEQCAEERLRKAAILSSQEGCNLFVNYKVARVTG 215

Query: 128 NFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG-------V 180
           N HF PG+ F+  G H+HD         N+SH ++ L FGE FPG VNP+DG       V
Sbjct: 216 NIHFVPGRMFNLMGQHLHDFRGKTVRQLNLSHIVHTLGFGERFPGQVNPMDGLVNLRGAV 275

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSSEQGRLQ------ 230
             T+E  +G + YF+KVVPT Y   S    G  ++SNQ+SVT HF  S    L       
Sbjct: 276 DATEEV-NGRFSYFVKVVPTQYQSASILGVGSVVESNQYSVTHHFTPSPSAELSAAAAES 334

Query: 231 ---TLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 285
               +PGVF  YDLSPIKV   E+H   S LH +  +CA+ GGVFTV+G++D+ I+HG R
Sbjct: 335 SPVMVPGVFITYDLSPIKVFVFEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVR 394

Query: 286 AIKKKIEIGKFS 297
            +++K++ GK S
Sbjct: 395 RVQRKMQQGKQS 406


>gi|296417040|ref|XP_002838173.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295634087|emb|CAZ82364.1| unnamed protein product [Tuber melanosporum]
          Length = 399

 Score =  206 bits (523), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 127/316 (40%), Positives = 164/316 (51%), Gaps = 36/316 (11%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
           MD+SGEQ   + H I   RL       ES+      P     L  H     H +  YCG 
Sbjct: 89  MDVSGEQQSSITHGIHLTRLTP---FPESK------PVSTTSLNVHEDTASHLDPAYCGK 139

Query: 60  CYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
           CYGA   ++D  CC  CE+VREAY   GWA    + ++QC+RE + +R+ E   EGCNI 
Sbjct: 140 CYGAPGPEKDKGCCQTCEDVREAYASIGWAFGKGEGVEQCEREHYAERLDEMREEGCNIA 199

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEHFPGVV- 174
           G L VNKV GNFH APGKSF  + +HVHD+  +         +H I+ L+FG   P  V 
Sbjct: 200 GHLSVNKVIGNFHIAPGKSFSSAQMHVHDLNQYFASTKEHTFTHTIHHLSFGPDLPANVK 259

Query: 175 ---NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-------TIQSNQFSVTEHFRSS 224
              NPLD  R   +  S  + YFIKVV T Y  +           I+++Q+SVT H RS 
Sbjct: 260 VQRNPLDDSRQVTQERSFNFMYFIKVVSTSYLPLGTSENSYIPGAIETHQYSVTSHKRSL 319

Query: 225 EQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 273
             G  +           +PGVFF YD+SP+KV   E    SF  FLT VCA++GG  TV+
Sbjct: 320 MGGADKEHASTIHARGGIPGVFFSYDISPMKVINREVRAKSFAGFLTGVCAVIGGTLTVA 379

Query: 274 GIIDAFIYHGQRAIKK 289
             ID  +Y G   +KK
Sbjct: 380 AAIDRGLYEGGMRVKK 395


>gi|340520521|gb|EGR50757.1| predicted protein [Trichoderma reesei QM6a]
          Length = 430

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 128/345 (37%), Positives = 176/345 (51%), Gaps = 65/345 (18%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGEQ   V H I K RL             +G  +I+ K L +   + EH +  YCG
Sbjct: 89  MDVSGEQQHGVAHGITKIRLQPAA---------LGGGEIESKSLSQLHEKAEHLDPNYCG 139

Query: 59  SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            CYGA     +    CCN C+EVREAY    WA    + ++QC+RE + +R+ ++  EGC
Sbjct: 140 GCYGAIAPSTAQKPGCCNTCDEVREAYALASWAFGRGEGVEQCEREHYAERLDQQREEGC 199

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGEHF 170
            I G L+VNKV GNFH APG+SF    +HVHD+  +    +  S + +H I+ L FG   
Sbjct: 200 RIEGLLQVNKVIGNFHLAPGRSFSNGNMHVHDLKNYWDLPEGKSHDFTHIIHSLRFGPQL 259

Query: 171 PGVV---------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV---------- 205
           P  V               NPLD  R   + P+  Y YF+K+VPT Y  +          
Sbjct: 260 PDTVIERLGGKNTWSNHHLNPLDNTRQDTKDPNFNYMYFVKIVPTSYLPLGWEKRKPSTT 319

Query: 206 --------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIK 245
                   S  +I+++Q+SVT H RS   G         RL     +PGVFF YD+SP+K
Sbjct: 320 NGGVTTFYSDGSIETHQYSVTSHKRSLMGGDDAKEGHPERLHARNGIPGVFFSYDISPMK 379

Query: 246 VTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           V   EE   +FL FL+ +CAIVGG  TV+  +D  ++ G   +KK
Sbjct: 380 VINREERAKTFLGFLSGLCAIVGGTLTVAAAVDRGLFEGATRLKK 424


>gi|358378080|gb|EHK15763.1| hypothetical protein TRIVIDRAFT_86970 [Trichoderma virens Gv29-8]
          Length = 420

 Score =  204 bits (520), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 123/333 (36%), Positives = 170/333 (51%), Gaps = 51/333 (15%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGEQ   V H I K RL            G G  + +   Q H         YCG C
Sbjct: 89  MDVSGEQQHGVAHGISKIRLRPAAQ-------GGGEIESNTLTQLHEKAEHLAPDYCGGC 141

Query: 61  YGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           YGA     +    CCN C+EVREAY +  WA    + ++QC+RE + +R+ ++  EGC I
Sbjct: 142 YGATAPANAEKPGCCNTCDEVREAYAQMSWAFGRGEGVEQCEREHYAERLDQQREEGCRI 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGEHFPG 172
            G L+VNKV GNFH APG+SF    +HVHD+  +    +    + +H I+ L FG   P 
Sbjct: 202 EGLLQVNKVVGNFHLAPGRSFSNGNMHVHDLKTYWDFPEGKPHDFTHIIHSLRFGPQLPD 261

Query: 173 VV---------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--------T 209
            V               NPLD      + P+  Y YF+K+VPT Y  +           +
Sbjct: 262 TVIERMGGKNTWTNHHLNPLDATHQETKDPNFNYMYFVKIVPTSYLPLGWEKRTPGYDGS 321

Query: 210 IQSNQFSVTEHFRS------SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFL 256
           I+++Q+SVT H RS      S++G  + L      PGVFF YD+SP+KV   EE   +FL
Sbjct: 322 IETHQYSVTSHKRSLMGGDDSQEGHPERLHARNGIPGVFFSYDISPMKVINREERAKTFL 381

Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            FL+ +CAIVGG  TV+  +D  ++ G   +KK
Sbjct: 382 GFLSGLCAIVGGTLTVAAAVDRGLFEGASRLKK 414


>gi|134054958|emb|CAK36967.1| unnamed protein product [Aspergillus niger]
          Length = 406

 Score =  204 bits (519), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 127/328 (38%), Positives = 173/328 (52%), Gaps = 53/328 (16%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ   V H I K RL S    G VI+     + A ++ K L         +  YC
Sbjct: 89  MDVSGEQQTGVVHGINKVRLTSAAEGGRVID-----VKALELAKHL---------DPDYC 134

Query: 58  GSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA +    S   CCN C+EVREAY ++ WA    + ++QC+ EG+ +RI  +  EG
Sbjct: 135 GECYGATAPAGASKPGCCNTCDEVREAYAQQQWAFGKGENVEQCELEGYAERIDAQRREG 194

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAFG 167
           C + G L VNKV GNFH APG+SF    +HVHD+  F        +   ++H+I++L FG
Sbjct: 195 CRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVHDLANFFDADLPDAEKHTMTHEIHQLRFG 254

Query: 168 EHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT-IQSNQ 214
              P  +            NPLDG +     P   Y YF+KVV T Y  +     I+++Q
Sbjct: 255 PQLPDELSDRWQWTDHHHTNPLDGTKQETNEPGYNYMYFVKVVSTSYLPLGWDPLIETHQ 314

Query: 215 FSVTEHFRS------SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTN 261
           +SVT H RS      S++G  + L      PGVF  YD+SP+KV   E    +F  FLT 
Sbjct: 315 YSVTSHKRSLMGGDASDEGHKERLHAANGIPGVFVNYDISPMKVINREARPKTFTGFLTG 374

Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           VCAI+GG  TV+  +D  +Y G   +KK
Sbjct: 375 VCAIIGGTLTVAAALDRGLYEGVSRMKK 402


>gi|348667280|gb|EGZ07106.1| hypothetical protein PHYSODRAFT_319656 [Phytophthora sojae]
          Length = 398

 Score =  204 bits (518), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 118/290 (40%), Positives = 156/290 (53%), Gaps = 8/290 (2%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVI-ESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           D++G    D++H+I K  LD  G  + E   D IG   +    + HG   E ++  CGSC
Sbjct: 95  DMAGNVQHDIEHNIRKIPLDHTGQALAEGMHDVIGG-ALTNNTELHG---ETDKPACGSC 150

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           Y A    E CC+ CE V+ AY +K W + +   I QC+     + ++ E  EGC I G L
Sbjct: 151 YSAGEPGE-CCDTCESVKAAYARKSWMMPSLHTIAQCQEVEIEKVLRGEVNEGCRIQGSL 209

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
            V+KVAG  +FAP K F    +   D++      F+ SH I  L+FGE +P + NPLD  
Sbjct: 210 VVSKVAGKLYFAPSKFFRSGYLSSKDLVDATFKVFDTSHTIRSLSFGEAYPDMKNPLDNR 269

Query: 181 R--WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
           +     E   G +QYF+KVVPT YT +S   I +NQFS TEHFR       + LP V F 
Sbjct: 270 KKELPDEKTRGSFQYFLKVVPTEYTFLSASRIITNQFSATEHFRQLTPVSDKGLPMVTFS 329

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIK 288
           Y  SPI     +  V FL FLT+VCAIVGGVFT +   D  +Y GQ   K
Sbjct: 330 YTFSPIMFRIEQYRVGFLQFLTSVCAIVGGVFTRTATADESVYRGQVGAK 379


>gi|346979363|gb|EGY22815.1| ER-derived vesicles protein ERV46 [Verticillium dahliae VdLs.17]
          Length = 435

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 130/350 (37%), Positives = 174/350 (49%), Gaps = 70/350 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEHNET---Y 56
           MD+SGEQ   +   I K RL SQ       +DG G   ID K L  H            Y
Sbjct: 89  MDVSGEQQHGIVSGISKVRLRSQ-------KDGGGV--IDTKALSLHAADEAATHLAPDY 139

Query: 57  CGSCYGAESS----DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA++      + CCN CEEVREAY +  WA    + ++QC RE + +R+ E+  E
Sbjct: 140 CGDCYGAKAPANAVKQGCCNTCEEVREAYAQASWAFGKGENVEQCTREHYAERLDEQRAE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLAFGEHF 170
           GC I G L VNKV GNFH APG+SF    +HVHD+  +     + + +H+I+ L FG   
Sbjct: 200 GCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDGDITHDFTHQIHALRFGPQL 259

Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
           P  +                NPLDG       PS  + YF+K+VPT Y  +         
Sbjct: 260 PESITKNLGNKATPWTNHHLNPLDGTSQITTDPSFNFMYFVKIVPTSYLPLGWDSKRSPQ 319

Query: 206 -------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYD 240
                        S  +I+++Q+SVT H RS   G         RL T   +PGVFF YD
Sbjct: 320 DHDGGLLGSFGQGSDGSIETHQYSVTSHKRSLSGGDDSAEGHAERLHTRGGIPGVFFSYD 379

Query: 241 LSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           +SP+KV   EE   SF  FLT +CA++GG  TV+  +D  ++ G   +KK
Sbjct: 380 ISPMKVINREERSKSFTGFLTGLCAVIGGTLTVAAAVDRGMFEGSLRLKK 429


>gi|219111025|ref|XP_002177264.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411799|gb|EEC51727.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 404

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 116/307 (37%), Positives = 179/307 (58%), Gaps = 29/307 (9%)

Query: 8   HLDVKHDIFKKRLDSQGN-----VIESRQDGIGAPKI-DKPLQRHGGRLEHNE------- 54
           HLD  H ++K R+    N     + E  +  +G+  + +K L+     L++ +       
Sbjct: 99  HLDTDHHVWKHRITLLPNGHRQLLGERSKLELGSTLLTEKDLEVKAEELQNAKDNSESRT 158

Query: 55  --TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
             T CG CYGA    E CC +CE+V+ AY+++GW+L +   + QC+RE     I E EGE
Sbjct: 159 EMTPCGDCYGAGEEGE-CCKSCEDVKRAYKRRGWSLRDTSGVSQCRRE---SGIAEAEGE 214

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQS---GVHVHDILAFQRDSFNISHKINKLAFGEH 169
           GCN++G + ++   GN H APG+    +   G+++ D L      +N+SH+I+KL FG+ 
Sbjct: 215 GCNVHGVVALSSGGGNLHIAPGRDTEANFPGGMNIFDALLQSFHQWNVSHQIHKLRFGKD 274

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           +P  V  LDG   T     GMYQY+ +VVPT YT ++G TIQ++Q+SVTEH R    G  
Sbjct: 275 YPAGVYQLDGETRTITDGYGMYQYYFQVVPTRYTFLNGTTIQTHQYSVTEHLRHVSPGSN 334

Query: 230 Q------TLPGVFFFYDLSPIKVTFTEEHVS-FLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
           +       +PG+FFFY++SP+ V   E +   ++ FLT+VCAIVGGV T++G+ID  I+ 
Sbjct: 335 RGYSLNSRMPGIFFFYEVSPLHVDIMEVYQKGWIAFLTSVCAIVGGVVTIAGLIDHVIFS 394

Query: 283 GQRAIKK 289
            Q + ++
Sbjct: 395 RQHSSRE 401


>gi|346324387|gb|EGX93984.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Cordyceps militaris CM01]
          Length = 423

 Score =  202 bits (514), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 123/338 (36%), Positives = 174/338 (51%), Gaps = 58/338 (17%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGEQ   V H + K RL         R +G G   ID   L  H    EH + +YCG
Sbjct: 89  MDVSGEQQHGVAHGVHKVRL---------RPEGEGGGVIDVSSLNLHNDAAEHLDPSYCG 139

Query: 59  SCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            C GA +    +   CCN CEE+REAY +  WA  +    +QC+RE + +R++E+  EGC
Sbjct: 140 DCGGAPAPTTVTKAGCCNTCEEIREAYAQVSWAFGDGKAFEQCEREHYAERLEEQRHEGC 199

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFGEHF 170
            I G L+VNKV GNFH APG+SF    +HVHD+  +   +     + +H I+ L FG   
Sbjct: 200 RIDGLLQVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETTDDKKHDFTHHIHHLRFGPQL 259

Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH------ 208
           P  V                NPLD  +     P+  + YF+K+VPT +  +         
Sbjct: 260 PETVVQKLGKGATPWTNHHGNPLDSTKQLTNDPNFNFMYFVKIVPTSFLPLGWEKMARTM 319

Query: 209 ----TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEH 252
               +++++Q+SVT H RS   G         RL +   +PGVFF YD+SP+KV   EE 
Sbjct: 320 NVDASVETHQYSVTSHKRSLTGGDDSAEGHAERLHSRGGIPGVFFSYDISPMKVINREEK 379

Query: 253 -VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
             SFL F+  +CA+VGG  TV+  +D  ++ G   +KK
Sbjct: 380 GKSFLGFVAGLCAVVGGTLTVAAAVDRGLFEGTTRLKK 417


>gi|400602673|gb|EJP70275.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Beauveria bassiana ARSEF 2860]
          Length = 423

 Score =  201 bits (511), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 122/338 (36%), Positives = 174/338 (51%), Gaps = 58/338 (17%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGEQ   V H + K RL         R +  G   ID   L  H    EH + +YCG
Sbjct: 89  MDVSGEQQHGVAHGVHKVRL---------RPEAEGGGVIDVSSLDLHNDAAEHLDPSYCG 139

Query: 59  SCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            C GA +        CCN CEE+REAY +  WA  +    +QC+RE + +R++E+  EGC
Sbjct: 140 DCGGAPAPSNVKKAGCCNTCEEIREAYAQVSWAFGDGKAFEQCEREHYAERLEEQRHEGC 199

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS----FNISHKINKLAFGEHF 170
            I G L+VNKV GNFH APG+SF    +HVHD+  +   +     + +H I+ L FG   
Sbjct: 200 RIDGLLQVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETTDDKKHDFTHYIHHLRFGPQL 259

Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
           P  V                NPLD  +   + P+  + YF+K+VPT +  +         
Sbjct: 260 PEAVVKKMGKGATPWTNHHANPLDNTKQLTDDPNYNFMYFVKIVPTSFLPLGWEKMSRAM 319

Query: 206 -SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEH 252
            +  +++++Q+SVT H RS   G         RL +   +PGVFF YD+SP+KV   EE 
Sbjct: 320 NTDGSVETHQYSVTSHKRSLTGGDDAAEGHAERLHSRGGIPGVFFSYDISPMKVINREEQ 379

Query: 253 -VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
             SFL F+  +CA+VGG  TV+  +D  ++ G   +KK
Sbjct: 380 GKSFLGFIAGLCAVVGGTLTVAAAVDRGLFEGTTRLKK 417


>gi|157873507|ref|XP_001685262.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68128333|emb|CAJ08503.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 467

 Score =  199 bits (507), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 107/314 (34%), Positives = 168/314 (53%), Gaps = 29/314 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ-GNVIESRQDGIGAPKI-DKPLQRHGGRLEHNETYCG 58
           +DI G    DV+ +  K+R+D+  G VI + +  +   K+  K +   G   E+    C 
Sbjct: 151 VDIFGVFANDVEGNTVKQRIDAATGQVISAARAMVDEKKVMTKAIDADGAEKEN----CP 206

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIY 117
           SCYGAE +  DCC+ CE+VR+AY ++GW L   ++ ++QC  +           EGCN+Y
Sbjct: 207 SCYGAERNPGDCCHTCEDVRQAYARRGWKLDIDEISVEQCAEDRINMAAAASGKEGCNLY 266

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
                ++  G+  F PG+ +   G  +HD++       ++SH ++ L FG+ FPG  NPL
Sbjct: 267 ATFAASRATGSLQFIPGRIYETLGRRMHDLMGSTTRKLDLSHTVHTLEFGDPFPGQQNPL 326

Query: 178 DGVRW-------TQETPSGMYQYFIKVVPTVYTDVSGHT-----IQSNQFSVTEHFRSSE 225
           DG           ++  +G + YF+K+VPT Y   S  T     ++SNQ+S T HF  SE
Sbjct: 327 DGTAQGSALSGDAKDAMNGRFSYFVKLVPTTYQRYSLITGLQDVVESNQYSATHHFTPSE 386

Query: 226 QGRL--------QTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
             +         + +PGVF  YDLSP+++   E H   S  HF+  +CA+ GGV TV+G+
Sbjct: 387 AAKAASQAPKKQEIVPGVFMTYDLSPVRILVQERHPYPSLAHFVLQLCAVCGGVLTVAGL 446

Query: 276 IDAFIYHGQRAIKK 289
           +D+  +H  R I+K
Sbjct: 447 VDSLCFHSARKIRK 460


>gi|392566201|gb|EIW59377.1| endoplasmic reticulum-derived transport vesicle ERV46 [Trametes
           versicolor FP-101664 SS1]
          Length = 423

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 113/327 (34%), Positives = 170/327 (51%), Gaps = 49/327 (14%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQG-----NVIESRQDGIGAPKIDKPLQRHGGRLEHNET 55
           MDISGE   D+ H+I K R+D +G      VI   Q+ +      +     G      E 
Sbjct: 91  MDISGETQSDITHNILKTRMDERGFPVPTTVITELQNDLDKINSQREGGYCGSCYGGVEP 150

Query: 56  YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
             G           CCN CE+VR+AY  +GW+ + PD I+QC +EG+ +++KE+  EGCN
Sbjct: 151 EGG-----------CCNTCEDVRQAYVNRGWSFNRPDSIEQCVQEGWSEKLKEQATEGCN 199

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF------ 166
           I G + VNKV GN H +PG+SF  S   +++++ + +   N    +H I+ LAF      
Sbjct: 200 IAGRVRVNKVVGNIHLSPGRSFRTSSHSLYELVPYLKTDGNRHDFTHTIHHLAFEGDDEW 259

Query: 167 -----------GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
                       +      NPLDG          M+QYF+KVV T +  +SG TI ++Q+
Sbjct: 260 DLAKAKLGKELKQRLGIAANPLDGTTGRTIKQQYMFQYFLKVVATQFRTLSGKTINTHQY 319

Query: 216 SVTEHFRSSEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
           S T   R  ++G  +              +PG FF Y++SP+++   E   SF HFLT+ 
Sbjct: 320 SATHFERDLDKGSQENTPTGVHVAHGNGGIPGAFFNYEISPLRIVHAETRQSFAHFLTST 379

Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           CAIVGGV TV+ +ID+ ++  ++A+KK
Sbjct: 380 CAIVGGVLTVASLIDSALFATRKALKK 406


>gi|389602486|ref|XP_001567299.2| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|322505471|emb|CAM42729.2| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 541

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 108/314 (34%), Positives = 168/314 (53%), Gaps = 29/314 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ-GNVIESRQDGIGAPK-IDKPLQRHGGRLEHNETYCG 58
           +D+ G    DV+ +  K+R+D+  G VI + +  +   K I K +   G   E+    C 
Sbjct: 225 VDVFGVFANDVEDNTVKQRIDAATGQVISAARAVVDEKKVITKAIDADGVEKEN----CP 280

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIY 117
           SCYGAE S  DCC+ CE+VR+AY +KGW L+  D+ ++QC  +           EGCN+Y
Sbjct: 281 SCYGAERSPGDCCHTCEDVRQAYAQKGWRLNVDDISVEQCAEDRIKMATAAFGKEGCNLY 340

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
                ++  G+  F PG+ +   G  +HD++       ++SH ++ L FGE FPG  NPL
Sbjct: 341 ATFAASRATGSLQFIPGRMYQMLGRRMHDLMGSAARKLDLSHTVHTLEFGERFPGQQNPL 400

Query: 178 DGVRW-------TQETPSGMYQYFIKVVPTVYTDVS-----GHTIQSNQFSVTEHFRSSE 225
           DG           ++  +G + YF+KV+PT Y   S       T++SNQ++ T HF  S 
Sbjct: 401 DGTAQGSALSGDAKDAMNGRFSYFVKVIPTTYQRYSLITGLQDTVESNQYTATHHFTPSA 460

Query: 226 QGRL--------QTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
             +         + +PGVF  YDLSP+++   E H   S +HF+  +CA+ GGV TV G+
Sbjct: 461 ATKAASQTPTMQEIVPGVFMTYDLSPVRILAQERHPYPSVIHFVLQLCAVCGGVLTVVGL 520

Query: 276 IDAFIYHGQRAIKK 289
           +D+  +H  R ++K
Sbjct: 521 VDSMCFHSVRKVRK 534


>gi|255941116|ref|XP_002561327.1| Pc16g10170 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211585950|emb|CAP93687.1| Pc16g10170 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 412

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 124/329 (37%), Positives = 171/329 (51%), Gaps = 49/329 (14%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETY 56
           MD+SGEQ + V H + K RL      G VI+ +  D   + +  K L            Y
Sbjct: 89  MDVSGEQQVGVAHGVNKVRLSPHNEGGKVIDVQALDLHSSSEAAKHLA---------PDY 139

Query: 57  CGSCYGAESS----DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG C GA          CC  CEEVREAY +K WA  +   I+QCKREG+ +++ E+  E
Sbjct: 140 CGECGGATPPANVIKPGCCTTCEEVREAYAEKQWAFGDGSNIEQCKREGYAEKLAEQRRE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
           GC I G L+VNKV GNFH APG+SF    +HVHD+ A+        +   +SH +++L F
Sbjct: 200 GCRIEGVLKVNKVVGNFHIAPGRSFTTGNMHVHDLDAYVVPNAGPAEQHTMSHLVHELRF 259

Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT-IQSN 213
           G   P  +            NPLD  +   + P+  + YF+KVV T Y  +     I+++
Sbjct: 260 GPQLPTELAGRWGWTDHHHTNPLDDTKQETDEPAYNFMYFVKVVSTSYLPLGWDPHIEAH 319

Query: 214 QFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLT 260
           Q+SVT H R    G         R+     +PGVFF YD+SP+KV   E    +F +FLT
Sbjct: 320 QYSVTSHKRPLSGGNDAAEGHKERVHAGGGIPGVFFNYDISPMKVINREARPKTFTNFLT 379

Query: 261 NVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            VCAI+GG  TV+  +D  +Y G   +KK
Sbjct: 380 GVCAIIGGTLTVAAALDRGLYEGAMRVKK 408


>gi|146095510|ref|XP_001467598.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398020411|ref|XP_003863369.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134071963|emb|CAM70660.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322501601|emb|CBZ36681.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 467

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 107/314 (34%), Positives = 168/314 (53%), Gaps = 29/314 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ-GNVIESRQDGIGAPKI-DKPLQRHGGRLEHNETYCG 58
           +DI G    DV+ +  K+R+D+  G VI + +  +   K+  K +   G   E+    C 
Sbjct: 151 VDIFGVFANDVEGNTVKQRIDAATGQVISAARAMVDEKKVMTKAIDADGAEKEN----CP 206

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIY 117
           SCYGAE +  DCC+ CE+VR+AY ++GW L   ++ ++QC  +           EGCN+Y
Sbjct: 207 SCYGAERNPGDCCHTCEDVRQAYARRGWKLDIDEISVEQCAEDRIKMAAAASGKEGCNLY 266

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
                ++  G+  F PG+ +   G  +HD++       ++SH ++ L FG+ FPG  NPL
Sbjct: 267 ATFAASRATGSLQFIPGRIYETLGRRMHDLMGSTTRKLDLSHTVHTLEFGDPFPGQQNPL 326

Query: 178 DGVRW-------TQETPSGMYQYFIKVVPTVYTDVSGHT-----IQSNQFSVTEHFRSSE 225
           DG           ++  +G + YF+K+VPT Y   S  T     ++SNQ+S T HF  SE
Sbjct: 327 DGTAQGSALSGDAKDAMNGRFSYFVKLVPTTYQRYSLITGLQDAVESNQYSATHHFTPSE 386

Query: 226 QGRL--------QTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
             +         + +PGVF  YDLSP+++   E H   S +HF+  +CA+ GGV TV G+
Sbjct: 387 AAKAVSQTPKKQEIVPGVFMTYDLSPVRILVQERHPYPSLVHFVLQLCAVCGGVLTVVGL 446

Query: 276 IDAFIYHGQRAIKK 289
           +D+  +H  R I+K
Sbjct: 447 VDSMCFHSVRKIRK 460


>gi|320592791|gb|EFX05200.1| copii-coated vesicle membrane protein [Grosmannia clavigera kw1407]
          Length = 440

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 126/355 (35%), Positives = 172/355 (48%), Gaps = 75/355 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGEQ   V+H +   RL+ Q           G  +I+ K L  H     H +  YCG
Sbjct: 89  MDVSGEQQHGVQHGVRMVRLEPQSR---------GGSEIEVKTLDLHADAASHLDPEYCG 139

Query: 59  SCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            CYGA          CCN C+EVREAY    WA    + ++QC+RE + +RI E+  EGC
Sbjct: 140 PCYGATPPQHAIKTGCCNTCDEVREAYASSSWAFGKGENVEQCQREHYAERIDEQRHEGC 199

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGEHF 170
            I G L VNKV GNFH APG+SF    +HVHD+  +      +  + +H ++ L FG   
Sbjct: 200 RIEGGLRVNKVVGNFHIAPGRSFSNGNMHVHDLKNYWDMPTPNLHSFTHTVHSLRFGPQL 259

Query: 171 PGV-------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVY--------- 202
           P                     +NPLDGV      P+  Y YFIK+VPT Y         
Sbjct: 260 PESLQKTLAGGGAKGQPWTNHHINPLDGVMQQTSDPNFNYMYFIKIVPTSYLALGWEKTF 319

Query: 203 ---------TDVSGH------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGV 235
                     DV  +      +++++Q+SVT H RS + G         RL     +PGV
Sbjct: 320 RGFVDDHDSADVGSYGLLADGSVETHQYSVTSHKRSLQGGDDAAEGHQERLHARGGIPGV 379

Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           FF YD+SP+KV   EE   +F  FL  +CAI+GG  TV+  +D  ++ G   +KK
Sbjct: 380 FFSYDISPMKVVNREERAKTFAGFLAGLCAIIGGTLTVAAAVDRTVFEGTIRLKK 434


>gi|380489161|emb|CCF36889.1| hypothetical protein CH063_08353 [Colletotrichum higginsianum]
          Length = 437

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 125/352 (35%), Positives = 175/352 (49%), Gaps = 72/352 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
           MD+SGEQ   V H + K RL SQ   G VI+ +        +D  L       EH +  Y
Sbjct: 89  MDVSGEQQHGVIHGVNKVRLRSQKEGGGVIDMK-------ALD--LHSREATAEHLDPNY 139

Query: 57  CGSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG+CYGA++        CCN CEEVREAY +  WA    + ++QC RE + +R++E+  E
Sbjct: 140 CGACYGAQAPANAQKAGCCNTCEEVREAYAQASWAFGKGENVEQCTREHYAERLEEQRQE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGE 168
           GC + G L VNKV GNFH APG+SF    +HVHD+  +         + +H I+ L FG 
Sbjct: 200 GCRLEGNLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDTPDDAQHDFTHTIHSLRFGP 259

Query: 169 HFPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH---- 208
             P  V                NPLD        P+  + YF+K+VPT Y  ++      
Sbjct: 260 QLPDQVTKKMGKRAYAWTNHHGNPLDNTHQETTDPNYNFMYFVKIVPTSYLALNWQKSSS 319

Query: 209 ------------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
                             +++++Q+SVT H RS   G         RL +   +PGVFF 
Sbjct: 320 YQDEENSGLGLLGQGNDGSVETHQYSVTSHKRSLAGGDDAAEGHKERLHSRGGIPGVFFS 379

Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           YD+SP+KV   EE   +F  FLT +CAI+GG  TV+  +D  ++ G   +KK
Sbjct: 380 YDISPMKVINREERAKTFTGFLTGLCAIIGGTLTVAAAVDRGVFEGGLRLKK 431


>gi|119496763|ref|XP_001265155.1| COPII-coated vesicle membrane protein Erv46, putative [Neosartorya
           fischeri NRRL 181]
 gi|119413317|gb|EAW23258.1| COPII-coated vesicle membrane protein Erv46, putative [Neosartorya
           fischeri NRRL 181]
          Length = 438

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 132/355 (37%), Positives = 173/355 (48%), Gaps = 75/355 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETY 56
           MD+SGEQ + V H + K RL S    G V++ +  D     +I K L         +  Y
Sbjct: 89  MDVSGEQQVGVAHGVNKVRLSSPAEGGRVLDVQALDLHSKEEIAKHL---------DPNY 139

Query: 57  CGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG C GA+    S  E CCN C+EVREAY  K WA      I+QC+REG+  RI  +  E
Sbjct: 140 CGDCGGADPLPGSMKEGCCNTCDEVREAYAAKNWAFGKGSNIEQCEREGYAARIDAQRRE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
           GC + G L VNKV GNFH APG+SF    VH HD+  +        +   ++H I++L F
Sbjct: 200 GCRLEGILRVNKVVGNFHIAPGRSFTSGQVHAHDLQNYLDLELPDNEKHTMTHHIHQLRF 259

Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
           G   P  V            NPLD        P+  + YF+KVV T Y  +         
Sbjct: 260 GPQLPDEVSDRWQWTDHHHTNPLDSTSQETNDPAYNFVYFVKVVSTSYLPLGWDPLFSSA 319

Query: 206 ------------------SGHTIQSNQFSVTEHFRS------SEQGRLQTL------PGV 235
                             SG +I+++Q+SVT H RS      S++G  + L      PGV
Sbjct: 320 AHNAHDQTPLGSHGIAYGSGGSIETHQYSVTSHKRSLRGGDASDEGHKERLHAANGIPGV 379

Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           FF YD+SP+KV   E    SF  FLT VCAI+GG  TV+  ID  +Y G   +KK
Sbjct: 380 FFNYDISPMKVINREARPKSFSGFLTGVCAIIGGTLTVAAAIDRGLYEGALRVKK 434


>gi|401426616|ref|XP_003877792.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322494038|emb|CBZ29334.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 406

 Score =  197 bits (502), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 108/314 (34%), Positives = 167/314 (53%), Gaps = 29/314 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ-GNVIESRQDGIGAPKI-DKPLQRHGGRLEHNETYCG 58
           +DI G    DV+ +  K+R+D+  G VI + +  +   K+  K +   G   E+    C 
Sbjct: 90  VDIFGVFANDVEGNTVKQRIDTATGQVISAARAIVDEKKVVTKAIDADGAEKEN----CP 145

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIY 117
           SCYGAE    DCC+ CE+VR+AY ++GW L   ++ ++QC  +           EGCN+Y
Sbjct: 146 SCYGAERHPGDCCHTCEDVRQAYVRRGWKLDIDEISVEQCAEDRIKMATAAFGKEGCNLY 205

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
                ++  G+  F PG+ +   G  +HD++       ++SH ++ L FG+ FPG  NPL
Sbjct: 206 ATFAASRATGSLQFIPGRIYETLGRRMHDLMGSATRKLDLSHTVHTLEFGDPFPGQQNPL 265

Query: 178 DGVRW-------TQETPSGMYQYFIKVVPTVYTDVS-----GHTIQSNQFSVTEHFRSSE 225
           DG           ++  +G + YF+K+VPT Y   S       T++SNQ+S T HF  SE
Sbjct: 266 DGTAQGSALSGDAKDAMNGRFSYFVKLVPTTYQRYSLITGLQDTVESNQYSATHHFTPSE 325

Query: 226 QGRLQT--------LPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
             + ++        +PGVF  YDLSP+++   E H   S  HF+  VCA+ GGV TV G+
Sbjct: 326 AAKAESQAPKKQEIVPGVFMTYDLSPVRILVQERHPYPSLAHFVLQVCAVCGGVLTVVGL 385

Query: 276 IDAFIYHGQRAIKK 289
           +D+  +H  R I+K
Sbjct: 386 VDSLCFHSVRKIRK 399


>gi|194224360|ref|XP_001916465.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Equus caballus]
          Length = 342

 Score =  197 bits (501), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 120/303 (39%), Positives = 163/303 (53%), Gaps = 59/303 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FK+RLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNC--EEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
           C SCYGAE+ D      C  + +  +   KG                   R +EE     
Sbjct: 142 CESCYGAETEDIKPPYFCLQDHLHSSLAGKGLPWG---------------RDQEE----- 181

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
                                + H   V +HD+ +F  D+ N++H I  L+FGE +PG+V
Sbjct: 182 ---------------------ALH--AVEIHDLQSFGLDNINMTHYIRHLSFGEDYPGIV 218

Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTL 232
           NPLD    T    S M+QYF+KVVPTVY  V G  +++NQFSVT H + +  G +  Q L
Sbjct: 219 NPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVA-NGLMGDQGL 277

Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
           PGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI+
Sbjct: 278 PGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKID 337

Query: 293 IGK 295
           +GK
Sbjct: 338 LGK 340


>gi|70990824|ref|XP_750261.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus fumigatus
           Af293]
 gi|66847893|gb|EAL88223.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           fumigatus Af293]
 gi|159130735|gb|EDP55848.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           fumigatus A1163]
          Length = 438

 Score =  197 bits (501), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 132/355 (37%), Positives = 173/355 (48%), Gaps = 75/355 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETY 56
           MD+SGEQ + V H + K RL S    G V++ +  D     +I K L         +  Y
Sbjct: 89  MDVSGEQQVGVAHGVNKVRLSSPAEGGRVLDVQALDLHSKEEIAKHL---------DPNY 139

Query: 57  CGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG C GA+    S  E CCN C+EVREAY  K WA      I+QC+REG+  RI  +  E
Sbjct: 140 CGDCGGADPLPGSIKEGCCNTCDEVREAYAAKNWAFGKGTNIEQCEREGYAARIDAQRRE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
           GC + G L VNKV GNFH APG+SF    VH HD+  +        +   ++H I++L F
Sbjct: 200 GCRLEGILRVNKVVGNFHIAPGRSFTSGQVHAHDLQNYLDSELPDNEKHTMTHHIHQLRF 259

Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
           G   P  V            NPLD        P+  + YF+KVV T Y  +         
Sbjct: 260 GPQLPDEVSDRWQWTDHHHTNPLDSTSQETNDPAYNFVYFVKVVSTSYLPLGWDPLFSSA 319

Query: 206 ------------------SGHTIQSNQFSVTEHFRS------SEQGRLQTL------PGV 235
                             SG +I+++Q+SVT H RS      S++G  + L      PGV
Sbjct: 320 AHNAHDQTPLGSHGIAYGSGGSIETHQYSVTSHKRSLRGGDASDEGHKERLHAANGIPGV 379

Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           FF YD+SP+KV   E    SF  FLT VCAI+GG  TV+  ID  +Y G   +KK
Sbjct: 380 FFNYDISPMKVINREARPKSFSGFLTGVCAIIGGTLTVAAAIDRGLYEGALRVKK 434


>gi|429853391|gb|ELA28466.1| copii-coated vesicle membrane protein [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 437

 Score =  196 bits (499), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 125/351 (35%), Positives = 174/351 (49%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGG--RLEH-NETYC 57
           MD+SGEQ   V H + K RL  Q       ++G G   + K L  H      EH +  YC
Sbjct: 89  MDVSGEQQHGVMHGVNKVRLRPQ-------KEGGGVIDV-KALSLHSSDEAAEHLDPNYC 140

Query: 58  GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA     +    CCN CEEVREAY +  WA    + ++QC RE + ++++E+  EG
Sbjct: 141 GPCYGAPAPPNAQKAGCCNTCEEVREAYAQASWAFGKGENVEQCTREHYAEKLEEQRREG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD----SFNISHKINKLAFGEH 169
           C I G L VNKV GNFH APG+SF    +HVHD+  +         + +H I+ L FG  
Sbjct: 201 CRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETPDDAQHDFTHVIHTLRFGPQ 260

Query: 170 FPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS------- 206
            P  +                NPLD        P+  + YF+K+VPT Y  ++       
Sbjct: 261 LPDTITKKMTKRAYAWTNHHGNPLDSTHQETNDPNYNFMYFVKIVPTSYLALNWQKSASI 320

Query: 207 -----------GH----TIQSNQFSVTEHFRS---------SEQGRLQT---LPGVFFFY 239
                      GH    +++++Q+SVT H RS           Q RL +   +PGVFF Y
Sbjct: 321 QDEESSGLGLLGHLSDGSVETHQYSVTSHKRSLAGGDDSAEGHQERLHSRGGIPGVFFSY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           D+SP+KV   EE   +F  FLT +CAI+GG  TV+  +D  ++ G   +KK
Sbjct: 381 DISPMKVINREERAKTFTGFLTGLCAIIGGTLTVAAAVDRGVFEGGLRLKK 431


>gi|389632999|ref|XP_003714152.1| hypothetical protein MGG_01245 [Magnaporthe oryzae 70-15]
 gi|351646485|gb|EHA54345.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Magnaporthe oryzae 70-15]
          Length = 439

 Score =  196 bits (499), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 127/353 (35%), Positives = 172/353 (48%), Gaps = 72/353 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ   V+H + K RL  Q   G VI+++   + A             L+ N  YC
Sbjct: 89  MDVSGEQQHGVQHGVIKVRLRPQSEGGGVIDAKTLALHAE------DEAATHLDPN--YC 140

Query: 58  GSCYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA +        CCN C+EVREAY +  WA    + ++QC RE + +R+ E+  EG
Sbjct: 141 GGCYGAPAPANAKKAGCCNTCDEVREAYAQASWAFGRGENVEQCTREHYAERLDEQRHEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----NISHKINKLAFGEH 169
           C I G L VNKV GNFH APG+SF    +HVHD+  +         + SH I+ L FG  
Sbjct: 201 CQIAGSLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDTPVEGGHSFSHTIHSLRFGPQ 260

Query: 170 FPGV------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--- 208
            P                    +NPLDGV  T   P+  Y YF+K+VPT Y  +      
Sbjct: 261 LPPSALEKLGNKDKNMPWTNHHINPLDGVIQTTVDPNFNYMYFVKIVPTSYLPLGWEKRT 320

Query: 209 -------------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFF 237
                              +++++Q+SVT H RS   G         R+ +   +PGVFF
Sbjct: 321 HLATMHDHGVGTYGYSGDGSVETHQYSVTSHKRSLAGGDDGEDGHKERMHSRGGIPGVFF 380

Query: 238 FYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            YD+SP+KV   E    +F  FLT +CAI+GG  TV+  ID   + G   IKK
Sbjct: 381 SYDISPMKVINREVRTKTFAGFLTGLCAILGGTLTVAAAIDRMTFEGVTRIKK 433


>gi|310800359|gb|EFQ35252.1| hypothetical protein GLRG_10396 [Glomerella graminicola M1.001]
          Length = 437

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 124/351 (35%), Positives = 173/351 (49%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHG--GRLEH-NETYC 57
           MD+SGEQ   V H + K RL         R++G G   I K L  H      EH +  YC
Sbjct: 89  MDVSGEQQHGVMHGVNKVRL-------RPRKEGGGVIDI-KALDLHSRDDSAEHLDPNYC 140

Query: 58  GSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA++        CCN C+EVREAY +  WA    + ++QC RE + +R++E+  EG
Sbjct: 141 GPCYGAQAPPNAQKPGCCNTCDEVREAYAQASWAFGKGEGVEQCTREHYAERLEEQRQEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGEH 169
           C I G L VN+V GNFH APG+SF    +HVHD+  +         + +H I+ L FG  
Sbjct: 201 CRIEGNLRVNRVVGNFHLAPGRSFSNGNMHVHDLKNYWDTPADAQHDFTHTIHSLRFGPQ 260

Query: 170 FPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----- 208
            P  V                NPLD        P+  + YF+K+VPT Y  ++       
Sbjct: 261 LPDQVTKKMGKRAYAWTNHHGNPLDNTHQDTNDPNYNFMYFVKIVPTSYLALNWQKSTAY 320

Query: 209 -----------------TIQSNQFSVTEHFRS---------SEQGRLQT---LPGVFFFY 239
                            +++++Q+SVT H RS           Q RL +   +PGVFF Y
Sbjct: 321 QDDDSSSLGLLGQGNDGSVETHQYSVTSHKRSLAGGDDAAEGHQERLHSRGGIPGVFFSY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           D+SP+KV   EE   +F  FLT +CAI+GG  TV+  +D  ++ G   +KK
Sbjct: 381 DISPMKVINREERAKTFTGFLTGLCAIIGGTLTVAAAVDRGVFEGGMRLKK 431


>gi|72393511|ref|XP_847556.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62175086|gb|AAX69235.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70803586|gb|AAZ13490.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|261330829|emb|CBH13814.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 405

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 117/319 (36%), Positives = 169/319 (52%), Gaps = 29/319 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D  GE   +V  D  K R+DS       +  G     +D   Q   G    NE  C +C
Sbjct: 90  IDAFGEYVENVVTDTAKVRVDSS----TLKPLGKARQLVDLKKQPTNGNETGNEN-CPTC 144

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGF 119
           YGAE +  +CC+ C++VR A+ ++ W     D+ I QC  E           EGCN++  
Sbjct: 145 YGAEKNPGECCHTCDDVRRAFAERQWEFHEDDVSIAQCAHERLKVAADSASAEGCNLHAS 204

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD- 178
             V +V GN HF PG+ F+  G H+H          N+SH ++ L FGE FPG  NP+D 
Sbjct: 205 FSVPRVTGNIHFVPGRMFNFFGQHLHSFKGETIRKLNLSHIVHALEFGERFPGQNNPMDG 264

Query: 179 -----GVRWTQETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSS----E 225
                GV+   E   G + YF+KVVPT+Y  VS    G+ ++SNQ+SVT HF  S    +
Sbjct: 265 MVNARGVKDPSEPLIGRFTYFVKVVPTLYQVVSMANTGNLVESNQYSVTHHFTPSWAAPK 324

Query: 226 QGRLQ-------TLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGII 276
           +G           +PGVF  YD+SPI+V+ T  H   S +H +  +CA+ GGV+TV+G+I
Sbjct: 325 EGETDNPNSDPLVVPGVFISYDISPIRVSVTRTHPYPSIVHLVLQLCAVGGGVYTVTGLI 384

Query: 277 DAFIYHGQRAIKKKIEIGK 295
           D+  +HG + +++KI  GK
Sbjct: 385 DSLFFHGIKRVQEKINRGK 403


>gi|212540034|ref|XP_002150172.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210067471|gb|EEA21563.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 440

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 127/355 (35%), Positives = 170/355 (47%), Gaps = 75/355 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGR---LEHNETY 56
           MD+SGEQ + V H + K RL S  +         G   ID   L+ H      +  +  Y
Sbjct: 89  MDVSGEQQMGVVHGLNKVRLSSVAD---------GGRVIDVSKLELHSQNEVAIHLDPEY 139

Query: 57  CGSCYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG C GA   +      CCN CEEVREAY  K WA    + I+QC+REG+  RI  +  E
Sbjct: 140 CGECGGASPPENAKKPGCCNTCEEVREAYALKSWAFGKGENIEQCQREGYADRIDAQRRE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
           GC I G + VNKV GNFH APG+SF    +HVHD+  +        +   +SH I++L F
Sbjct: 200 GCRIEGDIRVNKVIGNFHIAPGRSFSSGNMHVHDLDTYLDRELADYEKHTMSHIIHQLRF 259

Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
           G      V            NPLD  +     P+  Y Y+IKVV T Y  +         
Sbjct: 260 GPQLSDEVSQRWQWTDHHHTNPLDSTQQLTNEPAYNYNYYIKVVSTSYLPLGWDSARSDQ 319

Query: 206 ------------------SGHTIQSNQFSVTEHFRS---------SEQGRLQT---LPGV 235
                             +  +I+++Q+SVT H RS           Q R+     +PGV
Sbjct: 320 LHGDDQFTPLGLHGAAHGTAGSIETHQYSVTSHKRSLHGGNDAAEGHQERIHAEGGIPGV 379

Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           FF YD+SP+KV   E    +F  FLT VCA++GG  TV+  +D F+Y G R I+K
Sbjct: 380 FFNYDISPMKVVNREARAKTFTGFLTGVCAVIGGTLTVAAAVDRFLYEGSRRIRK 434


>gi|449549110|gb|EMD40076.1| hypothetical protein CERSUDRAFT_132878 [Ceriporiopsis subvermispora
           B]
          Length = 1001

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 114/323 (35%), Positives = 170/323 (52%), Gaps = 41/323 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNETYCGS 59
           MDISGE   D+ H+I K RL  +G  + +         IDK   QR GG           
Sbjct: 669 MDISGETQTDISHNIIKTRLTEKGLPVPNAASSELRNDIDKLNEQRQGGYCGSCYGGVEP 728

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
             G       CCN+CE+VR+AY  +GW+ + P+ I+QC  EG+ +++K++  EGCNI G 
Sbjct: 729 AGG-------CCNSCEDVRQAYVNRGWSFNRPEGIEQCVDEGWSEKLKDQANEGCNIAGR 781

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN---ISHKINKLAF-GEHFPGVV- 174
           + VNKV GN H +PG+SF     +++D++ + +D  N    SH I++ AF G+    ++ 
Sbjct: 782 VRVNKVVGNIHLSPGRSFRSGSQNLYDLVPYLKDDGNRHDFSHTIHEFAFEGDDEYDILK 841

Query: 175 ---------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
                          NPLDG          M+QYF+KVV T +  + G ++ +NQ+S T 
Sbjct: 842 AKSGKEMRRRMGIEGNPLDGAIGRTSKQQYMFQYFLKVVSTQFRTLDGMSVNTNQYSATH 901

Query: 220 HFRSSEQGRLQT-------------LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
             R    G+ +              +PG FF Y++SPI ++  E   SF HFLT+ CAIV
Sbjct: 902 FERDLTAGQQEKDQAGLHVAHTSVGIPGAFFNYEISPILISHAESRQSFAHFLTSTCAIV 961

Query: 267 GGVFTVSGIIDAFIYHGQRAIKK 289
           GGV TV+ +ID+ ++   R +KK
Sbjct: 962 GGVLTVASLIDSVLFVAGRTLKK 984


>gi|317025332|ref|XP_001388859.2| COPII-coated vesicle membrane protein Erv46 [Aspergillus niger CBS
           513.88]
 gi|350638031|gb|EHA26387.1| hypothetical protein ASPNIDRAFT_196625 [Aspergillus niger ATCC
           1015]
          Length = 438

 Score =  194 bits (493), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 128/355 (36%), Positives = 174/355 (49%), Gaps = 75/355 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG--RLEH-NETY 56
           MD+SGEQ   V H I K RL S            G   ID K L+ H      +H +  Y
Sbjct: 89  MDVSGEQQTGVVHGINKVRLTSAAE---------GGRVIDVKALELHSKDESAKHLDPDY 139

Query: 57  CGSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA +    S   CCN C+EVREAY ++ WA    + ++QC+ EG+ +RI  +  E
Sbjct: 140 CGECYGATAPAGASKPGCCNTCDEVREAYAQQQWAFGKGENVEQCELEGYAERIDAQRRE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
           GC + G L VNKV GNFH APG+SF    +HVHD+  F        +   ++H+I++L F
Sbjct: 200 GCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVHDLANFFDADLPDAEKHTMTHEIHQLRF 259

Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
           G   P  +            NPLDG +     P   Y YF+KVV T Y  +         
Sbjct: 260 GPQLPDELSDRWQWTDHHHTNPLDGTKQETNEPGYNYMYFVKVVSTSYLPLGWDPLFSSS 319

Query: 206 ------------------SGHTIQSNQFSVTEHFRS------SEQGRLQTL------PGV 235
                             +  +I+++Q+SVT H RS      S++G  + L      PGV
Sbjct: 320 IHSAYDQAPLGSHGIAYGAEGSIETHQYSVTSHKRSLMGGDASDEGHKERLHAANGIPGV 379

Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           F  YD+SP+KV   E    +F  FLT VCAI+GG  TV+  +D  +Y G   +KK
Sbjct: 380 FVNYDISPMKVINREARPKTFTGFLTGVCAIIGGTLTVAAALDRGLYEGVSRMKK 434


>gi|302923326|ref|XP_003053651.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256734592|gb|EEU47938.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 437

 Score =  194 bits (493), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 126/352 (35%), Positives = 174/352 (49%), Gaps = 72/352 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGEQ   V H + K RL  Q           G   ID K L  H     H + +YCG
Sbjct: 89  MDVSGEQQHGVMHGVNKVRLQPQSK---------GGADIDSKSLSLHDDAAAHLDPSYCG 139

Query: 59  SCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            CYGA+    +    CC  C+EVREAY +  WA    + ++QC+RE + +++  +  EGC
Sbjct: 140 GCYGAQPPANARKAGCCQTCDEVREAYAQASWAFGRGEGVEQCEREHYAEKLDAQREEGC 199

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL----AFQRDSFNISHKINKLAFGEHF 170
            I G L VNKV GNFHFAPG+SF    +HVHD+     A +  + + +H I+ L FG   
Sbjct: 200 RIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLKNYWDAPKGKAHDFTHIIHSLRFGPQL 259

Query: 171 PGVV---------------NPLDGVRWTQETPSGMYQYFIKVVPTVY------------- 202
           P  V               NPLDG R   + P+  + YF+K+VPT Y             
Sbjct: 260 PDEVARKVGKGTPWTNHHQNPLDGTRQDIKDPNFNFMYFVKIVPTSYLPLGWDSKGLKIA 319

Query: 203 ------TDVSGH------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
                 T +  +      +++++Q+SVT H RS   G         R  T   +PGVFF 
Sbjct: 320 GLLQDDTSLGAYGYAEDGSVETHQYSVTSHKRSLAGGNDAAEGHAERQHTSGGIPGVFFS 379

Query: 239 YDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           YD+SP+KV   EE   +F  FL  +CAIVGG  TV+  +D  ++ G   +KK
Sbjct: 380 YDISPMKVVNREEKGKTFSGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 431


>gi|367052857|ref|XP_003656807.1| hypothetical protein THITE_2121964 [Thielavia terrestris NRRL 8126]
 gi|347004072|gb|AEO70471.1| hypothetical protein THITE_2121964 [Thielavia terrestris NRRL 8126]
          Length = 436

 Score =  194 bits (493), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 128/352 (36%), Positives = 172/352 (48%), Gaps = 73/352 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG---RLEHNETY 56
           MD+SGEQ   V+H + K RL         R    G   ID K L  H      +  + +Y
Sbjct: 89  MDVSGEQQHGVQHGVTKTRL---------RPLSEGGGDIDSKALALHAADEAAIHLDPSY 139

Query: 57  CGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA+    +    CCN C+EV+EAY ++ WA    D I+QC+RE + +R+ E+  E
Sbjct: 140 CGPCYGAKPPTTAKKPGCCNTCDEVKEAYAQQAWAFGRGDGIEQCEREHYGERLDEQRRE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEHF 170
           GC I G L VNKV GNFH APG+SF    VHVHD+  +         +H I+ L FG   
Sbjct: 200 GCRIEGGLRVNKVVGNFHIAPGRSFSNGNVHVHDLKNYWDTPTKHTFTHIIHHLRFGPQL 259

Query: 171 PGV----------------VNPLDGVRWTQETPSGMYQYFIKVVPTVY------------ 202
           P                  +NPLDG     +  +  Y YFIK+VPT Y            
Sbjct: 260 PDSLHKKLGTKHLPWTNHHLNPLDGTSQETDDVNFNYMYFIKIVPTSYLPLGWEKTWAGF 319

Query: 203 ------------TDVSGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
                       T   G +++++Q+SVT H RS   G         RL     +PGVFF 
Sbjct: 320 REEHQAELGSFGTSADG-SVETHQYSVTSHKRSLAGGDDAAEGHRERLHAKGGIPGVFFS 378

Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           YD+SP+KV   EE   +FL F+  +CAIVGG  TV+  +D  ++ G   +KK
Sbjct: 379 YDISPMKVINREERSKTFLGFIAGLCAIVGGTLTVAAAVDRALFEGTVRLKK 430


>gi|325191973|emb|CCA26442.1| endoplasmic reticulumGolgi intermediate compartment protein
           putative [Albugo laibachii Nc14]
          Length = 401

 Score =  194 bits (493), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 100/284 (35%), Positives = 163/284 (57%), Gaps = 17/284 (5%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GE  +D+   I   RLD++GN I +         +D   +     L  N  YCGSC
Sbjct: 115 MDVTGELQMDLHRSIGMTRLDAKGNPINT---------LDSAKEE---VLPAN--YCGSC 160

Query: 61  Y-GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           Y       + CCN C+EV+EA+      L + D  +QC RE   ++ + + GEGC + G+
Sbjct: 161 YETVHPLGKTCCNTCDEVKEAFVANDLRLFDADQKEQCVREMTEEQRQAQAGEGCRLKGY 220

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           + VN+VAGNFH   G++FH+ G  +H  L  Q   FN S  ++ L+FG  +  V N LDG
Sbjct: 221 MMVNRVAGNFHVGLGRTFHRKGKLIHQFLPGQESVFNASFLLHSLSFGTPYANVKNGLDG 280

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQTLPGVFFF 238
            ++  +   G+ +YF+K+VPT+Y+D+S  ++ S Q+S T+  +  +  G++  LPG +F 
Sbjct: 281 TQYITKKKGGVMKYFLKIVPTIYSDISS-SVHSYQYSHTKQEKYMNAMGQISGLPGAYFM 339

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
           ++ SP  V    E + F HF+  + AI+GG+ +++G +D+ I+H
Sbjct: 340 FEFSPFMVKIDSEQIPFTHFVIRIFAILGGMISIAGFVDSVIFH 383


>gi|340923948|gb|EGS18851.1| hypothetical protein CTHT_0054620 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 436

 Score =  194 bits (493), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 127/351 (36%), Positives = 173/351 (49%), Gaps = 71/351 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRL---DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ   V+H + K RL   +  G  I+ ++  + +      ++     L+ N  YC
Sbjct: 89  MDVSGEQQHGVQHGVTKTRLRPWEEGGGDIDKKELALHS------IEESATHLDPN--YC 140

Query: 58  GSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           GSCYGA     +    CC  C+EVREAY +  WA    + I+QC+RE + +R+ ++  EG
Sbjct: 141 GSCYGANPPPNAVKPGCCQTCDEVREAYAQAAWAFGRGENIEQCQREHYAERLDQQRREG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C I G L VNKV GNFH APGKSF    +HVHD+  +         +H I+ L FG   P
Sbjct: 201 CRIEGGLRVNKVVGNFHIAPGKSFSNGNMHVHDLKNYWESPVRHTFTHIIHHLRFGPQLP 260

Query: 172 GV----------------VNPLDGVRWTQETPSGMYQYFIKVVPTVY------------- 202
                             VNPLD      +  +  Y YFIK+VPT Y             
Sbjct: 261 ESLHQKLGNKALPWSNHHVNPLDNTHQETDEVNFSYMYFIKIVPTSYLPLGWEKTWDQFR 320

Query: 203 -----------TDVSGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFY 239
                      T   G +++++Q+SVT H RS   G         RL +   +PGVFF Y
Sbjct: 321 EQHHAELGSFGTSADG-SVETHQYSVTSHRRSLSGGDDAAEGHSERLHSKGGIPGVFFSY 379

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           D+SP+KV   EE   SFL FL  +CAIVGG  TV+  ID  ++ G   +KK
Sbjct: 380 DISPMKVINREERAKSFLGFLAGLCAIVGGTLTVAAAIDRALFEGTVRLKK 430


>gi|443734706|gb|ELU18587.1| hypothetical protein CAPTEDRAFT_139951 [Capitella teleta]
          Length = 285

 Score =  194 bits (492), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 97/191 (50%), Positives = 123/191 (64%), Gaps = 15/191 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGIGAPKID----KPLQRHGGRLEHNE 54
           MD+SGEQ +DV HDIFK+RLD  G  +  E  ++ +G    D     PL+         +
Sbjct: 92  MDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKEDLGDKSKDFAVKNPLK---------D 142

Query: 55  TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
             C SCYGAES    CCN C EVREAYR+KGWA  +   I+QC REG++ +++E + EGC
Sbjct: 143 DRCESCYGAESEAHKCCNTCNEVREAYRQKGWAFVDAQNIEQCMREGYVSQLEEGKNEGC 202

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
            IYGFLEVNKVAGNFH APG+SF Q   H+HD+ A Q   FN+SH+I  L+FG+ +PG V
Sbjct: 203 RIYGFLEVNKVAGNFHVAPGRSFSQHHAHIHDMQALQGMKFNMSHRIQHLSFGDDYPGQV 262

Query: 175 NPLDGVRWTQE 185
           NPLD      E
Sbjct: 263 NPLDASEQVTE 273


>gi|169770949|ref|XP_001819944.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus oryzae
           RIB40]
 gi|238486566|ref|XP_002374521.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           flavus NRRL3357]
 gi|83767803|dbj|BAE57942.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|220699400|gb|EED55739.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           flavus NRRL3357]
 gi|391874294|gb|EIT83200.1| COPII vesicle protein [Aspergillus oryzae 3.042]
          Length = 436

 Score =  194 bits (492), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 127/352 (36%), Positives = 175/352 (49%), Gaps = 71/352 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ   V H I K RL S    G+VI+ +   + +       Q     L+ N  YC
Sbjct: 89  MDVSGEQQTGVVHGINKVRLSSPAEGGHVIDVKALELHSE------QEAAKHLDPN--YC 140

Query: 58  GSCYGAESS--DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           G C G      ++ CCN CEEVREAY ++ WA    + I+QC+REG+ QR+  +  EGC 
Sbjct: 141 GDCGGVPQPGGEKRCCNTCEEVREAYAQQQWAFGKGENIEQCEREGYAQRLDAQRREGCR 200

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAFGEH 169
           + G L VNKV GNFH APG+SF    VHVHD+  +        +   ++H I++L FG  
Sbjct: 201 LEGVLRVNKVVGNFHIAPGRSFTSGNVHVHDLENYFEGDLPDAEKHTMTHIIHQLRFGPQ 260

Query: 170 FPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV------------ 205
            P  +            NPLD  +     P+  + YF+KVV T Y  +            
Sbjct: 261 LPDELSDRWQWTDHHHTNPLDSTQQETSDPAYNFMYFVKVVSTSYLPLGWDPLFSSAVHS 320

Query: 206 ---------------SGHTIQSNQFSVTEHFRS------SEQGRLQTL------PGVFFF 238
                          S  +I+++Q+SVT H RS      S++G  + L      PGVFF 
Sbjct: 321 AYEDSPLGSHGIAYGSQSSIETHQYSVTSHKRSLRGGDASDEGHKERLHAANGIPGVFFN 380

Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           YD+SP+KV   E    +F  FLT VCAI+GG  TV+  +D  +Y G   +KK
Sbjct: 381 YDISPMKVINKEARPKTFTGFLTGVCAIIGGTLTVAAALDRGLYEGALRVKK 432


>gi|443734710|gb|ELU18591.1| hypothetical protein CAPTEDRAFT_139954 [Capitella teleta]
          Length = 285

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 97/191 (50%), Positives = 123/191 (64%), Gaps = 15/191 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGIGAPKID----KPLQRHGGRLEHNE 54
           MD+SGEQ +DV HDIFK+RLD  G  +  E  ++ +G    D     PL+         +
Sbjct: 92  MDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKEDLGDKSKDFAVKNPLK---------D 142

Query: 55  TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
             C SCYGAES    CCN C EVREAYR+KGWA  +   I+QC REG++ +++E + EGC
Sbjct: 143 DRCESCYGAESEAHKCCNTCNEVREAYRQKGWAFVDAQNIEQCMREGYVSQLEEGKNEGC 202

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
            IYGFLEVNKVAGNFH APG+SF Q   H+HD+ A Q   FN+SH+I  L+FG+ +PG V
Sbjct: 203 RIYGFLEVNKVAGNFHVAPGRSFSQHHAHIHDMQALQGMKFNMSHRIQHLSFGDDYPGQV 262

Query: 175 NPLDGVRWTQE 185
           NPLD      E
Sbjct: 263 NPLDASEQVTE 273


>gi|402083890|gb|EJT78908.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 444

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 124/358 (34%), Positives = 172/358 (48%), Gaps = 77/358 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ   V+H + K RL  Q   G VI+ +   + A   D+    H      +  YC
Sbjct: 89  MDVSGEQQHGVQHGVVKVRLQPQSEGGGVIDVKALSLHA---DEDSATH-----LDPKYC 140

Query: 58  GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA     ++   CC+ C+EVREAY +  WA    + ++QC RE + +R+ E+  EG
Sbjct: 141 GPCYGAPAPSNAAKAGCCSTCDEVREAYAQASWAFGRGENVEQCLREHYAERLDEQRQEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGEH 169
           C I G L VNKV GNFH APG+SF    +HVHD+  +     +     SH ++ L+FG  
Sbjct: 201 CQIAGSLRVNKVIGNFHLAPGRSFSNGNMHVHDLKNYWDTPVDGGHSFSHVVHSLSFGPQ 260

Query: 170 FPGVV-------------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-- 208
            P  V                   NPLDG       P+  + YF+K+VPT Y  +     
Sbjct: 261 LPLEVQKRLDRGRSLPWADHSHQLNPLDGTSQETADPNFSFMYFLKIVPTSYLPLGWEGR 320

Query: 209 ------------------------TIQSNQFSVTEHFRS---------SEQGRLQT---L 232
                                    ++++Q+SVT H RS           Q RL +   +
Sbjct: 321 RAKIATGNHDKDSWVGTYGYSPDGAVETHQYSVTSHKRSLAGGDDAAEGHQERLHSKGGI 380

Query: 233 PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           PGVFF YD+SP+KV   EE   +F  FLT +CAI+GG  TV+  +D   Y G   +KK
Sbjct: 381 PGVFFSYDISPMKVINREERPKTFAGFLTGLCAILGGTLTVAAAVDRTFYEGATRLKK 438


>gi|358372047|dbj|GAA88652.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus kawachii
           IFO 4308]
          Length = 438

 Score =  193 bits (491), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 127/355 (35%), Positives = 174/355 (49%), Gaps = 75/355 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG--RLEH-NETY 56
           MD+SGEQ   V H I K RL S            G   ID K L+ H      +H +  Y
Sbjct: 89  MDVSGEQQTGVVHGINKVRLTSAAE---------GGRVIDVKALELHSKDESAKHLDPDY 139

Query: 57  CGSCYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA +    S   CCN C+EVREAY ++ WA    + ++QC+ EG+ +RI  +  E
Sbjct: 140 CGECYGATAPAGASKPGCCNTCDEVREAYAQQQWAFGKGENVEQCELEGYAERIDAQRRE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
           GC + G L VNKV GNFH APG+SF    +HVHD+  F      + +   ++H+I++L F
Sbjct: 200 GCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVHDLATFFDAELPESERHTMTHEIHQLRF 259

Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
           G   P  +            NPLD  +     P   Y YF+KVV T Y  +         
Sbjct: 260 GPQLPDELSDRWQWTDHHHTNPLDNTKQETNEPGYNYMYFVKVVSTSYLPLGWDPLFSSS 319

Query: 206 ------------------SGHTIQSNQFSVTEHFRS------SEQGRLQTL------PGV 235
                             +  +I+++Q+SVT H RS      S++G  + L      PGV
Sbjct: 320 IHSAYDQAPLGSHGIAYGAEGSIETHQYSVTSHKRSLMGGDASDEGHKERLHAANGIPGV 379

Query: 236 FFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           F  YD+SP+KV   E    +F  FLT VCAI+GG  TV+  +D  +Y G   +KK
Sbjct: 380 FVNYDISPMKVINREARPKTFTGFLTGVCAIIGGTLTVAAALDRGLYEGVSRMKK 434


>gi|440299607|gb|ELP92159.1| endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Entamoeba invadens IP1]
          Length = 361

 Score =  193 bits (491), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 104/287 (36%), Positives = 168/287 (58%), Gaps = 29/287 (10%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY---C 57
           +D +GE  +D++ ++ KKRL+                     +     +   ++ Y   C
Sbjct: 86  LDTTGEVSIDIESNVNKKRLNPHS------------------MTESSNKATAHKVYGIEC 127

Query: 58  GSCYGAESSDED-CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
            +C   ES D++ CC  C+E++E+Y+K G  +  P+ + QC+ +   +     +GEGC++
Sbjct: 128 PAC--EESVDKNKCCFTCDELKESYKKAGKEVP-PNAV-QCQLKNIQKMALALDGEGCHM 183

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YG + VN+V+GNFH APG S  Q   H H   A    S N++H  N L+FG++FPG++ P
Sbjct: 184 YGSVFVNRVSGNFHIAPGMSEQQGEGHRHS--AEWIGSLNLTHTWNSLSFGDNFPGMIKP 241

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-QTLPGV 235
           +D ++    T + MYQYF++VVP  Y  +    +++N +SVTEH+RS     + Q +PGV
Sbjct: 242 MDSIQKVDVTNNSMYQYFVQVVPMTYFGLDKKVVKTNGYSVTEHYRSGNLKTMEQGVPGV 301

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
           F  Y++S ++V +TEE  SF H LT +C IVGG+FT+  ++DAFI+H
Sbjct: 302 FVLYEISSMEVLYTEETGSFGHLLTGICGIVGGIFTIFSLLDAFIFH 348


>gi|189203047|ref|XP_001937859.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187984958|gb|EDU50446.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 437

 Score =  193 bits (490), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 125/356 (35%), Positives = 175/356 (49%), Gaps = 78/356 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGN---VIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
           MD+SGE  + V H I K RL  + +   VIE+           K L  H     H    Y
Sbjct: 89  MDVSGELQMGVTHGINKVRLSPEADGSKVIET-----------KALDLHADEASHLAPDY 137

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     +   +CCN C+EVR+AY    W+    + ++QC+RE + + + ++  E
Sbjct: 138 CGQCYGAPPPTNAKKPNCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDQQRQE 197

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHF 170
           GC + G ++VNKV GNFHFAPGKSF    +HVHD+  + +D +    +H+I++L FG   
Sbjct: 198 GCRLEGSIKVNKVVGNFHFAPGKSFSNGNLHVHDLENYFKDDYAHTFTHRIHQLRFGPQL 257

Query: 171 PGV---------------------VNPLDGVRWTQETPSGMYQYFIKVV----------- 198
             V                     VNPLD      +  +  Y YFIKVV           
Sbjct: 258 SDVVVRDMQKKHLDSGHNGWSNHHVNPLDNTVQHTDEKAYNYMYFIKVVSTAYLPLGWEQ 317

Query: 199 ----PTVYTDVSGHT--------IQSNQFSVTEHFRSSEQG---------RLQT---LPG 234
               P+ Y+D+ G T        I+++Q+SVT H RS + G         R+     +PG
Sbjct: 318 EFPHPSKYSDILGTTIDESYKGSIETHQYSVTSHKRSLQGGTDEKDGHKERIHARGGIPG 377

Query: 235 VFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           VFF YD+SP+KV   E    SF  FL  +CA++GG  TV+  ID  +Y G   IKK
Sbjct: 378 VFFSYDISPMKVVNREVREKSFSGFLVGLCAVIGGTLTVAAAIDRALYEGVNRIKK 433


>gi|325189930|emb|CCA24410.1| hypothetical protein BRAFLDRAFT_63528 [Albugo laibachii Nc14]
          Length = 699

 Score =  193 bits (490), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 104/282 (36%), Positives = 157/282 (55%), Gaps = 8/282 (2%)

Query: 4   SGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY---CGSC 60
           SGE H D++H + K+ +D  G ++ +   G+    I K        +   +T    CGSC
Sbjct: 408 SGEIHHDIQHSVHKQAIDLNGKILSA---GMKLDSIGKAWTNQSDTVAEEKTVKVECGSC 464

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA +S E CCN CE+V++AY  + W + +   I+QC++    + +     EGC IYG +
Sbjct: 465 YGAGASGE-CCNTCEDVQQAYASRRWNIPSLHTIEQCQKSEIEKLLHSTVEEGCRIYGSI 523

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG- 179
            V KV G   FAP K+     +   +IL      F+ SHKIN L FGE +P + +PL+G 
Sbjct: 524 AVTKVHGKVLFAPAKALLSGYISTEEILDKTIKIFDTSHKINYLDFGERYPEMKSPLNGH 583

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
                +   G YQYF++VVPT Y  ++G  I +NQ+SVT+H++       Q LP + F Y
Sbjct: 584 NTILPKGTRGTYQYFLQVVPTAYYYLNGGIIDTNQYSVTQHYQELTPLGEQQLPMITFQY 643

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
             SPI     +    +L FLT++CAI+GGVFT+ G +D+ ++
Sbjct: 644 KFSPIMFQIEQRRRGYLQFLTSLCAILGGVFTMVGAVDSILF 685


>gi|336465550|gb|EGO53790.1| hypothetical protein NEUTE1DRAFT_151014 [Neurospora tetrasperma
           FGSC 2508]
 gi|350295150|gb|EGZ76127.1| DUF1692-domain-containing protein [Neurospora tetrasperma FGSC
           2509]
          Length = 444

 Score =  193 bits (490), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 126/359 (35%), Positives = 176/359 (49%), Gaps = 79/359 (22%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG---RLEHNETY 56
           MD+SGEQ   V+H + K RL  Q           G  +ID K L  H         + +Y
Sbjct: 89  MDVSGEQQHGVQHGVKKIRLRPQSE---------GGGEIDAKILSLHAADESATHLDPSY 139

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     +    CC+ CEEVREAY +  WA  +   ++QC+RE + +R+ E+  E
Sbjct: 140 CGPCYGAPAPYNAKKPGCCSTCEEVREAYAQASWAFGDGATMEQCQREHYTERLAEQRHE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----NISHKINKLAFGE 168
           GC I G L VNKV GNFH APG+SF    +HVHD+  +         + SH I+ L FG 
Sbjct: 200 GCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVHDLAQWWSTPVPGGHSFSHIIHSLRFGP 259

Query: 169 HFPGV------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----- 205
             P                    +NPLD  +   + P+  + YF+K+VPT Y  +     
Sbjct: 260 QLPDDLVRKLGGNGKNTLWTNHHLNPLDNTKQETDDPNYNFMYFVKIVPTSYLPLGWEKQ 319

Query: 206 ----------------------SGHTIQSNQFSVTEHFRS------SEQG---RLQT--- 231
                                 S  +++++Q+SVT H RS      S++G   RL +   
Sbjct: 320 AAQNKATWEQDHSVGLGAYGYGSDGSMETHQYSVTSHKRSLTGGDDSKEGHGERLHSRGG 379

Query: 232 LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           +PGVFF YD+SP+KV   EE   SFL FL  +CA+VGG  TV+  +D  ++ G   +KK
Sbjct: 380 IPGVFFSYDISPMKVVNREERAKSFLGFLAGLCAVVGGTLTVAAAVDRGLFEGTVRLKK 438


>gi|213409826|ref|XP_002175683.1| COPII-coated vesicle component Erv46 [Schizosaccharomyces japonicus
           yFS275]
 gi|212003730|gb|EEB09390.1| COPII-coated vesicle component Erv46 [Schizosaccharomyces japonicus
           yFS275]
          Length = 394

 Score =  193 bits (490), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 116/311 (37%), Positives = 171/311 (54%), Gaps = 28/311 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG+   DV+H + K RLD  GN+I      IG+   +  + + G      E  CG C
Sbjct: 89  MDISGDFQQDVQHSVTKTRLDKYGNIIAVIDSDIGSATDESAMDKDG------EVTCGDC 142

Query: 61  YGAESS----DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           YGA  +       CCNNC+ VR+AY +K WA+ + D   QC+ E +      ++GEGCNI
Sbjct: 143 YGAGDAAPPETPGCCNNCKAVRDAYARKQWAIGDYDAFQQCRDENYKAEHASQKGEGCNI 202

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEHF-PGV 173
            G L VN+VAGNFHFAPG+SF     H+HD+  +  ++++ +++H I++L+FG    P  
Sbjct: 203 AGHLFVNRVAGNFHFAPGRSFQTQQGHLHDLRGYEEEQEAHDMTHMIHQLSFGPPIKPSA 262

Query: 174 --VNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
              +PLDG     +     Y YFIK V    V  D +  TI +N+FSVT+H RS   GR 
Sbjct: 263 EHTDPLDGHFKNTDDALHNYAYFIKCVAHKFVPLDPADPTINTNEFSVTQHERSVTGGRE 322

Query: 230 QT----------LPGVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
                       +PGVFF  D+SP+ V   +    +F  F++NV + +GG  T++ ++D 
Sbjct: 323 NDNPSHLNRRGGIPGVFFNIDISPMLVIQRQIRGNTFGGFISNVLSFLGGFITLTTLVDR 382

Query: 279 FIYHGQRAIKK 289
            +Y  +  +KK
Sbjct: 383 GLYAAELKMKK 393


>gi|85115136|ref|XP_964815.1| hypothetical protein NCU08607 [Neurospora crassa OR74A]
 gi|28926610|gb|EAA35579.1| hypothetical protein NCU08607 [Neurospora crassa OR74A]
          Length = 444

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 126/359 (35%), Positives = 175/359 (48%), Gaps = 79/359 (22%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG---RLEHNETY 56
           MD+SGEQ   V+H + K RL  Q           G  +ID K L  H         + +Y
Sbjct: 89  MDVSGEQQHGVQHGVKKIRLRPQSE---------GGGEIDAKVLSLHAADESATHLDPSY 139

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     +    CC+ CEEVREAY +  WA  +   ++QC+RE + +R+ E+  E
Sbjct: 140 CGPCYGAPAPYNAKKPGCCSTCEEVREAYAQASWAFGDGATMEQCQREHYTERLAEQRHE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----NISHKINKLAFGE 168
           GC I G L VNKV GNFH APG+SF    +HVHD+  +         + SH I+ L FG 
Sbjct: 200 GCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVHDLAQWWSTPVPGGHSFSHIIHSLRFGP 259

Query: 169 HFPGV------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----- 205
             P                    +NPLD  +     P+  + YF+K+VPT Y  +     
Sbjct: 260 QLPDDLVRKLGGNGKNTLWTNHHLNPLDNTKQETNDPNYNFMYFVKIVPTSYLPLGWEKQ 319

Query: 206 ----------------------SGHTIQSNQFSVTEHFRS------SEQG---RLQT--- 231
                                 S  +++++Q+SVT H RS      S++G   RL +   
Sbjct: 320 AAQNKAAWEQDHSVGLGAYGYGSDGSMETHQYSVTSHKRSLTGGDDSKEGHGERLHSRGG 379

Query: 232 LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           +PGVFF YD+SP+KV   EE   SFL FL  +CA+VGG  TV+  +D  ++ G   +KK
Sbjct: 380 IPGVFFSYDISPMKVVNREERAKSFLGFLAGLCAVVGGTLTVAAAVDRGLFEGTVRLKK 438


>gi|323449476|gb|EGB05364.1| hypothetical protein AURANDRAFT_30967 [Aureococcus anophagefferens]
          Length = 368

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 113/295 (38%), Positives = 164/295 (55%), Gaps = 20/295 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++G+ H  ++  + K+RLD +G+ I  R     A + +     HG   E     C SC
Sbjct: 81  MDVAGDYHPYMEQHMTKQRLDGRGSPIPHRAIPERANEYE-----HGP--EDTGAGCQSC 133

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSN-----PDLIDQCKREGFLQRIKEEEGEGCN 115
           +GAE++++ CCN C+E+  AY  KGW+        P  +D   R+  ++ IK+  GEGCN
Sbjct: 134 FGAETAEQPCCNTCDELLRAYGNKGWSAQEIKKEAPQCVDD-TRDDSIRAIKK--GEGCN 190

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           + G+LEVNKVAGN H A G+S  Q+G  VH     +   FN+SH I+ LAFGE + G+  
Sbjct: 191 LAGWLEVNKVAGNVHVAMGESAIQNGRFVHQFDPTRAPEFNVSHVIHDLAFGETYDGMAL 250

Query: 176 PLDGVRWTQE--TPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRS--SEQGRLQ 230
           PL G     +  T +G++QYFIK+VPT+Y        +++ ++S T+ FR   ++     
Sbjct: 251 PLSGTSRIVDAATGTGLFQYFIKLVPTIYRAAPDAAPVRTVRYSYTQRFRPLHNQPPPTA 310

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 285
            LPG+F  YD S   V  T    S  HFL  VCAIVGGV TV   +D  +   +R
Sbjct: 311 MLPGIFLVYDFSAFMVEVTRHRSSLAHFLVRVCAIVGGVSTVVAFVDWAVVRAKR 365


>gi|258565913|ref|XP_002583701.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237907402|gb|EEP81803.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 435

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 124/351 (35%), Positives = 171/351 (48%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ   + H I K RL      G+V++++   +   K D+        +  +  YC
Sbjct: 89  MDVSGEQQSGLIHGIKKVRLGPASEGGHVLDAQT--LDLHKKDEVA------VHLDPEYC 140

Query: 58  GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           GSCY       +  + CCN C+EVREAY  +GWA    + + QC+REG+  RI  +  EG
Sbjct: 141 GSCYDGVPPPNAQKQGCCNTCDEVREAYASRGWAFGRGEGVAQCEREGYGARIDAQRHEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C + G L VNKV GNFH APG+SF    +H HD+  +        ++H I++L FG   P
Sbjct: 201 CRLEGILRVNKVIGNFHIAPGRSFTNGYMHAHDLKIYHETPVKHTMAHIIHQLRFGPQLP 260

Query: 172 GVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------- 208
             +            NPLD    T E P   + YF+KVV T Y  +              
Sbjct: 261 DELSQKWKWTDHHHTNPLDSTSQTTEDPKYNFMYFVKVVSTSYLPLGWDASLSSEVHSRL 320

Query: 209 -----------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFY 239
                            +I+++Q+SVT H RS E G         R+ T   +PGVFF Y
Sbjct: 321 ASDAPLGKQGIQLGRHGSIETHQYSVTSHKRSVEGGDDSAEGHKERIHTAGGIPGVFFNY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           D+SP+KV   E    SF  FLT VCA++GG  TV+  ID  +Y G   +KK
Sbjct: 381 DISPMKVINREARTKSFSGFLTGVCAVIGGTLTVAAAIDRMLYEGAVRVKK 431


>gi|363752862|ref|XP_003646647.1| hypothetical protein Ecym_5030 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356890283|gb|AET39830.1| hypothetical protein Ecym_5030 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 399

 Score =  191 bits (486), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 113/314 (35%), Positives = 168/314 (53%), Gaps = 35/314 (11%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD SGE  LD++   F K RLD  G  I + +  +G+ K           L  +  YCGS
Sbjct: 88  MDTSGEVQLDLQDAGFTKTRLDHSGTPIRTEKLEVGSNK--------AVHLPDDPNYCGS 139

Query: 60  CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
           CYG++S D +         CC  CEEVREAY +KGWA  +   I+QC REG++++I  + 
Sbjct: 140 CYGSKSQDNNDALPKEQKVCCQTCEEVREAYSEKGWAFFDGQKIEQCIREGYVEKINSQL 199

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG-VHVHDILAFQRDS-FNISHKINKLAFGE 168
            EGC + G  ++N++ GN HFAPG++ +     H HD+  +   S  N +H I+KL+FG 
Sbjct: 200 HEGCRVKGSAKLNRIQGNIHFAPGRTTNSGKRTHTHDVSLYDTHSHLNFNHIIHKLSFGS 259

Query: 169 HFPGVV-NPLDG---VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
              G + NPLDG   +    +     + YF K+VPT Y  + G  +++ QFSVT H R  
Sbjct: 260 DADGALSNPLDGHKNIIQGDDAHFSTFSYFTKIVPTRYEYLDGRKLETTQFSVTTHSRPL 319

Query: 225 EQGRLQTLP----------GVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVS 273
           + G+    P          GV  F+++SP+KV  +E+H +++  F+ N    +G V  V 
Sbjct: 320 KGGKDDDHPNTIHHRGGIAGVTIFFEMSPLKVINSEKHAITWSGFVLNCITSIGSVLAVG 379

Query: 274 GIIDAFIYHGQRAI 287
            +ID   Y  QR+I
Sbjct: 380 TVIDKITYRAQRSI 393


>gi|440636941|gb|ELR06860.1| hypothetical protein GMDG_08151 [Geomyces destructans 20631-21]
          Length = 441

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 127/355 (35%), Positives = 175/355 (49%), Gaps = 74/355 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGG--RLEH-NETYC 57
           MD+SGE    VKH + K RL+S         D  G     K L  H    +  H + +YC
Sbjct: 89  MDVSGEMQTGVKHGVSKVRLNSP--------DAGGGAIDVKALDLHSTEEKAAHLDPSYC 140

Query: 58  GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA     +    CCN C+EVR+AY    WA    + ++QC+RE + +R+ E+  EG
Sbjct: 141 GQCYGATPPPNAQKAGCCNTCDEVRDAYASASWAFGRGENVEQCEREHYSERLDEQRKEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ-----RDSFNISHKINKLAFGE 168
           C I G + VNKV GNFH APG+S+    +HVHD+  +          + +H I+ + FG 
Sbjct: 201 CRIEGGVRVNKVIGNFHIAPGRSYSNGNMHVHDLANYWDTPSLERGHSFAHTIHHVRFGP 260

Query: 169 HFP-GV---------------VNPLDGVRWTQETPSGMYQYFIKVVPTVY---------- 202
             P G+               +NPLDG +     P+  Y YF+KVV T Y          
Sbjct: 261 QLPEGLSKKFGGKNQPWTNHHLNPLDGTQQHTRDPAFNYMYFVKVVSTSYLPLGWNSKSA 320

Query: 203 --TDVS---------GH----TIQSNQFSVTEHFRSSEQG---------RLQT---LPGV 235
             T +S         GH    +++++Q+SVT H RS   G         RL +   +PGV
Sbjct: 321 AKTQISEENIGLGAYGHAVDGSVETHQYSVTSHKRSLSGGDDGAEGHKERLHSRTGIPGV 380

Query: 236 FFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           FF YD+SP+KV   EE    L  F+T +CAIVGG  TV+  +D  +Y G   IKK
Sbjct: 381 FFSYDISPMKVINREERTKTLSGFITGLCAIVGGTLTVAAAVDRGLYEGVSRIKK 435


>gi|378732932|gb|EHY59391.1| hypothetical protein HMPREF1120_07381 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 437

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 130/356 (36%), Positives = 170/356 (47%), Gaps = 78/356 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRH-----GGRLEHNE 54
           MD+SGEQ   V H + K RL S            G+  ID + LQ H        L+ + 
Sbjct: 89  MDVSGEQQSGVVHGVNKVRLTSVAE---------GSRVIDTQALQLHQQAEVSSHLDPD- 138

Query: 55  TYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
            YCGSCY A     +    CCN C+EVREAY    WA    + ++QC+REG+  R+ E+ 
Sbjct: 139 -YCGSCYSAPAPPNAKKPGCCNTCDEVREAYAANSWAFGRGEGVEQCEREGYGARLDEQR 197

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAF 166
            EGC I G + VNKV GNFH APG+SF    +HVHD+  F           +H+I+ L F
Sbjct: 198 HEGCRIEGVIRVNKVVGNFHIAPGRSFSNGNMHVHDLNNFFDTPIEGGHTFTHEIHSLRF 257

Query: 167 GEHFPGV------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
           G                   NPLDG+R   + P   + YFIKVV T Y  +         
Sbjct: 258 GPQLSDQEAKWTGADHHLNANPLDGLRQETDEPGYNFMYFIKVVSTSYLPLGWDEDKSIQ 317

Query: 206 -------------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPG 234
                              S  +I+++Q+SVT H RS   G         RL     +PG
Sbjct: 318 QHSSLSDLIPLGMHGKGAGSQGSIETHQYSVTSHKRSLAGGNDAAEGHKERLHAHGGIPG 377

Query: 235 VFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           VFF YD+SP+KV   E    SF +FLT VCA++GG  TV+  ID  +Y G   +KK
Sbjct: 378 VFFSYDISPMKVINREVRPKSFANFLTGVCAVIGGTLTVAAAIDRGLYEGATRLKK 433


>gi|171696240|ref|XP_001913044.1| hypothetical protein [Podospora anserina S mat+]
 gi|170948362|emb|CAP60526.1| unnamed protein product [Podospora anserina S mat+]
          Length = 437

 Score =  191 bits (484), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 124/351 (35%), Positives = 173/351 (49%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRL---DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ   V+H + K RL      G VIE++   + A             L+ N  YC
Sbjct: 89  MDVSGEQQHGVQHGVVKTRLRPLSEGGGVIEAKALALHA------RDEEAAHLDPN--YC 140

Query: 58  GSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA     +   +CC  C+EV+EAY  + WA    + I+QC+RE + +++ E+  EG
Sbjct: 141 GPCYGAAPPVHAQKPNCCQTCDEVKEAYAAQAWAFGRGEGIEQCEREHYAEKLDEQRNEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C I G + VNKV GNFH APGKSF    +HVHD+  +         +H+I+ L FG   P
Sbjct: 201 CRIEGNVRVNKVIGNFHIAPGKSFSNGNMHVHDLKNYWDTPVKHTFTHEIHHLRFGPQLP 260

Query: 172 -GV----------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
            G+                VNPLD      +  +  + YFIK+VPT Y  +         
Sbjct: 261 DGLAKKLGKNKALPWTNHHVNPLDNTHQETDDVNYNFMYFIKIVPTSYLPLGWEKTWQGF 320

Query: 206 --------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFY 239
                         +  +++++Q+SVT H RS   G         RL     +PGVFF Y
Sbjct: 321 KDQHHKELGSFGQSADGSLETHQYSVTSHRRSLSGGDDGSEGHKERLHAKGGIPGVFFSY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           D+SP+KV   EE   SFL FL  +CAIVGG  TV+  +D  ++ G   +KK
Sbjct: 381 DISPMKVINREERPKSFLGFLAGLCAIVGGTLTVAAAVDRALFEGGMKLKK 431


>gi|406606433|emb|CCH42207.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Wickerhamomyces ciferrii]
          Length = 405

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 112/322 (34%), Positives = 173/322 (53%), Gaps = 40/322 (12%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHG-GRLEHNETYCG 58
           MD+SG+  LDV +  F K RL   G  I   +  IG          HG    +    YCG
Sbjct: 89  MDVSGDLQLDVTNYGFTKIRLTETGEEIGEEEMKIG--------DDHGHADADIPADYCG 140

Query: 59  SCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
            CYGA++ D++         CCN+C+ VR+AY   GWA  +   ++QC+REG++++I + 
Sbjct: 141 PCYGAKNQDKNENKPQEEKVCCNDCDSVRKAYASVGWAFFDGKNVEQCEREGYVKKINDR 200

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFG- 167
            GEGC + G  ++N++ GN HFAPG S+     HVHD+  + ++  FN  H IN  +FG 
Sbjct: 201 LGEGCRVKGTAKLNRINGNIHFAPGASYSAPNRHVHDLSLYGKNKDFNFRHVINHFSFGP 260

Query: 168 --------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
                   E      +PLDG    Q +   +Y YF+KVVPT Y  ++G  +++NQFS T 
Sbjct: 261 DVNSKYTAETLELSSHPLDGTNAIQGSRDHLYSYFLKVVPTRYEYLNGTKVETNQFSSTY 320

Query: 220 HFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGG 268
           H R    GR +           +PG+FF +++SP+K+   E +  S+  FL NV + +GG
Sbjct: 321 HDRPLTGGRDEDHPNTFHARGGIPGLFFHFEMSPLKIINKETYGTSWSGFLLNVISAIGG 380

Query: 269 VFTVSGIIDAFIYHGQRAIKKK 290
           + TV  ++D  ++   + I++K
Sbjct: 381 ILTVGAVVDRTVFVADKVIRRK 402


>gi|396471326|ref|XP_003838845.1| similar to endoplasmic reticulum-golgi intermediate compartment
           protein 3 [Leptosphaeria maculans JN3]
 gi|312215414|emb|CBX95366.1| similar to endoplasmic reticulum-golgi intermediate compartment
           protein 3 [Leptosphaeria maculans JN3]
          Length = 439

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 127/356 (35%), Positives = 174/356 (48%), Gaps = 76/356 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGE  + + H I K RL  + +         G+  ID KPL  H     H + +YCG
Sbjct: 89  MDVSGELQMGITHGINKVRLSPEVD---------GSKVIDAKPLDLHQDEASHLDPSYCG 139

Query: 59  SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
           +CYGA     +    CCN C+EVR+AY    W+    + ++QC+RE + + + E+  EGC
Sbjct: 140 NCYGAPPPTNAIKHGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDEQRQEGC 199

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHF-- 170
            + G ++VNKV GNFH APGKSF    +HVHD+  + RD +    +HKI+ L FG     
Sbjct: 200 RLEGSIKVNKVVGNFHIAPGKSFSNGNLHVHDLENYFRDEYAHTFTHKIHHLRFGPQLSQ 259

Query: 171 --------------PGV-----VNPLDGVRWTQETPSGMYQYFIKVVPTVYT-------- 203
                         PG      VNPLD      +  +  Y YFIKVV T Y         
Sbjct: 260 AVVQDMAKKHMATGPGGWTNHHVNPLDHTEQRTDEKAFNYMYFIKVVSTAYLPLGWEKSA 319

Query: 204 ---------DVSGHTIQS--------NQFSVTEHFRSSEQG---------RLQT---LPG 234
                    D+ G TI S        +Q+SVT H RS + G         R+     +PG
Sbjct: 320 DGSSSGGYDDLLGTTIHSVNKGSIETHQYSVTSHKRSLQGGSDEKEGHKERIHARGGIPG 379

Query: 235 VFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           VFF YD+SP+KV   E    +F  FL  +CA++GG  TV+  +D  +Y G   IKK
Sbjct: 380 VFFSYDISPMKVINREMREKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNKIKK 435


>gi|242803029|ref|XP_002484091.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218717436|gb|EED16857.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 440

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 122/353 (34%), Positives = 174/353 (49%), Gaps = 71/353 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLD--SQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           MD+SGEQ + V H + K RL   ++G  +      I   K++   Q     +  N  YCG
Sbjct: 89  MDVSGEQQMGVVHGLNKVRLSPVAEGGKV------IDVAKLELHAQNEVA-VHLNPEYCG 141

Query: 59  SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            C GA     ++   CCN CEEVREAY  K WA    + I+QC+REG+ ++I  +  EGC
Sbjct: 142 QCGGAPPPPNTNKPGCCNTCEEVREAYALKSWAFGKGENIEQCQREGYAEKINAQRREGC 201

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ------RDSFNISHKINKLAFGE 168
            I G + VNKV GNFH APG+SF    +HVHD+  +        +   +SH I++L FG 
Sbjct: 202 RIEGDIRVNKVIGNFHIAPGRSFSTGNMHVHDLDTYMDRELSDNEKHTMSHIIHQLRFGP 261

Query: 169 HFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----------- 205
                +            NPLD  +   + P+  Y Y+IKVV T Y  +           
Sbjct: 262 QLSDELSRRWQWTDHHHTNPLDDTQQFTDEPAYNYNYYIKVVSTSYLPLGWDSSQSDQLH 321

Query: 206 ----------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFF 237
                           +  +++++Q+SVT H RS   G         R+     +PGVFF
Sbjct: 322 GDDQSTPLGLHGAVHGAAGSLETHQYSVTSHKRSLHGGNDAAEGHKERVHAEGGIPGVFF 381

Query: 238 FYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            YD+SP+KV   E    +F  FLT VCA++GG  TV+  +D F+Y G R ++K
Sbjct: 382 NYDISPMKVVNREVRPKTFTGFLTGVCAVIGGTLTVAAAVDRFLYEGSRRMRK 434


>gi|121702771|ref|XP_001269650.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           clavatus NRRL 1]
 gi|119397793|gb|EAW08224.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           clavatus NRRL 1]
          Length = 438

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 127/355 (35%), Positives = 169/355 (47%), Gaps = 75/355 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETY 56
           MD+SGEQ + V H + K RL S    G+V++ R  D     ++ K L         +  Y
Sbjct: 89  MDVSGEQQVGVAHGVNKVRLSSPAEGGHVLDIRSLDLHSKDEVAKHL---------DPNY 139

Query: 57  CGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG C GA+    +    CCN C+EVREAY  K WA      I+QC+REG+  RI  +  E
Sbjct: 140 CGDCGGADPLPGAIKPGCCNTCDEVREAYAAKNWAFGKGANIEQCEREGYTARIDAQRRE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAF 166
           GC + G L VNKV GNFH APG+SF    +HVHD  A+            + H+I++L F
Sbjct: 200 GCRLEGVLRVNKVVGNFHIAPGRSFTNGNIHVHDTQAYFDLDLPDDAKHTMEHEIHQLRF 259

Query: 167 GEHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------- 205
           G   P  +            NPLD        P+  + YF+KVV T Y  +         
Sbjct: 260 GPQLPDELSARWQWTDHHHTNPLDNTHQETNDPAYNFVYFVKVVSTSYLPLGWDPLFSSA 319

Query: 206 ------------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGV 235
                             +  +I+++Q+SVT H RS   G         RL     +PGV
Sbjct: 320 LHSTYEKAPLGAHGIGYGASGSIETHQYSVTSHKRSLRGGDAEDEGHKERLHAANGIPGV 379

Query: 236 FFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           FF YD+SP+KV   E     L  FLT VCAI+GG  TV+  ID  +Y G   +KK
Sbjct: 380 FFNYDISPMKVINREARPKTLSSFLTGVCAIIGGTLTVAAAIDRGLYEGALRVKK 434


>gi|440473660|gb|ELQ42442.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Magnaporthe oryzae Y34]
 gi|440486294|gb|ELQ66175.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Magnaporthe oryzae P131]
          Length = 444

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 127/358 (35%), Positives = 172/358 (48%), Gaps = 77/358 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ   V+H + K RL  Q   G VI+++   + A             L+ N  YC
Sbjct: 89  MDVSGEQQHGVQHGVIKVRLRPQSEGGGVIDAKTLALHAE------DEAATHLDPN--YC 140

Query: 58  GSCYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA +        CCN C+EVREAY +  WA    + ++QC RE + +R+ E+  EG
Sbjct: 141 GGCYGAPAPANAKKAGCCNTCDEVREAYAQASWAFGRGENVEQCTREHYAERLDEQRHEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----NISHKINKLAFGEH 169
           C I G L VNKV GNFH APG+SF    +HVHD+  +         + SH I+ L FG  
Sbjct: 201 CQIAGSLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDTPVEGGHSFSHTIHSLRFGPQ 260

Query: 170 FPGV------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--- 208
            P                    +NPLDGV  T   P+  Y YF+K+VPT Y  +      
Sbjct: 261 LPPSALEKLGNKDKNMPWTNHHINPLDGVIQTTVDPNFNYMYFVKIVPTSYLPLGWEKRT 320

Query: 209 -------------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFF 237
                              +++++Q+SVT H RS   G         R+ +   +PGVFF
Sbjct: 321 HLATMHDHGVGTYGYSGDGSVETHQYSVTSHKRSLAGGDDGEDGHKERMHSRGGIPGVFF 380

Query: 238 FY-----DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            Y     D+SP+KV   E    +F  FLT +CAI+GG  TV+  ID   + G   IKK
Sbjct: 381 SYPFCPQDISPMKVINREVRTKTFAGFLTGLCAILGGTLTVAAAIDRMTFEGVTRIKK 438


>gi|256078219|ref|XP_002575394.1| serologically defined breast cancer antigen ny-br-84-related
           [Schistosoma mansoni]
 gi|353230384|emb|CCD76555.1| serologically defined breast cancer antigen ny-br-84-related
           [Schistosoma mansoni]
          Length = 338

 Score =  190 bits (483), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 102/255 (40%), Positives = 145/255 (56%), Gaps = 12/255 (4%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD +G Q L+V H+++K  +   G  +    D +     D            +  YCGSC
Sbjct: 90  MDTTGAQQLNVMHEVYKTSVSVDGTPVS---DSVRHAVNDAS----ALTTTRDPNYCGSC 142

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG-EGCNIYGF 119
           YGAES    CCN CEEV+ AY +  W   N    +QC++E +   IK++ G EGC I+G 
Sbjct: 143 YGAESPSRKCCNTCEEVQMAYNEMRWIFVNISAFEQCRKENW-NEIKQKIGNEGCRIHGN 201

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           L VN+V G FH APG S+ ++  H H   +     FN+SH I +L FGE +PG VNPLDG
Sbjct: 202 LTVNRVGGAFHIAPGHSYTENHAHFHSFQSLGPVQFNVSHSIGELRFGESYPGQVNPLDG 261

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGH--TIQSNQFSVTEHFRSSE-QGRLQTLPGVF 236
            +   +T S M  Y++K+VPT+Y  +  +  T+ +NQ+S T H + +   G  Q LPGVF
Sbjct: 262 TKLAVQTHSQMVIYYLKLVPTMYISLRRNESTVITNQYSATWHSKGTPLTGDGQGLPGVF 321

Query: 237 FFYDLSPIKVTFTEE 251
           F Y+++P+ V  TEE
Sbjct: 322 FNYEIAPLLVKITEE 336


>gi|336265645|ref|XP_003347593.1| hypothetical protein SMAC_04901 [Sordaria macrospora k-hell]
 gi|380096460|emb|CCC06508.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 428

 Score =  190 bits (482), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 121/347 (34%), Positives = 170/347 (48%), Gaps = 71/347 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG---RLEHNETY 56
           MD+SGEQ   V+H + K RL  Q           G  +ID K L  H         + +Y
Sbjct: 89  MDVSGEQQHGVQHGVKKIRLRPQSE---------GGGEIDAKVLALHAADESATHLDPSY 139

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     +    CC+ CEE+REAY +  WA  +   ++QC+RE + +R+ E+  E
Sbjct: 140 CGPCYGAPAPYNAKKAGCCSTCEEIREAYAQASWAFGDGSTMEQCQREHYTERLAEQRHE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKI--------N 162
           GC I G L VNKV GNFH APG+SF    +HVHD+  +       ++  K+        N
Sbjct: 200 GCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVHDLAQWWNSPLPDDLVRKLGGGKDGKRN 259

Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----------------- 205
            L    H     NPLD  R   + P+  + YF+K+VPT Y  +                 
Sbjct: 260 TLWTNHHL----NPLDNTRQETDDPNYNFMYFVKIVPTSYLPLGWEKQAAQNKASWDQDH 315

Query: 206 ----------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSP 243
                     S  +++++Q+SVT H RS   G         RL +   +PGVFF YD+SP
Sbjct: 316 SVGLGVFGQGSDGSMETHQYSVTSHKRSLAGGDDAKEGHGERLHSRGGIPGVFFSYDISP 375

Query: 244 IKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           +KV   EE   SF+ FL  +CA+VGG  TV+  +D  ++ G   +KK
Sbjct: 376 MKVVNREERAKSFIGFLAGLCAVVGGTLTVAAAVDRGLFEGTVRLKK 422


>gi|425772976|gb|EKV11354.1| COPII-coated vesicle membrane protein Erv46, putative [Penicillium
           digitatum PHI26]
 gi|425782132|gb|EKV20058.1| COPII-coated vesicle membrane protein Erv46, putative [Penicillium
           digitatum Pd1]
          Length = 438

 Score =  190 bits (482), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 121/354 (34%), Positives = 173/354 (48%), Gaps = 73/354 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ + V H + K RL  +   G VI+ +   + +P       +H      +  YC
Sbjct: 89  MDVSGEQQVGVAHGVNKVRLSPRNEGGKVIDVQALDLHSPS---EAAKH-----LDPEYC 140

Query: 58  GSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G C GA          CC  CEEVR+AY +K WA  +   I+QC REG+ +R+ E+  EG
Sbjct: 141 GECGGATPPPNVIKPGCCTTCEEVRQAYAEKQWAFGDGSNIEQCTREGYAERLAEQRREG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAFG 167
           C I G L+VNKV GNFH APG+SF    +HVHD+  +        +   +SH +++L FG
Sbjct: 201 CRIEGVLKVNKVIGNFHIAPGRSFTTGNMHVHDLDTYIDPNAGPAEQHTMSHLVHELRFG 260

Query: 168 EHFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV---------- 205
              P  +            NPLD  +   + P+  + YF+KVV T Y  +          
Sbjct: 261 PQLPAELAGRWGWTDHHHTNPLDDTKQETDEPAYNFLYFVKVVSTSYLPLGWDPQFSTAI 320

Query: 206 -----------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVF 236
                            +  +I+++Q+SVT H R    G         R+     +PGVF
Sbjct: 321 HNAYDKAPLGYHGLAYGTQGSIEAHQYSVTSHKRPLSGGNDAAEGHKERVHAGGGIPGVF 380

Query: 237 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           F YD+SP+KV   E    +F +FLT VCAI+GG  TV+  +D  +Y G   +KK
Sbjct: 381 FNYDISPMKVVNREARPKTFTNFLTGVCAIIGGTLTVAAALDRGVYEGAMRVKK 434


>gi|342874382|gb|EGU76396.1| hypothetical protein FOXB_13074 [Fusarium oxysporum Fo5176]
          Length = 439

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 125/354 (35%), Positives = 172/354 (48%), Gaps = 74/354 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGEQ   V H + K RL              G   ID K L  H    +H + +YCG
Sbjct: 89  MDVSGEQQHGVMHGVNKVRLQPANQ---------GGAVIDIKSLALHDESADHLDPSYCG 139

Query: 59  SCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            CYGA+    +    CC  C+EVREAY +  WA    + ++QC+RE + +++  +  EGC
Sbjct: 140 GCYGAQPPANARKAGCCQTCDEVREAYAQSSWAFGRGEGVEQCEREHYGEKLDAQREEGC 199

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGEHF 170
            I G L VNKV GNFHFAPG+SF    +HVHD+  +    +  S + +H I+ L FG   
Sbjct: 200 RIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLKNYWDVPKGKSHDFTHYIHSLRFGPQL 259

Query: 171 PGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT----------- 203
           P  +                NPLD  R     P+  + YF+K+VPT Y            
Sbjct: 260 PDNIAKKVGTKSSLWTNHHQNPLDNTRQEIHDPNFNFMYFVKIVPTSYLPLGWDSKGIKI 319

Query: 204 ------DVSG---------HTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVF 236
                 D +G          +++++Q+SVT H RS   G         R  T   +PGVF
Sbjct: 320 AGLLQDDNAGLGAYGYSEDGSVETHQYSVTSHKRSLAGGNDAAEGHAERQHTSGGIPGVF 379

Query: 237 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           F YD+SP+KV   EE   +F  FL  +CAIVGG  TV+  +D  ++ G   IKK
Sbjct: 380 FSYDISPMKVVNREEKAKTFSGFLAGLCAIVGGTLTVAAAVDRGLFEGAARIKK 433


>gi|119189667|ref|XP_001245440.1| hypothetical protein CIMG_04881 [Coccidioides immitis RS]
 gi|392868334|gb|EAS34105.2| COPII-coated vesicle membrane protein Erv46 [Coccidioides immitis
           RS]
          Length = 435

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 121/351 (34%), Positives = 168/351 (47%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ   V H + K RL +    G+ ++     +    +DK   R    L  +  YC
Sbjct: 89  MDVSGEQQSGVIHGVNKVRLSAASEGGHALD-----VETLDLDK---RDQAPLHLDPAYC 140

Query: 58  GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           GSCY       +    CCN C+EVREAY  + WA    + ++QC++EG+  +I  +  EG
Sbjct: 141 GSCYDGIPPPNAKKPGCCNTCDEVREAYALRNWAFGRGEGVEQCEQEGYGSKIDSQRNEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C + G L VNKV GNFH APG+SF    +H HD+  +        +SH I++L FG   P
Sbjct: 201 CRLEGILRVNKVVGNFHVAPGRSFTNGYMHAHDLKTYYETPVKHTMSHIIHQLRFGPQLP 260

Query: 172 GVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------- 208
             +            NPLD    T E P   + YF+KVV T Y  +              
Sbjct: 261 DELSQKWKWTDHHHTNPLDSTSQTTEDPKFNFMYFVKVVSTSYLPLGWDASLSSEVHSRL 320

Query: 209 -----------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFY 239
                            +I+++Q+SVT H RS E G         R+ T   +PGVFF Y
Sbjct: 321 SSDAPLGKQGIQLGQYGSIETHQYSVTSHKRSIEGGDDSAEGHKERVHTAGGIPGVFFNY 380

Query: 240 DLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           D+SP+KV   E     L  FLT VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 381 DISPMKVINREARTKSLSGFLTGVCAVIGGTLTVAAAVDRALYEGSVRVKK 431


>gi|67524561|ref|XP_660342.1| hypothetical protein AN2738.2 [Aspergillus nidulans FGSC A4]
 gi|40743850|gb|EAA63036.1| hypothetical protein AN2738.2 [Aspergillus nidulans FGSC A4]
 gi|259486349|tpe|CBF84116.1| TPA: COPII-coated vesicle membrane protein Erv46, putative
           (AFU_orthologue; AFUA_1G05120) [Aspergillus nidulans
           FGSC A4]
          Length = 437

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 124/354 (35%), Positives = 167/354 (47%), Gaps = 74/354 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGEQ + V H + K RL              G   +D + LQ H    +H +  YCG
Sbjct: 89  MDVSGEQQVGVAHGVNKVRLAPAAE---------GGRVLDVQALQLHAEEAKHLDPDYCG 139

Query: 59  SCYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            C GA          CC+ C+EVREAY +K W       I+QC+RE + +RI  +  EGC
Sbjct: 140 ECGGAPPPPNAIKPGCCSTCDEVREAYAQKQWGFGKGTNIEQCEREHYSERIDAQRREGC 199

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN------ISHKINKLAFGE 168
            + G + VNKV GNFH APG+SF  + VH+HDI  ++    +      +SH I+ L FG 
Sbjct: 200 RLEGVIRVNKVVGNFHIAPGRSFSSNNVHIHDIANYEERGLSPAEQHTMSHIIHSLRFGP 259

Query: 169 HFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----------- 205
             P  +            NPLD        P+  + YFIKVV T Y  +           
Sbjct: 260 QLPDELSDRWQWTDHHHTNPLDSTSQEAPEPAYSFMYFIKVVSTSYLPLGWDPLYSASLH 319

Query: 206 -----------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVF 236
                            S  +I+++Q+SVT H RS   G         R+     +PGVF
Sbjct: 320 AAADTNTPLGAQGLSAGSQGSIETHQYSVTSHKRSLRGGDASDEAHKERIHAAGGIPGVF 379

Query: 237 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           F YD+SP+KV   E    +F  FLT VCAIVGG  TV+  ID  +Y G   ++K
Sbjct: 380 FNYDISPMKVINREARPKTFTGFLTGVCAIVGGTLTVAAAIDRTLYEGVSRVRK 433


>gi|452842116|gb|EME44052.1| hypothetical protein DOTSEDRAFT_71753 [Dothistroma septosporum
           NZE10]
          Length = 436

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 123/353 (34%), Positives = 169/353 (47%), Gaps = 73/353 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGE    V H + K RL  +   G  IE +   +G  +  + L         +  YC
Sbjct: 89  MDVSGEVQTGVMHGVNKVRLRPEAEGGGEIEKKALDLGVEEAAQHL---------DPDYC 139

Query: 58  GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA     ++   CCN C EVREAY    W+    + ++QC+RE + + +  +  EG
Sbjct: 140 GECYGAPAPSNAAKPGCCNTCAEVREAYAGVSWSFGRGENVEQCEREHYSEHLDAQRKEG 199

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI----SHKINKLAFGEH 169
           C I G + VNKV GNFHFAPGKSF    +HVHD+  F      I    +HKI+ L FG  
Sbjct: 200 CRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVHDLENFFNSPEGIQHTFTHKIHSLRFGPQ 259

Query: 170 FPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS------- 206
            P  V                NPLDG     E  S  + YF+KVV T Y  ++       
Sbjct: 260 LPDDVVNKVGKRGIAWSEHHLNPLDGTSQVTEEKSYNFMYFVKVVSTAYLPLAWKPSGSL 319

Query: 207 -----------------GHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFF 237
                            G +I+++Q+SVT H RS + G         RL     +PGVFF
Sbjct: 320 LDLPHELVELGGYGKGEGGSIETHQYSVTSHKRSLQGGDANEEGHKERLHARGGIPGVFF 379

Query: 238 FYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            YD+SP+KV   E    +F  FLT V A++GG  TV+  +D  +Y G + ++K
Sbjct: 380 SYDISPMKVVNREARTKTFTGFLTGVAAVIGGTLTVAAAVDRLMYEGGQRVRK 432


>gi|408400673|gb|EKJ79750.1| hypothetical protein FPSE_00030 [Fusarium pseudograminearum CS3096]
          Length = 439

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 125/356 (35%), Positives = 174/356 (48%), Gaps = 78/356 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRL--DSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETY 56
           MD+SGEQ   V H + K RL  +SQG  +           ID K L  H     H + +Y
Sbjct: 89  MDVSGEQQHGVMHGVNKVRLQPESQGGAV-----------IDTKSLSLHDDAAHHLDPSY 137

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     +    CC  C+EVREAY +  WA    + ++QC+RE + +++  +  E
Sbjct: 138 CGGCYGATPPANAQKAGCCQTCDEVREAYAQASWAFGRGEGVEQCEREHYGEKLDAQRSE 197

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGE 168
           GC I G L VNKV GNFHFAPG+SF    +HVHD+  +    +  S + +H ++ L FG 
Sbjct: 198 GCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLKNYWDVPKGFSHDFTHIVHSLRFGP 257

Query: 169 HFPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT--------- 203
             P  +                NPLD  R     P+  + YF+K+VPT Y          
Sbjct: 258 QLPDHIARKVGHKNTLWTNHHQNPLDDTRQETHDPNYNFMYFVKIVPTSYLPLGWDKKGI 317

Query: 204 --------DVSG---------HTIQSNQFSVTEHFRSSEQG---------RLQT---LPG 234
                   D +G          +++++Q+SVT H RS   G         R  T   +PG
Sbjct: 318 KIAGLLQEDNAGLGAYGYGEDGSVETHQYSVTSHRRSLAGGNDAAEGHAERQHTSGGIPG 377

Query: 235 VFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           VFF YD+SP+KV   EE   +F  FL  +CAIVGG  TV+  +D  ++ G   +KK
Sbjct: 378 VFFSYDISPMKVVNREEKAKTFSGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 433


>gi|330919615|ref|XP_003298687.1| hypothetical protein PTT_09471 [Pyrenophora teres f. teres 0-1]
 gi|311327999|gb|EFQ93219.1| hypothetical protein PTT_09471 [Pyrenophora teres f. teres 0-1]
          Length = 437

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 123/353 (34%), Positives = 172/353 (48%), Gaps = 72/353 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
           MD+SGE  + V H I K RL  +        DG  A +I K +  H     H    YCG 
Sbjct: 89  MDVSGELQMGVTHGINKVRLSPEA-------DGSKAIEI-KAVDLHTDEASHLAPDYCGQ 140

Query: 60  CYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           CYGA +        CCN C+EVR+AY    W+    + ++QC+RE + + + ++  EGC 
Sbjct: 141 CYGAPAPSNAKKPTCCNTCDEVRDAYASVSWSFGRGEGVEQCEREHYAEHLDQQRQEGCR 200

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGV 173
           + G ++VNKV GNFHFAPGKSF    +HVHD+  + +D +    +H I++L FG     V
Sbjct: 201 LEGNIKVNKVVGNFHFAPGKSFSNGNLHVHDLENYFKDEYTHTFTHHIHQLRFGPQLSDV 260

Query: 174 V---------------------NPLDGVRWTQETPSGMYQYFIKVVPTVY---------- 202
           V                     NPLD      +  +  Y YFIKVV TVY          
Sbjct: 261 VVQNMQKKHQESGIGGWSNHHINPLDETMQHTDEKAYNYMYFIKVVTTVYLPLGWEKVFP 320

Query: 203 -----TDVSGHT--------IQSNQFSVTEHFRSSEQGRLQT------------LPGVFF 237
                +D+ G T        I+++Q+SVT H RS + G  +             +PGVFF
Sbjct: 321 HPSKFSDILGATIDESYKGSIETHQYSVTSHKRSLQGGNDEKDGHKERIHARGGIPGVFF 380

Query: 238 FYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            YD+SP++V   E    +F  FL  +CA++GG  TV+  ID  +Y G   IKK
Sbjct: 381 SYDISPMEVINREVREKTFSGFLVGLCAVIGGTLTVAAAIDRALYEGVNRIKK 433


>gi|46105482|ref|XP_380545.1| hypothetical protein FG00369.1 [Gibberella zeae PH-1]
          Length = 444

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 125/356 (35%), Positives = 174/356 (48%), Gaps = 78/356 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRL--DSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETY 56
           MD+SGEQ   V H + K RL  +SQG  +           ID K L  H     H + +Y
Sbjct: 89  MDVSGEQQHGVMHGVNKVRLQPESQGGAV-----------IDTKSLSLHDDAAHHLDPSY 137

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     +    CC  C+EVREAY +  WA    + ++QC+RE + +++  +  E
Sbjct: 138 CGGCYGATPPANAQKAGCCQTCDEVREAYAQASWAFGRGEGVEQCEREHYGEKLDAQRSE 197

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF----QRDSFNISHKINKLAFGE 168
           GC I G L VNKV GNFHFAPG+SF    +HVHD+  +    +  S + +H ++ L FG 
Sbjct: 198 GCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVHDLKNYWDVPKGFSHDFTHIVHSLRFGP 257

Query: 169 HFPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT--------- 203
             P  +                NPLD  R     P+  + YF+K+VPT Y          
Sbjct: 258 QLPDHIARKVGHKNTLWTNHHQNPLDDTRQETHDPNYNFMYFVKIVPTSYLPLGWDKKGI 317

Query: 204 --------DVSG---------HTIQSNQFSVTEHFRSSEQG---------RLQT---LPG 234
                   D +G          +++++Q+SVT H RS   G         R  T   +PG
Sbjct: 318 KIAGLLQEDNAGLGAYGYGEDGSVETHQYSVTSHRRSLAGGNDAAEGHAERQHTSGGIPG 377

Query: 235 VFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           VFF YD+SP+KV   EE   +F  FL  +CAIVGG  TV+  +D  ++ G   +KK
Sbjct: 378 VFFSYDISPMKVVNREEKAKTFSGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 433


>gi|261327856|emb|CBH10834.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 405

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 116/320 (36%), Positives = 163/320 (50%), Gaps = 31/320 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR-LEHNETYCGS 59
           +D  GE   +V  D  + R++    V      G   P +D   Q   G   EH +  C S
Sbjct: 90  IDAFGEHVENVLTDTARVRVNPDTLV----PLGEARPLMDMKKQPADGNGAEHGK--CPS 143

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYG 118
           CYGAES+  DCC+ C++VR A+ ++ W     D  I QC  E           EGCN++ 
Sbjct: 144 CYGAESNPGDCCHTCDDVRRAFAERQWEFHEDDASIVQCVHERLKMAAASASTEGCNLHA 203

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
              V +V GN HF PG+ F+  G H+H          N+SH ++ L FGE FPG  NP+D
Sbjct: 204 SFSVPRVTGNIHFIPGRMFNFFGQHLHSFKGETIQKLNLSHIVHSLEFGERFPGQSNPMD 263

Query: 179 GVRWTQ------ETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSS---- 224
           G+   +      E   G + YF+KVVPTVY   S    G  ++SNQ+SVT HF  S    
Sbjct: 264 GMANVRGATDPSEPLIGRFSYFVKVVPTVYRIESLVGGGRVVESNQYSVTHHFTPSWETP 323

Query: 225 -------EQGRLQTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
                   +     +PGVF  YDLSPI+V+    H   S +H +  +CA+ GGV+TV+G+
Sbjct: 324 KGGENNNAKHDPSVVPGVFISYDLSPIRVSVKRTHPYPSIVHLVLQLCAVGGGVYTVTGL 383

Query: 276 IDAFIYHGQRAIKKKIEIGK 295
           ID+  +H  R ++ K+  GK
Sbjct: 384 IDSLFFHSIRRMQIKMNRGK 403


>gi|72388468|ref|XP_844658.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62360135|gb|AAX80555.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70801191|gb|AAZ11099.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 405

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 116/320 (36%), Positives = 163/320 (50%), Gaps = 31/320 (9%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR-LEHNETYCGS 59
           +D  GE   +V  D  + R++    V      G   P +D   Q   G   EH +  C S
Sbjct: 90  IDAFGEHVENVLTDTARVRVNPDTLV----PLGEARPLMDMKKQPADGNGAEHGK--CPS 143

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYG 118
           CYGAES+  DCC+ C++VR A+ ++ W     D  I QC  E           EGCN++ 
Sbjct: 144 CYGAESNPGDCCHTCDDVRRAFAERQWEFHEDDASIVQCVHERLKMAAASASTEGCNLHA 203

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
              V +V GN HF PG+ F+  G H+H          N+SH ++ L FGE FPG  NP+D
Sbjct: 204 SFSVPRVTGNIHFIPGRMFNFFGQHLHSFKGETIQKLNLSHIVHSLEFGERFPGQSNPMD 263

Query: 179 GVRWTQ------ETPSGMYQYFIKVVPTVYTDVS----GHTIQSNQFSVTEHFRSS---- 224
           G+   +      E   G + YF+KVVPTVY   S    G  ++SNQ+SVT HF  S    
Sbjct: 264 GMANVRGATDPSEPLIGRFSYFVKVVPTVYRIESLVGGGRVVESNQYSVTHHFTPSWETP 323

Query: 225 -------EQGRLQTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGI 275
                   +     +PGVF  YDLSPI+V+    H   S +H +  +CA+ GGV+TV+G+
Sbjct: 324 KGGENNNAKHDPSVVPGVFISYDLSPIRVSVKRTHPYPSIVHLVLQLCAVGGGVYTVTGL 383

Query: 276 IDAFIYHGQRAIKKKIEIGK 295
           ID+  +H  R ++ K+  GK
Sbjct: 384 IDSLFFHSIRRMQIKMNRGK 403


>gi|353237029|emb|CCA69011.1| related to ERV46-component of copii vesicles [Piriformospora indica
           DSM 11827]
          Length = 428

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 109/319 (34%), Positives = 171/319 (53%), Gaps = 35/319 (10%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           DISG+   ++ H + K RLD   +  +   DGI    +   L +       ++ YCGSCY
Sbjct: 92  DISGDVVREITHHVVKTRLDPAAH--QPIPDGIYRTDLKSDLSKQ--LTATSKGYCGSCY 147

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           G +  +  CCN C++VR AY  +GWA  NPD IDQC  E + ++I   + EGCNI G + 
Sbjct: 148 GGQPPEGGCCNTCDDVRRAYTDRGWAFGNPDQIDQCVSENWTEKIMAMQREGCNIEGRVR 207

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN-ISHKINKLAFGEH---------FP 171
           VNKV GN  F+PG+SF  +   V+ ++ + +DS +   H I+ L   ++          P
Sbjct: 208 VNKVTGNMQFSPGRSFVVNRPEVYALVPYLKDSNHFFGHHIHSLEIYDYEEDTWTRRNLP 267

Query: 172 GVVN--------PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR- 222
             +         PL+ V    E+   M+QYF+KVV + Y  + G    ++Q+S +   R 
Sbjct: 268 EQIKERLGITKPPLEDVYAHTESADYMFQYFLKVVKSSYKGLDGKAYSTHQYSTSSFERD 327

Query: 223 -------SSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
                   +E G       Q +PGVFF +++SP++V   E+  S+ HF+T++ AI+GGV 
Sbjct: 328 LATMSHGKNEDGIEIVHERQGVPGVFFNFEISPMEVIHIEQRQSWAHFITSMAAIIGGVL 387

Query: 271 TVSGIIDAFIYHGQRAIKK 289
           TV+ ++DA +++ Q  IKK
Sbjct: 388 TVATLVDALLFNTQGLIKK 406


>gi|67479077|ref|XP_654920.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56472012|gb|EAL49533.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449701866|gb|EMD42605.1| endoplasmic reticulumgolgi intermediate compartment protein,
           putative [Entamoeba histolytica KU27]
          Length = 354

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 108/283 (38%), Positives = 160/283 (56%), Gaps = 20/283 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D +GE  +D+  +I K+RL    N++   +D I   K  K +  +G       T C  C
Sbjct: 86  LDTTGEVIIDISKNIKKERL----NLV--NEDEISKKKFAKTV--YG-------TECPPC 130

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
              ES  + CC  CEE+ E+Y+K    +  P    QC+     +      GEGC I G +
Sbjct: 131 -NNESDKDKCCFTCEELTESYQKLNKEV--PKGSPQCEIRNIHKMTTFYNGEGCRISGTV 187

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
            VN+ +GNFH APG S   +  H+H +  +     N++H  N L+FG+ FPG++NP+DG+
Sbjct: 188 FVNRASGNFHIAPGSSQQLTQEHIHSV-DWISGGINLTHTWNFLSFGDSFPGMINPMDGI 246

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFY 239
                T + MYQYF++VVP  YT +    I +N +SVTEH+R  S +   Q +PGVF  Y
Sbjct: 247 VKVDRTNNSMYQYFVQVVPMTYTSLDNKVIHTNGYSVTEHYRPGSLKSPEQGIPGVFVIY 306

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
           D+S I+V + EE  SF H LT++C I+GGVF +  ++D FI+H
Sbjct: 307 DISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFSLLDYFIFH 349


>gi|303322923|ref|XP_003071453.1| hypothetical protein CPC735_069900 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240111155|gb|EER29308.1| hypothetical protein CPC735_069900 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320033474|gb|EFW15422.1| COPII-coated vesicle membrane protein Erv46 [Coccidioides posadasii
           str. Silveira]
          Length = 435

 Score =  187 bits (475), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 121/351 (34%), Positives = 168/351 (47%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ   V H + K RL +    G+ ++     +    +DK  Q     L  +  YC
Sbjct: 89  MDVSGEQQSGVIHGVNKVRLSAASEGGHALD-----VETVDLDKKDQ---APLHLDPGYC 140

Query: 58  GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           GSCY       +    CCN C+EVREAY  + WA    + ++QC++EG+  +I  +  EG
Sbjct: 141 GSCYDGIPPPNAKKPGCCNTCDEVREAYALRNWAFGRGEGVEQCEQEGYGSKIDSQRNEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C + G L VNKV GNFH APG+SF    +H HD+  +        +SH I++L FG   P
Sbjct: 201 CRLEGILRVNKVVGNFHVAPGRSFTNGYMHAHDLKTYYETPVKHTMSHIIHQLRFGPQLP 260

Query: 172 GVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------- 208
             +            NPLD    T E P   + YF+KVV T Y  +              
Sbjct: 261 DELSQKWKWTDHHHTNPLDSTSQTTEDPKFNFMYFVKVVSTSYLPLGWDASLSSEVHSRL 320

Query: 209 -----------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFY 239
                            +I+++Q+SVT H RS E G         R+ T   +PGVFF Y
Sbjct: 321 SSDAPLGKQGIQLGQYGSIETHQYSVTSHKRSIEGGDDSAEGHKERVHTAGGIPGVFFNY 380

Query: 240 DLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           D+SP+KV   E     L  FLT VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 381 DISPMKVINREARTKSLSGFLTGVCAVIGGTLTVAAAVDRALYEGSVRVKK 431


>gi|300123299|emb|CBK24572.2| unnamed protein product [Blastocystis hominis]
          Length = 376

 Score =  187 bits (475), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 103/276 (37%), Positives = 148/276 (53%), Gaps = 20/276 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISG+Q + V   I +  LD     +      +   K   P              CGSC
Sbjct: 111 MDISGQQQMGVTSRIVQLDLDENHKPVNMALSSVLYEKNIDPA-------------CGSC 157

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGW-ALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           +GA  S+  CCN C++V  AY ++GW          QC++     +      +GC ++G 
Sbjct: 158 FGASLSNV-CCNTCDDVLSAYERRGWDTWFVSKYSPQCRKNNDEVKKPRVNSQGCMMWGV 216

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           LEVNKVAGNFH A G + ++   H+H         FN++H I KL+FGEH PG+ NPLDG
Sbjct: 217 LEVNKVAGNFHIAVGHAANRDSHHIHSFNPLMISKFNVTHHIEKLSFGEHIPGIQNPLDG 276

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ---GRLQTLPGVF 236
                E+ +    Y++KV+PTVY++ +  T+ SN+ SV E  R  E    G++ +LPG+F
Sbjct: 277 HDMVAESLTSQ-NYYLKVMPTVYSNRTS-TVVSNELSVNEVSRRVEMTPFGQITSLPGIF 334

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
           F YD++P     TE  ++F HFL  VCA++GGV  V
Sbjct: 335 FIYDITPFMHVVTESRIAFAHFLVRVCAVIGGVAAV 370


>gi|453082617|gb|EMF10664.1| DUF1692-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 432

 Score =  187 bits (475), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 125/350 (35%), Positives = 169/350 (48%), Gaps = 71/350 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
           MD+SGE    V H + K RLD+ G  I     G  A  ++   Q     + H +  YCG 
Sbjct: 89  MDVSGEVQSGVMHGVNKVRLDANGKEI-----GKEALTVNSEEQ-----VPHLDPDYCGD 138

Query: 60  CYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           CYGA     ++   CCNNC EVREAY    W+    + ++QC RE + + + E+  EGC 
Sbjct: 139 CYGAPAPETATKAGCCNNCAEVREAYAGVSWSFGRGEGVEQCTREHYAEHLDEQRKEGCR 198

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINKLAFGEHFPG 172
           I G + VNKV GNFHFAPGKSF    +HVHD+  + +      + +HKI+ L FG   P 
Sbjct: 199 IEGGIRVNKVVGNFHFAPGKSFSNGNMHVHDLENYFQSGEVQHSFTHKIHHLRFGPELPD 258

Query: 173 VV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGH---- 208
            V                NPLD      +  +  + YF+KVV T Y     D SG     
Sbjct: 259 DVVKAVGKKGMAWSNHHLNPLDDTEQVTDEVAYNFMYFVKVVSTAYLPLGWDGSGSLLDI 318

Query: 209 ----------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYD 240
                           +I+++Q+SVT H RS   G         RL     +PGVFF YD
Sbjct: 319 PHELIALGGYGKGEQGSIETHQYSVTSHKRSLTGGDAKAEGHEERLHAKGGIPGVFFSYD 378

Query: 241 LSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           +SP+KV   E    SF  FL  VCA++GG  TV+  +D  +Y G   ++K
Sbjct: 379 ISPMKVINREARAKSFSGFLVGVCAVIGGTLTVAAAVDRLLYEGGSKLRK 428


>gi|261188384|ref|XP_002620607.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
           dermatitidis SLH14081]
 gi|239593207|gb|EEQ75788.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
           dermatitidis SLH14081]
 gi|239609349|gb|EEQ86336.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
           dermatitidis ER-3]
 gi|327354450|gb|EGE83307.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
           dermatitidis ATCC 18188]
          Length = 435

 Score =  187 bits (475), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 124/354 (35%), Positives = 169/354 (47%), Gaps = 76/354 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRL---DSQGNVIESRQDGIGAPKIDKPLQRHG---GRLEHNE 54
           MDISGE   +V H + K RL   +  G V++     I A      LQ H       + + 
Sbjct: 89  MDISGEYQTEVVHGVNKLRLSPAEEGGQVLD-----ITA------LQLHSKTDNAKDLDP 137

Query: 55  TYCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
            YCGSCYGA     +    CCN C+EVREAY  K W+    + ++QC++EG+   +  + 
Sbjct: 138 NYCGSCYGAPAPPNAQKPGCCNTCDEVREAYAAKRWSFGRGENVEQCEKEGYSANLDAQR 197

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGE 168
            EGC + G + VNKV GNFH APG+SF    +H HD+  +       N+ HKI+ L FG 
Sbjct: 198 KEGCRVEGVIRVNKVIGNFHIAPGRSFTNGNMHAHDLNNYYNTPIPHNVGHKIHYLRFGP 257

Query: 169 HFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV----------- 205
             P  V            NPLD        P   + YF+KVV T Y  +           
Sbjct: 258 QLPDEVSRRWKWTDHHHTNPLDNTEQHTTNPRLNFAYFVKVVATSYLPLGWDDDWSSTVH 317

Query: 206 -----------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVF 236
                            SG +I+++Q+SVT H RS + G         RL +   +PGVF
Sbjct: 318 SKVSNNVPLGKQGVSLGSGGSIETHQYSVTSHKRSVDGGNDAEEGHKERLHSQGGIPGVF 377

Query: 237 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
             YD+SP+KV   E    +F  FLT VCA++GG  TV+  ID  +Y G   +KK
Sbjct: 378 VNYDISPMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAIDRALYEGSVRVKK 431


>gi|407044387|gb|EKE42566.1| hypothetical protein ENU1_017250 [Entamoeba nuttalli P19]
          Length = 354

 Score =  186 bits (473), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 108/283 (38%), Positives = 160/283 (56%), Gaps = 20/283 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D +GE  +D+  +I K+RL    N++   +D I   K  K +  +G       T C  C
Sbjct: 86  LDTTGEVIIDISKNIKKERL----NLV--NEDEISKKKFAKTV--YG-------TECPPC 130

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
              E   + CC  CEE+ E+Y+K    +  P    QC+ +   +      GEGC I G +
Sbjct: 131 -NNEIDKDKCCFTCEELTESYQKLNKEV--PKGSPQCEIKNIHKMTTFYNGEGCRISGTV 187

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
            VN+ +GNFH APG S   +  H+H +  +     N++H  N L+FG+ FPG++NPLDG+
Sbjct: 188 FVNRASGNFHIAPGSSQQLTQEHIHSV-DWISGGINLTHTWNFLSFGDSFPGMINPLDGI 246

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFY 239
                T + MYQYF++VVP  YT +    I +N +SVTEH+R  S +   Q +PGVF  Y
Sbjct: 247 VKVDRTNNSMYQYFVQVVPMTYTSLDNKVINTNGYSVTEHYRPGSLKSPEQGIPGVFVIY 306

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
           D+S I+V + EE  SF H LT++C I+GGVF +  ++D FI+H
Sbjct: 307 DISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFSLLDYFIFH 349


>gi|167376738|ref|XP_001734125.1| endoplasmic reticulum-golgi intermediate compartment protein
           [Entamoeba dispar SAW760]
 gi|165904489|gb|EDR29705.1| endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Entamoeba dispar SAW760]
          Length = 361

 Score =  186 bits (473), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 98/292 (33%), Positives = 164/292 (56%), Gaps = 18/292 (6%)

Query: 4   SGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA 63
           SGE  +D++ ++ K R+   G+++   +         K +Q       H+   C SCYGA
Sbjct: 86  SGESMIDIEQNVTKIRIHHDGSLVTESEM--------KAIQSKLSTETHDPKECRSCYGA 137

Query: 64  ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVN 123
           E+ ++ CC  C++V+EAY+KKGW L + +++ QC+    +Q  +  + EGC + G   +N
Sbjct: 138 ETPEKKCCFTCDDVKEAYKKKGWRL-DLNIVSQCQNHEKIQMARLTKDEGCRVIGDFLLN 196

Query: 124 KVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWT 183
           K+ GNFH APG S    G H H++    +   ++SHK N+L+FGEH           +  
Sbjct: 197 KIGGNFHIAPGSSEQSWGRHSHNLEWTGKTQIDLSHKWNELSFGEHSKKFTTEKKDTQM- 255

Query: 184 QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSP 243
               + M+QY++ ++P     ++G T     +S+ E+ RS   G  +  PGVF +YD+SP
Sbjct: 256 ----NSMFQYYLTIIPIKNNFING-TSTFYDYSIQENIRS---GEGEGSPGVFVYYDVSP 307

Query: 244 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           + +  TE +  FLHFL  +C+IVGG+FT   + DA ++    +++KK+E+GK
Sbjct: 308 MVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHSLEKKVELGK 359


>gi|298708525|emb|CBJ49158.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 467

 Score =  186 bits (471), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 118/351 (33%), Positives = 176/351 (50%), Gaps = 66/351 (18%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++G+  +D+ H ++K+RLD  G+ I      +     D P Q         E YCGSC
Sbjct: 127 MDVAGDNQIDIDHGMWKQRLDPDGSAIGEAFMEVPGEVDDDPAQ------SLPEDYCGSC 180

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSN-PDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           +GA+   + CCN C +V +AY  KGW++ +     +QC R+  ++      GEGCN+ GF
Sbjct: 181 FGAK---KGCCNMCRDVVDAYTAKGWSVQDIRRTAEQCIRDNHIE-TPIVNGEGCNLSGF 236

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV-VNPLD 178
           + VNKV+GNFH A G+   + G HVH     Q   FN SH IN L+F E +PG+  NPLD
Sbjct: 237 MSVNKVSGNFHVATGEGVMREGRHVHLYTLEQAVGFNTSHSINLLSFWEPYPGMKPNPLD 296

Query: 179 GVRWT--QETPSGMYQYFIKVVPTVY-----TDVSGHTIQ---------------SNQFS 216
                  ++  +G +QY+IK+VPT++     ++ SG  +                ++QF+
Sbjct: 297 RTSRIIDEDVGTGAFQYYIKLVPTMHSLSPQSEASGSPLPKGKGEEAERQQQSSLTSQFT 356

Query: 217 VTEHFRS--------------------SEQGRLQ-----------TLPGVFFFYDLSPIK 245
            T  FRS                    +E+G  Q            LPGVFF YD+SP  
Sbjct: 357 YTYKFRSLKGLTEYHTDHEEGEEQAKEAEKGLTQDGGVNSIVNSALLPGVFFVYDVSPFM 416

Query: 246 V-TFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           V     E   F H L  +CA+ GG F +SGI+D+ ++H    +++   +GK
Sbjct: 417 VEVVPAEQPPFSHLLIRLCAVAGGAFAISGIVDSAVFHLSNRLRRHGVLGK 467


>gi|407034208|gb|EKE37117.1| hypothetical protein ENU1_208770 [Entamoeba nuttalli P19]
          Length = 361

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 99/292 (33%), Positives = 164/292 (56%), Gaps = 18/292 (6%)

Query: 4   SGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA 63
           SGE  +D++ ++ K R+   G+++   +         K +Q       H+   C SCYGA
Sbjct: 86  SGESMIDIEQNVTKIRIHHDGSLVTENEM--------KAIQSKLSTETHDPKECRSCYGA 137

Query: 64  ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVN 123
           E+ ++ CC  C++V+EAY+KKGW L + +++ QC+    +Q  K  + EGC + G   +N
Sbjct: 138 ETPEKKCCFTCDDVKEAYKKKGWRL-DLNIVSQCQNHEKIQMAKLTKDEGCRLIGDFLLN 196

Query: 124 KVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWT 183
           K+ GNFH APG S    G H H++    +   ++SHK N+L+FGE+           +  
Sbjct: 197 KIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQIDLSHKWNELSFGENSKKFTTEKKDTQM- 255

Query: 184 QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSP 243
               + M+QY++ ++P     ++G T     +S+ E+ RS   G+ +  PGVF +YD+SP
Sbjct: 256 ----NSMFQYYLTIIPIKNNFING-TSTFYDYSIQENTRS---GKGEGQPGVFVYYDVSP 307

Query: 244 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           + +  TE +  FLHFL  +C+IVGG+FT   + DA ++     +KKK+E+GK
Sbjct: 308 MVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHTLKKKVELGK 359


>gi|115388503|ref|XP_001211757.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114195841|gb|EAU37541.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 438

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 125/354 (35%), Positives = 177/354 (50%), Gaps = 73/354 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGEQ + V H I K RL S    G+V++ +   + +   ++ + +H   L+ N  YC
Sbjct: 89  MDVSGEQQVGVAHGINKVRLASPAEGGHVLDVQALELHS---EQEVAKH---LDPN--YC 140

Query: 58  GSCYGAESSD---EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
           G C G        + CCN CEEVREAY +  WA    + I+QC+REG+  RI  +  EGC
Sbjct: 141 GECGGIPQQPGEPKRCCNTCEEVREAYAEHQWAFGKGENIEQCEREGYAARIDAQRREGC 200

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF------QRDSFNISHKINKLAFGE 168
            + G L VNKV GNFH APG+SF    +HVHD+  +        +   ++H I++L FG 
Sbjct: 201 RLEGVLRVNKVVGNFHIAPGRSFSSGNIHVHDLENYFELDQPASEKHTMTHHIHQLRFGP 260

Query: 169 HFPGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS---------- 206
             P  +            NPLD      +  +  Y YF+KVV T Y  +           
Sbjct: 261 QLPDELSDRWQWTDHHHTNPLDDTVQETDLAAFNYMYFVKVVSTAYLPLGWDPRVSSYIH 320

Query: 207 ----------------GH--TIQSNQFSVTEHFR------SSEQGRLQTL------PGVF 236
                           GH  +I+++Q+SVT H R      ++++G  + L      PGVF
Sbjct: 321 SASSHNVPLGRHGIGYGHDGSIETHQYSVTSHKRPLMGGNAADEGHKERLHAAAGIPGVF 380

Query: 237 FFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           F YD+SP+KV   E    +F  FLT VCAI+GG  TV+  ID  +Y G   +KK
Sbjct: 381 FNYDISPMKVINREARPKTFTGFLTGVCAIIGGTLTVAAAIDRGLYEGAIRVKK 434


>gi|298706631|emb|CBJ29569.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 453

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 118/332 (35%), Positives = 166/332 (50%), Gaps = 56/332 (16%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVI-------------------ESRQDGIGAPKIDKP 42
           D  G    D++HD+ + RLDS G  +                   E +Q    A   D+ 
Sbjct: 112 DALGIPQEDLRHDVTRTRLDSIGRALDDGEKHEMGNTLKAVIAKEEEKQAEADASPGDED 171

Query: 43  L---QRHG----GRLEH----------NETYCGSCYGAESSDEDCCNNCEEVREAYRKKG 85
           L    R G    G +E            E  C +CYGA +  E CC  CE+VR+AYR+KG
Sbjct: 172 LDSKSRAGDGGDGDVEQRALEDTATTGQEDEC-NCYGAGAEGE-CCRTCEDVRKAYRRKG 229

Query: 86  WALSNPDLIDQCKREGFLQRIKEE------EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQ 139
           W L NP  I  C  E               E EGC + G LEV++  GNFHFAPG   H+
Sbjct: 230 WRL-NPAEIPACAGEALSANSANTMESPPVENEGCRLAGHLEVSRTEGNFHFAPGHRLHR 288

Query: 140 SGVHVH--DILAFQRDSFNISHKINKLAFGEHFP-GVVNP--------LDGVRWTQETPS 188
               +   D +    +SFN +H IN L FG+  P G  +P        L+G + T +   
Sbjct: 289 HANELSFVDRIQVALESFNTTHTINTLTFGDQPPPGHASPKHAVASTVLEGHQKTVQDTH 348

Query: 189 GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTF 248
            M+QYF+++VPTVY   +G T+ SNQ+S TEH +    G  + LPGV+F+Y++SP++   
Sbjct: 349 AMHQYFLQLVPTVYRLDNGETVHSNQYSATEHLKHVHDGTSRGLPGVYFYYEVSPVQALV 408

Query: 249 TEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
            E+   FL FLT  C +VGGV+T+ G+++  I
Sbjct: 409 EEKRKGFLAFLTGACGVVGGVYTILGLVNTGI 440


>gi|342183032|emb|CCC92512.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 401

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 117/319 (36%), Positives = 166/319 (52%), Gaps = 33/319 (10%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D  GE   D+  D  K R+DS         D +      +PL     +   +   C SC
Sbjct: 90  IDAFGEYVEDMGRDTVKMRVDS---------DTLAPLGEARPLVNMNKKATSDTHDCPSC 140

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGF 119
           YGAE +  DCC+ C++VR A+ ++ W     D+ I QC +E           EGCN++  
Sbjct: 141 YGAEKNPGDCCHTCDDVRRAFAERQWEFHEDDVSIMQCAKERLQMAASTASREGCNLHSS 200

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
             V +V GN HF PG+ F+  G H+H          N+SH I+ L FGE FPG  NPLDG
Sbjct: 201 FSVPRVTGNIHFVPGRMFNFFGQHLHSFKGETIQRLNLSHIIHTLEFGERFPGQKNPLDG 260

Query: 180 VRWTQ--ETPS----GMYQYFIKVVPTVY----TDVSGHTIQSNQFSVTEHFRSS----- 224
           +  T+  E PS    G + YF+KVVPT+Y       SG  ++SNQ+SVT HF +S     
Sbjct: 261 MVNTRGVENPSEDLIGRFAYFVKVVPTLYQVRTLMSSGRVVESNQYSVTHHFTASWDAAD 320

Query: 225 ------EQGRLQTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGII 276
                      + +PGVF  YD+SPI+V+    H   S +H +  +CA+ GGV+TV G+I
Sbjct: 321 QNNQTNRDANPRVVPGVFVSYDISPIRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVVGLI 380

Query: 277 DAFIYHGQRAIKKKIEIGK 295
           D+  +H  R +++KI  GK
Sbjct: 381 DSMFFHSIRRVQEKINRGK 399


>gi|342183042|emb|CCC92522.1| unnamed protein product [Trypanosoma congolense IL3000]
 gi|343474271|emb|CCD14057.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 401

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 117/319 (36%), Positives = 166/319 (52%), Gaps = 33/319 (10%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D  GE   D+  D  K R+DS         D +      +PL     +   +   C SC
Sbjct: 90  IDAFGEYVEDMGRDTVKMRVDS---------DTLAPLGEARPLVNMNKKATSDTHDCPSC 140

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGF 119
           YGAE +  DCC+ C++VR A+ ++ W     D+ I QC +E           EGCN++  
Sbjct: 141 YGAEKNPGDCCHTCDDVRRAFAERQWEFHEDDVSIMQCAKERLQMAASTASREGCNLHSS 200

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
             V +V GN HF PG+ F+  G H+H          N+SH I+ L FGE FPG  NPLDG
Sbjct: 201 FSVPRVTGNIHFVPGRMFNFFGQHLHSFKGETIQRLNLSHIIHTLEFGERFPGQKNPLDG 260

Query: 180 VRWTQ--ETPS----GMYQYFIKVVPTVY----TDVSGHTIQSNQFSVTEHFRSS----- 224
           +  T+  E PS    G + YF+KVVPT+Y       SG  ++SNQ+SVT HF +S     
Sbjct: 261 MVNTRGVENPSEDLIGRFAYFVKVVPTLYQVKTLMSSGRVVESNQYSVTHHFTASWDAAD 320

Query: 225 ------EQGRLQTLPGVFFFYDLSPIKVTFTEEH--VSFLHFLTNVCAIVGGVFTVSGII 276
                      + +PGVF  YD+SPI+V+    H   S +H +  +CA+ GGV+TV G+I
Sbjct: 321 QNNQTNRDANPRVVPGVFVSYDISPIRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVVGLI 380

Query: 277 DAFIYHGQRAIKKKIEIGK 295
           D+  +H  R +++KI  GK
Sbjct: 381 DSMFFHSIRRVQEKINRGK 399


>gi|367019108|ref|XP_003658839.1| hypothetical protein MYCTH_2295135 [Myceliophthora thermophila ATCC
           42464]
 gi|347006106|gb|AEO53594.1| hypothetical protein MYCTH_2295135 [Myceliophthora thermophila ATCC
           42464]
          Length = 436

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 121/357 (33%), Positives = 164/357 (45%), Gaps = 83/357 (23%)

Query: 1   MDISGEQHLDVKHDIFKKRL----------DSQGNVIESRQDGIGAPKIDKPLQRHGGRL 50
           MD+SGEQ   V+H I K RL          DS+  V+ SR +                 +
Sbjct: 89  MDVSGEQQHGVQHGITKTRLRPLSEGGGDIDSKEIVLHSRDEAA---------------V 133

Query: 51  EHNETYCGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRI 106
             +  YCG CYGA   +      CCN C+EVR+AY +  WA    + I QC+RE + +++
Sbjct: 134 HLDPNYCGECYGAPPPNNAKKPGCCNTCDEVRDAYAQASWAFGRGEGIVQCEREHYSEKL 193

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKL 164
             +  EGC I G L VNKV GNFH APG+SF    +HVHD+  +         +H I+ L
Sbjct: 194 DAQRNEGCRIEGGLRVNKVVGNFHIAPGRSFSNGNMHVHDLKNYWDSPTKHTFTHTIHHL 253

Query: 165 AFGEHFPGV----------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
            FG   P                  VNPLD      +  +  Y YF+K+VPT Y  +   
Sbjct: 254 RFGPQLPESLTQKLGTKNLPWTNHHVNPLDDTHQQTDDVNYNYMYFLKIVPTSYLPLGWE 313

Query: 209 -----------------------TIQSNQFSVTEHFRSSEQGRLQT------------LP 233
                                  +++++Q+SVT H RS   G                +P
Sbjct: 314 KTWAGFRERHSAELGSFGTSPDGSVETHQYSVTSHKRSLAGGNDAAEGHQERQHARGGIP 373

Query: 234 GVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           GVFF YD+SP+KV   EE   SFL FL  +CAIVGG  TV+  ID  ++ G   +KK
Sbjct: 374 GVFFSYDISPMKVINREERAKSFLGFLAGLCAIVGGTLTVAAAIDRALFEGTVRLKK 430


>gi|449299159|gb|EMC95173.1| hypothetical protein BAUCODRAFT_529716 [Baudoinia compniacensis
           UAMH 10762]
          Length = 435

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 124/351 (35%), Positives = 171/351 (48%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
           MD+SGE    V H + K RL   G     R+ G  A ++ K ++     ++H +  YCG 
Sbjct: 89  MDVSGEVQTGVMHGVNKVRLGEDG-----REVGREALELGKEVEE---SMKHMDPEYCGE 140

Query: 60  CYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           CYGA +        CCN C EVREAY    W+    + ++QC+RE + + + E+  EGC 
Sbjct: 141 CYGAPAPGNAIRAGCCNTCAEVREAYASVSWSFGRGENVEQCEREHYSEHLDEQRREGCR 200

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI----SHKINKLAFGEHFP 171
           I G + VNKV GNFHFAPGKSF    +HVHD+  +      I    SH I+ L FG   P
Sbjct: 201 IEGGIRVNKVVGNFHFAPGKSFSNGNMHVHDLENYFAGGEGIDHTFSHTIHHLRFGPQLP 260

Query: 172 GVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVY------------- 202
             V                NPLD      +  +  Y YF+KVV T Y             
Sbjct: 261 EDVVRRIGRRGMAWSNHHLNPLDETEQKTDEKAYNYMYFVKVVSTAYLPLGWERTGSILD 320

Query: 203 -----TDVSGH------TIQSNQFSVTEHFRS------SEQGRLQTL------PGVFFFY 239
                 ++ G+      +++++Q+SVT H RS       E+G  + L      PGVFF Y
Sbjct: 321 IPHELVELGGYGKGEAGSVETHQYSVTSHKRSLAGGDGGEEGHKERLHARGGIPGVFFSY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           D+SP+KV   E    SF  FL  VCA++GG  TV+  ID  +Y G + +KK
Sbjct: 381 DISPMKVINREARSKSFSGFLVGVCAVIGGTLTVAAAIDRALYEGGQRVKK 431


>gi|156844136|ref|XP_001645132.1| hypothetical protein Kpol_538p34 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156115789|gb|EDO17274.1| hypothetical protein Kpol_538p34 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 405

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 112/324 (34%), Positives = 166/324 (51%), Gaps = 44/324 (13%)

Query: 1   MDISGEQHLDV-KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD SG+  LDV ++   K RLD  G V+E+          D  + +  G    +  YCG 
Sbjct: 89  MDDSGDLQLDVLEYGFTKTRLDPDGKVLETD---------DFDMYKQDGAPSTDPNYCGP 139

Query: 60  CYGA---------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
           CYG+         E+S+  CC  CE+VR+AY K GWA  +   I+QC++EG++++I    
Sbjct: 140 CYGSIDQSKNDEVEASERVCCQTCEDVRKAYVKAGWAFYDGKGIEQCEQEGYVKKINSHL 199

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEH 169
            EGC + G   +N++ GN HFAPGKSF     H HD   ++R+   N +H I+  +FG+ 
Sbjct: 200 NEGCRVAGSASLNRIQGNIHFAPGKSFQTVRGHFHDQSLYERNPQLNFNHIIHHFSFGKE 259

Query: 170 FP---------GVVNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVT 218
            P          +VNPLDG     E  + ++Q  Y+ K+VPT +  ++   + + QFS T
Sbjct: 260 IPTKLASRHSKNIVNPLDGRSVAPERDTHLHQFSYYTKIVPTRFEYLNKAVVDTAQFSAT 319

Query: 219 EHFRSSEQGR----------LQTLPGVFFFYDLSPIKVTFTEEHV--SFLHFLTNVCAIV 266
            H R    G              +PGVFFF+D SPIKV   +E++  S+  F  N    +
Sbjct: 320 YHDRPLRGGADDDHPNTFHFRSGIPGVFFFFDASPIKV-INKEYISGSWSSFFLNCITSI 378

Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
           GGV  V  ++D  +Y  QR+   K
Sbjct: 379 GGVLAVGSMLDRLMYKAQRSFLGK 402


>gi|347842451|emb|CCD57023.1| similar to endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Botryotinia fuckeliana]
          Length = 439

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 127/354 (35%), Positives = 165/354 (46%), Gaps = 74/354 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGR---LEHNETY 56
           MD+SGEQ + V H + K RL  Q           G   ID K L  H         +  Y
Sbjct: 89  MDVSGEQQVGVMHGVKKVRLGPQEE---------GGKVIDIKALDLHNAEDSATHLDPNY 139

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG+CYGA     +    CCN C+EVREAY    WA    + ++QC+RE + +R+  +  E
Sbjct: 140 CGACYGATPPPNAQKPGCCNTCDEVREAYASVSWAFGRGENVEQCEREHYGERLDSQRKE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGE 168
           GC I G L VNKV GNFH APG+SF    +HVHD+  F           SH I+ L FG 
Sbjct: 200 GCRIEGGLRVNKVIGNFHIAPGRSFTNGNMHVHDLNNFFDTPVPGGHVFSHHIHSLRFGP 259

Query: 169 HFPGVV-----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS----- 206
             P  V                 NPLD         +  + YF+KVV T Y  +      
Sbjct: 260 ELPEEVFKKLGSDSIIPWTNHHLNPLDNTEQITHEAAYNFMYFVKVVSTSYLPLGWETNY 319

Query: 207 --------------GH----TIQSNQFSVTEHFRS------SEQGRLQTL------PGVF 236
                         GH    +I+++Q+SVT H RS      S +G  + L      PGVF
Sbjct: 320 NSRPHDASVDIGTYGHSEDGSIETHQYSVTSHRRSLNGGDDSAEGHKEKLHARGGIPGVF 379

Query: 237 FFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           F YD+SP+KV   EE    L  FLT +CAIVGG  TV+  +D  +Y G   ++K
Sbjct: 380 FSYDISPMKVINKEERTKTLAGFLTGLCAIVGGTLTVAAAVDRGVYEGATRLRK 433


>gi|225562998|gb|EEH11277.1| COPII coated vesicle component Erv46 [Ajellomyces capsulatus
           G186AR]
 gi|240279818|gb|EER43323.1| COPII-coated vesicle component Erv46 [Ajellomyces capsulatus H143]
 gi|325092948|gb|EGC46258.1| COPII-coated vesicle component Erv46 [Ajellomyces capsulatus H88]
          Length = 435

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 120/348 (34%), Positives = 166/348 (47%), Gaps = 64/348 (18%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE    V H + K RL S    +E     +    +    Q + G  + +  YCG C
Sbjct: 89  MDISGEYQTGVIHGVNKVRLSS----VEEGGRVLDITALQLHSQTNKG-TDVDPDYCGQC 143

Query: 61  YGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           YGA     +    CCN CEEVR+AY  KGWA    + ++QC++EG+   +  +  EGC +
Sbjct: 144 YGATPPSNAKKPGCCNTCEEVRDAYAAKGWAFGRGENVEQCEKEGYSANLDAQRKEGCRV 203

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGVV 174
            G + VNKV GNFH APG+SF    +H HD+  +       N+ H+I+ L FG   P  +
Sbjct: 204 EGVIRVNKVVGNFHIAPGRSFTNGNLHAHDLDNYYHTPVQHNMGHRIHYLRFGPQLPEQL 263

Query: 175 ------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------SGH------ 208
                       NPLD        P   + YF+KVV T Y  +        S H      
Sbjct: 264 SSRWKWTDNHHTNPLDNTEQHTTNPRFNFMYFVKVVSTSYLPLGWDPDASSSAHSQYSKN 323

Query: 209 --------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLS 242
                         +I+++Q+SVT H RS + G         RL +   +PGVF  YD+S
Sbjct: 324 APLGKQGLSFGSYGSIETHQYSVTSHKRSVDGGDDSAEGHKERLHSQGGIPGVFVNYDIS 383

Query: 243 PIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           P+KV   E    +F  FLT VCA++GG  TV+  ID  +Y G   +KK
Sbjct: 384 PMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAIDRVLYEGAVRVKK 431


>gi|225680824|gb|EEH19108.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Paracoccidioides brasiliensis Pb03]
          Length = 413

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 120/352 (34%), Positives = 168/352 (47%), Gaps = 72/352 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
           MD+SGE    V H I K RL  +   G+VI++             L       +H +  Y
Sbjct: 67  MDVSGEMQSGVIHGISKVRLAPESEGGHVIDTTA---------LVLHTQTDAAKHLDPDY 117

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     ++   CC+ CEEVREAY  + WA    + ++QC+REG+ + +  +  E
Sbjct: 118 CGPCYGAPPPSHATKPGCCSTCEEVREAYASQSWAFGRGENVEQCEREGYSKNLDAQRNE 177

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHF 170
           GC I G L VNKV GNFH APG+SF    +H HD+  +       ++SHKI++L FG   
Sbjct: 178 GCRIEGVLRVNKVVGNFHIAPGRSFSSGNIHAHDLDTYYHTPVPHHMSHKIHQLRFGPQL 237

Query: 171 PGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV------------- 205
              +            NPLD        P   + YF+KVV T Y  +             
Sbjct: 238 SDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNFMYFVKVVSTSYLPLGWSPEFSSSVHET 297

Query: 206 ---------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
                          S  +I+++Q+SVT H RS + G         RL +   +PGVF  
Sbjct: 298 TLGNTPLGKQGVHFGSSGSIETHQYSVTSHKRSIDGGDDAAEGHKERLHSHGGIPGVFVN 357

Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           YD+SP+KV   E    +F  FLT VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 358 YDISPMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAVDRALYEGAARVKK 409


>gi|295672798|ref|XP_002796945.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226282317|gb|EEH37883.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 435

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 120/352 (34%), Positives = 168/352 (47%), Gaps = 72/352 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
           MD+SGE    V H I K RL  +   G+VI++             L       +H +  Y
Sbjct: 89  MDVSGEMQSGVIHGISKVRLAPESEGGHVIDTTA---------LVLHTQTDAAKHLDPDY 139

Query: 57  CGSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     ++   CC+ CEEVREAY  + WA    + ++QC+REG+ + +  +  E
Sbjct: 140 CGPCYGAPPPPHATKPGCCSTCEEVREAYASQSWAFGRGENVEQCEREGYSKNLDAQRNE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN--ISHKINKLAFGEHF 170
           GC I G L VNKV GNFH APG+SF    +H HD+  +        ++HKI++L FG   
Sbjct: 200 GCRIEGVLRVNKVIGNFHIAPGRSFSNGNLHAHDLDTYYHTPVPHYMAHKIHQLRFGPQL 259

Query: 171 PGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV------------- 205
           P  +            NPLD        P   + YF+KVV T Y  +             
Sbjct: 260 PDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNFMYFVKVVSTSYLPLGWSPEFSSSVHET 319

Query: 206 ---------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
                          S  +I+++Q+SVT H RS + G         RL +   +PGVF  
Sbjct: 320 TLRDTPLGKQGVHFGSSGSIETHQYSVTSHKRSIDGGDDAAEGHKERLHSQGGIPGVFVN 379

Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           YD+SP+KV   E    +F  FLT VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 380 YDISPMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAVDRALYEGAVRVKK 431


>gi|154280410|ref|XP_001541018.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150412961|gb|EDN08348.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 435

 Score =  183 bits (465), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 120/348 (34%), Positives = 166/348 (47%), Gaps = 64/348 (18%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MDISGE    V H + K RL S    +E     +    +    Q + G  + +  YCG C
Sbjct: 89  MDISGEYQTGVIHGVNKVRLSS----VEEGGRVLDITALQLHSQTNKG-TDVDPDYCGQC 143

Query: 61  YGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           YGA     +    CCN CEEVR+AY  KGWA    + ++QC++EG+   +  +  EGC +
Sbjct: 144 YGATPPSNAKKPGCCNTCEEVRDAYAAKGWAFGRGENVEQCEKEGYSANLDAQRKEGCRV 203

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGVV 174
            G + VNKV GNFH APG+SF    +H HD+  +       N+ H+++ L FG   P  +
Sbjct: 204 EGVIRVNKVVGNFHIAPGRSFTNGNLHAHDLDNYYHTPVQHNMGHRVHYLRFGPQLPEEL 263

Query: 175 ------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV--------SGH------ 208
                       NPLD        P   + YF+KVV T Y  +        S H      
Sbjct: 264 SSRWKWTDNHHTNPLDNTEQHTTNPRFNFIYFVKVVSTSYLPLGWDPDASSSAHSKYSKN 323

Query: 209 --------------TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLS 242
                         +I+++Q+SVT H RS + G         RL +   +PGVF  YD+S
Sbjct: 324 APLGKQGLSFGSYGSIETHQYSVTSHKRSVDGGDDSAEGHKERLHSQGGIPGVFVNYDIS 383

Query: 243 PIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           P+KV   E    SF  FLT VCA++GG  TV+  ID  +Y G   +KK
Sbjct: 384 PMKVINREARTKSFSGFLTGVCAVIGGTLTVAAAIDRVLYEGAVRVKK 431


>gi|452980033|gb|EME79795.1| hypothetical protein MYCFIDRAFT_64499 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 436

 Score =  183 bits (465), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 120/353 (33%), Positives = 171/353 (48%), Gaps = 73/353 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGN---VIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGE    V H I K RL S  +   VIE ++  + A +    L            YC
Sbjct: 89  MDVSGEVQTGVLHGINKVRLSSVADGSKVIEKQKLDLDAAENSVHLA---------PDYC 139

Query: 58  GSCYGAESSDED----CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA + D      CCN C EVR+AY    W+    + ++QC+RE + +++  +  EG
Sbjct: 140 GECYGAPAPDNAKKAGCCNTCAEVRDAYASVSWSFGRGENVEQCEREHYSEQLDAQRKEG 199

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISHKINKLAFGEHF 170
           C I G L VNKV GNFHFAPGKSF    +HVHD+  +        + +H I++L FG   
Sbjct: 200 CRIEGALRVNKVVGNFHFAPGKSFSNGNLHVHDLDNYFNSGEVEHSFTHHIHRLRFGPPL 259

Query: 171 P----------GV------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-------- 206
           P          G+      +NPLD      +  +  + YF+KVV T Y  +         
Sbjct: 260 PHDFDKRVGKKGMAWSNHHLNPLDDTHQETDDSAFNFMYFVKVVSTAYLPLGWEKTNSFS 319

Query: 207 -------------GH----TIQSNQFSVTEHFRSSEQGRLQT------------LPGVFF 237
                        GH    +I+++Q+SVT H RS + G  +             +PGVFF
Sbjct: 320 RSLPHELIDLGDYGHGEQGSIETHQYSVTSHKRSLQGGDAKDEGHKERVHARGGIPGVFF 379

Query: 238 FYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            YD+SP+KV   E    SF  FL  VCA++GG  TV+  +D  +Y G++ ++K
Sbjct: 380 SYDISPMKVINRETRAKSFSGFLVGVCAVIGGTLTVAAAVDRMLYEGEQRVRK 432


>gi|254569250|ref|XP_002491735.1| Protein localized to COPII-coated vesicles, forms a complex with
           Erv41p [Komagataella pastoris GS115]
 gi|238031532|emb|CAY69455.1| Protein localized to COPII-coated vesicles, forms a complex with
           Erv41p [Komagataella pastoris GS115]
 gi|328351763|emb|CCA38162.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Komagataella pastoris CBS 7435]
          Length = 401

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 106/320 (33%), Positives = 165/320 (51%), Gaps = 33/320 (10%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRL--EHNETYCG 58
           MDI+G+  +D+    F+K     G   E+ +  +   K      +   +L   +N  YCG
Sbjct: 88  MDITGDLQIDLLMSGFQKTRVVDGLAKETTELRVNEYK------QENNKLTNSNNPYYCG 141

Query: 59  SCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
           SCYGA +  ++         CCN CE V++AY K GWA  +   I+QC+ EG++Q +   
Sbjct: 142 SCYGALNQKDNENKPFDEKLCCNTCESVKKAYAKAGWAFYDGRNIEQCENEGYVQLVTSM 201

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG 167
             EGC + G  ++N+V+GN HFAPG S      H+HD+  F++  D FN  H +N L+FG
Sbjct: 202 VDEGCQVSGTAQINRVSGNLHFAPGSSLTSGSRHIHDLSLFEKYPDKFNFDHTVNHLSFG 261

Query: 168 EHFPG---VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
           +         +PLDG        + +Y YF+KVV T Y  +SG    +NQFS T H R  
Sbjct: 262 KTIDNQEMSTHPLDGYEAATGNKNHLYSYFLKVVATRYESMSGLKWDTNQFSATYHDRPL 321

Query: 225 EQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 273
           E GR             +PG FF +++SP+K+   E++  +   F   V A V GV T+ 
Sbjct: 322 EGGRDSDHPNTLHASGGIPGAFFHFEISPLKIINREQYSKTRSAFALGVSASVAGVLTLG 381

Query: 274 GIIDAFIYHGQRAIKKKIEI 293
            ++D  I+   + +++K ++
Sbjct: 382 SVLDKTIWTADQILRQKKDL 401


>gi|320583549|gb|EFW97762.1| COPII-coated vesicle membrane protein Erv46, putative [Ogataea
           parapolymorpha DL-1]
          Length = 400

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 110/318 (34%), Positives = 160/318 (50%), Gaps = 37/318 (11%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD SG+  LD+    F K RLD QGN I     G    ++++           + TYCGS
Sbjct: 89  MDQSGDMQLDLLSSGFSKIRLDRQGNEI-----GQENMRVNQEF----ALTSSDPTYCGS 139

Query: 60  CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
           CYGA     +         CCN+CE V++AY +  W   +   I+QC++EG++ RI    
Sbjct: 140 CYGAADQSRNDELPQDQKVCCNSCESVKQAYARNAWKFYDGKDIEQCEKEGYVDRINARL 199

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGE 168
            EGC + G  E+ ++ GN HFAPG S + +  HVHD+  +   S  FN  H IN  +FG 
Sbjct: 200 DEGCRVRGTAEIARIGGNLHFAPGSSMNFNEKHVHDLSLYDMHSNKFNFDHTINHFSFGL 259

Query: 169 HFPGVVN-----PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
               V +     PLD           +Y YF+KVV T Y  + G  +++NQFS T+H R 
Sbjct: 260 DDHSVADYKTTHPLDATTHRDGRKYHVYSYFLKVVNTRYEFLDGRKVETNQFSATQHDRP 319

Query: 224 SEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTV 272
              GR +           LPGVFF +++SP+K+   E++  ++  F    CA + GV TV
Sbjct: 320 FRGGRDEDHPNTIHAQGGLPGVFFHFEISPLKIINREQYNKTWSAFALGACAAISGVLTV 379

Query: 273 SGIIDAFIYHGQRAIKKK 290
             ++D  I+   R +K K
Sbjct: 380 FTLLDRTIWAANRMLKDK 397


>gi|170586880|ref|XP_001898207.1| Serologically defined breast cancer antigen NY-BR-84 homolog,
           putative [Brugia malayi]
 gi|158594602|gb|EDP33186.1| Serologically defined breast cancer antigen NY-BR-84 homolog,
           putative [Brugia malayi]
          Length = 341

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 106/266 (39%), Positives = 149/266 (56%), Gaps = 19/266 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRL--DSQGNVIESRQD-GIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SG+   D+K D++K  L    +GN I  RQ   I    +          +  ++  C
Sbjct: 90  MDLSGDNQDDIKDDVYKISLLNGKEGNGI--RQGVNINTTTV--------SSVPASQILC 139

Query: 58  GSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIY 117
           GSCYGA+   + CCN CEEV+EAY KKGW L N + ++QCK + +++++ E + EGC +Y
Sbjct: 140 GSCYGAK---DGCCNTCEEVKEAYIKKGWELVNIETVEQCKSDLWVKKMNEHKNEGCRVY 196

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
           G ++V KVAGNFH APG        H HD+ +     F+ SH +N L+FG  FPG V PL
Sbjct: 197 GKVQVAKVAGNFHIAPGDPLKAHRSHFHDLHSLSPSKFDTSHTVNHLSFGNSFPGKVYPL 256

Query: 178 DGVRWTQETPSG-MYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           DG  +     SG MYQY +K+VPT Y  + S   I S+ FSVT + +   QG    LPG 
Sbjct: 257 DGKFFGSAKDSGIMYQYHLKLVPTSYVFLDSTRNIFSHLFSVTTYQKDISQGA-SGLPGF 315

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTN 261
           F  Y+ SP+ V + E     +  + N
Sbjct: 316 FIQYEFSPLMVKYEERRQYVVTIILN 341


>gi|448081831|ref|XP_004194985.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
 gi|359376407|emb|CCE86989.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
          Length = 405

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 110/322 (34%), Positives = 178/322 (55%), Gaps = 35/322 (10%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGN--VIESR---QDGIGAPKIDKPLQRHGGRLEHNE 54
           +D+SG+   DV    F+K RL    N  V+++    ++ +    I +   + GG      
Sbjct: 90  LDVSGDTQADVLKSGFEKYRLIPSSNEEVLDNAPVLRNDLSLEDIARNPNKEGG------ 143

Query: 55  TYCGSCYGA--ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE--EE 110
            +CGSCYGA  +  +E CCN+CE VR AY ++ WA  +   I+QC+ EG++ R+ +  E+
Sbjct: 144 GFCGSCYGALPQGDNEYCCNDCETVRLAYAERMWAFYDGANIEQCENEGYVTRLNQRIEQ 203

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG- 167
            EGC I G  ++N+V+GN HFAPG +    G H+HD+  +++  D FN  H IN L+FG 
Sbjct: 204 KEGCRIKGTAQINRVSGNMHFAPGYAKTSPGRHIHDLSLYEKHFDKFNFDHVINHLSFGL 263

Query: 168 ---EHFPG--VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
              +  P     +PLDG R      S +  Y++KVV T +  +SG  +++NQFS   H R
Sbjct: 264 DPVKEDPNHQSTHPLDGYRLILNDKSRVISYYLKVVATRFEFLSGLAMETNQFSAIPHHR 323

Query: 223 SSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFT 271
               G+ +           +PGVFF +D+SP+K+   E++  ++  F+  V + + GV T
Sbjct: 324 PYRGGKDEDHRHTMHAKGGIPGVFFHFDISPMKIINKEQYAKTWSGFVLGVVSSIAGVLT 383

Query: 272 VSGIIDAFIYHGQRAIKKKIEI 293
           V  ++D  ++  ++AIK K +I
Sbjct: 384 VGAVLDRSVWAAEKAIKSKKDI 405


>gi|402590490|gb|EJW84420.1| hypothetical protein WUBG_04668 [Wuchereria bancrofti]
          Length = 341

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 104/265 (39%), Positives = 146/265 (55%), Gaps = 17/265 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRL--DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           MD+SG+   D+K D++K  L    +GN I    +         P          ++  CG
Sbjct: 90  MDLSGDNQDDIKDDVYKISLLNGKEGNGIRQGVNINTTTVSSAP---------ASQILCG 140

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYGA+   + CCN CEEV+EAY KKGW L N + ++QCK + +++++ E + EGC +YG
Sbjct: 141 SCYGAK---DGCCNTCEEVKEAYIKKGWELVNIETVEQCKSDLWVKKMNEHKNEGCRVYG 197

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
            ++V KVAGNFH APG        H HD+ +     F+ SH +N L+FG  FPG V PLD
Sbjct: 198 KVQVAKVAGNFHIAPGDPLKAHRSHFHDLHSLSPSKFDTSHTVNHLSFGNSFPGKVYPLD 257

Query: 179 GVRWTQETPSG-MYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 236
           G  +     SG MYQY +K+VPT Y  + S   I S+ FSVT + +   QG    LPG F
Sbjct: 258 GKFFGSAKDSGIMYQYHLKLVPTSYVFLDSTRNIFSHLFSVTTYQKDISQGA-SGLPGFF 316

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTN 261
             Y+ SP+ V + E     +  + N
Sbjct: 317 IQYEFSPLMVKYEERRQYVVTIILN 341


>gi|398398231|ref|XP_003852573.1| hypothetical protein MYCGRDRAFT_72288 [Zymoseptoria tritici IPO323]
 gi|339472454|gb|EGP87549.1| hypothetical protein MYCGRDRAFT_72288 [Zymoseptoria tritici IPO323]
          Length = 435

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 121/359 (33%), Positives = 165/359 (45%), Gaps = 85/359 (23%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET----- 55
           MD+SG+    V H I K RL  +           G   IDK      GRL+ NE      
Sbjct: 89  MDVSGDVQTGVLHGIVKTRLKPESE---------GGGDIDK------GRLQVNEVEEAAK 133

Query: 56  -----YCGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRI 106
                YCG CYGA     +    CCN C EVREAY    W+    + ++QC RE + + +
Sbjct: 134 HLARDYCGDCYGAPPPANAIKSGCCNTCAEVREAYASVSWSFGRGENVEQCTREHYSEHL 193

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKL 164
            E+  EGC + G + VNKV GNFHFAPGKSF    +HVHD+  +         SH I+ L
Sbjct: 194 DEQRKEGCRVDGVIRVNKVVGNFHFAPGKSFSNGNMHVHDLENYLTGGGDHTPSHIIHHL 253

Query: 165 AFGEHFPGV-----------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS- 206
            FG   P                   ++PLDG R      +  Y YF+KVVPT Y  +  
Sbjct: 254 RFGPLLPESYKHRVRDTERHWSNNHHLSPLDGFRQETNEKAYNYMYFVKVVPTAYLPLGY 313

Query: 207 -----------------------GHTIQSNQFSVTEHFR------SSEQGRLQTL----- 232
                                  G +I+++Q+SVT H R      ++++G  + L     
Sbjct: 314 ENLPSVGDYPHEHAHVGEYGISHGSSIETHQYSVTSHKRHLGGGDANDEGHKERLHARGG 373

Query: 233 -PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            PGVFF YD+SP+KV   E    SF  FL  +C ++GG  TV+  +D   + G + +KK
Sbjct: 374 IPGVFFSYDISPMKVIDREVRAKSFSSFLVGICGVLGGTLTVAAAVDRIWFEGTQRVKK 432


>gi|61555552|gb|AAX46728.1| serologically defined breast cancer antigen 84 isoform b [Bos
           taurus]
          Length = 283

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 89/164 (54%), Positives = 112/164 (68%), Gaps = 11/164 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE  D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 160
           YGFLEVNKVAGNFHFAPGKSF QS VHVHD+ +F  D+     K
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNVRTRWK 245


>gi|407929248|gb|EKG22082.1| protein of unknown function DUF1692 [Macrophomina phaseolina MS6]
          Length = 442

 Score =  181 bits (458), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 120/358 (33%), Positives = 169/358 (47%), Gaps = 77/358 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
           MD+SGE    V H + K RL  +     SR   + A      L  H     H +  YCG 
Sbjct: 89  MDVSGEIQTGVMHGVNKVRLTPENE--GSRPIEVNA------LNLHADEASHMDPDYCGE 140

Query: 60  CYGAES----SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           CYGA +        CCN C++VR+AY    W+ +  D ++QC+RE + +++  +  EGC 
Sbjct: 141 CYGAPAPTTAKKPGCCNTCDDVRDAYAAISWSFTRGDGVEQCEREHYGEKLDAQRREGCR 200

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGV 173
           + G + VNKV GNFHFAPGKSF    +HVHD+  + +D    + +H+++ L FG   P  
Sbjct: 201 VEGGIRVNKVIGNFHFAPGKSFSNGNMHVHDLENYFKDGAPHSFTHQVHSLRFGPQLPDD 260

Query: 174 V--------------------NPLDGVRWTQETPSGMYQYFIKVVPTVY----------T 203
           V                    NPLD      +  +  + YF+KVV T Y          +
Sbjct: 261 VIAKLEASGMSASSLWTNHHINPLDNTEQRTDEKAFNFMYFVKVVSTAYLPLGWENKGSS 320

Query: 204 DVSG-------------------HTIQSNQFSVTEHFRSSEQG---------RLQT---L 232
            +SG                    +I+++Q+SVT H RS   G         RL     +
Sbjct: 321 SLSGLLPDADRAPLGSYGLASGEGSIETHQYSVTSHKRSLAGGNDEKDGHKERLHARGGI 380

Query: 233 PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           PGVFF YD+SP+KV   E    SF  FL  VCA++GG  TV+  ID  +Y G   +KK
Sbjct: 381 PGVFFSYDISPMKVINRESRAKSFSGFLVGVCAVIGGTLTVAAAIDRALYEGSTKLKK 438


>gi|148674215|gb|EDL06162.1| ERGIC and golgi 3, isoform CRA_b [Mus musculus]
          Length = 269

 Score =  180 bits (456), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 88/164 (53%), Positives = 115/164 (70%), Gaps = 3/164 (1%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FKKRLD  G  + S  +     K++  +      L+ N   C SC
Sbjct: 100 MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTV-FDPNSLDPNR--CESC 156

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 157 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 216

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
           EVNKVAGNFHFAPGKSF QS VHVH +      SF + +  + L
Sbjct: 217 EVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNPSDCL 260


>gi|451849936|gb|EMD63239.1| hypothetical protein COCSADRAFT_38106 [Cochliobolus sativus ND90Pr]
          Length = 437

 Score =  180 bits (456), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 118/353 (33%), Positives = 169/353 (47%), Gaps = 72/353 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETYCGS 59
           MD+SGE  + V H I K RL  +       ++G    +I K L  H     H    YCG 
Sbjct: 89  MDVSGELQMGVTHGINKVRLSPE-------REGSKTIEI-KALDLHADEASHLAPDYCGE 140

Query: 60  CYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           C+GA     +    CCN C+EVR+AY    W+    + ++QC+RE + + + E+  EGC 
Sbjct: 141 CFGAPPPANAKKPGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDEQRQEGCR 200

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGV 173
           + G + VNKV GNFH APGKSF    +HVHD+  + +D +    +HKI++L FG     V
Sbjct: 201 LEGSIRVNKVVGNFHIAPGKSFSNGNMHVHDLENYFKDEYAHTFTHKIHQLRFGPQLSDV 260

Query: 174 V---------------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT--------- 203
           V                     NPLD      +  +  + YFIKVV T Y          
Sbjct: 261 VIQGIQDKHRGSGPGSWSNHHINPLDNTEQHTDEKAFNFMYFIKVVSTAYLPLGWEDAAP 320

Query: 204 ------DVSGHT--------IQSNQFSVTEHFRSSEQGRLQT------------LPGVFF 237
                 ++ G T        I+++Q+SVT H R+ + G  +             +PGVFF
Sbjct: 321 RLTKHDELLGSTIDATHKGSIETHQYSVTSHKRNLKGGNDEKDGHKERVHARGGIPGVFF 380

Query: 238 FYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            YD+SP+KV   E    +F  FL  +CA++GG  TV+  +D  +Y G   IKK
Sbjct: 381 SYDISPMKVINREVREKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNRIKK 433


>gi|444314203|ref|XP_004177759.1| hypothetical protein TBLA_0A04460 [Tetrapisispora blattae CBS 6284]
 gi|387510798|emb|CCH58240.1| hypothetical protein TBLA_0A04460 [Tetrapisispora blattae CBS 6284]
          Length = 406

 Score =  179 bits (455), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 111/322 (34%), Positives = 168/322 (52%), Gaps = 42/322 (13%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD +GE  LD+    F K RLDS+GN + +    +     + P          ++ YCG 
Sbjct: 88  MDDAGEIQLDILSSGFTKTRLDSRGNELGTFDFDLSKDISEYP--------PDDDKYCGP 139

Query: 60  CYGA--ESSDED--------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
           CYGA  +S+++D        CC  C +VR+AY   GWA  +   I+QC+REG++QRI + 
Sbjct: 140 CYGALDQSNNKDDMPMDEKVCCQTCADVRQAYLNAGWAFFDGKDIEQCEREGYVQRINDH 199

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFGE 168
             EGC I G   +N++ GN HFAPG +F     H HD   + + +    +H IN L+FG+
Sbjct: 200 LNEGCRIQGNARLNRIHGNVHFAPGLAFQNRRGHYHDTSLYDKKTELTFNHIINHLSFGK 259

Query: 169 HF-PGV--------VNPLDGVRWT-QETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSV 217
           H  PG+        V+PLDG +    + P  + + YF K+VPT Y  +    I++ QFS 
Sbjct: 260 HVKPGIGSKFSAASVSPLDGHQMILNDDPHNVQFIYFAKIVPTRYEYLDKDVIETAQFST 319

Query: 218 TEHFR----------SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIV 266
           T H +          + +  R    PG++  Y++SP+KV   E+HV +++ F+ N    +
Sbjct: 320 TTHSKALNNLADDKTTPKPSRRSGTPGLYINYEMSPLKVINREQHVQTWVSFILNCLTSI 379

Query: 267 GGVFTVSGIIDAFIYHGQRAIK 288
           GGV  V  +ID   Y  QR I+
Sbjct: 380 GGVLAVGTVIDKIFYRAQRTIQ 401


>gi|401839164|gb|EJT42494.1| ERV46-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 415

 Score =  179 bits (455), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 117/337 (34%), Positives = 166/337 (49%), Gaps = 59/337 (17%)

Query: 1   MDISGEQHLDVKHDIFK-KRLDSQGNVIESRQ------DGIG-APKIDKPLQRHGGRLEH 52
           MD SGE  LD+    F   RLD +G  +          DG G AP  D P          
Sbjct: 88  MDDSGEMQLDILDAGFTMTRLDKEGRPVGDAAELQVGGDGDGVAPVNDDP---------- 137

Query: 53  NETYCGSCYGAES---------SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFL 103
              YCG CYGA           +D+ CC +C+ VR AY   GWA  +   I+QC+REG++
Sbjct: 138 --NYCGPCYGARDQTQNENLAQADKVCCQDCDAVRSAYLDAGWAFFDGKNIEQCEREGYV 195

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKIN 162
            +I E   EGC I G  ++N++ GN HFAPG+ F  +  H HD+  +++    N +H IN
Sbjct: 196 SKINEHLHEGCRIEGSAQINRIQGNIHFAPGRPFQNANGHFHDVSLYEKTPDLNFNHMIN 255

Query: 163 KLAFGE--------------HFPGVV--NPLDGVRWTQE--TPSGMYQYFIKVVPTVYTD 204
            L+FG+              H   V+  +PLDG +   E  T S ++ YF K+VPT Y  
Sbjct: 256 HLSFGKPIESRNKLLENDDRHGGAVIATSPLDGRKVFPERTTHSHLFSYFAKIVPTRYEY 315

Query: 205 VSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-V 253
           +    I++ QFS T H R    GR Q           +PG+F F+++SP+KV   E+H  
Sbjct: 316 LDDVVIETAQFSATYHSRPLRGGRDQDHPNTFHARGGIPGLFVFFEMSPLKVINKEQHGQ 375

Query: 254 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
           ++  F+ N    +GGV  V  ++D   Y  QR+I  K
Sbjct: 376 TWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412


>gi|291001965|ref|XP_002683549.1| predicted protein [Naegleria gruberi]
 gi|284097178|gb|EFC50805.1| predicted protein [Naegleria gruberi]
          Length = 391

 Score =  179 bits (455), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 103/302 (34%), Positives = 154/302 (50%), Gaps = 34/302 (11%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPK------IDKPLQRHGGRLE-HN 53
           +D SG+  +DV H I K  +DS G +       + +PK       + P  ++    + H+
Sbjct: 91  VDASGDAAIDVAHHIHKVPVDSSGRITH-----LESPKHKTKLGTEMPQDKYDPTKDPHS 145

Query: 54  ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
             YCG+CY  E    +CCN C++V E Y++ G      + ++QC  +        +   G
Sbjct: 146 IMYCGTCY-VEQRRGECCNTCQDVMEVYKRNGLPAPRVEDVEQCLFDA------SKNHPG 198

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGV----HVHDILAFQRDSFNISHKINKLAFGEH 169
           CNIYG L+V KV GNFHF PG+SF Q       H+H+      D +N +H I+ L+FG  
Sbjct: 199 CNIYGTLDVQKVNGNFHFLPGRSFSQEYETRVHHIHEFNPILVDRYNSTHIIHSLSFGLR 258

Query: 170 FPGVVNPLDGV---------RWTQETPSGMYQYFIKVVPTVYTDVS--GHTIQSNQFSVT 218
            P V  PLD              Q   + +++YFIK VPT Y   S    TI + QFS T
Sbjct: 259 IPHVTYPLDETVGIIPKIEESDAQAPKTALFKYFIKAVPTTYIGSSYFSSTINTYQFSFT 318

Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
           +H    +  ++  LPGVFF Y+  PI++T+ E  + F HF+ ++ A+  G+F V   IDA
Sbjct: 319 KHVMPFDSSKMMMLPGVFFVYNFEPIRITYEENGMPFTHFIVDLMAVCAGIFVVLNYIDA 378

Query: 279 FI 280
            +
Sbjct: 379 LL 380


>gi|323454843|gb|EGB10712.1| hypothetical protein AURANDRAFT_2571, partial [Aureococcus
           anophagefferens]
          Length = 380

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 109/301 (36%), Positives = 150/301 (49%), Gaps = 45/301 (14%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVI----------------ESRQDGIGAPKIDKPLQR 45
           D SG+    V+  + K RLD+ G  +                 + ++ + AP   KP   
Sbjct: 91  DESGQPLEGVQQHVIKTRLDTNGRRVLVNRKAANSVHKVGDTATSEEHLAAPDEAKP--- 147

Query: 46  HGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
                   E  CG CYGA+  +  CC  C++VR AYRK+GW   +   + QC  E     
Sbjct: 148 --------EVACGDCYGAQDDERPCCATCDDVRSAYRKRGWTF-HEHTVAQCAGELAEAA 198

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV-HVHDILAFQRDSFNISHKINKL 164
           +  +  EGC+I G LE+  V+GNFH APG+    SG+    D++    D FN+SH + +L
Sbjct: 199 LDLDSDEGCSIKGTLELPAVSGNFHVAPGRHLQTSGLFKGMDLVQLTFDKFNVSHTVKQL 258

Query: 165 AFG---------EHFPGVVNP-------LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
            FG              VV P       LDG   T     GM+QY++KVVPTVY ++ G 
Sbjct: 259 RFGPDERSLEPARASRKVVGPDVDLSSQLDGESRTLGDGYGMHQYYLKVVPTVYKNLGGK 318

Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
           T +  Q+SVTEH R    G  + LPGVFFFY++SP+   F E    +L  LT + AIVGG
Sbjct: 319 TRELWQYSVTEHVRHVAPGSGKGLPGVFFFYEVSPLCAEFVERRNGWLALLTGLAAIVGG 378

Query: 269 V 269
           V
Sbjct: 379 V 379


>gi|315044047|ref|XP_003171399.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Arthroderma gypseum CBS 118893]
 gi|311343742|gb|EFR02945.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Arthroderma gypseum CBS 118893]
          Length = 435

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 122/351 (34%), Positives = 163/351 (46%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGE   DV H + K RL S    G VI+     +   K D P       L+ N  YC
Sbjct: 89  MDVSGELQTDVDHGVNKVRLSSAAEGGKVIDVTALALHK-KEDSP-----AHLDPN--YC 140

Query: 58  GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYG    + +    CCN CEEVR+AY +K WA    + + QC  EG+ QRI E+  EG
Sbjct: 141 GDCYGVPAPSNAKKPGCCNTCEEVRDAYAEKNWAFGRGENVAQCIDEGYSQRIDEQRHEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C I G L VNKVAGNFH APG+S      H HD+  +        +SH I+KL FG   P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMSHTIHKLRFGPQLP 260

Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
                         +NPLD      +     + YF+KVV T Y  +              
Sbjct: 261 EELYSRWKWTHQDTINPLDKSDHKTDEARYNFMYFVKVVSTSYLPLGWDPTWSSEVHSQA 320

Query: 206 --------------SGHTIQSNQFSVTEHFRS------SEQGRLQT------LPGVFFFY 239
                         +  +I+++Q+SVT H RS      S +G  +       +P V F Y
Sbjct: 321 HKDIPLGNHGVYFGTQGSIETHQYSVTSHQRSLDAEDASAEGHKERQHTRGGIPSVIFNY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           ++SP+KV   E    S   F T VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 381 EISPMKVINREARPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGGLRVKK 431


>gi|452001785|gb|EMD94244.1| hypothetical protein COCHEDRAFT_1202021 [Cochliobolus
           heterostrophus C5]
          Length = 437

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 118/354 (33%), Positives = 168/354 (47%), Gaps = 74/354 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEH-NETYCG 58
           MD+SGE  + V H I K RL  +           G+  I+ K L  H     H    YCG
Sbjct: 89  MDVSGELQMGVTHGINKVRLGPEKE---------GSKTIEIKALDLHADEASHLAPDYCG 139

Query: 59  SCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
            C+GA     +    CCN C+EVR+AY    W+    + ++QC+RE + + + E+  EGC
Sbjct: 140 ECFGAPPPANAKKPGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDEQRQEGC 199

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFPG 172
            + G + VNKV GNFH APGKSF    +HVHD+  + +D +    +HKI++L FG     
Sbjct: 200 RLEGSIRVNKVVGNFHIAPGKSFSNGNMHVHDLENYFKDEYAHTFTHKIHQLRFGPQLSD 259

Query: 173 VV---------------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT-------- 203
           VV                     NPLD      +  +  + YFIKVV T Y         
Sbjct: 260 VVIQGIQDKHKGSGPGSWSNHHINPLDNTEQHTDEKAFNFMYFIKVVSTAYLPLGWEDAA 319

Query: 204 -------DVSGHT--------IQSNQFSVTEHFRSSEQGRLQT------------LPGVF 236
                  ++ G T        I+++Q+SVT H R+ + G  +             +PGVF
Sbjct: 320 PRLTKHDELLGSTIDASHKGSIETHQYSVTSHKRNLKGGNDEKDGHKERIHARGGIPGVF 379

Query: 237 FFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           F YD+SP+KV   E    +F  FL  +CA++GG  TV+  +D  +Y G   IKK
Sbjct: 380 FSYDISPMKVINREVREKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNRIKK 433


>gi|326476034|gb|EGE00044.1| COPII-coated vesicle protein [Trichophyton tonsurans CBS 112818]
 gi|326481270|gb|EGE05280.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Trichophyton equinum CBS 127.97]
          Length = 435

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 122/351 (34%), Positives = 159/351 (45%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGE   DV H + K RL S    G VI+     +   K D P       L+ N  YC
Sbjct: 89  MDVSGELQTDVDHGVNKVRLSSAAEGGKVIDVTALALHK-KEDSP-----AHLDPN--YC 140

Query: 58  GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYG    + +    CCN C+EVR+AY +K WA    + + QC  EG+ QRI E+  EG
Sbjct: 141 GDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFGRGENVAQCIDEGYSQRIDEQRHEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C I G L VNKVAGNFH APG+S      H HD+  +        +SH I+KL FG   P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMSHIIHKLRFGPQLP 260

Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
                         +NPLD            + YF+KVV T Y  +              
Sbjct: 261 EELYSRWKWTHQDTINPLDKSEHKTNEARYNFLYFVKVVSTSYLPLGWDPTLSSEAHSQA 320

Query: 206 --------------SGHTIQSNQFSVTEHFRS------------SEQGRLQTLPGVFFFY 239
                         S  +I+++Q+SVT H RS              Q     +P V F Y
Sbjct: 321 HRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAEDASADGHKERQHARGGIPSVMFNY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           D+SP+KV   E    S   F T VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 381 DISPMKVINRESRPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVKK 431


>gi|366996541|ref|XP_003678033.1| hypothetical protein NCAS_0I00190 [Naumovozyma castellii CBS 4309]
 gi|342303904|emb|CCC71687.1| hypothetical protein NCAS_0I00190 [Naumovozyma castellii CBS 4309]
          Length = 409

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 113/329 (34%), Positives = 170/329 (51%), Gaps = 58/329 (17%)

Query: 4   SGEQHLDVKHD--IFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           SGE  LD+  +    K R+DS GN ++S +  +     + P Q        ++ YCGSCY
Sbjct: 93  SGELQLDLLQEGSFTKTRVDSNGNALDSMKFKLDDEVGEYPPQ--------DDNYCGSCY 144

Query: 62  GA-ESSDED--------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           GA + S+ D        CC +CE+VR AY   GWA  +   I+QC+REG++ RI     E
Sbjct: 145 GALDQSNNDNLPKDEKVCCQDCEQVRNAYLTAGWAFFDGKKIEQCEREGYVARINSHLNE 204

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEHFP 171
           GC + G + +N++ GN HFAPG++F  +  H HD   +++  S N +H IN L+FG+   
Sbjct: 205 GCRVKGDVLLNRIHGNIHFAPGRAFQNTKGHFHDTSLYEQTLSLNFNHIINHLSFGKSVE 264

Query: 172 GV---------VNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVT-- 218
            +          +PLDG + +    S +Y+  YF K+VPT Y  + G   ++ QFS T  
Sbjct: 265 QLAEVRGASVSTSPLDGQQVSPSFDSHLYRYSYFTKIVPTRYEWLDGVVAETAQFSATFH 324

Query: 219 ------------EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS-----FLHFLTN 261
                        H R S  G    LPGVF ++++SP+KV   E+H       FLH +T+
Sbjct: 325 ESPVNGAMDPEHPHIRHSRTG----LPGVFIYFEMSPLKVINQEQHFKSWSGVFLHGITS 380

Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
               +GG+  V  ++D   Y  QR I+K+
Sbjct: 381 ----MGGILAVGTVLDKIFYRAQRTIQKR 405


>gi|119596606|gb|EAW76200.1| ERGIC and golgi 3, isoform CRA_b [Homo sapiens]
          Length = 239

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 85/154 (55%), Positives = 112/154 (72%), Gaps = 3/154 (1%)

Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
           +HD+ +F  D+ N++H I  L+FGE +PG+VNPLD    T    S M+QYF+KVVPTVY 
Sbjct: 85  IHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYM 144

Query: 204 DVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
            V G  +++NQFSVT H + +  G L  Q LPGVF  Y+LSP+ V  TE+H SF HFLT 
Sbjct: 145 KVDGEVLRTNQFSVTRHEKVAN-GLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTG 203

Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 204 VCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 237


>gi|224011116|ref|XP_002294515.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970010|gb|EED88349.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 454

 Score =  178 bits (451), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 108/301 (35%), Positives = 157/301 (52%), Gaps = 24/301 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAP-KIDKPLQRHGGRLEH-NETYCG 58
           +D++G+  LD+   +FK RL+  G +    +    A  K D+  ++     +     YCG
Sbjct: 158 IDVAGDSQLDLSDTLFKHRLNLDGTLRSKAKIATEANIKADEDKKKQEALSKDIPADYCG 217

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSN-PDLIDQCKREGFLQR--IKEEEGEGCN 115
            CYGA+  + DCCN C++V E Y+KK W  +    L +QC REG  +    +   GEGCN
Sbjct: 218 PCYGADEKEGDCCNTCDDVMERYKKKRWNENAVQPLAEQCIREGKGKNEPKRMSNGEGCN 277

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH------ 169
           + G   VN+VAGNFH A G+   + G H+H  L   R +FN SH +++L F +       
Sbjct: 278 LSGHFTVNRVAGNFHIAMGEGVDRDGRHIHQFLPEDRMNFNASHVVHELIFMDEEYGDMV 337

Query: 170 ---FPG--VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
               PG   +N +  V       +G++QYFIKVVPT Y   SG T+        EH  + 
Sbjct: 338 IAGVPGETSMNSVSKVVTEDTGTTGLFQYFIKVVPTKYKGKSGGTLHEK----VEHHDTQ 393

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
                  LPGVFF Y++ P  V  T+  V F+H L  + A VGGVFT+ G ID+ +Y  +
Sbjct: 394 N----AVLPGVFFVYEIYPFAVEVTKNKVPFMHLLIRIMATVGGVFTIMGWIDSALYSRE 449

Query: 285 R 285
           +
Sbjct: 450 K 450


>gi|67479189|ref|XP_654976.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56472072|gb|EAL49587.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 361

 Score =  177 bits (450), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 100/296 (33%), Positives = 165/296 (55%), Gaps = 26/296 (8%)

Query: 4   SGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGA 63
           SGE  + ++ ++ K R+   G+++   +         K +Q        +   C SCYGA
Sbjct: 86  SGESMIGIEQNVTKIRIHHDGSLVTENEM--------KAIQSKLSIETPDPKECRSCYGA 137

Query: 64  ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVN 123
           E+ ++ CC  C++V+EAY+K+GW L + +++ QC+    +Q  K  + EGC + G   +N
Sbjct: 138 ETPEKKCCFTCDDVKEAYKKRGWRL-DLNIVSQCQNHEKIQMAKLTKDEGCRLIGDFLLN 196

Query: 124 KVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWT 183
           K+ GNFH APG S    G H H++    +   ++SHK N+L+FGE         +  ++T
Sbjct: 197 KIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQIDLSHKWNELSFGE---------NSKKFT 247

Query: 184 QETP----SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
            E      + M+QY++ ++P     ++G T     +S+ E+ RS E G  Q  PGVF +Y
Sbjct: 248 TEKKDTQMNSMFQYYLTIIPIKNNFING-TSTFYDYSIQENIRSGE-GEGQ--PGVFIYY 303

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           D+SP+ +  TE +  FLHFL  +C+IVGG+FT   + DA ++     +KKK+E+GK
Sbjct: 304 DVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHTLKKKVELGK 359


>gi|148674214|gb|EDL06161.1| ERGIC and golgi 3, isoform CRA_a [Mus musculus]
          Length = 238

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 92/189 (48%), Positives = 123/189 (65%), Gaps = 14/189 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FKKRLD  G  + S  +     K++  +      L+ N   C SC
Sbjct: 61  MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAERHELGKVEVTV-FDPNSLDPNR--CESC 117

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAES D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 118 YGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 177

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           EVNKV G      G    Q    VHD+ +F  D+ N++H I  L+FGE +PG+VNPLD  
Sbjct: 178 EVNKVPG------GSKARQL---VHDLQSFGLDNINMTHYIKHLSFGEDYPGIVNPLDHT 228

Query: 181 RWTQETPSG 189
             T   P G
Sbjct: 229 NVT--APQG 235


>gi|296811622|ref|XP_002846149.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Arthroderma otae CBS 113480]
 gi|238843537|gb|EEQ33199.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Arthroderma otae CBS 113480]
          Length = 435

 Score =  177 bits (448), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 120/351 (34%), Positives = 161/351 (45%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGE   DV H + K RL S    G VI+     +   K D P       L+ N  YC
Sbjct: 89  MDVSGELQTDVDHGVNKVRLSSAAEGGKVIDVTALDLHK-KDDSP-----AHLDPN--YC 140

Query: 58  GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G+CYG    + +    CCN C EVR+AY +K WA    + + QC  EG+ QRI E+  EG
Sbjct: 141 GNCYGVPAPSTAKKPGCCNTCAEVRDAYAEKNWAFGRGEGVTQCMDEGYSQRIDEQRHEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C I G L VNKVAGNFH APG+S      H HD+  +        ++H I+KL FG   P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMTHIIHKLRFGPQLP 260

Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
                         +NPLD      +     + YF+KVV T Y  +              
Sbjct: 261 EELYSRWKWTHQDTINPLDKSEHRTDEVRYNFLYFVKVVSTSYLPLGWDATWSSEVHSQA 320

Query: 206 --------------SGHTIQSNQFSVTEHFRSSEQGRLQT------------LPGVFFFY 239
                         S  +I+++Q+SVT H RS + G                +P V F Y
Sbjct: 321 HKDIPLGNHGVYFGSQGSIETHQYSVTSHKRSLDGGDDSAEGHKERQYARGGIPSVMFNY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           ++SP+KV   E    S   F T VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 381 EISPMKVINRETRPKSLSTFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVKK 431


>gi|440301578|gb|ELP93964.1| endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Entamoeba invadens IP1]
          Length = 363

 Score =  176 bits (447), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 107/297 (36%), Positives = 156/297 (52%), Gaps = 26/297 (8%)

Query: 4   SGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN-----ETYCG 58
           SGE  +D++ +I K RL+  G            P  +  L+    +L  N     +  C 
Sbjct: 86  SGESMIDIEKNITKTRLNKNG-----------VPLTESELKATQQKLNANIKTVDQKTCR 134

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCYGAE+    CC  C++V EAY+++GW L N   I QC     L+  K    EGC + G
Sbjct: 135 SCYGAETPSRKCCYTCDDVIEAYKERGWNL-NIRTIAQCDNSEKLEMAKLTLEEGCRVEG 193

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
            L +NK+ GNFH APG S +    H H+I    R   +++H  N L+FGE          
Sbjct: 194 NLLLNKIGGNFHIAPGTSDNTWTGHHHNIEWTGRTKIDLTHTWNDLSFGEGSKTYSGSKK 253

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
             +      +GM+QYF+ ++P     ++G     + F + E  RS   G+ +  PGVF +
Sbjct: 254 DAKM-----NGMFQYFLTLIPKKNNFINGTKFVYD-FVINEQTRS---GQGEGEPGVFVY 304

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           YD+SP+ +   E +  FLHFL  VCAI+GGVFTV  +IDAF++     ++KKIE+GK
Sbjct: 305 YDVSPMLLEVNEFNHGFLHFLIGVCAIIGGVFTVFQLIDAFVFDSIHTLQKKIELGK 361


>gi|354544621|emb|CCE41346.1| hypothetical protein CPAR2_303350 [Candida parapsilosis]
          Length = 412

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 109/322 (33%), Positives = 170/322 (52%), Gaps = 30/322 (9%)

Query: 2   DISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDK--PLQRHGGRLEHNETY-- 56
           D SG+  LD+ +   +K R+  QG+  +  +     P + +  PL++    L   +T   
Sbjct: 91  DESGDLKLDIINSQLEKFRIIKQGHSSKPVEIKDEQPALQREVPLEQIAPGLPEGQTEGE 150

Query: 57  CGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGE 112
           CGSCYGA   D+   CCN C  VR AY +  W   + + I QC++EG++QR+K+   E E
Sbjct: 151 CGSCYGAVPQDKKQYCCNTCAAVRRAYAEANWQFFDGENIAQCEQEGYVQRLKQRIGENE 210

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ--RDSFNISHKINKLAFGEHF 170
           GC + G  ++N+++G   FAPG S  + G HVHD+  +Q  +D FN  H IN L+FG + 
Sbjct: 211 GCRVKGTAKINRISGTMDFAPGASMTKDGRHVHDLSLYQKYKDKFNFDHVINHLSFGNNP 270

Query: 171 P-------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG-HTIQSNQFSVTEHFR 222
           P       G + PLDG ++ Q        YF+K+V T +  + G H   +NQFSV  H R
Sbjct: 271 PASKLVDTGSITPLDGHKFLQHKKYHSINYFLKIVATRFESLDGKHKFDTNQFSVITHDR 330

Query: 223 SSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFT 271
               G+ +           +PGV F +D+SP+K+   EE+      F+  V + + GV  
Sbjct: 331 PLAGGKDEDHQHTLHARGGVPGVAFNFDISPLKIINREEYAKTRSGFILGVVSSIAGVLM 390

Query: 272 VSGIIDAFIYHGQRAIKKKIEI 293
           V  ++D  ++  Q+AIK K ++
Sbjct: 391 VGSLMDRSVFAAQQAIKGKKDL 412


>gi|344230638|gb|EGV62523.1| DUF1692-domain-containing protein [Candida tenuis ATCC 10573]
          Length = 409

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 108/326 (33%), Positives = 172/326 (52%), Gaps = 37/326 (11%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNE---- 54
           +D+SG   LD+  + F+K R+ S G  +  +     AP ID  PL+     L+  E    
Sbjct: 88  LDVSGNVELDILQNGFQKYRILSSGEEVLMKN----APLIDSTPLEVMAKGLDKPEDAEH 143

Query: 55  TYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--E 110
           T CG CYG+   D    CCNNCE +R AY  K WA  + + I  C+ EG+++ I+ E   
Sbjct: 144 TPCGDCYGSLPQDRKQYCCNNCETIRRAYAAKVWAFYDGENIKPCEDEGYVKAIQSEIFN 203

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFGE 168
            EGC + G  ++N+++GN HFAPG SF +   HVHD+  + +  D FN  H IN L+FG+
Sbjct: 204 NEGCRVKGTTQINRISGNLHFAPGASFTEPSRHVHDLSLYNKFPDRFNFDHTINHLSFGK 263

Query: 169 HFPGVVN-------PLDGVRWTQETPSGMYQYFIKVVPTVYTDVS---GHTIQSNQFSVT 218
                 N       PLDG     +    +Y YF+KVV T Y  +       +++NQFS  
Sbjct: 264 DPETNANTDKKTLHPLDGETRNLKEKYHLYSYFLKVVSTRYEYLQEKLKAPLETNQFSAI 323

Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
            H R  + G+ +           LPG++F++D+SP+K+   E++  ++  F+  V + + 
Sbjct: 324 YHDRPIKGGKDEDHQHTLHARGGLPGLYFYFDISPLKIINKEQYSKTWSGFVLGVISSIA 383

Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GV  +  ++D  ++  ++AI+ K +I
Sbjct: 384 GVLMIGSLLDRSVWAAEKAIRAKKDI 409


>gi|302666755|ref|XP_003024974.1| hypothetical protein TRV_00895 [Trichophyton verrucosum HKI 0517]
 gi|291189052|gb|EFE44363.1| hypothetical protein TRV_00895 [Trichophyton verrucosum HKI 0517]
          Length = 435

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 120/351 (34%), Positives = 159/351 (45%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGE   DV H + K RL S    G VI+     +   K D P       L+ N  YC
Sbjct: 89  MDVSGELQTDVDHGVNKVRLSSAAEGGRVIDVTALALHK-KEDSP-----AHLDPN--YC 140

Query: 58  GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYG    + +    CCN C+EVR+AY +K WA    + + QC  EG+ QRI E+  EG
Sbjct: 141 GDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFGRGENVAQCIDEGYSQRIDEQRHEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C I G L VNKVAGNFH APG+S      H HD+  +        ++H I+KL FG   P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMTHIIHKLRFGPQLP 260

Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
                         +NPLD            + YF+KVV T Y  +              
Sbjct: 261 EELYSRWKWTHQDTINPLDKSEHKTNEVRYNFLYFVKVVSTSYLPLGWDPTLSSEAHSQA 320

Query: 206 --------------SGHTIQSNQFSVTEHFRS------------SEQGRLQTLPGVFFFY 239
                         S  +I+++Q+SVT H RS              Q     +P V F Y
Sbjct: 321 HRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAEDASADGHKERQHSRGGIPSVMFNY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           ++SP+KV   E    S   F T VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 381 EISPMKVINRETRPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVKK 431


>gi|344230637|gb|EGV62522.1| hypothetical protein CANTEDRAFT_131007 [Candida tenuis ATCC 10573]
          Length = 410

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 108/326 (33%), Positives = 172/326 (52%), Gaps = 37/326 (11%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDK-PLQRHGGRLEHNE---- 54
           +D+SG   LD+  + F+K R+ S G  +  +     AP ID  PL+     L+  E    
Sbjct: 89  LDVSGNVELDILQNGFQKYRILSSGEEVLMKN----APLIDSTPLEVMAKGLDKPEDAEH 144

Query: 55  TYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--E 110
           T CG CYG+   D    CCNNCE +R AY  K WA  + + I  C+ EG+++ I+ E   
Sbjct: 145 TPCGDCYGSLPQDRKQYCCNNCETIRRAYAAKVWAFYDGENIKPCEDEGYVKAIQSEIFN 204

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFGE 168
            EGC + G  ++N+++GN HFAPG SF +   HVHD+  + +  D FN  H IN L+FG+
Sbjct: 205 NEGCRVKGTTQINRISGNLHFAPGASFTEPSRHVHDLSLYNKFPDRFNFDHTINHLSFGK 264

Query: 169 HFPGVVN-------PLDGVRWTQETPSGMYQYFIKVVPTVYTDVS---GHTIQSNQFSVT 218
                 N       PLDG     +    +Y YF+KVV T Y  +       +++NQFS  
Sbjct: 265 DPETNANTDKKTLHPLDGETRNLKEKYHLYSYFLKVVSTRYEYLQEKLKAPLETNQFSAI 324

Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
            H R  + G+ +           LPG++F++D+SP+K+   E++  ++  F+  V + + 
Sbjct: 325 YHDRPIKGGKDEDHQHTLHARGGLPGLYFYFDISPLKIINKEQYSKTWSGFVLGVISSIA 384

Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GV  +  ++D  ++  ++AI+ K +I
Sbjct: 385 GVLMIGSLLDRSVWAAEKAIRAKKDI 410


>gi|302511557|ref|XP_003017730.1| hypothetical protein ARB_04613 [Arthroderma benhamiae CBS 112371]
 gi|291181301|gb|EFE37085.1| hypothetical protein ARB_04613 [Arthroderma benhamiae CBS 112371]
          Length = 435

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 121/351 (34%), Positives = 161/351 (45%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGE   DV H + K RL S    G VI+     +   K D P       L+ N  YC
Sbjct: 89  MDVSGELQTDVDHGVNKVRLSSAAEGGRVIDVTALALHK-KEDSP-----AHLDPN--YC 140

Query: 58  GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYG    + +    CCN C+EVR+AY +K WA    + + QC  EG+ QRI E+  EG
Sbjct: 141 GDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFGRGENVAQCIDEGYSQRIDEQRHEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C I G L VNKVAGNFH APG+S      H HD+  +        ++H I+KL FG   P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMTHIIHKLRFGPQLP 260

Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
                         +NPLD            + YF+KVV T Y  +              
Sbjct: 261 EELYSRWKWTHQDTINPLDKSEHKTNEVRYNFLYFVKVVSTSYLPLGWDPTLSSEAHSQA 320

Query: 206 --------------SGHTIQSNQFSVTEHFRS------SEQGRLQT------LPGVFFFY 239
                         S  +I+++Q+SVT H RS      S  G  +       +P V F Y
Sbjct: 321 HRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAEDASADGHKERQHARGGIPSVMFNY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           ++SP+KV   E    S   F T VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 381 EISPMKVINRETRPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVKK 431


>gi|116181584|ref|XP_001220641.1| hypothetical protein CHGG_01420 [Chaetomium globosum CBS 148.51]
 gi|88185717|gb|EAQ93185.1| hypothetical protein CHGG_01420 [Chaetomium globosum CBS 148.51]
          Length = 438

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 120/353 (33%), Positives = 168/353 (47%), Gaps = 73/353 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MDISGEQ   V+H + K RL  Q   G  I+++   + A        R       + +YC
Sbjct: 89  MDISGEQQHGVQHGVTKTRLRPQSEGGGDIDTKAVALHA--------RDEVATHLDPSYC 140

Query: 58  GSCYGAE----SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA+    +    CCN CEEV++AY +  WA    + I+QC+RE + +++ E+  EG
Sbjct: 141 GPCYGAQPPPNAKKPGCCNTCEEVKDAYAQAAWAFGRGEGIEQCEREHYSEKLDEQRNEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEHFP 171
           C I G L VNKV GNFH APG+SF    +HVHD+  +         SH+I+ L FG   P
Sbjct: 201 CRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVHDLKNYWDTPTKHTFSHQIHHLRFGPQLP 260

Query: 172 G-----------------VVNPLDGV------------------------RWT-QETPSG 189
                               NPLD                          RW  ++T +G
Sbjct: 261 DNLHKKLDARKNMRGRSTTFNPLDDTPPGDGTTSTTTTCTSSRSCPHRTCRWAGRKTWAG 320

Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS---------SEQGRLQT---LPGVFF 237
             +     + +      G +++++Q+SVT H RS           Q RL     +PGVFF
Sbjct: 321 FREEHHAELGSFGASADG-SVETHQYSVTSHKRSLAGGDDSAEGHQERLHARGGIPGVFF 379

Query: 238 FYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            YD+SP+KV   EE   SFL F+  +CAIVGG  TV+  ID  ++ G   +KK
Sbjct: 380 SYDISPMKVINREEKAKSFLGFIAGLCAIVGGTLTVAAAIDRALFEGGVRLKK 432


>gi|255712984|ref|XP_002552774.1| KLTH0D01144p [Lachancea thermotolerans]
 gi|238934154|emb|CAR22336.1| KLTH0D01144p [Lachancea thermotolerans CBS 6340]
          Length = 402

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 110/320 (34%), Positives = 168/320 (52%), Gaps = 38/320 (11%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD +GE  L+V +  + K RLD  G V++++Q   G   +D   +        +E YCG 
Sbjct: 88  MDSAGEMQLEVLNKGWSKTRLDPSGQVLDTKQFKPGKDVVDYAPE--------DENYCGP 139

Query: 60  CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
           CYGA    ++         CC  C++VREAY +K WA  +   I+QC+REG+++++ E  
Sbjct: 140 CYGARDQSKNDEVNVDERVCCQTCDDVREAYAEKQWAFFDGKNIEQCEREGYVEQVNEHI 199

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEH 169
            EGC I G  ++N++ GN HFAPGK FH    H HD   +Q   S N +H I+ L+FG+ 
Sbjct: 200 EEGCRIKGMAKLNRIGGNLHFAPGKGFHNIRGHFHDASLYQNSPSLNFNHIIHHLSFGKE 259

Query: 170 FPGVVN------PLDGVRWTQE--TPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 221
              +        PLDG   + E  T    + YF K+VPT Y  +SG T+++ QF+ T H 
Sbjct: 260 VEDITGQGASTAPLDGTNVSPEFDTHKHQFSYFAKIVPTRYEYLSGETVETTQFTTTYHS 319

Query: 222 RSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVF 270
           R  + GR              P V+F++++SP+KV   +++  S+  F  N    +GGV 
Sbjct: 320 RPLKGGRDSDHPTTLHSQGGFPSVYFYFEMSPLKVINKQQYAQSWSGFWLNCITSIGGVL 379

Query: 271 TVSGIIDAFIYHGQRAIKKK 290
            V  ++D   Y  QR++  K
Sbjct: 380 AVGTVLDKITYKAQRSMWGK 399


>gi|169731514|gb|ACA64886.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           (predicted) [Callicebus moloch]
          Length = 237

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 85/153 (55%), Positives = 111/153 (72%), Gaps = 3/153 (1%)

Query: 145 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD 204
           HD+ +F  D+ N++H I  L+FGE +PG+VNPLD    T    S M+QYF+KVVPTVY  
Sbjct: 84  HDLQSFGLDNINMTHYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMK 143

Query: 205 VSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
           V G  +++NQFSVT H + +  G L  Q LPGVF  Y+LSP+ V  TE+H SF HFLT V
Sbjct: 144 VDGEVLRTNQFSVTRHEKVA-NGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGV 202

Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           CAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 203 CAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 235


>gi|365982867|ref|XP_003668267.1| hypothetical protein NDAI_0A08710 [Naumovozyma dairenensis CBS 421]
 gi|343767033|emb|CCD23024.1| hypothetical protein NDAI_0A08710 [Naumovozyma dairenensis CBS 421]
          Length = 410

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 110/326 (33%), Positives = 168/326 (51%), Gaps = 51/326 (15%)

Query: 1   MDISGEQHLDVKHD--IFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           +D +G+  LD+ +     K RLD  GNVIE     +   KID  +        ++E YCG
Sbjct: 90  LDDAGDLQLDILNQGQFTKTRLDRMGNVIE-----VSKFKIDDDVAEFP---PNDENYCG 141

Query: 59  SCYGA----------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 108
            CYG+             D+ CC  CE+VREAY K GWA  +   I+QC+REG++ +I +
Sbjct: 142 PCYGSIDQSGNDKIESVKDKICCQTCEQVREAYLKAGWAFFDGKNIEQCEREGYVTKINK 201

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFG 167
              EGC + G + +N++ GN HFAPGK+F     H HD   ++     N +H I+ L+FG
Sbjct: 202 HLNEGCRVKGNVLLNRIQGNIHFAPGKAFQNVKGHFHDSSLYETSPDLNFNHIIHHLSFG 261

Query: 168 EHFPGV---------VNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFS 216
           +    +          +PLDG + +    S +Y+  YF+K+VPT Y  +     ++ QFS
Sbjct: 262 KTIEQLAQLRGATVATSPLDGQQISPSFDSHLYRYSYFVKIVPTRYEYLDKMISETAQFS 321

Query: 217 VTEHF------RSSEQGRLQT----LPGVFFFYDLSPIKVTFTEEHVS-----FLHFLTN 261
            T H       R  E   ++     LPG+F ++++SP+K+  TE+H       FLH +T+
Sbjct: 322 ATFHQSLVTGERDPENPNIKYSRTGLPGLFIYFEMSPLKIINTEQHFKSWSGVFLHCITS 381

Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAI 287
               +GG+  V  I+D F Y  QR +
Sbjct: 382 ----IGGILAVGTILDKFFYKAQRTV 403


>gi|406866287|gb|EKD19327.1| copii-coated vesicle membrane protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 453

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 124/368 (33%), Positives = 168/368 (45%), Gaps = 88/368 (23%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGG-RLEH-NETYC 57
           MD+SGEQ   V H + K RL  +           G  +I  + L  HG  +  H +  YC
Sbjct: 89  MDVSGEQQTGVMHGVKKVRLGPEAE---------GGKEISIESLDLHGDDQATHLDPDYC 139

Query: 58  GSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYGA     +    CCN CEEVREAY    WA    + ++QC+RE + +++  +  EG
Sbjct: 140 GGCYGATAPPNAKKAGCCNTCEEVREAYASVSWAFGRGENVEQCEREHYGEKLDAQRKEG 199

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGEH 169
           C I G + VNKV GNFH APG+SF    +HVHD+  +           +H I+ L FG  
Sbjct: 200 CRIEGGIRVNKVVGNFHIAPGRSFSNGNMHVHDLNNYFDTPVPGGHVFTHHIHSLRFGPQ 259

Query: 170 FPGVV----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS------- 206
            P  V                NPLD  R      +  + YF+KVVPT Y  +        
Sbjct: 260 LPESVTKKLGNKALPWTNHHINPLDDTRQVAPETAYNFMYFVKVVPTSYLPLGWDNSVTS 319

Query: 207 ------------GH----TIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
                       GH    +++++QFSVT H RS   G         +L +   +PGVFF 
Sbjct: 320 EQRIDHVDIGSYGHLDDGSVETHQFSVTSHKRSLSGGDDGAEGHKEKLHSRGGIPGVFFS 379

Query: 239 Y----------------DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
           Y                D+SP+KV   EE   S   FLT +CAI+GG  TV+  +D  +Y
Sbjct: 380 YVSSHFYPQKISTNKTQDISPMKVINREERAKSLAGFLTGLCAIIGGTLTVAAAVDRGVY 439

Query: 282 HGQRAIKK 289
            G   +KK
Sbjct: 440 EGTTRLKK 447


>gi|327296796|ref|XP_003233092.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
 gi|326464398|gb|EGD89851.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
          Length = 435

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 121/351 (34%), Positives = 161/351 (45%), Gaps = 70/351 (19%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD+SGE   DV H + K RL S    G VI+     +   K D P       L+ N  YC
Sbjct: 89  MDVSGELQTDVDHGVNKVRLSSAAEGGRVIDVTALSLHK-KEDSP-----AHLDPN--YC 140

Query: 58  GSCYG----AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           G CYG    + +    CCN C+EVR+AY +K WA    + + QC  EG+ QRI E+  EG
Sbjct: 141 GDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKNWAFGRGENVAQCIDEGYSQRIDEQRHEG 200

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHFP 171
           C I G L VNKVAGNFH APG+S      H HD+  +        ++H I+KL FG   P
Sbjct: 201 CRIEGILRVNKVAGNFHIAPGRSLTAGNFHAHDLDNYYHTPVPHTMTHIIHKLRFGPQLP 260

Query: 172 ------------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-------------- 205
                         +NPLD            + YF+KVV T Y  +              
Sbjct: 261 EELYSRWKWTHQDTINPLDKSEHKTNEVRYNFLYFVKVVSTSYLPLGWDPTLSSEAHSQA 320

Query: 206 --------------SGHTIQSNQFSVTEHFRS------SEQGRLQT------LPGVFFFY 239
                         S  +I+++Q+SVT H RS      S  G  +       +P V F Y
Sbjct: 321 HRDIPLGNHGVFFGSQGSIETHQYSVTSHQRSLDAEDASADGHKERQHARGGIPSVMFNY 380

Query: 240 DLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           ++SP+KV   E    S   F T VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 381 EISPMKVINRETRPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVKK 431


>gi|190347075|gb|EDK39286.2| hypothetical protein PGUG_03384 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 404

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 169/324 (52%), Gaps = 44/324 (13%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN------ 53
           +D+SG+  +D+    F+K RL   G+ I            + P+    G LE        
Sbjct: 88  LDVSGDLQVDLLSSGFEKFRLLKDGSEIRD----------ESPVMSSAGELEERARGRAP 137

Query: 54  ETYCGSCYGAESSDED---CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
           +  CGSCYGA   DE+   CCN+CE VR AY +K W   + + I+QC+REG++ R+ E+ 
Sbjct: 138 DGSCGSCYGALPQDENSDYCCNDCETVRLAYAQKAWGFFDGENIEQCEREGYVARLNEKI 197

Query: 111 G--EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAF 166
              EGC I G  ++N+++GN HFAPG SF   G H HD+  F +  D F   H IN L+F
Sbjct: 198 NNFEGCRIKGTGKINRISGNLHFAPGASFTAPGSHFHDLSLFNKYDDKFTFDHVINHLSF 257

Query: 167 GEHFPGV-------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--IQSNQFSV 217
           G     +        +PLD      ++   +Y Y++KVV T +  ++ +T  +++NQFSV
Sbjct: 258 GSDPHNIQFFEKQSTHPLDKSSMILKSKDRLYSYYLKVVATRFEFLTPNTPALETNQFSV 317

Query: 218 TEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIV 266
             H R    G+             LPGVFF +++SP+K+   E++  ++  F+  V + +
Sbjct: 318 ISHHRPLAGGKDDDHQHTLHARGGLPGVFFHFEISPMKIINKEQYAKTWSGFVLGVISSI 377

Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
            GV  V  ++D  ++  +R I+ K
Sbjct: 378 AGVLMVGALLDRSVWAAERVIRAK 401


>gi|449705731|gb|EMD45722.1| endoplasmic reticulumgolgi intermediate compartment protein,
           putative [Entamoeba histolytica KU27]
          Length = 272

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 90/239 (37%), Positives = 142/239 (59%), Gaps = 10/239 (4%)

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE+ ++ CC  C++V+EAY+K+GW L + +++ QC+    +Q  K  + EGC +
Sbjct: 42  CRSCYGAETPEKKCCFTCDDVKEAYKKRGWRL-DLNIVSQCQNHEKIQMAKLTKDEGCRL 100

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
            G   +NK+ GNFH APG S    G H H++    +   ++SHK N+L+FGE+       
Sbjct: 101 IGDFLLNKIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQIDLSHKWNELSFGENSKKFTTE 160

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 236
               +      + M+QY++ ++P     ++G T     +S+ E+ RS E G  Q  PGVF
Sbjct: 161 KKDTQM-----NSMFQYYLTIIPIKNNFING-TSTFYDYSIQENIRSGE-GEGQ--PGVF 211

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
            +YD+SP+ +  TE +  FLHFL  +C+IVGG+FT   + DA ++     +KKK+E+GK
Sbjct: 212 IYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHTLKKKVELGK 270


>gi|443925078|gb|ELU44001.1| ER-derived vesicles protein ERV46 [Rhizoctonia solani AG-1 IA]
          Length = 383

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 106/298 (35%), Positives = 155/298 (52%), Gaps = 43/298 (14%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           DISGE   D+ H++ K RLDS G +I   QDG    ++D  +++        + YCGSCY
Sbjct: 91  DISGEIQQDLTHNMVKTRLDSNGQII---QDGFHNNELDNDVEK--TMKARPQGYCGSCY 145

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           G E  +  CC  CE VR+AY  +GW+  +PD I+QC  E +  +I E+  EGC+I G + 
Sbjct: 146 GGEPPEGGCCQTCESVRQAYMNRGWSFGDPDAIEQCVAEHWTAKIHEQNSEGCHISGRVR 205

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAF-GE-----HFPGV 173
           VNKV GNFHF+PG+SF  +  H  D++ + +D    +  H +++  F GE      + G 
Sbjct: 206 VNKVTGNFHFSPGRSFVLNRGHFQDLVPYLKDGNHHDFGHYVHEFRFEGESEAEDEWRGT 265

Query: 174 -------------VNPLDGVRW---TQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
                         NPLD V          + M+QYF+KVV T +  + G  I+S+Q+SV
Sbjct: 266 DRGTRWRKKVGISANPLDQVSAHVVDDRASNYMFQYFMKVVSTEFKYLDGDIIRSHQYSV 325

Query: 218 TEHFRSSEQGR--------------LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
           T + R    G               +Q LPG FF +++SP+ V   E   +F HF T+
Sbjct: 326 TSYERDLTHGDGAERDSHGTLTAHGVQGLPGAFFNFEISPMMVVHRETRQTFAHFATS 383


>gi|19113757|ref|NP_592845.1| COPII-coated vesicle component Erv46 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|1351651|sp|Q09895.1|YAI8_SCHPO RecName: Full=Uncharacterized protein C24B11.08c
 gi|1061296|emb|CAA91773.1| COPII-coated vesicle component Erv46 (predicted)
           [Schizosaccharomyces pombe]
          Length = 390

 Score =  174 bits (440), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 113/312 (36%), Positives = 162/312 (51%), Gaps = 35/312 (11%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SGE   D+ H + K RL   G +I      IG     + +   G         CG C
Sbjct: 89  LDVSGEFQRDIHHTVSKTRLSPSGEIISVDDLDIGN---QQSISDDGA------AECGDC 139

Query: 61  YGAES-SDED---CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           YGA   + ED   CCN C+ VR+AY K  W + + D   QCK E F +  + ++ EGCN+
Sbjct: 140 YGAADFAPEDTPGCCNTCDAVRDAYGKAHWRIGDVDAFKQCKDENFKELYEAQKVEGCNL 199

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEHFPGVV 174
            G L VN++AGNFH APG+S      HVHD   +  + D  ++SH I+ L+FG      V
Sbjct: 200 AGQLSVNRMAGNFHIAPGRSTQNGNQHVHDTRDYINELDLHDMSHSIHHLSFGPPLDASV 259

Query: 175 ---NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--IQSNQFSVTEHFRSSEQGRL 229
              NPLDG      T    Y+YFIK V   +  +S  T  I +N+++VT+H RS   GR 
Sbjct: 260 HYSNPLDGTVKKVSTADYRYEYFIKCVSYQFMPLSKSTLPIDTNKYAVTQHERSIRGGRE 319

Query: 230 QT----------LPGVFFFYDLSPIKVTFTEEHV---SFLHFLTNVCAIVGGVFTVSGII 276
           +           +PGV+F +D+SP++V   E  V   +F  FL+NV A++GG  T++  +
Sbjct: 320 EKVPTHVNFHGGIPGVWFQFDISPMRV--IERQVRGNTFGGFLSNVLALLGGCVTLASFV 377

Query: 277 DAFIYHGQRAIK 288
           D   Y  Q+  K
Sbjct: 378 DRGYYEVQKLKK 389


>gi|367007030|ref|XP_003688245.1| hypothetical protein TPHA_0N00300 [Tetrapisispora phaffii CBS 4417]
 gi|357526553|emb|CCE65811.1| hypothetical protein TPHA_0N00300 [Tetrapisispora phaffii CBS 4417]
          Length = 407

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 111/320 (34%), Positives = 162/320 (50%), Gaps = 43/320 (13%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRL--EHNETYC 57
           MD +G   LD+    FKK RLD  G  +E R+           L+ +  R+  E    YC
Sbjct: 90  MDDAGGLQLDILDSGFKKTRLDPNGKQLEFRE---------FDLKDNSKRIVSEKGPNYC 140

Query: 58  GSCYGA--------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
           GSCYGA        E + + CCN CE+VR AY    WA  +   I+QC+ EG+++RI E 
Sbjct: 141 GSCYGAIDQSHNDEEGAKKVCCNTCEDVRLAYVTANWAFFDGKNIEQCEDEGYVKRINEH 200

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGE 168
             EGC + G  ++N+V GN HFAPGK    S  H+HD   +++  + N  H I+  +FGE
Sbjct: 201 LNEGCRVTGKAKINRVKGNIHFAPGKPMQNSKGHLHDTSLYEKSPNMNFKHIIHHFSFGE 260

Query: 169 HFPG---------VVNPLD--GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
                        + NPLD   V+   +T    + Y++KVVPT Y  ++   +++ QFSV
Sbjct: 261 PIDRKAKSKGADVLTNPLDDYDVQPNIDTHYHQFSYYMKVVPTRYEYLNRMVVETAQFSV 320

Query: 218 TEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIV 266
           T H R    G+ +           +PGVFFF+D+S IKV   E+   ++  F+ N    +
Sbjct: 321 TFHDRPLRGGKDEDHPNTIHARNGIPGVFFFFDISSIKVINNEQITQTWSGFILNCIITI 380

Query: 267 GGVFTVSGIIDAFIYHGQRA 286
           GGV  V  ++D   Y  Q+ 
Sbjct: 381 GGVLAVGSMVDRLSYKAQKT 400


>gi|344301666|gb|EGW31971.1| hypothetical protein SPAPADRAFT_50577 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 410

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 103/320 (32%), Positives = 173/320 (54%), Gaps = 27/320 (8%)

Query: 1   MDISGEQHLDVKHDIFKKR--LDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEH--NETY 56
           +D +G+  L++ +  F+K   +  +GN++    D   A  +D+PL      L    +   
Sbjct: 91  LDETGDMQLNIINAGFQKLRLIKDKGNIVREISDDTPALNLDRPLSEVVKGLPEGGDPKT 150

Query: 57  CGSCYGA--ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGE 112
           CGSCYGA  +   + CCN+C  V+ AY ++ W+  + + I+QC++EG+++R+++   + E
Sbjct: 151 CGSCYGALPQEKHQYCCNDCYSVKRAYAERRWSFFDGENIEQCEKEGYVKRLRQRINDNE 210

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG--- 167
           GC I G  ++N+V+G   FAPG SF   G HVHD+  + +  D FN  H IN L+FG   
Sbjct: 211 GCRIKGSAKINRVSGTMDFAPGASFTSDGRHVHDVSLYGKYQDKFNFDHIINHLSFGSND 270

Query: 168 --EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVTEHFRSS 224
             E     V+PLDG ++       +  Y++KVV T +  +     + +NQFSV  H R  
Sbjct: 271 AREEILNSVHPLDGYQFMLHKKHHVASYYLKVVATRFESLDQSKRLDTNQFSVITHDRPL 330

Query: 225 EQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 273
             G+ +           +PGV F +D+SP+K+   E++  ++  F+  V + + GV  V 
Sbjct: 331 TGGKDEDHEHTLHARGGIPGVEFHFDISPLKIINKEQYAKTWSGFVLGVISSIAGVLMVG 390

Query: 274 GIIDAFIYHGQRAIKKKIEI 293
            +ID  +Y  Q+AI+ K +I
Sbjct: 391 TLIDRSVYATQQAIRGKKDI 410


>gi|401626934|gb|EJS44847.1| erv46p [Saccharomyces arboricola H-6]
          Length = 415

 Score =  172 bits (435), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 115/337 (34%), Positives = 164/337 (48%), Gaps = 59/337 (17%)

Query: 1   MDISGEQHLDVKHDIFK-KRLDSQGNVIESRQ------DGIGAPKIDKPLQRHGGRLEHN 53
           MD SGE  LD+    F   R+D  G+ +          +G GA   D P           
Sbjct: 88  MDDSGELQLDILDAGFTMTRVDKDGHPVGDATELHVGGNGEGATPNDDP----------- 136

Query: 54  ETYCGSCYGAESS---------DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQ 104
             YCG CYGA            D+ CC NC+ VR AY  KGWA  +   I+QC++EG++ 
Sbjct: 137 -NYCGQCYGARDQSNNENLAQEDKVCCQNCDSVRSAYLDKGWAFFDGKDIEQCEKEGYVN 195

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS-GVHVHDILAFQRD-SFNISHKIN 162
           +I +   EGC I G  ++N++ GN HFAPGK F  + G H HD   + +    N +H IN
Sbjct: 196 KINDHLHEGCRIEGSAQINRIQGNIHFAPGKPFQDTRGNHRHDTSLYDKTPDLNFNHIIN 255

Query: 163 KLAFGE--------------HFPGVV--NPLDGVRWTQETPSGMYQ--YFIKVVPTVYTD 204
           +L+FG+              H   VV  +PLDG +   + P+  +Q  YF K+VPT Y  
Sbjct: 256 RLSFGKPIQSHHKRLGNDKLHGGAVVSTSPLDGRQVFPDRPTHFHQFSYFAKIVPTRYEY 315

Query: 205 VSGHTIQSNQFSVTEHFRSSEQGRLQTLP----------GVFFFYDLSPIKVTFTEEH-V 253
           +    I++ QFS T H R    GR Q  P          G++ F+++SP+KV   E+H  
Sbjct: 316 LDSTVIETAQFSATYHSRPLGGGRDQDHPNTFHARGGISGLYVFFEMSPLKVINKEQHGQ 375

Query: 254 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
           ++  F+ N    +GGV  V  ++D   Y  QR+I  K
Sbjct: 376 TWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412


>gi|148674216|gb|EDL06163.1| ERGIC and golgi 3, isoform CRA_c [Mus musculus]
          Length = 261

 Score =  171 bits (434), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 85/160 (53%), Positives = 112/160 (70%), Gaps = 9/160 (5%)

Query: 144 VHDILAFQRDS------FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKV 197
           +HD+ +F  D+       N++H I  L+FGE +PG+VNPLD    T    S M+QYF+KV
Sbjct: 101 IHDLQSFGLDNPSDCLQINMTHYIKHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKV 160

Query: 198 VPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTEEHVSF 255
           VPTVY  V G  +++NQFSVT H + +  G L  Q LPGVF  Y+LSP+ V  TE+H SF
Sbjct: 161 VPTVYMKVDGEVLRTNQFSVTRHEKVAN-GLLGDQGLPGVFVLYELSPMMVKLTEKHRSF 219

Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
            HFLT VCAI+GG+FTV+G+ID+ IYH  RAI+KKI++GK
Sbjct: 220 THFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGK 259


>gi|340055752|emb|CCC50073.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 404

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 106/315 (33%), Positives = 158/315 (50%), Gaps = 28/315 (8%)

Query: 5   GEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAE 64
           GE        I K R+ +Q     S       P+ D+ +      + +    C SCYGAE
Sbjct: 94  GEYMTGAVRSITKVRVPTQDPAPVSE----ALPQSDRSVSTAALPVSNKMGGCVSCYGAE 149

Query: 65  SSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKE-EEGEGCNIYGFLEV 122
            S  DCCN+C++V  A+R+ GW +   D+ + QC  EG L  +      EGCNI+    V
Sbjct: 150 ESPGDCCNSCDDVHAAFRRNGWEIDENDIKLSQCT-EGQLHNVGPVSPSEGCNIHSKFSV 208

Query: 123 NKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD---- 178
            K+ GN HF PG+  +  G  ++ +        N+SH  + L FGE FPG VNPL+    
Sbjct: 209 RKIKGNIHFVPGRRLNHRGQPMYVVRREAIKKMNLSHVFHSLEFGERFPGQVNPLNGIAN 268

Query: 179 --GVRWTQETPSGMYQYFIKVVPTVYTDV----SGHTIQSNQFSVTEHFRSSEQGRLQTL 232
             GVR   E  SG + Y+++V+PT Y  V    S   +++NQ+SV +HF  S     +  
Sbjct: 269 ARGVRNASEVVSGRFSYYVQVLPTEYQFVPALGSRVRLETNQYSVKQHFTESWYTTDRRY 328

Query: 233 P---------GVFFFYDLSPIK--VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
           P         GVF  YD+SP+K  V  T  + S +H L  +CA+ GG FTV+ +ID+ + 
Sbjct: 329 PGWSDPTLVAGVFIVYDVSPVKTLVMRTSPYPSLIHLLLRMCAVGGGAFTVASMIDSLLL 388

Query: 282 HGQRAIKKKIEIGKF 296
           +     ++K+   K+
Sbjct: 389 NILGHFRRKMRETKY 403


>gi|255732259|ref|XP_002551053.1| hypothetical protein CTRG_05351 [Candida tropicalis MYA-3404]
 gi|240131339|gb|EER30899.1| hypothetical protein CTRG_05351 [Candida tropicalis MYA-3404]
          Length = 414

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 109/326 (33%), Positives = 172/326 (52%), Gaps = 33/326 (10%)

Query: 1   MDISGEQHLDVKHDIFKK-RL--DSQGNVIESR-QDGIGAPKIDKPLQRHGGRL---EHN 53
           +D++G+Q LD+     KK RL  + QG+VI +  +D   A   D  L+     L      
Sbjct: 89  LDVTGDQQLDIIDSGLKKVRLLKNKQGDVIINEIEDDKPALNSDVSLKELAKGLPEGSDQ 148

Query: 54  ETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE-- 109
             YCG CYGA   D+   CCN+C  VR AY +K W   + + I+QC++EG+++R++E   
Sbjct: 149 NAYCGPCYGALPQDKKQFCCNDCNTVRRAYAEKQWQFFDGENIEQCEKEGYVKRLRERIN 208

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG 167
             EGC I G  ++N+V+G   FAPG SF+  G H HD+  +++  D FN  H IN L+FG
Sbjct: 209 NNEGCRIKGSTKINRVSGTMDFAPGSSFNHDGRHFHDLSLYKKYNDKFNFDHVINHLSFG 268

Query: 168 --------EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVT 218
                   E     ++PLD  ++       +  YF+KVV T Y  +     + +NQFSV 
Sbjct: 269 EVPTNNGAEEMFDSIHPLDDYQFMLHKKDHVVSYFLKVVATRYESLDYSKRVDTNQFSVI 328

Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
            H R    G+ +           +PGV F +D+SP+K+   +++  ++  F+  V + + 
Sbjct: 329 THDRPLIGGKDEDHQHTLHARGGIPGVNFNFDISPLKIINRQQYAKTWSGFILGVVSSIA 388

Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GV  V  ++D  ++  Q+AIK K +I
Sbjct: 389 GVLMVGTLLDRSVFAAQQAIKGKKDI 414


>gi|74267709|gb|AAI02327.1| ERGIC and golgi 3 [Bos taurus]
          Length = 231

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 83/147 (56%), Positives = 103/147 (70%), Gaps = 11/147 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD----GIGAPKIDKPLQRHGGRLEHNETY 56
           MD++GEQ LDV+H++FKKRLD  G  + S  +    G    K+  P      R       
Sbjct: 89  MDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKVEVKVFDPDSLDPDR------- 141

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C SCYGAE  D  CCN+CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +
Sbjct: 142 CESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQV 201

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVH 143
           YGFLEVNKVAGNFHFAPGKSF QS VH
Sbjct: 202 YGFLEVNKVAGNFHFAPGKSFQQSHVH 228


>gi|323356370|gb|EGA88170.1| Erv46p [Saccharomyces cerevisiae VL3]
          Length = 415

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 55/335 (16%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR------LEHNE 54
           MD SGE  LD+        LD+      SR +  G P  D      GG       + ++ 
Sbjct: 88  MDDSGEMQLDI--------LDA--GFTMSRLNSEGRPVGDATELHVGGNGDGTXPVNNDP 137

Query: 55  TYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
            YCG CYGA+   ++         CC +C+ VR AY + GWA  +   I+QC+REG++ +
Sbjct: 138 NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAGWAFFDGKNIEQCEREGYVSK 197

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKL 164
           I E   EGC I G  ++N++ GN HFAPGK +  +  H HD   + + S  N +H IN L
Sbjct: 198 INEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFHDTSLYDKTSNLNFNHIINHL 257

Query: 165 AFGE--------------HFPGVV--NPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVS 206
           +FG+              H   VV  +PLDG  V   + T    + YF K+VPT Y  + 
Sbjct: 258 SFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNTHFHQFSYFAKIVPTRYEYLD 317

Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSF 255
              I++ QFS T H R    GR +           +PG+F F+++SP+KV   E+H  ++
Sbjct: 318 NVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGMFVFFEMSPLKVINKEQHGQTW 377

Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
             F+ N    +GGV  V  ++D   Y  QR+I  K
Sbjct: 378 SGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412


>gi|349576209|dbj|GAA21381.1| K7_Erv46p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 415

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 55/335 (16%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR------LEHNE 54
           MD SGE  LD+        LD+      SR +  G P  D      GG       + ++ 
Sbjct: 88  MDDSGEMQLDI--------LDA--GFTMSRLNSEGRPVGDATELHVGGNGDGTAPVNNDP 137

Query: 55  TYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
            YCG CYGA+   ++         CC +C+ VR AY + GWA  +   I+QC+REG++ +
Sbjct: 138 NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAGWAFFDGKNIEQCEREGYVSK 197

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKL 164
           I E   EGC I G  ++N++ GN HFAPGK +  +  H HD   + + S  N +H IN L
Sbjct: 198 INEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFHDTSLYDKTSNLNFNHIINHL 257

Query: 165 AFGE--------------HFPGVV--NPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVS 206
           +FG+              H   VV  +PLDG  V   + T    + YF K+VPT Y  + 
Sbjct: 258 SFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNTHFHQFSYFAKIVPTRYEYLD 317

Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSF 255
              I++ QFS T H R    GR +           +PG+F F+++SP+KV   E+H  ++
Sbjct: 318 NVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGMFVFFEMSPLKVINKEQHGQTW 377

Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
             F+ N    +GGV  V  ++D   Y  QR+I  K
Sbjct: 378 SGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412


>gi|151941348|gb|EDN59719.1| ER vesicle protein [Saccharomyces cerevisiae YJM789]
 gi|190406692|gb|EDV09959.1| ER-Golgi transport vesicle protein [Saccharomyces cerevisiae
           RM11-1a]
 gi|207348028|gb|EDZ74008.1| YAL042Wp-like protein [Saccharomyces cerevisiae AWRI1631]
 gi|256272276|gb|EEU07261.1| Erv46p [Saccharomyces cerevisiae JAY291]
 gi|259144662|emb|CAY77603.1| Erv46p [Saccharomyces cerevisiae EC1118]
 gi|323334778|gb|EGA76150.1| Erv46p [Saccharomyces cerevisiae AWRI796]
 gi|323338873|gb|EGA80087.1| Erv46p [Saccharomyces cerevisiae Vin13]
 gi|323349926|gb|EGA84136.1| Erv46p [Saccharomyces cerevisiae Lalvin QA23]
 gi|365767200|gb|EHN08685.1| Erv46p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 415

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 55/335 (16%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR------LEHNE 54
           MD SGE  LD+        LD+      SR +  G P  D      GG       + ++ 
Sbjct: 88  MDDSGEMQLDI--------LDA--GFTMSRLNSEGRPVGDATELHVGGNGDGTAPVNNDP 137

Query: 55  TYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
            YCG CYGA+   ++         CC +C+ VR AY + GWA  +   I+QC+REG++ +
Sbjct: 138 NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAGWAFFDGKNIEQCEREGYVSK 197

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKL 164
           I E   EGC I G  ++N++ GN HFAPGK +  +  H HD   + + S  N +H IN L
Sbjct: 198 INEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFHDTSLYDKTSNLNFNHIINHL 257

Query: 165 AFGE--------------HFPGVV--NPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVS 206
           +FG+              H   VV  +PLDG  V   + T    + YF K+VPT Y  + 
Sbjct: 258 SFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNTHFHQFSYFAKIVPTRYEYLD 317

Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSF 255
              I++ QFS T H R    GR +           +PG+F F+++SP+KV   E+H  ++
Sbjct: 318 NVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGMFVFFEMSPLKVINKEQHGQTW 377

Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
             F+ N    +GGV  V  ++D   Y  QR+I  K
Sbjct: 378 SGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412


>gi|6319274|ref|NP_009358.1| Erv46p [Saccharomyces cerevisiae S288c]
 gi|1723191|sp|P39727.2|ERV46_YEAST RecName: Full=ER-derived vesicles protein ERV46
 gi|1326054|gb|AAC04989.1| Yal042wp [Saccharomyces cerevisiae]
 gi|285810158|tpg|DAA06944.1| TPA: Erv46p [Saccharomyces cerevisiae S288c]
 gi|392301230|gb|EIW12318.1| Erv46p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 415

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 164/335 (48%), Gaps = 55/335 (16%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGR------LEHNE 54
           MD SGE  LD+        LD+      SR +  G P  D      GG       + ++ 
Sbjct: 88  MDDSGEMQLDI--------LDA--GFTMSRLNSEGRPVGDATELHVGGNGDGTAPVNNDP 137

Query: 55  TYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
            YCG CYGA+   ++         CC +C+ VR AY + GWA  +   I+QC+REG++ +
Sbjct: 138 NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAGWAFFDGKNIEQCEREGYVSK 197

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKL 164
           I E   EGC I G  ++N++ GN HFAPGK +  +  H HD   + + S  N +H IN L
Sbjct: 198 INEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFHDTSLYDKTSNLNFNHIINHL 257

Query: 165 AFGE--------------HFPGVV--NPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVS 206
           +FG+              H   VV  +PLDG  V   + T    + YF K+VPT Y  + 
Sbjct: 258 SFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNTHFHQFSYFAKIVPTRYEYLD 317

Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSF 255
              I++ QFS T H R    GR +           +PG+F F+++SP+KV   E+H  ++
Sbjct: 318 NVVIETAQFSATFHSRPLAGGRDKDHPNTLHVRGGIPGMFVFFEMSPLKVINKEQHGQTW 377

Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
             F+ N    +GGV  V  ++D   Y  QR+I  K
Sbjct: 378 SGFILNCITSIGGVLAVGTVMDKLFYKAQRSIWGK 412


>gi|366987855|ref|XP_003673694.1| hypothetical protein NCAS_0A07550 [Naumovozyma castellii CBS 4309]
 gi|342299557|emb|CCC67313.1| hypothetical protein NCAS_0A07550 [Naumovozyma castellii CBS 4309]
          Length = 425

 Score =  170 bits (430), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 109/334 (32%), Positives = 166/334 (49%), Gaps = 48/334 (14%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD SGE  LD+    F K R+D+ GN + S    +G   +   +Q+      ++  YCGS
Sbjct: 93  MDDSGELQLDLLDSAFTKIRVDADGNELGSSTLEVGTDDLASEVQQRN----NDPDYCGS 148

Query: 60  CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
           CYG++  DE+         CC  C +VREAY   GW   +   I+QC++EG++ +I E  
Sbjct: 149 CYGSKVQDENDKLPRESRVCCQTCNDVREAYLNIGWGFFDGKGIEQCEKEGYVAKINEHL 208

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSF-----HQSGVHVHDILAFQRDS-FNISHKINKL 164
            EGC + G   ++++ GN HFAPGKS+       S  H HD   + + S  N +HKIN L
Sbjct: 209 KEGCRVKGQTLLSRIQGNIHFAPGKSYTSYKRSTSASHYHDTSLYDKTSNLNFNHKINHL 268

Query: 165 AFGEHFPGV------------VNPLDG---VRWTQETPSGMYQYFIKVVPTVY--TDVSG 207
           +FG+    +            ++PLDG   +    +T   +Y Y+ K+VPT Y   +   
Sbjct: 269 SFGKPIDKLDEKVQDHSTEFSISPLDGREVIPTDIDTHYHVYSYYAKIVPTRYEFLNKKE 328

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFL 256
            +I++ QFS T H R    GR             +PG+F ++++S +KV   E H  S+ 
Sbjct: 329 KSIETAQFSTTFHSRPLRGGRDADHPTTMHSQGGIPGLFIYFEMSAVKVINKEHHFRSWS 388

Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            FL N    VG V  V  + D   Y  Q++++ K
Sbjct: 389 SFLLNCITTVGSVLAVGTVSDKIFYRAQKSLQGK 422


>gi|12060847|gb|AAG48265.1|AF308298_1 serologically defined breast cancer antigen NY-BR-84, partial [Homo
           sapiens]
          Length = 239

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 79/143 (55%), Positives = 103/143 (72%), Gaps = 3/143 (2%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD++GEQ LDV+H++FK+RLD  G  + S  +     K++  +         +   C SC
Sbjct: 98  MDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELGKVEVTVFDPDSL---DPDRCESC 154

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGAE+ D  CCN CE+VREAYR++GWA  NPD I+QC+REGF Q+++E++ EGC +YGFL
Sbjct: 155 YGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 214

Query: 121 EVNKVAGNFHFAPGKSFHQSGVH 143
           EVNKVAGNFHFAPGKSF QS VH
Sbjct: 215 EVNKVAGNFHFAPGKSFQQSHVH 237


>gi|449016424|dbj|BAM79826.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 499

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 100/275 (36%), Positives = 148/275 (53%), Gaps = 33/275 (12%)

Query: 55  TYCGSCYGAESSDED-----------CCNNCEEVREAYRKKGWALSNP-DLIDQCKREGF 102
            YCGSCYGA    +            CCN C+E+R  Y ++ WA        +QC  + +
Sbjct: 224 AYCGSCYGAVPQTDQVGEANQITSGVCCNTCDEIRVLYEERNWAFDQVLRTAEQCAEKRY 283

Query: 103 LQRIKEE---EGEGCNIYGFLEVNKVAGNFHFAPGKS-FHQSGVHVHDIL-AFQRDSFNI 157
           L  + E    +  GC +   L++ +VAGNFHFAPGK   H+ G HVH +       ++N 
Sbjct: 284 LTLLHEAGRVQSGGCRVSARLQLPRVAGNFHFAPGKGHTHRMGHHVHSVDDQLLHRTYNF 343

Query: 158 SHKINKLAFGEHFPGVVNPLDG-VRWTQETPSG-----MYQYFIKVVPTVYTD--VSGHT 209
           SH+I  L FG  FP   NPLDG +R  ++ P G     M  Y+ K++PT Y      G  
Sbjct: 344 SHRIRHLRFGPLFPHQQNPLDGAMRILEQPPPGSPFGNMVLYYCKLIPTTYRRDRQRGDA 403

Query: 210 IQSNQFSVTEHFRSSEQGRLQ------TLPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNV 262
           ++S +++  +  +SSEQ R+        LPG+FFFY+  P+++ + E  +   LHF+  +
Sbjct: 404 LRSMEYAAADLTQSSEQDRVGITHSTGALPGIFFFYEPQPLQIAYFEGRMYGLLHFIVQL 463

Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIK-KKIEIGKF 296
           CAIVGGVFTVS +ID F++     I+ +K  +GK 
Sbjct: 464 CAIVGGVFTVSSMIDRFVFGAGTFIRAQKRRLGKL 498


>gi|226292523|gb|EEH47943.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Paracoccidioides brasiliensis Pb18]
          Length = 435

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 115/352 (32%), Positives = 164/352 (46%), Gaps = 72/352 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQDGIGAPKIDKPLQRHGGRLEH-NETY 56
           MD+SGE    + H I K RL  +   G+VI++             L       +H +  Y
Sbjct: 89  MDVSGEMQSGIIHGISKVRLAPESEGGHVIDTTA---------LVLHTQTDAAKHLDPDY 139

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     ++        +EVREAY  + WA    + ++QC+REG+ + +  +  E
Sbjct: 140 CGPCYGAPPPSHATKPGVALPAKEVREAYASQSWAFGRGENVEQCEREGYSKNLDAQRNE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHF 170
           GC I G L VNKV GNFH APG+SF    +H HD+  +       ++SHKI++L FG   
Sbjct: 200 GCRIEGVLRVNKVVGNFHIAPGRSFSSGNIHAHDLDTYYHTPVPHHMSHKIHQLRFGPQL 259

Query: 171 PGVV------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDV------------- 205
              +            NPLD        P   + YF+KVV T Y  +             
Sbjct: 260 SDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNFMYFVKVVSTSYLPLGWSPEFSSSVHET 319

Query: 206 ---------------SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFF 238
                          S  +I+++Q+SVT H RS + G         RL +   +PGVF  
Sbjct: 320 TLGNTPLGKQGVHFGSSGSIETHQYSVTSHKRSIDGGDDAAEGHKERLHSHGGIPGVFVN 379

Query: 239 YDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           YD+SP+KV   E    +F  FLT VCA++GG  TV+  +D  +Y G   +KK
Sbjct: 380 YDISPMKVINREARTKTFSGFLTGVCAVIGGTLTVAAAVDRALYEGVARVKK 431


>gi|254581328|ref|XP_002496649.1| ZYRO0D04972p [Zygosaccharomyces rouxii]
 gi|238939541|emb|CAR27716.1| ZYRO0D04972p [Zygosaccharomyces rouxii]
          Length = 404

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 103/326 (31%), Positives = 160/326 (49%), Gaps = 43/326 (13%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD +GE  LD+    F K RLD  G  + S    +     D            +E YCG+
Sbjct: 89  MDSAGEIQLDLLESGFTKTRLDQNGQSLGSSSLKVSDESYDP----------KDENYCGA 138

Query: 60  CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
           CYGA+    +         CC  C +VR AY +  WA  +   I+QC+REG++ R+ E+ 
Sbjct: 139 CYGAKDQSRNNEVPKEERVCCQTCNDVRRAYLEANWAFFDGKNIEQCEREGYVDRVNEQL 198

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEH 169
            EGC + G   +N++ G  HFAPG +F     H HD+  +++  + N +H IN L+FG+ 
Sbjct: 199 NEGCRVQGSALLNRIQGTLHFAPGVAFQNPKGHFHDLSLYEKTHNLNFNHIINHLSFGKP 258

Query: 170 FPG---------VVNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVT 218
                          PLDG +   +  + M+Q  YF K+VPT Y  +    +++ QFS T
Sbjct: 259 VTSNARGRGASVATAPLDGRQAFPDRDTHMHQFSYFTKIVPTRYEYMDKMVVETAQFSAT 318

Query: 219 EHFRS----SEQGRLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
            H R     ++Q    TL      PG+F ++++SP+KV   E+H  ++  F+ N    +G
Sbjct: 319 LHDRPLHGGADQDHPTTLHTKGGFPGLFVYFEMSPLKVINREQHAQTWSGFILNCITSIG 378

Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GV  V  ++D   Y  Q++I  K  +
Sbjct: 379 GVLAVGTVLDKITYKAQKSIWGKKSV 404


>gi|403215799|emb|CCK70297.1| hypothetical protein KNAG_0E00290 [Kazachstania naganishii CBS
           8797]
          Length = 408

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 108/324 (33%), Positives = 160/324 (49%), Gaps = 40/324 (12%)

Query: 1   MDISGEQHLDVKHD---IFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNET-Y 56
           +D SG   LDV  +     K R+D +G  +++      A      L     +L   +  Y
Sbjct: 89  LDDSGVLLLDVDDENNHFTKTRIDQRGEPLDA------AAAASFKLDAEAAQLPPTDPDY 142

Query: 57  CGSCYGA---------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIK 107
           CGSCYG+         + +++ CCN C  VREAY   GWA  +   I+QC+REG++ +I 
Sbjct: 143 CGSCYGSRDQTRNDELDPANKVCCNTCSSVREAYLDAGWAFFDGKNIEQCEREGYVDKIS 202

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF-QRDSFNISHKINKLAF 166
           +   EGC I G + +N+V GN HFAPG +F  +  H HD   + Q  S N  H I+ L+F
Sbjct: 203 QRITEGCRIKGGVRLNRVQGNIHFAPGDAFRSARGHFHDTSMYDQTGSLNFDHIIHHLSF 262

Query: 167 GEHFPGV----------VNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQ 214
           G     +          + PLDG +      S  YQ  YF K+VPT +   SG  I++ Q
Sbjct: 263 GPSVDNMQSLEKASNVAIAPLDGKQVLPRYDSHAYQYTYFTKIVPTRFEYFSGSVIETTQ 322

Query: 215 FSVTEHFRSSEQGRLQT-------LPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIV 266
           FS T   R    G  +T        PG++F  ++SP+KV   E++ +S+  FL N    +
Sbjct: 323 FSSTFSARPIGGGTTETATYTSGGTPGLYFNIEMSPLKVIHKEQNKISWSGFLLNCITSI 382

Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
           GGV  V  ++D  +Y  +R +  K
Sbjct: 383 GGVLAVGTVVDKILYRAERTLLNK 406


>gi|169603005|ref|XP_001794924.1| hypothetical protein SNOG_04508 [Phaeosphaeria nodorum SN15]
 gi|111067148|gb|EAT88268.1| hypothetical protein SNOG_04508 [Phaeosphaeria nodorum SN15]
          Length = 351

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 102/297 (34%), Positives = 142/297 (47%), Gaps = 63/297 (21%)

Query: 56  YCGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG 111
           YCG CYGA S        CCN C+EVR+AY    W+    + ++QC+RE + + + ++  
Sbjct: 51  YCGECYGAPSPTNAIKAGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDQQRQ 110

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN--ISHKINKLAFGEH 169
           EGC + G + VNKV GNFH APGKSF    +HVHD+  + +D ++   +HKI+ L FG  
Sbjct: 111 EGCRLEGSIRVNKVVGNFHIAPGKSFSTGNMHVHDLENYFKDEYSHTFTHKIHHLRFGPQ 170

Query: 170 FPGVV---------------------NPLDGVRWTQETPSGMYQYFIKVVPTVYT----- 203
               V                     NPLD         +  + YF+KVV T Y      
Sbjct: 171 LSNAVIADMQKKHQNTGPGGWTSHHINPLDNTEQQTSEKAYNFMYFVKVVSTAYLPLGWE 230

Query: 204 ----------DVSGHTIQSN--------QFSVTEHFRSSEQGRLQT------------LP 233
                     ++ G TI+ N        Q+SVT H RS   G  +             +P
Sbjct: 231 KEAPRLTKHDELLGSTIEGNYKGSIETHQYSVTSHKRSLAGGNDEKEGHKERIHAKGGIP 290

Query: 234 GVFFFYDLSPIKVTFTE-EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           GVFF YD+SP+KV   E    +F  FL  +CA++GG  TV+  +D  +Y G   IKK
Sbjct: 291 GVFFSYDISPMKVINREVRDKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNKIKK 347


>gi|356547537|ref|XP_003542168.1| PREDICTED: probable endoplasmic reticulum-Golgi intermediate
           compartment protein 3-like [Glycine max]
          Length = 351

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 100/291 (34%), Positives = 157/291 (53%), Gaps = 35/291 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K RL+S G++I       G   I   +++     EH++      
Sbjct: 89  IDMSGKHEVDLDTNIWKLRLNSYGHII-------GTEYISDLVEKEHTNQEHDDNKDHDH 141

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE--EEGEGCNIYG 118
           +   S  +    N +E                       E  ++++KE  + GEGC +YG
Sbjct: 142 HHEHSEQKIHLQNLDE---------------------STENIIKKVKEALKNGEGCRVYG 180

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
            L+V +VAGNFH     S H   ++V  ++     + N+SH I+ L+FG  +PG+ NPLD
Sbjct: 181 VLDVQRVAGNFHI----SVHGLNIYVAQMIFDGAKNVNVSHFIHDLSFGPKYPGLHNPLD 236

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
                    SG ++Y+IKVVPT Y  +S   + +NQFSV+E++    Q   +T P V+F 
Sbjct: 237 DTTRILHDTSGTFKYYIKVVPTEYRYISKEVLPTNQFSVSEYYSPINQFD-RTWPAVYFL 295

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           YDLSPI VT  EE  SFLHF+T +CA++GG F V+G++D ++Y    A+ K
Sbjct: 296 YDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLDRWMYRLLEALTK 346


>gi|146416067|ref|XP_001484003.1| hypothetical protein PGUG_03384 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 404

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 107/324 (33%), Positives = 169/324 (52%), Gaps = 44/324 (13%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN------ 53
           +D+SG+  +D+    F+K RL   G  +E R +         P+    G LE        
Sbjct: 88  LDVSGDLQVDLLLSGFEKFRLLKDG--LEIRDES--------PVMSSAGELEERARGRAP 137

Query: 54  ETYCGSCYGAESSDED---CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
           +  CGSCYGA   DE+   CCN+CE VR AY +K W   + + I+QC+REG++ R+ E+ 
Sbjct: 138 DGLCGSCYGALPQDENLDYCCNDCETVRLAYAQKAWGFFDGENIEQCEREGYVARLNEKI 197

Query: 111 G--EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAF 166
              EGC I G  ++N+++GN HFAPG SF   G H HD+  F +  D F   H IN L F
Sbjct: 198 NNFEGCRIKGTGKINRISGNLHFAPGASFTAPGSHFHDLSLFNKYDDKFTFDHVINHLLF 257

Query: 167 G------EHFPG-VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--IQSNQFSV 217
           G      + F   + +PLD      ++   +Y Y++KVV T +  ++ +T  +++NQF V
Sbjct: 258 GLDPHNIQFFEKQLTHPLDKSSMILKSKDRLYSYYLKVVATRFEFLTPNTPALETNQFLV 317

Query: 218 TEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIV 266
             H R    G+             LPGVFF +++ P+K+   E++  ++  F+  V + +
Sbjct: 318 ISHHRPLAGGKDDDHQHTLHARGGLPGVFFHFEILPMKIINKEQYAKTWSGFVLGVISSI 377

Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
            GV  V  ++D  ++  +R I+ K
Sbjct: 378 AGVLMVGALLDRSVWAAERVIRAK 401


>gi|356575088|ref|XP_003555674.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Glycine max]
          Length = 347

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 98/289 (33%), Positives = 153/289 (52%), Gaps = 35/289 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K RL+S G++I +      +  ++K    H      N  +    
Sbjct: 89  IDMSGKHEVDLDTNIWKLRLNSYGHIIGTEY---ISDLVEKEHTHHKHDDNKNHEHSEQK 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
              ++ DE   N  ++V+EA +                            GEGC +YG L
Sbjct: 146 IHLQNLDESTENIIKKVKEALKN---------------------------GEGCRVYGVL 178

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V +VAGNFH     S H   ++V  ++     + N+SH I+ L+FG  +PG+ NPLD  
Sbjct: 179 DVQRVAGNFHI----SVHGLNIYVAQMIFDGAKNVNVSHFIHDLSFGPKYPGLHNPLDDT 234

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
                  SG ++Y+IKVVPT Y  +S   + +NQFSV+E++    Q   +T P V+F YD
Sbjct: 235 TRILHDTSGTFKYYIKVVPTEYRYISKEVLPTNQFSVSEYYSPINQFD-RTWPAVYFLYD 293

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           LSPI VT  EE  SFLHF+T +CA++GG F V+G++D ++Y     + K
Sbjct: 294 LSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLDRWMYRLLETLTK 342


>gi|45188262|ref|NP_984485.1| ADR389Cp [Ashbya gossypii ATCC 10895]
 gi|44983106|gb|AAS52309.1| ADR389Cp [Ashbya gossypii ATCC 10895]
          Length = 392

 Score =  167 bits (422), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 106/317 (33%), Positives = 155/317 (48%), Gaps = 42/317 (13%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA--PKIDKPLQRHGGRLEHNETYC 57
           +D +GE  L++  + F K RLD  G  +   +  +G   P  D            ++ YC
Sbjct: 88  IDDTGEAQLNLLEEGFTKTRLDKHGRTLGKEEFRVGETLPSTD------------DQDYC 135

Query: 58  GSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 108
           G CYGA   D++         CC  C EVR AY +  WA  +    +QCKREG+ +R++E
Sbjct: 136 GPCYGARDQDQNENLPRSERVCCQTCGEVRAAYAEMNWATFDGKGFEQCKREGYTERLQE 195

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFG 167
           +  EGC + G  ++N+V GN HFAPG S H    H HD   ++     + +H I+ L+FG
Sbjct: 196 QINEGCRVAGTAQLNRVHGNIHFAPG-SAHVGKGHAHDDSFYKEHPHLSFNHVIHSLSFG 254

Query: 168 EHFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
               G   PL+G     E P+G    + YF KVVP  Y  ++G   +S +FSVT H R  
Sbjct: 255 PEIAGNPGPLNGR--AMEVPNGHSHFFSYFAKVVPIRYETLAGTITESAEFSVTAHDRPV 312

Query: 225 EQGRLQTLPGVFFF----------YDLSPIKVTFTEEHVS-FLHFLTNVCAIVGGVFTVS 273
             GR    P    F          +++SP+KV   E++ S +  F+ N    +GGV  V 
Sbjct: 313 HGGRDADHPNTVHFRGGMAGMTINFEMSPLKVIQREQYASTWTAFVLNAITSIGGVLAVG 372

Query: 274 GIIDAFIYHGQRAIKKK 290
            ++D   YH QR +  K
Sbjct: 373 TVLDRVTYHTQRTLMGK 389


>gi|156838396|ref|XP_001642904.1| hypothetical protein Kpol_367p1 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156113483|gb|EDO15046.1| hypothetical protein Kpol_367p1 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 404

 Score =  166 bits (421), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 108/323 (33%), Positives = 164/323 (50%), Gaps = 48/323 (14%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD SG   LD+    F K R+ S G     +Q G    K+ + L  +  +   ++ YCGS
Sbjct: 88  MDDSGNVQLDITESGFTKTRIGSDG-----QQLGTTNFKVSEDLLEYSPK---DKNYCGS 139

Query: 60  CYGA---------ESSDED-CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
           CYGA         ES D+  CC  CE+V+ AY   GWA  +   I+QC+REG+++++ ++
Sbjct: 140 CYGARDQSKNDEAESVDKKVCCQTCEDVKNAYSDAGWAFFDGKNIEQCEREGYVEKMNDQ 199

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD--SFNISHKINKLAFG 167
             EGC I G   +N++ GN HFAPGK+F   G H HD  +F  D  + N  H I  L+FG
Sbjct: 200 LNEGCRISGEALLNRIHGNIHFAPGKAFQNRGGHFHDT-SFYNDHKNLNFKHMIEHLSFG 258

Query: 168 ---------EHFPGVVNPLDGVRWTQETPS-----GMYQYFIKVVPTVYTDVSGHTIQSN 213
                    +    + +PLDG    QE PS       + YF K+VPT +  ++    +++
Sbjct: 259 RPVAQFKSNKDLVAMTSPLDG---HQELPSIDAHNHQFIYFAKIVPTRFEYLNKQAQETS 315

Query: 214 QFSVTEHFR--------SSEQGRLQTLPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCA 264
           Q  VT H +        S+     Q +PG+F  Y++SP+KV   E+H  ++  FL N   
Sbjct: 316 QLVVTSHMKPIGDATDYSTTMNSRQGIPGLFIDYEISPLKVINREQHATTWSGFLLNCIT 375

Query: 265 IVGGVFTVSGIIDAFIYHGQRAI 287
            +GG+  V  + D  ++  QR +
Sbjct: 376 SIGGILAVGTVADKIVHATQRVV 398


>gi|219110527|ref|XP_002177015.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411550|gb|EEC51478.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 500

 Score =  166 bits (419), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 109/343 (31%), Positives = 164/343 (47%), Gaps = 72/343 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLD-----SQGNVIESRQDGIGAPKIDKPLQRHGGRLEHN-- 53
           MD++G+  L+++  + K+++D      Q  +++S Q         +  Q    +L  +  
Sbjct: 161 MDVAGDSQLNIEDTLTKRKMDRTGRYGQAEILQSNQH--------EQEQSRKAKLRQDPL 212

Query: 54  -ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI----DQCKREGFLQRIKE 108
            +TYCG CYGA+   + CCNNC+ + +AY+ KGW     DL+    +QC REG  Q+   
Sbjct: 213 PDTYCGPCYGAQPDVDACCNNCDALLDAYKLKGW---RTDLVLYTAEQCIREGRDQKKLR 269

Query: 109 E--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
              +GEGCN+ GF+ +N+VAGNFH A G+   + G H+H       + +N SH I+ L+F
Sbjct: 270 PLIQGEGCNLSGFMSLNRVAGNFHIAMGEGLQRDGRHIHVFDPEDSEHYNASHVIHHLSF 329

Query: 167 GEHFPGVV-------NPLDGVRWTQETP----SGMYQYFIKVVPTVYTDVSGH-----TI 210
           G    G         + L+GV     TP    +G++QYFIKVVPT Y    G      T 
Sbjct: 330 GPEIQGKTKSGNLDSSSLNGVT-KMVTPEHGTTGLFQYFIKVVPTTYLGPGGRRDESGTF 388

Query: 211 QSNQFSVTEHFRS------SEQG------------------------RLQTLPGVFFFYD 240
           ++N++  TE FR        E+                         R   LPGVFF Y+
Sbjct: 389 ETNRYFYTERFRPLMKEYLPEEAVAEDPKQAAVHAGGGHRTHDHHHVRNSVLPGVFFLYE 448

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
           + P  V      V   H L  + A +GGVFT+   +D  +  G
Sbjct: 449 IYPFAVEIHPVSVPLTHLLIRLMATIGGVFTIVRWVDTAVLEG 491


>gi|374107698|gb|AEY96606.1| FADR389Cp [Ashbya gossypii FDAG1]
          Length = 392

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 105/317 (33%), Positives = 154/317 (48%), Gaps = 42/317 (13%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGA--PKIDKPLQRHGGRLEHNETYC 57
           +D +GE  L++  + F K RLD  G  +   +  +G   P  D            ++ YC
Sbjct: 88  IDDTGEAQLNLLEEGFTKTRLDKHGRTLGKEEFRVGETLPSTD------------DQDYC 135

Query: 58  GSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 108
           G CYGA   D++         CC  C EVR AY +  WA  +    +QCKREG+ +R++E
Sbjct: 136 GPCYGARDQDQNENLPRSERVCCQTCGEVRAAYAEMNWATFDGKGFEQCKREGYTERLQE 195

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFG 167
           +  EGC + G  ++N+V GN HFAPG S H    H HD   ++     + +H I+ L+FG
Sbjct: 196 QINEGCRVAGTAQLNRVHGNIHFAPG-SAHVGKGHAHDDSFYKEHPHLSFNHVIHSLSFG 254

Query: 168 EHFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
               G   PL+G     E P+G    + YF KVVP  Y  ++G   +S +FS T H R  
Sbjct: 255 PEIAGNPGPLNGR--AMEVPNGHSHFFSYFAKVVPIRYETLAGTITESAEFSATAHDRPV 312

Query: 225 EQGRLQTLPGVFFF----------YDLSPIKVTFTEEHVS-FLHFLTNVCAIVGGVFTVS 273
             GR    P    F          +++SP+KV   E++ S +  F+ N    +GGV  V 
Sbjct: 313 HGGRDADHPNTVHFRGGMAGMTINFEMSPLKVIQREQYASTWTAFVLNAITSIGGVLAVG 372

Query: 274 GIIDAFIYHGQRAIKKK 290
            ++D   YH QR +  K
Sbjct: 373 TVLDRVTYHTQRTLMGK 389


>gi|444321132|ref|XP_004181222.1| hypothetical protein TBLA_0F01610 [Tetrapisispora blattae CBS 6284]
 gi|387514266|emb|CCH61703.1| hypothetical protein TBLA_0F01610 [Tetrapisispora blattae CBS 6284]
          Length = 414

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 110/331 (33%), Positives = 167/331 (50%), Gaps = 47/331 (14%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHG-GRLEHNET-YC 57
           +D SGE  LDV    F K R+D+ GN ++   DG    ++D    R     L+ ++  YC
Sbjct: 87  VDDSGETSLDVLESGFTKIRVDTNGNELD---DG---SQLDVGTDRESLSSLDMDKAKYC 140

Query: 58  GSCYGA----------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIK 107
           G CYGA           +S++ CC  C +VR+AY   GWA  +   I+QC+REG++ RI 
Sbjct: 141 GPCYGALDQSGNDNIDVASEKVCCQTCYDVRKAYTDVGWAFFDGKDIEQCEREGYVDRIN 200

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-DSFNISHKINKLAF 166
           +   EGC I G   +N++ GN HFAPG +F  +  H HD   + + +  N +H IN L+F
Sbjct: 201 DHLHEGCRIVGSALLNRIQGNVHFAPGAAFETAKGHFHDTSLYDKTEQLNFNHIINHLSF 260

Query: 167 GEHFPGVVN-------------PLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTI 210
           G+    ++              PLDG     E+ +     + YF K+VPT +  +SG   
Sbjct: 261 GKTGHELLTPKSSKSFSVSRRQPLDGRVMIPESRNTHFFQFSYFAKIVPTRFESLSGKVE 320

Query: 211 QSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFL 259
           ++ Q+SVT H R  + GR +           +PG+F ++ ++P+KV   E H  +F   L
Sbjct: 321 EAAQYSVTFHSRPLQGGRDEDHPNTFHGRSGIPGLFIYFQMAPLKVIDIEAHSQTFSGLL 380

Query: 260 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            N    +GGV  V  ++D   Y  QR+I  K
Sbjct: 381 LNCITTIGGVLAVGTMMDKVFYKAQRSIWGK 411


>gi|149237735|ref|XP_001524744.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146451341|gb|EDK45597.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 411

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 109/322 (33%), Positives = 172/322 (53%), Gaps = 31/322 (9%)

Query: 2   DISGEQHLDVKHDIFKK-RLDSQGN--VIESRQDGIGAPKIDKPLQRHGGRLEHNET-YC 57
           D +G+  LDV +   +K R+  +GN  V+E   D   A + ++PL      L  NE   C
Sbjct: 91  DETGDMKLDVINSGLEKYRIIKRGNNKVVEELDDQ-PALRREQPLHEICKGLGENEQGEC 149

Query: 58  GSCYGAESSD--EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGEG 113
           GSCYGA   D  E CCN+C  VR AY  K W   + + I+QC++EG++Q++K+   + EG
Sbjct: 150 GSCYGALPQDKKEYCCNSCAAVRRAYAHKKWQFFDGENIEQCEKEGYVQKLKDRINQNEG 209

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFGEHFP 171
           C + G  ++N+VAG   FAPG S   +G HVHD+  + +  D FN  H I+ L+FG+   
Sbjct: 210 CRVKGSAKINRVAGTMDFAPGISTTSNGQHVHDLSLYTKYPDKFNFDHVIHHLSFGKIPT 269

Query: 172 GVVN--------PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG-HTIQSNQFSVTEHFR 222
            + N        PLDG  + Q     M  Y++K+V T + ++ G   + +NQFSV  H R
Sbjct: 270 AITNLQETDSLSPLDGHSFLQHKRYHMNNYYLKIVSTRFENLDGTKKVDTNQFSVITHDR 329

Query: 223 SSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFT 271
               G+ +           +P V F +D+SP+K+   E +  ++  F+  V + V GV  
Sbjct: 330 PLVGGKDEDHQHTLHARGGVPSVAFHFDISPLKIINRERYAKTWSGFVLGVVSSVAGVLM 389

Query: 272 VSGIIDAFIYHGQRAIKKKIEI 293
           V  ++D  ++  Q+A+K K ++
Sbjct: 390 VGALLDRSVFAAQQAMKGKKDL 411


>gi|255637400|gb|ACU19028.1| unknown [Glycine max]
          Length = 347

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 97/289 (33%), Positives = 152/289 (52%), Gaps = 35/289 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K RL+S G++I +      +  ++K    H      N  +    
Sbjct: 89  IDMSGKHEVDLDTNIWKLRLNSYGHIIGTEY---VSDLVEKEHTHHKHDDNKNHEHSEQK 145

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
              ++ DE   N  ++V+EA +                            GEGC +YG L
Sbjct: 146 IHLQNLDESTENIIKKVKEALKN---------------------------GEGCRVYGVL 178

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V +VAGNFH     S H   ++V  ++     + N+SH I+ L+FG  +PG+ NPLD  
Sbjct: 179 DVQRVAGNFHI----SVHGLNIYVAQMIFDGAKNVNVSHFIHDLSFGPKYPGLHNPLDDT 234

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
                  SG ++Y+IKVVPT Y  +S   + +NQFSV+E++    Q   +T P V+F YD
Sbjct: 235 TRILHDTSGTFKYYIKVVPTEYRYISKEVLPTNQFSVSEYYSPINQFD-RTWPAVYFLYD 293

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           LSPI VT  EE  SF HF+T +CA++GG F V+G++D ++Y     + K
Sbjct: 294 LSPITVTIKEERRSFFHFITRLCAVLGGTFAVTGMLDRWMYRLLETLTK 342


>gi|50294900|ref|XP_449861.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49529175|emb|CAG62841.1| unnamed protein product [Candida glabrata]
          Length = 415

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 113/332 (34%), Positives = 172/332 (51%), Gaps = 43/332 (12%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIG-APKIDKPLQRHGGRLEHNETYCG 58
           MD SGE  LD+ +  F+K RL  +G V+ +    IG A K DK  Q    +L  N  YCG
Sbjct: 88  MDDSGEVQLDIMNAGFEKTRLSKEGKVLGTADMKIGEAAKKDKEAQL--AKLGAN--YCG 143

Query: 59  SCYGAESSDED----------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 108
           +CYGA    ++          CC  C++VR+AY +K WA  +   I+QC+REG++Q+I +
Sbjct: 144 NCYGARDQGKNNDDTPRDQWVCCQTCDDVRQAYFEKNWAFFDGKDIEQCEREGYVQKIAD 203

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH-DILAFQRDSFNISHKINKLAFG 167
           +  EGC + G  ++N++ GN HFA G  F     H H D L  Q  + N +H IN L+FG
Sbjct: 204 QLQEGCRVSGSAQLNRIDGNLHFAAGPGFQNIRGHFHDDSLYIQHPNLNFNHIINHLSFG 263

Query: 168 EHFPG------------VVNPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQS 212
           +                 VNPLDG  +   ++     Y Y+ K+VPT Y  ++  + +++
Sbjct: 264 KAVEPTKKGKVMGIEKVTVNPLDGHSMFPPRDAHFLQYSYYAKIVPTRYEGLNKKNMVET 323

Query: 213 NQFSVTEHFR----SSEQGRLQTL------PGVFFFYDLSPIKVTFTEEH-VSFLHFLTN 261
            QFS T H R     S+     T+      P ++  +++SP+KV   EEH  S+  F+ N
Sbjct: 324 AQFSSTFHIRPVGGGSDDDHPNTVHQRGGSPSMWINFEMSPLKVINREEHGQSWSGFVLN 383

Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
               +GGV  V  ++D  +Y  QR I +K ++
Sbjct: 384 CITSIGGVLAVGTVLDKALYKAQRTIFQKKDV 415


>gi|30686584|ref|NP_188868.2| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
 gi|13877821|gb|AAK43988.1|AF370173_1 unknown protein [Arabidopsis thaliana]
 gi|51969000|dbj|BAD43192.1| unknown protein [Arabidopsis thaliana]
 gi|51970108|dbj|BAD43746.1| unknown protein [Arabidopsis thaliana]
 gi|51970556|dbj|BAD43970.1| unknown protein [Arabidopsis thaliana]
 gi|51970734|dbj|BAD44059.1| unknown protein [Arabidopsis thaliana]
 gi|62319967|dbj|BAD94071.1| hypothetical protein [Arabidopsis thaliana]
 gi|332643097|gb|AEE76618.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
          Length = 354

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 104/297 (35%), Positives = 161/297 (54%), Gaps = 43/297 (14%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGI--GAPKIDKPLQRHGGRLEH-NET 55
           +D+SG+  +D+  +I+K RL+S G++I  E   D +  G      P  +H G+ EH NET
Sbjct: 89  IDMSGKHEVDLDTNIWKLRLNSHGHIIGTEYISDLVEKGHEHGHSP-HKHDGKEEHKNET 147

Query: 56  YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGEG 113
                                  EA    G+        DQ   E  ++++K+   +GEG
Sbjct: 148 ET---------------------EALNILGF--------DQAA-ETMIKKVKQALADGEG 177

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV 173
           C +YG L+V +VAGNFH     S H   ++V  ++     + N+SH I+ L+FG  +PG+
Sbjct: 178 CRVYGVLDVQRVAGNFHI----SVHGLNIYVAQMIFGGSKNVNVSHMIHDLSFGPKYPGI 233

Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 233
            NPLD         SG ++Y+IK+VPT Y  +S   + +NQ+SVTE+F    +   +T P
Sbjct: 234 HNPLDDTNRILHDTSGTFKYYIKIVPTEYRYLSKDVLSTNQYSVTEYFTPMTEFD-RTWP 292

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            V+F YDLSPI VT  EE  SFLH +T +CA++GG F ++G++D +++    +  KK
Sbjct: 293 AVYFLYDLSPITVTIKEERRSFLHLITRLCAVLGGTFALTGMLDRWMFRFIESFNKK 349


>gi|115455745|ref|NP_001051473.1| Os03g0784400 [Oryza sativa Japonica Group]
 gi|14718311|gb|AAK72889.1|AC091123_8 unknown protein [Oryza sativa Japonica Group]
 gi|108711422|gb|ABF99217.1| Serologically defined breast cancer antigen NY-BR-84, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113549944|dbj|BAF13387.1| Os03g0784400 [Oryza sativa Japonica Group]
 gi|215737170|dbj|BAG96099.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625918|gb|EEE60050.1| hypothetical protein OsJ_12848 [Oryza sativa Japonica Group]
          Length = 350

 Score =  164 bits (414), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 96/289 (33%), Positives = 148/289 (51%), Gaps = 35/289 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K RLD  G++I       G   ++  +++  G   HN  +    
Sbjct: 89  IDMSGKHEVDLHTNIWKLRLDKYGHII-------GTEYLNDLVEKEHG--THNHDHDHEH 139

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
              +   E   N  E+  +  +    A+ N                    GEGC +YG L
Sbjct: 140 EDEQKKQEHTFN--EDAEKMVKSVKQAMEN--------------------GEGCRVYGVL 177

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V +VAGNFH     S H   + V + +       N+SH I+ L+FG  +PG+ NPLD  
Sbjct: 178 DVQRVAGNFHI----SVHGLNIFVAEKIFDGSSHVNVSHIIHDLSFGPKYPGIHNPLDET 233

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
                  SG ++Y+IK+VPT Y  +S   + +NQFSVTE+F           P V+F YD
Sbjct: 234 TRILHDTSGTFKYYIKIVPTEYRYLSKQVLPTNQFSVTEYFVPKRATDRSAWPAVYFLYD 293

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           LSPI VT  EE  +FLHFLT +CA++GG F ++G++D ++Y    ++ K
Sbjct: 294 LSPITVTIKEERRNFLHFLTRLCAVLGGTFAMTGMLDRWMYRLIESVTK 342


>gi|297830940|ref|XP_002883352.1| hypothetical protein ARALYDRAFT_479742 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329192|gb|EFH59611.1| hypothetical protein ARALYDRAFT_479742 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 354

 Score =  164 bits (414), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 103/297 (34%), Positives = 161/297 (54%), Gaps = 43/297 (14%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGI--GAPKIDKPLQRHGGRLEH-NET 55
           +D+SG+  +D+  +I+K RL+S G++I  E   D +  G      P  +H G+ EH NET
Sbjct: 89  IDMSGKHEVDLDTNIWKLRLNSHGHIIGTEYISDLVEKGHEHGHSP-HKHDGKEEHKNET 147

Query: 56  YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGEG 113
                                  EA    G+        DQ   E  ++++K+   +GEG
Sbjct: 148 ET---------------------EALNILGF--------DQAA-ETMIKKVKQALADGEG 177

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV 173
           C +YG L+V +VAGNFH     S H   ++V  ++     + N+SH I+ L+FG  +PG+
Sbjct: 178 CRVYGVLDVQRVAGNFHI----SVHGLNIYVAQMIFGGSKNVNVSHMIHDLSFGPKYPGI 233

Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 233
            NPLD         SG ++Y+IK+VPT Y  +S   + +NQ+SVTE++    +   +T P
Sbjct: 234 HNPLDDTNRILHDTSGTFKYYIKIVPTEYRYLSKDVLSTNQYSVTEYYTPMTEFD-RTWP 292

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            V+F YDLSPI VT  EE  SFLH +T +CA++GG F ++G++D +++    +  KK
Sbjct: 293 AVYFLYDLSPITVTIKEERRSFLHLITRLCAVLGGTFALTGMLDRWMFRLIESFNKK 349


>gi|218193856|gb|EEC76283.1| hypothetical protein OsI_13786 [Oryza sativa Indica Group]
          Length = 350

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 96/289 (33%), Positives = 148/289 (51%), Gaps = 35/289 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K RLD  G++I       G   ++  +++  G   HN  +    
Sbjct: 89  IDMSGKHEVDLHTNIWKLRLDKYGHII-------GTEYLNDLVEKEHG--THNHDHDHEH 139

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
              +   E   N  E+  +  +    A+ N                    GEGC +YG L
Sbjct: 140 EDEQKKQEHTFN--EDAEKMVKSVKQAMEN--------------------GEGCRVYGVL 177

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V +VAGNFH     S H   + V + +       N+SH I+ L+FG  +PG+ NPLD  
Sbjct: 178 DVQRVAGNFHI----SVHGLNIFVAEKIFDGSSHVNVSHIIHDLSFGPKYPGIHNPLDET 233

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
                  SG ++Y+IK+VPT Y  +S   + +NQFSVTE+F           P V+F YD
Sbjct: 234 TRILHDTSGTFKYYIKIVPTEYRYLSKQVLPTNQFSVTEYFVPKRATDRSAWPAVYFLYD 293

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           LSPI VT  EE  +FLHFLT +CA++GG F ++G++D ++Y    ++ K
Sbjct: 294 LSPITVTIKEERRNFLHFLTRLCAVLGGTFAMTGMLDRWMYRLIESVTK 342


>gi|241955457|ref|XP_002420449.1| COPII-coated vesicle complex subunit, putative; ER-derived vesicle
           protein, putative [Candida dubliniensis CD36]
 gi|223643791|emb|CAX41527.1| COPII-coated vesicle complex subunit, putative [Candida
           dubliniensis CD36]
          Length = 414

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 107/326 (32%), Positives = 172/326 (52%), Gaps = 33/326 (10%)

Query: 1   MDISGEQHLDVKHDIFKK-RL--DSQGNVIESR-QDGIGAPKIDKPLQRHGGRL---EHN 53
           +D++G+  L++     KK RL  + QG+VI +  +D   A   D  L      L      
Sbjct: 89  LDVTGDLSLNIIDSGLKKIRLLKNKQGDVIVNEIEDDEPAFNNDIELTDLAKGLPEGSDE 148

Query: 54  ETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE-- 109
             YCGSCYGA   D+   CCN+C  VR AY +K W+  + + I+QC++EG++ R++E   
Sbjct: 149 NAYCGSCYGALPQDKKQFCCNDCNTVRRAYAEKHWSFYDGENIEQCEKEGYVARLRERIN 208

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG 167
             EGC I G  ++N+V+G   FAPG SF + G H HD+  + +  D FN  H IN L+FG
Sbjct: 209 NNEGCRIKGTTKINRVSGTMDFAPGASFTREGRHFHDLSLYTKYEDKFNFDHIINHLSFG 268

Query: 168 E--------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVT 218
           E             ++PLD  ++     + +  Y++KVV T +  +   + I +NQFSV 
Sbjct: 269 EMPVDGQADQLFDSIHPLDDHQFMLHKKAHLVSYYLKVVATRFESLDYKNRIDTNQFSVI 328

Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
            H R    G+ +           +PGV F +D+SP+K+   +++  ++  F+  V + + 
Sbjct: 329 THDRPLRGGKDEDHQHTLHARGGIPGVNFNFDISPLKIINRQQYAKTWSGFVLGVISSIA 388

Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GV  V  ++D  ++  Q+AIK K +I
Sbjct: 389 GVLMVGTLLDRSVFAAQQAIKGKKDI 414


>gi|224137484|ref|XP_002322569.1| predicted protein [Populus trichocarpa]
 gi|222867199|gb|EEF04330.1| predicted protein [Populus trichocarpa]
          Length = 351

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 101/290 (34%), Positives = 155/290 (53%), Gaps = 36/290 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K RL+S G++        G   +   +++     E +       
Sbjct: 89  IDMSGKHEVDLDTNIWKLRLNSHGHIT-------GTEYLSDLVEKEH---EAHNHDHDKD 138

Query: 61  YGAESSDEDCCNNCEEVREAYRKK-GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           +  +S +E   +  ++  E   KK   AL+N                    GEGC +YG 
Sbjct: 139 HHKDSHEEQHTHGFDDAAETMIKKVKQALAN--------------------GEGCRVYGV 178

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           L+V +VAGNFH     S H   + V  ++       N+SH I+ L+FG  +PG+ NPLDG
Sbjct: 179 LDVQRVAGNFHI----SVHGLNIFVAQMIFDGAKHVNVSHIIHDLSFGPKYPGIHNPLDG 234

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
                   SG+++Y+IK+VPT Y  +S   + +NQFSVTE+F S      +T P V+F Y
Sbjct: 235 TARILRETSGIFKYYIKIVPTEYRYISKDVLPTNQFSVTEYF-SPITDFDRTWPAVYFLY 293

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           DLSPI VT  EE  SFLHF+T +CAI+GG F ++G++D ++Y    A+ K
Sbjct: 294 DLSPITVTIKEERRSFLHFITRLCAILGGTFALTGMLDRWMYRLLEALTK 343


>gi|365989554|ref|XP_003671607.1| hypothetical protein NDAI_0H01900 [Naumovozyma dairenensis CBS 421]
 gi|343770380|emb|CCD26364.1| hypothetical protein NDAI_0H01900 [Naumovozyma dairenensis CBS 421]
          Length = 438

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/346 (31%), Positives = 167/346 (48%), Gaps = 57/346 (16%)

Query: 1   MDISGEQHLDVKHDIF-KKRLDSQGNVIESRQDGIGAPKID-----KPLQRHGGR----- 49
           MD SGE  LD+    F K RLD QGN +++  + +     D       L ++G +     
Sbjct: 91  MDESGELQLDLLDSTFIKTRLDPQGNPLDN-DNNVADTDADLVIGVDDLTKNGEKRLKEI 149

Query: 50  LEHNETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKRE 100
           L  +  YCGSCYG++   E+         CC  C +VR++Y   GWA  +   I+QC+ E
Sbjct: 150 LAKDPDYCGSCYGSQDQTENESKSKDQKICCQTCNDVRDSYLNAGWAFFDGAQIEQCENE 209

Query: 101 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH----QSGVHVHDILAFQR-DSF 155
           G++ +I +   EGC I G   +N++ GN HFAPGKS+     +   H HD   + +    
Sbjct: 210 GYVAKINKHLEEGCRIKGQALLNRIQGNIHFAPGKSYSNYKAKGSTHRHDTSLYDKVKKM 269

Query: 156 NISHKINKLAFGEHFPGV---------------VNPLDGVRWTQE--TPS-GMYQYFIKV 197
           N +H I+ L+FG+    V               +NPLD  +   +   P+   + Y+ K+
Sbjct: 270 NFNHIIHHLSFGKSIDKVGKNDLKDYSDRKKFSINPLDDRKVIVKDFNPAFHQFSYYTKI 329

Query: 198 VPTVY--TDVSGHTIQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIK 245
           VPT Y   D    +I++ QFS T H R  + G  +           +PG+FFF+++SPIK
Sbjct: 330 VPTRYEFLDEKISSIETAQFSATYHSRPIQGGTDEDHPTTFHSRGGIPGLFFFFEMSPIK 389

Query: 246 VTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
           V   E H  ++  FL N    +G V  V  + D   Y  Q+ +K K
Sbjct: 390 VINKEHHFRTWSSFLLNCITSIGSVLAVGTVFDKIFYRAQKTLKAK 435


>gi|403215743|emb|CCK70242.1| hypothetical protein KNAG_0D05030 [Kazachstania naganishii CBS
           8797]
          Length = 422

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 109/338 (32%), Positives = 167/338 (49%), Gaps = 50/338 (14%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGR-----LEHNE 54
           MD SG   LDV  D F K R+D  GN++     G  A +  KP    G R     L+ + 
Sbjct: 90  MDESGNVQLDVLFDQFTKTRVDVNGNMV-----GGSASEPYKPNSLSGKRAGAKDLQMDA 144

Query: 55  TYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQR 105
            YCGSCYG+++ + +         CC  C++V +AY + GWA  +   I+QC+ EG+++R
Sbjct: 145 DYCGSCYGSKNQENNAELPPEQRICCQTCDDVHDAYLEAGWAFFDGANIEQCESEGYVKR 204

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV--------HVHDILAFQRDS-FN 156
           I+E+  EGCN+ G   +N++ GN HFAPGK + Q           H HD+  ++R+   N
Sbjct: 205 IQEQLHEGCNVKGTALLNRIQGNLHFAPGKPYQQLAAGMPGQGLGHYHDVSLYERNRHMN 264

Query: 157 ISHKINKLAFGEHFPGVV--------NPLDGVRWTQETPS-GMYQYFIKVVPTVYTDV-S 206
           ++H IN+  FGE     +         PL+    + E P   ++ Y+  VVPT Y  + +
Sbjct: 265 LNHVINEFRFGEDPQSEIVAQKIQRSAPLEDTVASLENPHYYIFNYYTNVVPTRYEFLGA 324

Query: 207 GHTIQSNQFSVTEHFRSSEQGR----LQTL------PGVFFFYDLSPIKVTFTEEHV-SF 255
              + + Q+S T H R    GR      TL      PGV+F  + SP+K+   E     +
Sbjct: 325 SKPLDTAQYSATYHDRPIMGGRDADHPTTLHGRGGTPGVYFNLEFSPLKIINRERRPQQW 384

Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
              L N    +GG+  V  + D  +Y  QR+I  K ++
Sbjct: 385 STLLLNWITTIGGILAVGTVTDKVVYKAQRSIGAKKQL 422


>gi|150866674|ref|XP_001386342.2| hypothetical protein PICST_85013 [Scheffersomyces stipitis CBS
           6054]
 gi|149387930|gb|ABN68313.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 407

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 107/317 (33%), Positives = 166/317 (52%), Gaps = 29/317 (9%)

Query: 1   MDISGEQHLDVKHDIFKK-RL--DSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYC 57
           MD +G+  LD+    F+K R+  DS+  +I+     I A    + + +  G  E  +  C
Sbjct: 90  MDEAGDLQLDILKSGFEKFRIVKDSEEEIIDRESTPINADLSIEEMAK--GLKEGEDGEC 147

Query: 58  GSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG--EG 113
           GSCYGA   D+   CCN+CE V+ AY +K W   + + I+QC+ EG++QR++      EG
Sbjct: 148 GSCYGALPQDKKQYCCNDCETVKLAYAEKLWGFYDGENIEQCENEGYVQRVQSRINGKEG 207

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFG----E 168
           C I G   +N+++G   FAPG SF  SG HVHD+  + +    N  H +NKL FG    E
Sbjct: 208 CRIKGNARINRISGTMDFAPGASFTSSGHHVHDLSLYDKHPHLNFDHIVNKLTFGPIPDE 267

Query: 169 HFPGV--VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--IQSNQFSVTEHFRSS 224
             P     +PLD         + ++ Y++KVV T +  ++G +  + +NQFSV  H R  
Sbjct: 268 SVPTAESTHPLDNYGVALNDKNHVFTYYLKVVATRFEFLNGASKALDANQFSVITHDRPI 327

Query: 225 EQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVS 273
             G+             +PGV F +D+SP+K+   E++  S+  F+  V + V GV  V 
Sbjct: 328 SGGKDNDHQHTLHAKGGIPGVVFHFDISPLKIINREQYAKSWSGFVLGVVSSVAGVLIVG 387

Query: 274 GIIDAFIYHGQRAIKKK 290
            ++D  +Y  + AIK K
Sbjct: 388 SLLDRSVYAAESAIKGK 404


>gi|448086324|ref|XP_004196073.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
 gi|359377495|emb|CCE85878.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
          Length = 405

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 101/318 (31%), Positives = 167/318 (52%), Gaps = 27/318 (8%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQ---RHGGRLEHNETY 56
           +D+SG+   DV    F+K RL    N  E   D     + D  L+   R+  +       
Sbjct: 90  LDVSGDTQADVLKSGFEKYRLIPSSN--EEVLDNAPVLRNDLSLEDIARNPNKEGGGYCG 147

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE--EEGEGC 114
                  +  +E CCN+CE VR AY ++ WA  +   I+QC+ EG++ R+ +  E+ EGC
Sbjct: 148 SCYGALPQGDNEFCCNDCETVRVAYAERMWAFYDGANIEQCENEGYVTRLNQRIEQKEGC 207

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG----E 168
            I G  ++N+V+GN HFAPG +    G H+HD+  +++  D F+  H IN L+FG    +
Sbjct: 208 RIKGTAQINRVSGNMHFAPGYAKTSPGRHIHDLSLYEKHFDKFSFDHVINHLSFGLDPAK 267

Query: 169 HFPG--VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
             P     +PLDG R      S +  Y++KVV T +  ++G ++++NQFS   H R    
Sbjct: 268 EDPNHQSTHPLDGYRLILNDKSRVISYYLKVVATRFEFLNGSSMETNQFSAIPHHRPYRG 327

Query: 227 GRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGI 275
           G+ +           +PGVFF +D+SP+K+   E++  ++  F+  V + + GV TV  +
Sbjct: 328 GKDEDHRHTMHAKGGIPGVFFHFDISPMKIINKEQYAKTWSGFVLGVISSIAGVLTVGAV 387

Query: 276 IDAFIYHGQRAIKKKIEI 293
           +D  ++  ++ IK K +I
Sbjct: 388 LDRSVWAAEKVIKSKKDI 405


>gi|353242343|emb|CCA73995.1| related to ERV46-component of copii vesicles [Piriformospora indica
           DSM 11827]
          Length = 420

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 102/322 (31%), Positives = 160/322 (49%), Gaps = 46/322 (14%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           D+SGE   +V H+I K RLDS+G    + QD I   + +    +  G+      YCGSCY
Sbjct: 94  DVSGEHMREVSHNIVKVRLDSEGKPYPN-QDHISDLRNEISRVKDIGK----PGYCGSCY 148

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           G    +  CCN CE+VR++Y  +GWA S P+ I+QC REG+ ++IK +  +GC I G + 
Sbjct: 149 GGLEPEGGCCNTCEDVRKSYLDRGWAFSAPEHIEQCVREGWTEKIKVQANDGCQISGRVR 208

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAF---GEHFPGVVN- 175
           + KVA +  F+ G+SF  +  H  +++ + +D    +  H I  L F    E+ P   N 
Sbjct: 209 IKKVASSLIFSFGRSFQANSFHAQELVPYLKDGLIHDFGHHIETLQFQSDDEYDPRRANE 268

Query: 176 -------------PLDGV---------RWTQETPSGMYQYFIKVVPTVYTDVSGHTI--- 210
                        PL+G          R   +  + M+QYFIKVV   +  +    +   
Sbjct: 269 AARLKKHLGVPKDPLNGFNSHYAKYSGRRGPDITTYMFQYFIKVVSADFETLDHEHVSSH 328

Query: 211 ------QSNQFSVTEHFRSSE----QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
                  +       H +++E           PG+F   D+SP++V  TE+   F HFLT
Sbjct: 329 LYSYSSHTRNVGEAYHLKNTEGIETTHGYDAAPGLFINIDVSPMQVIHTEKRKPFAHFLT 388

Query: 261 NVCAIVGGVFTVSGIIDAFIYH 282
             CAI+GGV TV+ ++D+ +++
Sbjct: 389 TFCAIIGGVLTVASLVDSALFN 410


>gi|413949705|gb|AFW82354.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein,
           partial [Zea mays]
          Length = 202

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 69/95 (72%), Positives = 85/95 (89%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           DISGEQH D++HDI K+RL+S GNVIE+R++GIG  K+++PLQ+HGGRL+  E YCG+CY
Sbjct: 91  DISGEQHHDIRHDIEKRRLNSHGNVIEARKEGIGGAKVERPLQKHGGRLDKGEQYCGTCY 150

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ 96
           GAE SDE CCN+CEEVREAY+KKGWAL+NPDLIDQ
Sbjct: 151 GAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQ 185


>gi|68483709|ref|XP_714213.1| hypothetical protein CaO19.586 [Candida albicans SC5314]
 gi|68483794|ref|XP_714172.1| hypothetical protein CaO19.8218 [Candida albicans SC5314]
 gi|46435713|gb|EAK95089.1| hypothetical protein CaO19.8218 [Candida albicans SC5314]
 gi|46435761|gb|EAK95136.1| hypothetical protein CaO19.586 [Candida albicans SC5314]
 gi|238882494|gb|EEQ46132.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 414

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 107/326 (32%), Positives = 172/326 (52%), Gaps = 33/326 (10%)

Query: 1   MDISGEQHLDVKHDIFKK-RL--DSQGNVIESR-QDGIGAPKIDKPLQRHGGRL---EHN 53
           +D++G+  L++     KK RL  + QG+VI +  +D   A   D  L      L      
Sbjct: 89  LDVTGDLSLNIIDSGLKKIRLLKNKQGDVIVNEIEDDEPAFNNDIELSDLAKGLPEGSDE 148

Query: 54  ETYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE-- 109
             YCGSCYGA   D+   CCN+C  VR AY +K W+  + + I+QC++EG++ R++E   
Sbjct: 149 NAYCGSCYGALPQDKKQFCCNDCNTVRRAYAEKHWSFYDGENIEQCEKEGYVGRLRERIN 208

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG 167
             EGC I G  ++N+V+G   FAPG SF + G H HD+  + +  D FN  H IN L+FG
Sbjct: 209 NNEGCRIKGTTKINRVSGTMDFAPGASFTREGRHFHDLSLYTKYPDKFNFDHIINHLSFG 268

Query: 168 E--------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-GHTIQSNQFSVT 218
           E             ++PLD  ++     + +  Y++KVV T +  +   + I +NQFSV 
Sbjct: 269 EMPVDGQADELFDSIHPLDDHQFMLHKKAHLVSYYLKVVATRFESLDYKNRIDTNQFSVI 328

Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
            H R    G+ +           +PGV F +D+SP+K+   +++  ++  F+  V + + 
Sbjct: 329 THDRPLVGGKDEDHQHTLHARGGIPGVNFNFDISPLKIINRQQYAKTWSGFVLGVISSIA 388

Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEI 293
           GV  V  ++D  ++  Q+AIK K +I
Sbjct: 389 GVLMVGTLLDRSVFAAQQAIKGKKDI 414


>gi|367017984|ref|XP_003683490.1| hypothetical protein TDEL_0H04200 [Torulaspora delbrueckii]
 gi|359751154|emb|CCE94279.1| hypothetical protein TDEL_0H04200 [Torulaspora delbrueckii]
          Length = 406

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 162/323 (50%), Gaps = 41/323 (12%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD +GE  LD+    F K R+DS G  I +                    +  +E YCGS
Sbjct: 89  MDNAGELQLDIMEAGFTKTRIDSNGKEISTSSFDASD--------SSSDYVPDDENYCGS 140

Query: 60  CYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
           CYGA+  D++         CC  C++VR+AY +  WA  +   I+QC+REG+++RI ++ 
Sbjct: 141 CYGAKDQDKNDELPKEERVCCQTCDDVRKAYLEAEWAFYDGKNIEQCEREGYVERINQQL 200

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEH 169
            EGC + G   ++++ G  HFAPG+ F  +  H HD+  +      N +H I+ L+FG+ 
Sbjct: 201 NEGCRVQGNALLSRIQGTIHFAPGRGFQNNRGHFHDMSLYDNTPQLNFNHIIHHLSFGKP 260

Query: 170 F---------PGVVNPLDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVT 218
                         +PLDG +   +  + ++Q  YF K+VPT Y  +    +++ QFS T
Sbjct: 261 INSGAEDRGAATSTHPLDGRQVFPDRDTHLHQFSYFAKIVPTRYEYLDDVVVETAQFSTT 320

Query: 219 EHFRSSEQG----RLQTL------PGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVG 267
            H R    G       TL      PG+F ++++SP+KV   E+H  ++  FL N    +G
Sbjct: 321 YHDRPLRGGVDDDHPNTLHSRGGSPGMFVYFEMSPLKVINKEQHAQTWSGFLLNCITSIG 380

Query: 268 GVFTVSGIIDAFIYHGQRAIKKK 290
           GV  V  ++D  +Y  Q++I  K
Sbjct: 381 GVLAVGTVLDKVLYKAQKSIWGK 403


>gi|388493200|gb|AFK34666.1| unknown [Medicago truncatula]
          Length = 106

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 75/107 (70%), Positives = 92/107 (85%), Gaps = 2/107 (1%)

Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFT 249
           M QYFIKVVPTVYTD+ G  I SNQ+SVTEHF+SSE G    +PGVFFFYD+SPIKV F 
Sbjct: 1   MCQYFIKVVPTVYTDIRGRVIHSNQYSVTEHFKSSELG--AAVPGVFFFYDISPIKVNFK 58

Query: 250 EEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           EEH+ FLHFLTN+CAI+GG+FT++GI+D+ IY+GQ+ IKKK+EIGK+
Sbjct: 59  EEHIPFLHFLTNICAIIGGIFTIAGIVDSSIYYGQKTIKKKMEIGKY 105


>gi|449479952|ref|XP_004155757.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 266

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 81/192 (42%), Positives = 120/192 (62%), Gaps = 7/192 (3%)

Query: 100 EGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
           E  ++++K+  EE +GC +YG L+V +VAGNFH     S H   + V  ++       N+
Sbjct: 74  ENLVKKVKQALEEAQGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFGGSKHVNV 129

Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
           SH I+ L+FG  +PG+ NPLDG        SG ++Y+IK+VPT Y  +S   + +NQFSV
Sbjct: 130 SHMIHDLSFGPKYPGIHNPLDGTVRILRDTSGTFKYYIKIVPTEYKYISKAVLPTNQFSV 189

Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           TE+F S      ++ P V+F YDLSPI VT  EE  SFLHF+T +CA++GG F V+G++D
Sbjct: 190 TEYF-SPMTDSDRSWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLD 248

Query: 278 AFIYHGQRAIKK 289
            +++    A+ K
Sbjct: 249 RWMFRFLEALTK 260


>gi|118482697|gb|ABK93267.1| unknown [Populus trichocarpa]
          Length = 366

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 83/192 (43%), Positives = 120/192 (62%), Gaps = 7/192 (3%)

Query: 100 EGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
           E  ++++K+    GEGC +YG L+V +VAGNFH     S H   + V  ++       N+
Sbjct: 172 ETMIKKVKQALANGEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAKHVNV 227

Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
           SH I+ L+FG  +PG+ NPLDG        SG+++Y+IK+VPT Y  +S   + +NQFSV
Sbjct: 228 SHIIHDLSFGPKYPGIHNPLDGTARILRETSGIFKYYIKIVPTEYRYISKDVLPTNQFSV 287

Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           TE+F S      +T P V+F YDLSPI VT  EE  SFLHF+T +CAI+GG F ++G++D
Sbjct: 288 TEYF-SPITDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAILGGTFALTGMLD 346

Query: 278 AFIYHGQRAIKK 289
            ++Y    A+ K
Sbjct: 347 RWMYRLLEALTK 358


>gi|449445069|ref|XP_004140296.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 388

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 84/193 (43%), Positives = 125/193 (64%), Gaps = 9/193 (4%)

Query: 100 EGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
           E  ++++K+  EE +GC +YG L+V +VAGNFH     S H   + V  ++       N+
Sbjct: 196 ENLVKKVKQALEEAQGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFGGSKHVNV 251

Query: 158 SHKINKLAFGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 216
           SH I+ L+FG  +PG+ NPLDG VR  ++T SG ++Y+IK+VPT Y  +S   + +NQFS
Sbjct: 252 SHMIHDLSFGPKYPGIHNPLDGTVRILRDT-SGTFKYYIKIVPTEYKYISKAVLPTNQFS 310

Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           VTE+F S      ++ P V+F YDLSPI VT  EE  SFLHF+T +CA++GG F V+G++
Sbjct: 311 VTEYF-SPMTDSDRSWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGML 369

Query: 277 DAFIYHGQRAIKK 289
           D +++    A+ K
Sbjct: 370 DRWMFRFLEALTK 382


>gi|302414546|ref|XP_003005105.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Verticillium albo-atrum VaMs.102]
 gi|261356174|gb|EEY18602.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Verticillium albo-atrum VaMs.102]
          Length = 349

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 108/303 (35%), Positives = 143/303 (47%), Gaps = 62/303 (20%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKID-KPLQRHGGRLEHNET---Y 56
           MD+SGEQ   +   I K RL SQ       +DG G   ID K L  H            Y
Sbjct: 89  MDVSGEQQHGIVSGISKVRLRSQ-------KDGGGV--IDTKALSLHAADEAATHLAPDY 139

Query: 57  CGSCYGAESS----DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA++      + CCN CEEVREAY +  WA    + ++QC RE + +R+ E+  E
Sbjct: 140 CGDCYGAKAPANAVKQGCCNTCEEVREAYAQASWAFGKGENVEQCTREHYAERLDEQRAE 199

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF--NISHKINKLAFGEHF 170
           GC I G L VNKV GNFH APG+SF    +HVHD+  +       + +H+I+ L F    
Sbjct: 200 GCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWDAEIIHDFTHQIHALRF---- 255

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
                                         V +D     +     S   H       RL 
Sbjct: 256 ------------------------------VLSDEPQAQLSGGDDSAEGHAE-----RLH 280

Query: 231 T---LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
           T   +PGVFF YD+SP+KV   EE   SF  FLT +CA++GG  TV+  +D  ++ G   
Sbjct: 281 TRGGIPGVFFSYDISPMKVINREERSKSFTGFLTGLCAVIGGTLTVAAAVDRGMFEGSLR 340

Query: 287 IKK 289
           +KK
Sbjct: 341 LKK 343


>gi|448531492|ref|XP_003870264.1| Erv46 protein [Candida orthopsilosis Co 90-125]
 gi|380354618|emb|CCG24134.1| Erv46 protein [Candida orthopsilosis]
          Length = 411

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 104/322 (32%), Positives = 165/322 (51%), Gaps = 31/322 (9%)

Query: 2   DISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDK--PLQRHGGRLEHNETY-- 56
           D SG+  LD+ +   +K R+   G+  +  +     P + +  PL++    L   +T   
Sbjct: 91  DESGDLKLDIINSQLEKFRIIKSGHSSKPTEIKDDQPPLQREMPLEQIAPGLPDGQTEGE 150

Query: 57  CGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGE 112
           CGSCYGA   D+   CCN+C  VR AY +  W   + + I QC+ EG++QR+++   + E
Sbjct: 151 CGSCYGAVPQDKKQYCCNSCAAVRRAYAEANWQFYDGENIAQCEEEGYVQRLRQRINDNE 210

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ--RDSFNISHKINKLAFGEHF 170
           GC + G  ++N+VAG   FAPG S  +   HVHD+  +   +D FN  H IN L+FG + 
Sbjct: 211 GCRVKGTTKINRVAGTMDFAPGASMTKER-HVHDLSLYMKYKDKFNFDHVINHLSFGNNP 269

Query: 171 P-------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-TIQSNQFSVTEHFR 222
           P       G ++PLDG ++ Q        YF+K+V T +  + G     +NQFS   H R
Sbjct: 270 PDSQLVDTGSISPLDGHKFLQHKKLHSINYFLKIVATRFESLEGKDKFDTNQFSAITHDR 329

Query: 223 SSEQGR----------LQTLPGVFFFYDLSPIKVTFTEEHVSFLH-FLTNVCAIVGGVFT 271
               G+             +PGV F +D+SP+K+   EE+      F+  V + + GV  
Sbjct: 330 PLAGGKDDDHQHTLHARAGVPGVAFNFDISPLKIINREEYAKTRSGFILGVVSSIAGVLM 389

Query: 272 VSGIIDAFIYHGQRAIKKKIEI 293
           V  ++D  ++  Q+AIK K ++
Sbjct: 390 VGSLMDRSVFAAQQAIKGKKDL 411


>gi|294657513|ref|XP_459821.2| DEHA2E11792p [Debaryomyces hansenii CBS767]
 gi|199432751|emb|CAG88060.2| DEHA2E11792p [Debaryomyces hansenii CBS767]
          Length = 402

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 102/317 (32%), Positives = 166/317 (52%), Gaps = 27/317 (8%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           +DISG+  LD+    F+K R+  + N      D       D  L+     +  N   CG 
Sbjct: 89  LDISGDLQLDILKSGFQKYRILKESN--HEILDEAPVLSNDLSLEEMAKGVGANGK-CGP 145

Query: 60  CYGA--ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGEGCN 115
           CYGA  + ++E CCN+CE V+ AY +K WA  +   I+QC+ EG++ R+ E     EGC 
Sbjct: 146 CYGALPQDNNEYCCNSCETVKLAYAEKMWAFYDGKDIEQCENEGYVSRLTERINNNEGCR 205

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFGE----- 168
           + G  ++N+++GN HFAPG S    G H+HD+  F++  D FN  H IN  +FG      
Sbjct: 206 VKGTAQINRISGNLHFAPGSSSTAPGRHIHDLSLFEKYEDKFNFDHVINHFSFGSDPHDN 265

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQG 227
           +     +PLD  +   +    +  Y++KVV T +  + +   + +NQFSV  H R    G
Sbjct: 266 NLQQSTHPLDNHQLVFDEKYHVASYYLKVVATRFEFIDTSLPLDTNQFSVISHHRPLRGG 325

Query: 228 RLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNVCAIVGGVFTVSGII 276
           + +           LPGVFF +++SP+K+   E++  ++  F+  V + V GV  V  ++
Sbjct: 326 KDEDHKHTLHARGGLPGVFFHFEISPMKIINKEQYAKTWSGFILGVISSVAGVLMVGTVL 385

Query: 277 DAFIYHGQRAIKKKIEI 293
           D  ++  ++AIK K ++
Sbjct: 386 DRSVWAAEKAIKGKKDM 402


>gi|255563175|ref|XP_002522591.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
 gi|223538182|gb|EEF39792.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
          Length = 191

 Score =  160 bits (405), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 84/191 (43%), Positives = 119/191 (62%), Gaps = 9/191 (4%)

Query: 102 FLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISH 159
            ++++K+    GEGC +YG L+V +VAGNFH     S H   + V  ++       N+SH
Sbjct: 1   MIKKVKQALANGEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAIHVNVSH 56

Query: 160 KINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
            I+ L+FG  FPG+ NPLDG        SG ++Y+IK+VPT Y  +S   + +NQFSVTE
Sbjct: 57  IIHDLSFGPKFPGLHNPLDGTARILHDASGTFKYYIKIVPTEYRYISKEVLPTNQFSVTE 116

Query: 220 HFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
           +F   SE  R  T P V+F YDLSPI VT  EE  SFLHF+T +CA++GG F ++G++D 
Sbjct: 117 YFSPMSEYDR--TWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFALTGMLDR 174

Query: 279 FIYHGQRAIKK 289
           ++Y    A+ K
Sbjct: 175 WMYRLLEAVTK 185


>gi|303290895|ref|XP_003064734.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226453760|gb|EEH51068.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 363

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 110/287 (38%), Positives = 157/287 (54%), Gaps = 32/287 (11%)

Query: 1   MDISGEQHLDVKHDIFKKR-LDSQGNVIES--RQDGIGAPK-IDKPLQRHGGRLEHNETY 56
           MD +GE   DV     KKR LDS G  +E   + +   A K I + ++ H   L  +E Y
Sbjct: 98  MDQAGEAFHDVHSGHLKKRRLDSDGKPLEGVFKHEKANAHKEIREDIESHALALSGDEEY 157

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSN-PDLIDQCKREGFLQRIKEEEGEGCN 115
                  ++S+ED             ++G  + N   L+D+    G  +  K E  EGC 
Sbjct: 158 -------KTSEEDLM----------PEEGLTMFNLKQLLDKQFPGGIEKAFKNEAREGCE 200

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           + G+LEVN+V G+F  +PGKS      HV   L  Q    N+SH IN+ AFG+ FPG V+
Sbjct: 201 VIGYLEVNRVPGSFSVSPGKSIRLGMEHVQ--LNVQ-SRLNMSHTINRFAFGKSFPGFVS 257

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTL--- 232
           PLDG       P+ ++QYF+K+VPT +T + G  +QSNQ+SVTE   S+    L  +   
Sbjct: 258 PLDG-NARDLDPNYVHQYFLKIVPTSFTPLRGEYLQSNQYSVTE--ASAPAKALNVVGSK 314

Query: 233 -PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
             GV+F YDLSP++V + E   S   F+T+VCAIVGGV ++SG++ A
Sbjct: 315 PSGVYFNYDLSPLRVDYVESRNSMTEFITSVCAIVGGVASMSGLVQA 361


>gi|357112836|ref|XP_003558212.1| PREDICTED: probable endoplasmic reticulum-Golgi intermediate
           compartment protein 3-like [Brachypodium distachyon]
          Length = 349

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 99/281 (35%), Positives = 146/281 (51%), Gaps = 36/281 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K RLD  G +I +                     E+        
Sbjct: 89  IDMSGKHEVDLHTNIWKLRLDKYGTIIGT---------------------EYLSDLVEKE 127

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +GA   D    ++ EE      KK     N D     K      R   E GEGC +YG L
Sbjct: 128 HGAHHHDNGHEHHDEE------KKPEHTFNEDADKMVKS----VRQALENGEGCRVYGML 177

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V +VAGNFH     S H   ++V + +       N+SH I++L+FG  +PG+ NPLD  
Sbjct: 178 DVQRVAGNFHI----SVHGLNIYVAEKIFEGSSHVNVSHVIHELSFGPKYPGIHNPLDDT 233

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
                  SG ++Y+IKVVPT Y  +S   + +NQFSVTE+F        ++ P V+F YD
Sbjct: 234 TRILHDASGTFKYYIKVVPTEYRYLSKQVLPTNQFSVTEYFVPIRPAD-RSWPAVYFLYD 292

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
           LSPI VT  EE  +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 293 LSPITVTIKEERRNFLHFITRLCAVLGGTFAMTGMLDRWMY 333


>gi|242059085|ref|XP_002458688.1| hypothetical protein SORBIDRAFT_03g038260 [Sorghum bicolor]
 gi|241930663|gb|EES03808.1| hypothetical protein SORBIDRAFT_03g038260 [Sorghum bicolor]
          Length = 350

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 99/284 (34%), Positives = 147/284 (51%), Gaps = 41/284 (14%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K RLD  G++I +        K       H    EH++      
Sbjct: 89  IDMSGKHEVDLHTNIWKLRLDKYGHIIGTEYLSDLVEKGHGAHHDHDHGQEHHD------ 142

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
              +   E   N  EE  +  +    AL N                    GEGC +YG L
Sbjct: 143 --EQKKPEQTFN--EEAEKMIKSVKQALGN--------------------GEGCRVYGML 178

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V +VAGNFH     S H   + V + +       N+SH I++L+FG  +PG+ NPLD  
Sbjct: 179 DVQRVAGNFHI----SVHGLNIFVAEKIFEGSSHVNVSHVIHELSFGPKYPGIHNPLDET 234

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQGRLQTLPGVFF 237
                  SG ++Y+IKVVPT Y  +S   + +NQFSVTE+F   R S++      P V+F
Sbjct: 235 SRILHDTSGTFKYYIKVVPTEYKYLSKKVLPTNQFSVTEYFLPIRPSDRA----WPAVYF 290

Query: 238 FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
            YDLSPI VT  EE  +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 291 LYDLSPITVTIKEERRNFLHFITRLCAVLGGTFAMTGMLDRWMY 334


>gi|11036454|dbj|BAB17274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 333

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 101/281 (35%), Positives = 152/281 (54%), Gaps = 43/281 (15%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVI--ESRQDGI--GAPKIDKPLQRHGGRLEH-NET 55
           +D+SG+  +D+  +I+K RL+S G++I  E   D +  G      P  +H G+ EH NET
Sbjct: 89  IDMSGKHEVDLDTNIWKLRLNSHGHIIGTEYISDLVEKGHEHGHSP-HKHDGKEEHKNET 147

Query: 56  YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--EGEG 113
                                  EA    G+        DQ   E  ++++K+   +GEG
Sbjct: 148 ET---------------------EALNILGF--------DQAA-ETMIKKVKQALADGEG 177

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGV 173
           C +YG L+V +VAGNFH     S H   ++V  ++     + N+SH I+ L+FG  +PG+
Sbjct: 178 CRVYGVLDVQRVAGNFHI----SVHGLNIYVAQMIFGGSKNVNVSHMIHDLSFGPKYPGI 233

Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 233
            NPLD         SG ++Y+IK+VPT Y  +S   + +NQ+SVTE+F    +   +T P
Sbjct: 234 HNPLDDTNRILHDTSGTFKYYIKIVPTEYRYLSKDVLSTNQYSVTEYFTPMTEFD-RTWP 292

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
            V+F YDLSPI VT  EE  SFLH +T +CA++GG F ++G
Sbjct: 293 AVYFLYDLSPITVTIKEERRSFLHLITRLCAVLGGTFALTG 333


>gi|224086657|ref|XP_002307923.1| predicted protein [Populus trichocarpa]
 gi|222853899|gb|EEE91446.1| predicted protein [Populus trichocarpa]
          Length = 351

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 81/192 (42%), Positives = 118/192 (61%), Gaps = 7/192 (3%)

Query: 100 EGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
           E  ++++K+    GEGC +YG L+V +VAGNFH     S H   + V  ++       N+
Sbjct: 157 ETMVKKVKQALANGEGCRVYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAKHVNV 212

Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
           SH I+ L+FG  +PG+ NPLDG        SG ++Y+IK+VPT Y  +S   + +NQFSV
Sbjct: 213 SHIIHDLSFGPKYPGIHNPLDGTTRILHETSGTFKYYIKIVPTEYRYISKEVLPTNQFSV 272

Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           TE+F S      +T P V+F YDLSPI VT  EE  SFLHF+T +CA++GG F ++G++D
Sbjct: 273 TEYF-SPMTDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFALTGMLD 331

Query: 278 AFIYHGQRAIKK 289
            ++     A+ K
Sbjct: 332 RWMCRLLEALTK 343


>gi|50305633|ref|XP_452777.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49641910|emb|CAH01628.1| KLLA0C12947p [Kluyveromyces lactis]
          Length = 405

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 102/323 (31%), Positives = 154/323 (47%), Gaps = 41/323 (12%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           +D SGE  +++    F K R+  +G  +   +  +G     +     G        YCG 
Sbjct: 88  LDDSGEFQINLLDSGFTKIRISPEGKELSKEKFQVGDKSSKQSFNEEG--------YCGP 139

Query: 60  CYGA-ESSDED--------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE 110
           CYGA + S  D        CC  C++VR AY +KGWA  +   ++QC+REG+++ I    
Sbjct: 140 CYGALDQSKNDELPQDQKVCCQTCDDVRAAYGQKGWAFKDGKGVEQCEREGYVESINARI 199

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-DSFNISHKINKLAFGEH 169
            EGC + G  ++N++ G  HF PG S      H HD   +      N +H IN L FGE 
Sbjct: 200 HEGCRVQGRAQLNRIQGTIHFGPGSSMRNIRGHFHDTSLYDAYPHLNFNHIINTLTFGEK 259

Query: 170 F---------PGVVNPLDG--VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
                        ++PLD   V   ++T    + YF K++PT +  + G  +++ QFS T
Sbjct: 260 PKDGDSELIGSASISPLDSRQVFPDRDTHFHEFSYFCKIIPTRFEFLDGKKVETTQFSAT 319

Query: 219 EHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVG 267
            H R    GR +           +PGVFF +++SP+KV   E+H  S+  FL N    +G
Sbjct: 320 YHDRPLRGGRDEDHPNTVHSKGGVPGVFFNFEMSPLKVINKEQHATSWSGFLLNCITSIG 379

Query: 268 GVFTVSGIIDAFIYHGQRAIKKK 290
           GV  V  +ID   Y  Q++I  K
Sbjct: 380 GVLAVGTVIDKITYRAQKSIWGK 402


>gi|225446891|ref|XP_002284045.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Vitis vinifera]
 gi|296086333|emb|CBI31774.3| unnamed protein product [Vitis vinifera]
          Length = 351

 Score =  157 bits (396), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 97/294 (32%), Positives = 152/294 (51%), Gaps = 44/294 (14%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY---- 56
           +D+SG+  +D+  +I+K RL+  G +I       G   +   +++     +H+       
Sbjct: 89  IDMSGKHEVDLDTNIWKLRLNRDGFII-------GTEYLSDLVEKEHADHKHDHNKDHHG 141

Query: 57  -CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
                  A S D+D  N  ++V++A       L+N                    GEGC 
Sbjct: 142 DSDQKLHAHSFDQDAENMVKKVKQA-------LAN--------------------GEGCR 174

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           +YG L+V +VAGNFH     S H   + V  ++       N+SH I+ L+FG  +PG+ N
Sbjct: 175 VYGVLDVQRVAGNFHI----SVHGLNIFVAQMIFDGAIHVNVSHIIHDLSFGPKYPGLHN 230

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           PLDG        SG ++Y+IK+VPT Y  +S   + +NQFSV E+F    +   +T P V
Sbjct: 231 PLDGTVRILRGASGTFKYYIKIVPTEYRYISKEVLPTNQFSVMEYFSPMNEFD-RTWPAV 289

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           +F YDLSP+ VT  EE  SFLHF+T +CA++GG F ++G++D ++Y     + K
Sbjct: 290 YFLYDLSPVTVTIKEERRSFLHFITRLCAVLGGTFALTGMLDRWMYRFLEMLTK 343


>gi|212275606|ref|NP_001131002.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
 gi|194690678|gb|ACF79423.1| unknown [Zea mays]
 gi|413952089|gb|AFW84738.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 293

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 98/285 (34%), Positives = 152/285 (53%), Gaps = 43/285 (15%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQR-HGGRLEHNETYCGS 59
           +D+SG+  +D+  +I+K RLD  G++I       G   +   +++ HG   +H+  +   
Sbjct: 32  IDMSGKHEVDLHTNIWKLRLDKYGHII-------GTEYLSDLVEKGHGAHHDHDHDHDHH 84

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
               +   E   N  EE  +  +    AL N                    GEGC +YG 
Sbjct: 85  D--EQKKHEQTFN--EEAEKMIKSVKQALGN--------------------GEGCRVYGM 120

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           L+V +VAGNFH     S H   + V + +    +  N+SH I++L+FG  +PG+ NPLD 
Sbjct: 121 LDVQRVAGNFHI----SVHGLNIFVAEKIFEGSNHVNVSHVIHELSFGPKYPGIHNPLDE 176

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQGRLQTLPGVF 236
                   SG ++Y+IKVVPT Y  +S   + +NQFSVTE+F   R +++      P V+
Sbjct: 177 TSRILHDTSGTFKYYIKVVPTEYKYLSKKVLPTNQFSVTEYFLPIRPTDRA----WPAVY 232

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
           F YDLSPI VT  EE  +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 233 FLYDLSPITVTIKEERRNFLHFVTRLCAVLGGTFAMTGMLDRWMY 277


>gi|194708090|gb|ACF88129.1| unknown [Zea mays]
 gi|195607866|gb|ACG25763.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
 gi|195619788|gb|ACG31724.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
 gi|413952088|gb|AFW84737.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 350

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 98/285 (34%), Positives = 152/285 (53%), Gaps = 43/285 (15%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQR-HGGRLEHNETYCGS 59
           +D+SG+  +D+  +I+K RLD  G++I       G   +   +++ HG   +H+  +   
Sbjct: 89  IDMSGKHEVDLHTNIWKLRLDKYGHII-------GTEYLSDLVEKGHGAHHDHDHDHDHH 141

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
               +   E   N  EE  +  +    AL N                    GEGC +YG 
Sbjct: 142 D--EQKKHEQTFN--EEAEKMIKSVKQALGN--------------------GEGCRVYGM 177

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           L+V +VAGNFH     S H   + V + +    +  N+SH I++L+FG  +PG+ NPLD 
Sbjct: 178 LDVQRVAGNFHI----SVHGLNIFVAEKIFEGSNHVNVSHVIHELSFGPKYPGIHNPLDE 233

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQGRLQTLPGVF 236
                   SG ++Y+IKVVPT Y  +S   + +NQFSVTE+F   R +++      P V+
Sbjct: 234 TSRILHDTSGTFKYYIKVVPTEYKYLSKKVLPTNQFSVTEYFLPIRPTDRA----WPAVY 289

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
           F YDLSPI VT  EE  +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 290 FLYDLSPITVTIKEERRNFLHFVTRLCAVLGGTFAMTGMLDRWMY 334


>gi|367004394|ref|XP_003686930.1| hypothetical protein TPHA_0H02930 [Tetrapisispora phaffii CBS 4417]
 gi|357525232|emb|CCE64496.1| hypothetical protein TPHA_0H02930 [Tetrapisispora phaffii CBS 4417]
          Length = 439

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 102/343 (29%), Positives = 167/343 (48%), Gaps = 63/343 (18%)

Query: 4   SGEQHLDV-----KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           SG   LD+       +  K RL+++G VI   +      KI   L  +    E  E YCG
Sbjct: 95  SGNVQLDIDLEEASSNFVKTRLNNRGEVIGKAK----KFKITDDLGEYAP--EDKENYCG 148

Query: 59  SCYGAES----------SDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE 108
           SCYG++           +D+ CCN+CE+VR+AY + GWA  +   I+QC+REG+++ I E
Sbjct: 149 SCYGSKDQTKNEDIEKITDKVCCNSCEDVRQAYSEAGWAFFDGKNIEQCEREGYVKTINE 208

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF-QRDSFNISHKINKLAFG 167
              EGC + G   +NK+ GN HFAPGK+F     H HD   F Q  + N  H IN L+FG
Sbjct: 209 RLSEGCRVKGEALLNKIHGNLHFAPGKAFQNRRGHFHDTSLFNQHKNLNFQHVINHLSFG 268

Query: 168 EHFPGVVN----------------PLDGVRWTQETPSG--------------MYQYFIKV 197
           +    +V                 P+DG +   +  +G               + Y+ ++
Sbjct: 269 KPIRQLVTSNFQDTMSDSLRAQTAPIDGHQAFIQDNTGDSDSASTTIAAHDYQFIYYAEI 328

Query: 198 VPTVYTDVSGHTIQSNQFSVTEHFRS----SEQGRLQTL------PGVFFFYDLSPIKVT 247
           + T +  + G   +++Q +VT H++     + Q  +Q +      PG++  +++SP+KV 
Sbjct: 329 ISTRFEYLKGDLEETSQLTVTSHYKKIGYQNGQDYMQGMQSRSGIPGLYIDFEVSPLKVI 388

Query: 248 FTEEH-VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
             E++  S+  +L      +GG+  V  +ID  +Y  Q A+K+
Sbjct: 389 NKEQYSTSWSGYLLKTITSIGGILAVGTVIDKVVYATQTALKQ 431


>gi|260950825|ref|XP_002619709.1| hypothetical protein CLUG_00868 [Clavispora lusitaniae ATCC 42720]
 gi|238847281|gb|EEQ36745.1| hypothetical protein CLUG_00868 [Clavispora lusitaniae ATCC 42720]
          Length = 415

 Score =  154 bits (388), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 99/331 (29%), Positives = 168/331 (50%), Gaps = 42/331 (12%)

Query: 1   MDISGEQHLDVKHDIFKK-RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY--- 56
           +D++G+ HLD+    F+  R+   G   E   D +      K  +   G L  +E     
Sbjct: 89  LDMTGDLHLDIVESGFEMFRVLPSG---EEISDDLPLLSGAKKFEDVCGPLTEDEISRGV 145

Query: 57  -CGSCYGA--ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRI--KEEEG 111
            CG CYGA  ++ ++ CCN CE VR AY  + W   +   I+QC+REG+++++  +    
Sbjct: 146 PCGPCYGAVDQTDNKRCCNTCEAVRMAYAVQEWGFFDGSNIEQCEREGYVEKMVSRINNN 205

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFGEH 169
           EGC I G  ++N+++GN HFAPG    ++G H HD+  + +  + F+I HKIN  +FGE 
Sbjct: 206 EGCRIKGSAKINRISGNLHFAPGVPLSRNGRHSHDLSLWTKYSNKFSIDHKINHFSFGED 265

Query: 170 FPGV--------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG--HTIQSN 213
            P                ++PLDG  +  +  + +  Y++ VV T +  + G    + +N
Sbjct: 266 -PSASRRLASTDDSQEPSIHPLDGFHFDLKKKNHVASYYLSVVSTRFEFLDGKKEAVDTN 324

Query: 214 QFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHFLTNV 262
           QFSV  H R    GR             +PG FF +D+SP+K+   EE+  ++  F+  V
Sbjct: 325 QFSVITHDRPIVGGRDDDHQNTMHAQGGVPGAFFHFDISPMKIISREEYAKTWSGFILGV 384

Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
            + + GV TV   +D  ++  ++ ++ K ++
Sbjct: 385 VSSIAGVLTVGAALDRSVWTAEQVLRGKKDM 415


>gi|410083920|ref|XP_003959537.1| hypothetical protein KAFR_0K00470 [Kazachstania africana CBS 2517]
 gi|372466129|emb|CCF60402.1| hypothetical protein KAFR_0K00470 [Kazachstania africana CBS 2517]
          Length = 417

 Score =  150 bits (380), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 103/332 (31%), Positives = 160/332 (48%), Gaps = 53/332 (15%)

Query: 4   SGEQHLDV-KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYG 62
           +G+  LD+ +  + K R+DS G  + +    IG   + K         +  + YCGSCYG
Sbjct: 91  AGDLQLDLLESGLTKTRVDSNGVSLTTESFNIGNEALIKR--------DFPQDYCGSCYG 142

Query: 63  A---------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEG 113
           A          ++++ CC  CE+V +AY   GWA  +   I+QC+ EG++ RI E   EG
Sbjct: 143 ALDQGKNDELNANEKVCCQTCEDVHDAYLNIGWAFYDGKNIEQCETEGYVDRINEHLNEG 202

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQS------GVHVHDILAFQRD-SFNISHKINKLAF 166
           C + G   +N+V GN HFAPGKS+           H HD   + +  S + +H I+  +F
Sbjct: 203 CRVQGSARLNRVQGNIHFAPGKSYQDYSRRNSFATHFHDTSLYDKTHSLSFNHIIHHFSF 262

Query: 167 GE---------HFPGV----VNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHT-- 209
           G+         H  G+     NPLDG +   +  S    Y YF ++VPT Y  ++  +  
Sbjct: 263 GKPIENSYVNNHNEGLSKISTNPLDGRKVFPDRDSHFIQYSYFAEIVPTRYEYLNNKSDP 322

Query: 210 IQSNQFSVTEHFRSSEQGRLQT----------LPGVFFFYDLSPIKVTFTEEHV-SFLHF 258
           +++ QFS T H R    GR +           +PG+F +++ SP+KV   E++  ++  F
Sbjct: 323 VETTQFSATFHSRPLRGGRDEDHPTTLHQRGGIPGLFIYFETSPLKVINKEQYSQAWSTF 382

Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
           L N    +GG+  V    D   Y  QR I  K
Sbjct: 383 LLNCITTIGGILAVGTSFDKITYKAQRTIWGK 414


>gi|221114903|ref|XP_002155889.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Hydra magnipapillata]
          Length = 399

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 78/173 (45%), Positives = 106/173 (61%), Gaps = 2/173 (1%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +E +GC IYG +EVNKVAGNFH   GKS      H H        ++N SH+I+ L+FGE
Sbjct: 174 KEFDGCRIYGNIEVNKVAGNFHITAGKSIPHPRGHAHLSALVSELNYNFSHRIDMLSFGE 233

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQG 227
             PG++NPLDG      TP  MYQY+I +VPT    +  +TI++NQ+SVT+  R  +   
Sbjct: 234 PHPGIINPLDGDLMITTTPYHMYQYYIAIVPTTIQTLK-NTIKTNQYSVTQRSRQLNLNS 292

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             Q +PG+FF YD + I V+  EE  SF  FL  +C I+GGVF  SG++ + I
Sbjct: 293 GSQGVPGIFFKYDFNAISVSVNEERRSFNEFLIRLCGIIGGVFATSGMLHSAI 345


>gi|397564627|gb|EJK44287.1| hypothetical protein THAOC_37187 [Thalassiosira oceanica]
          Length = 506

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 163/356 (45%), Gaps = 75/356 (21%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           +D++G+  L+V   +FK+RLD  G    +        A  ++   +R          YCG
Sbjct: 142 IDVAGDSQLEVSDKMFKQRLDLDGTPRPLAKISAEANAKALEDKKRREVVEKSVGPDYCG 201

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSN-PDLIDQCKREG---FLQRIKEEEGEGC 114
            CYGA+ + +DCCN C++V E Y+KK W  +    L +QC REG     +  +   GEGC
Sbjct: 202 PCYGAQENAQDCCNTCDDVIERYKKKRWNDNAVQPLAEQCIREGRAGVSEPKRMAGGEGC 261

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF-------- 166
           N+ G   VN+VAGNFH A G+   + G H+H  L   R +F  +H I++L+F        
Sbjct: 262 NLSGHFTVNRVAGNFHIAMGEGVERDGRHIHQFLPEDRVNFIANHVIHELSFLDDEYGDI 321

Query: 167 -GEHFPGVVNP--LDGVRWTQET---------PSGMYQYFIKVVPTVY------------ 202
            GE F  +++   ++G R    +          +G++QYFIKVVPT Y            
Sbjct: 322 EGEGFLNLMSKAGVNGERSMNGSVKTVTEETGTTGLFQYFIKVVPTKYKGDIIDDMGVST 381

Query: 203 -TDVSGHTIQSNQFSVTEHFRS------------------------SEQGRLQ------- 230
            +D     +++N++  TE FR                         S+ G  Q       
Sbjct: 382 LSDGQEKQLETNRYFYTERFRPLIGDIDEEALLAGDVEKGTAGAHVSKAGGTQHQQAEHH 441

Query: 231 -----TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
                 LPGVFF Y++ P  V  +   V F+H    + A VGGVFT+   ID  ++
Sbjct: 442 AATNAVLPGVFFVYEIYPFMVEVSRNRVPFMHLWIRIMATVGGVFTMMSWIDGALH 497


>gi|326490247|dbj|BAJ84787.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326493774|dbj|BAJ85349.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 348

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 93/281 (33%), Positives = 146/281 (51%), Gaps = 37/281 (13%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K RLD  G +       IG   +   +++       ++   G  
Sbjct: 89  IDMSGKHEVDLHTNIWKLRLDKYGQI-------IGTEYLSDLVEK---EHGTHDHDHGHG 138

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +  +   E   N  E+  +  +    A+ N                    GEGC +YG L
Sbjct: 139 HDVQKQPEHTFN--EDADKMVKSVKLAMEN--------------------GEGCRVYGAL 176

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           +V +VAGNFH     S H   + V + +       N+SH I++L+FG  +PG+ NPLD  
Sbjct: 177 DVQRVAGNFHI----SVHGLNIFVANQIFDGSSHVNVSHVIHRLSFGPEYPGIHNPLDDT 232

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
                  SG ++Y+IKVVPT Y  +S   + +NQFSVTE+F        ++ P V+F YD
Sbjct: 233 SRILHDTSGTFKYYIKVVPTEYRYLSKGVLPTNQFSVTEYFVPIRPTD-RSWPAVYFLYD 291

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
           LSPI VT  EE  +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 292 LSPITVTIREERRNFLHFITRLCAVLGGTFAMTGMLDRWMY 332


>gi|67623967|ref|XP_668266.1| serologically defined breast cancer antigen 84 [Cryptosporidium
           hominis TU502]
 gi|54659454|gb|EAL38030.1| serologically defined breast cancer antigen 84 [Cryptosporidium
           hominis]
          Length = 397

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 97/303 (32%), Positives = 159/303 (52%), Gaps = 37/303 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDK-------PLQRHGGR---L 50
           +D +    LD+  DI   RL  +   ++S  D +G  ++D        P+  +G     +
Sbjct: 87  VDNTINNKLDIMLDITFPRLRCEEISVDS-VDYVGENQVDSKEYMAKIPIDLNGQEVRNI 145

Query: 51  EHNE-----TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID---QCKREGF 102
           ++N+       C SCYGAE+++  CCN+C+ ++ AYR KGW  S  D++    QC     
Sbjct: 146 KYNQQNDLKIECMSCYGAETNEFLCCNDCDSLKTAYRSKGW--SYLDIVSKAPQCI---- 199

Query: 103 LQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI-LAFQRDSFNISHKI 161
                  E  GC I G ++VNKV+GN H A G +  ++G HVH+  +      FN SH I
Sbjct: 200 -------EKVGCRINGRMQVNKVSGNIHVALGTATIKNGKHVHEFNMNDVSRGFNTSHII 252

Query: 162 NKLAFG-EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT-IQSNQFSVTE 219
           ++L FG +  P + +PL+ ++      + M+ Y++K++PT Y   +G   +  NQ++ TE
Sbjct: 253 HELRFGSDRIPFLFSPLENIQKFVHKGTKMFHYYVKLIPTQYFSGNGEVNLYGNQYAFTE 312

Query: 220 HFRSS--EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
             R    + G L  LPGVF  YD  P  +    + V   H +T+ CAIVGG++++  ++D
Sbjct: 313 RERDVHVQNGELSGLPGVFIVYDFQPFLLQKIYKRVPISHLITSFCAIVGGIYSIMSLLD 372

Query: 278 AFI 280
            F+
Sbjct: 373 TFV 375


>gi|123389547|ref|XP_001299739.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121880652|gb|EAX86809.1| hypothetical protein TVAG_100310 [Trichomonas vaginalis G3]
          Length = 351

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 99/297 (33%), Positives = 144/297 (48%), Gaps = 34/297 (11%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           D+ G  +   +  ++K R+D  GN I   Q                         CG CY
Sbjct: 86  DMMGSGNRPDQKTLYKVRVDQNGNPIPQTQIA---------------------EDCGPCY 124

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLE 121
           GAESS   CC  CE+V  AY++KGW + N     QC+ EG +   KE     C  YG L 
Sbjct: 125 GAESSQRKCCQTCEDVVAAYQEKGWGIGNLSSWAQCRAEGVMFDGKER----CQAYGNLH 180

Query: 122 VNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVR 181
           VN + G FH APG +      HVHD      D+ N++H+I  ++FG   P   +PLD  R
Sbjct: 181 VNAIEGGFHLAPGINVFSRFGHVHDFSPLV-DTLNLTHEIEHISFGA--PIDKSPLDNTR 237

Query: 182 WTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSVT-EHFRSSEQGRLQTLPGVFFFY 239
             Q+ P  + Y+Y +K VPTV  +V+G   +  +F+V       + +GR    PG+FF Y
Sbjct: 238 VVQKKPGQIHYRYNLKAVPTV-KEVNGKVHRFFRFTVNYAEIPVTARGRYG--PGIFFVY 294

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
             +P+ +T T +  +    L  + +I GG F ++ +ID+F Y     I+ K  I KF
Sbjct: 295 SFAPVAITSTYDRPNITVLLARLISIFGGSFMLARLIDSFTYR-LNTIEGKDRINKF 350


>gi|66363024|ref|XP_628478.1| ER vesicle protein; Erv41p, transmembrane region near C terminus
           and possible N region transmembrane [Cryptosporidium
           parvum Iowa II]
 gi|46229502|gb|EAK90320.1| ER vesicle protein; Erv41p, transmembrane region near C terminus
           and possible N region transmembrane [Cryptosporidium
           parvum Iowa II]
          Length = 397

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 93/288 (32%), Positives = 151/288 (52%), Gaps = 36/288 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D  GE  +D K  + K  +D  G  + +           K  Q++  ++E     C SC
Sbjct: 116 VDYVGENQVDSKEYMVKIPIDLNGQEVRNI----------KYNQQNDLKIE-----CMSC 160

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLID---QCKREGFLQRIKEEEGEGCNIY 117
           YGAE+++  CCN+C+ ++ AYR KGW  S  D++    QC            E  GC I 
Sbjct: 161 YGAETNEFLCCNDCDSLKTAYRSKGW--SYLDIVSKAPQCI-----------EKVGCRIN 207

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDI-LAFQRDSFNISHKINKLAFG-EHFPGVVN 175
           G ++VNKV+GN H A G +  ++G HVH+  +      FN SH I++L FG +  P + +
Sbjct: 208 GRMQVNKVSGNIHVALGTATIKNGKHVHEFNMNDVSRGFNTSHIIHELRFGSDKIPFLFS 267

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT-IQSNQFSVTEHFRSS--EQGRLQTL 232
           PL+ ++      + M+ Y++K++PT Y   +G   +  NQ++ TE  R    + G L  L
Sbjct: 268 PLENIQKFVHKGTKMFHYYVKLIPTQYFSGNGEVNLYGNQYAFTERERDVHVQNGELSGL 327

Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           PG+F  YD  P  +    + V   H +T+ CAIVGG++++  ++D F+
Sbjct: 328 PGIFIVYDFQPFLLQKIYKRVPISHLITSFCAIVGGIYSIMSLLDTFV 375


>gi|209876426|ref|XP_002139655.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209555261|gb|EEA05306.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 395

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 98/304 (32%), Positives = 160/304 (52%), Gaps = 36/304 (11%)

Query: 1   MDISGEQHLDVKHDIFKKRL-------DSQGNVIESRQDGIGAPKIDKPLQRHGGRLE-- 51
           +D +  Q LD++ DI    L       D+  NV E++ +  G   +  P+  HG  ++  
Sbjct: 87  VDDNMNQKLDIRLDISFPSLRCSEISVDTVDNVGENQVNAHGNL-LKIPIDIHGNEVQEE 145

Query: 52  ----HNETY---CGSCYGAESSDEDCCNNCEEVREAYRKKGWA-LSNPDLIDQCKREGFL 103
               +NE+    C SC+GAES    CCN CE ++ A+R KGW+ L       QC      
Sbjct: 146 IMAQYNESTSMKCLSCFGAESIHYKCCNTCESLKSAFRYKGWSYLDIASKAPQCINT--- 202

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI-LAFQRDSFNISHKIN 162
                    GC ++G L+VNKV+GN H A G++  + G HVH+  +      FN SH I+
Sbjct: 203 --------VGCRLHGSLQVNKVSGNIHVALGQATVRDGKHVHEFNMNDISRGFNTSHTIH 254

Query: 163 KLAFG-EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT--IQSNQFSVTE 219
           +L FG ++   + +PL+  +    T + M+ Y++K+VPT +   SG++  + SNQ++ TE
Sbjct: 255 ELRFGKDNIEFIGSPLENTKKIVTTGTSMFHYYLKLVPTQFIK-SGYSKVLFSNQYTYTE 313

Query: 220 HFRSS--EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
             +    + G L  LPGVF  YD  P  +      +   HFLT+ CAI+GG++++  ++D
Sbjct: 314 RQKDVLVKDGELSGLPGVFIVYDFQPFVIRKIHNSIPTTHFLTSFCAIIGGIYSLMSLVD 373

Query: 278 AFIY 281
           + ++
Sbjct: 374 SILF 377


>gi|428171090|gb|EKX40010.1| hypothetical protein GUITHDRAFT_154283 [Guillardia theta CCMP2712]
          Length = 331

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 95/279 (34%), Positives = 137/279 (49%), Gaps = 65/279 (23%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGE  LD+ HD++K+        ++S+ + +G P I                     
Sbjct: 97  MDVSGEHELDIVHDVYKR-------AMDSKGNALG-PVI--------------------- 127

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
                         E+V+ A      ALS   + +Q +R            EGCNIYG L
Sbjct: 128 -------------SEKVKLARD----ALSISHIKEQLERH-----------EGCNIYGTL 159

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
              KV+GNFH     S H    HV   +   R + N SH +N L+FG  +PG+ NPLDG 
Sbjct: 160 NAQKVSGNFHL----SLHAQDFHVLAQVFPDRATVNTSHIVNHLSFGRDYPGLKNPLDGE 215

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYD 240
               +  SG ++Y+IK+VPT +  + G  I +NQ+SVT+HFR  + G     P V+F YD
Sbjct: 216 MKVLDQGSGTFEYYIKIVPTKFHHLDGTIIDTNQYSVTDHFRKLQDG----FPAVYFIYD 271

Query: 241 LSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
           +SPI V   +   SF H+ T +CAI GG++ V+G + A 
Sbjct: 272 ISPIMVRVKQWKQSFSHYATQLCAITGGMYVVTGQLHAL 310


>gi|384253563|gb|EIE27037.1| DUF1692-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 327

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 79/181 (43%), Positives = 109/181 (60%), Gaps = 15/181 (8%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-----DSFNISHKINKLAF 166
           EGCNI+G+L++ +VAGNF  +         VHV D  A  R        N SH I++++F
Sbjct: 152 EGCNIFGWLDLQRVAGNFRVS---------VHVEDFFALTRLQADTTGINSSHIIHRVSF 202

Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
           G  FPG VNPLDG     +  SG ++YF+KVVPT Y   +G    +NQ+SVTE+     +
Sbjct: 203 GPTFPGQVNPLDGAERILDKESGTFKYFLKVVPTEYQWSAGTRTTTNQYSVTEYDTVVHK 262

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
           G +Q +P V+F YD+SPI VT +E   SF H L   CA+VGGVF V+G+ D +++    A
Sbjct: 263 GEMQ-MPSVWFSYDISPISVTISEIRKSFAHLLVRFCAVVGGVFAVTGMFDRWVHRIVTA 321

Query: 287 I 287
           I
Sbjct: 322 I 322


>gi|384501765|gb|EIE92256.1| hypothetical protein RO3G_17063 [Rhizopus delemar RA 99-880]
          Length = 291

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 80/173 (46%), Positives = 107/173 (61%), Gaps = 11/173 (6%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD SGEQ      D+ K RLD+ GN+IES        K+          LE     CGSC
Sbjct: 81  MDESGEQSSGYSQDVTKIRLDTLGNIIESGH----TVKLGDHTNDAKKALEE-APECGSC 135

Query: 61  YGAESSDED-CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           YGA+   ED CC++C++VREAY K+GW L N   I+QC REG+L +++ +  EGCN++G 
Sbjct: 136 YGAKPLREDGCCHSCQDVREAYVKQGWGLVNTKEIEQCIREGWLAKLENQSNEGCNVHGH 195

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-----SFNISHKINKLAFG 167
           L VNKV GNFHFAPG +F    +HVHD+  + +      SF++SH+I+KL FG
Sbjct: 196 LLVNKVRGNFHFAPGGAFQAGSMHVHDLQEYTQGAPNGHSFDMSHRIHKLKFG 248


>gi|410078101|ref|XP_003956632.1| hypothetical protein KAFR_0C05060 [Kazachstania africana CBS 2517]
 gi|372463216|emb|CCF57497.1| hypothetical protein KAFR_0C05060 [Kazachstania africana CBS 2517]
          Length = 414

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 88/269 (32%), Positives = 134/269 (49%), Gaps = 35/269 (13%)

Query: 53  NETYCGSCYGAESS-----------DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREG 101
           +E YCG CYGA+             D  CC  C +V+ +Y   GWA  +   I+QC+REG
Sbjct: 137 DENYCGPCYGAKDQSINDKEGIKKEDRVCCQTCSDVKNSYLDAGWAFFDGKNIEQCEREG 196

Query: 102 FLQRIKEEEGEGCNIYGF-LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ-RDSFNISH 159
           ++++I  +  EGC I G  + +N+V GN HFAPG+++H    H HD   +  +   N +H
Sbjct: 197 YIEKINSQLNEGCQIKGSNVLINRVNGNLHFAPGEAYHNPNGHYHDTSFYDLKPQLNFNH 256

Query: 160 KINKLAFGE--------HFPGVVN-PLDGVRWTQETPSGMY--QYFIKVVPTVYTDVSGH 208
            IN  +FG         H   ++N PLDG +   E  S  Y   YF K+V T Y  +   
Sbjct: 257 IINHFSFGNGAVDRDATHDTTLMNSPLDGTQVLPEYDSHAYAFTYFNKIVSTRYEYLERD 316

Query: 209 TIQSNQFSVTEHFRSSEQGR----------LQTLPGVFFFYDLSPIKVTFTEEH-VSFLH 257
            +++ QF+   H R    G              +PG+F ++D+SP+K+   E+H V++  
Sbjct: 317 PLETVQFTSMFHDRQINGGNDIHDEKIKHARGGIPGLFIYFDISPMKIINKEQHTVNWST 376

Query: 258 FLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
           F+ N    +GG+  V  +ID   Y  QR 
Sbjct: 377 FVLNCITSIGGILAVGTVIDKIFYKTQRT 405


>gi|302823246|ref|XP_002993277.1| hypothetical protein SELMODRAFT_431378 [Selaginella moellendorffii]
 gi|302825185|ref|XP_002994225.1| hypothetical protein SELMODRAFT_236933 [Selaginella moellendorffii]
 gi|300137936|gb|EFJ04730.1| hypothetical protein SELMODRAFT_236933 [Selaginella moellendorffii]
 gi|300138947|gb|EFJ05698.1| hypothetical protein SELMODRAFT_431378 [Selaginella moellendorffii]
          Length = 333

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 95/291 (32%), Positives = 158/291 (54%), Gaps = 52/291 (17%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K RL   G+++       G+  +   +++     EH        
Sbjct: 87  IDMSGKHEVDLDTNIWKLRLHKDGHIL-------GSEYLSDLVEK-----EH-------- 126

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
             A  +     ++ EE+R A +          ++++  +         ++GEGC ++G L
Sbjct: 127 --AHDNLTGIFHSHEELRSAVK----------VVNEINK-------ALQDGEGCRVFGVL 167

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD- 178
           +V +VAGNFH     S H   + + H +        N+SH IN L+FG  +PG+ NPLD 
Sbjct: 168 DVERVAGNFHI----SMHGMSLQIFHSV-----KEVNVSHIINDLSFGPKYPGIHNPLDR 218

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
            VR  ++T +G ++YFIK+VPT Y  ++G  + +NQFSV E++ ++    + + P V+F 
Sbjct: 219 TVRILRDT-AGTFKYFIKIVPTEYRYLNGGKLPTNQFSVGEYYLAARDDDI-SWPAVYFL 276

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           YDLSPI V   EE  SF H LT  CAIVGG F+++G++D +IY    +I +
Sbjct: 277 YDLSPITVLIKEERRSFGHLLTRFCAIVGGTFSLTGMLDRWIYRLVESITR 327


>gi|412992535|emb|CCO18515.1| predicted protein [Bathycoccus prasinos]
          Length = 428

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 108/314 (34%), Positives = 155/314 (49%), Gaps = 30/314 (9%)

Query: 1   MDISGEQHLDVKHD--IFKKRLDSQGNVIESR----QDGIGAPKIDKPLQRHGGRL---- 50
           +D +GE H DV HD  I K+RLD  G  I  R    +D +   +      +H  +L    
Sbjct: 121 LDAAGEVHHDV-HDGHITKRRLDRDGKPIPRRDSSAKDDVAVTREKPNKHKHIEKLVREK 179

Query: 51  -EHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWA------LSNPDLIDQCKREGFL 103
            +  E         +   E      +E R   +    A           LI +    G  
Sbjct: 180 EKEEEGKKNEGEQEQEQQEQNHEQHDEKRRKLQNTALAGFGGGFFDINALIHEQFPNGLE 239

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           +  K +  EGC + G+LEVN+V G+F  +PGKS      H+   +       N+SH IN+
Sbjct: 240 EAFKNKNKEGCEVMGYLEVNRVPGSFSISPGKSLQIGMSHIQLNVV---SHLNMSHTINR 296

Query: 164 LAFGEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
           LAFGE FPG +N LD   R+    P+ ++QYF+KVVPT +  +   T+ +NQ+SVTE   
Sbjct: 297 LAFGEAFPGALNLLDKNTRYL--PPNAVHQYFLKVVPTSFARLKDTTLATNQYSVTESSS 354

Query: 223 SSEQ-----GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           S++Q     G      G++F Y+LSPI++ F E   SF  F+ +VC+I+GGV T SGI+ 
Sbjct: 355 SAKQSFFGMGSSGKPSGIYFHYELSPIRIDFKERRNSFGEFMLSVCSIIGGVATSSGILH 414

Query: 278 AFIYHGQ-RAIKKK 290
             I   Q RA  KK
Sbjct: 415 KLIVFIQTRARSKK 428


>gi|145351005|ref|XP_001419879.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580112|gb|ABO98172.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 373

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 99/281 (35%), Positives = 150/281 (53%), Gaps = 27/281 (9%)

Query: 2   DISGEQHLDVKHD--IFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           D +GE+H DV HD  I K+R+D  G VI++      + K +K  +      + NET   S
Sbjct: 95  DKAGEEHYDV-HDGHIEKRRIDKHGKVIDA---AFTSEKPNKHKEIEQALQKMNET--DS 148

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
            + A+S             E  +  G       L+ +   EG     + E  EGC + G+
Sbjct: 149 AHAADS----------HAMEHVQPFGGMFGLQSLLQEVFPEGVEHAFRNENQEGCEVKGY 198

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDG 179
           LEVN+V G F  +PG+S       V   L  Q  + N++H I++L+FGE FPG+V+PLDG
Sbjct: 199 LEVNRVPGRFSISPGRSLMMGMQMVK--LNVQ-TALNLTHTIHRLSFGESFPGLVSPLDG 255

Query: 180 VRWTQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQGRLQTL----PG 234
              +   P+ + QYF+ VV T +  +     I ++Q+SVTE F SS++  + T     PG
Sbjct: 256 THRSLP-PNAVQQYFLNVVSTTFEPLGENKIISTHQYSVTETFTSSQRSIMGTSNGRDPG 314

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
           V F Y++SPI+V F E   SF  F+  +C+++GGV T++GI
Sbjct: 315 VIFTYEISPIRVDFKETRTSFGAFVLGICSVIGGVVTMAGI 355


>gi|156402826|ref|XP_001639791.1| predicted protein [Nematostella vectensis]
 gi|156226921|gb|EDO47728.1| predicted protein [Nematostella vectensis]
          Length = 413

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/184 (41%), Positives = 105/184 (57%), Gaps = 1/184 (0%)

Query: 98  KREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
           KRE        +E + C +YG  +VNKVAGNFH   GKS H    H H       +S N 
Sbjct: 156 KREESKDAANTKEHDACRVYGSFKVNKVAGNFHITSGKSIHHPRGHAHLSSMVPVESLNF 215

Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
           SH+I+ L+FG+  PG+V+PLDG     E    MYQY+I+VVPT    ++   I++NQ+S+
Sbjct: 216 SHRIDMLSFGKRVPGIVHPLDGEMQITEKRRMMYQYYIQVVPTSIKSLNSEEIKTNQYSM 275

Query: 218 TEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           T+  R  S       + G+FF YD+S I V    +H S + FL  +C IVGG+F  SG++
Sbjct: 276 TQRIREISHDSGSHGIAGLFFKYDMSSIMVRVKHQHHSMVGFLVRLCGIVGGIFATSGML 335

Query: 277 DAFI 280
             FI
Sbjct: 336 HDFI 339


>gi|168004249|ref|XP_001754824.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693928|gb|EDQ80278.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score =  143 bits (361), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 92/282 (32%), Positives = 149/282 (52%), Gaps = 43/282 (15%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D+SG+  +D+  +I+K R+   G V+ S                        E      
Sbjct: 91  IDMSGKHEVDLDTNIWKLRIHRDGYVLGS------------------------EFVNDLV 126

Query: 61  YGAESSDEDCCNNCEEVREA-YRKKGWALSNPD-LIDQCKREGFLQRIKEEEGEGCNIYG 118
            G    +E   +  +E ++  +RKK     +P  +I++ K+         ++GEGC I+G
Sbjct: 127 EGEHRKEEPKADKKDEHKDGDHRKK-----DPQKVINEVKK-------AIDDGEGCQIFG 174

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
            L+V +VAGNFH     S H   ++V   +       N+SH I+ L+FG  +PG  NPLD
Sbjct: 175 VLDVERVAGNFHI----SMHGLSLYVASKIFEAGYEVNVSHVIHDLSFGPTYPGHHNPLD 230

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFF 238
           G        SG ++YF+K+VPT Y  + G  + +NQFSVTE+++ ++    ++ P V+F 
Sbjct: 231 GSERILHDTSGTFKYFLKIVPTEYHYLHGEVMPTNQFSVTEYYQRTKPSD-RSYPAVYFV 289

Query: 239 YDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           YDLSPI VT  E   +F HF+T +CA++GG F V+G++D ++
Sbjct: 290 YDLSPIVVTIREHRRNFGHFITRLCAVLGGTFAVTGMLDRWM 331


>gi|145476255|ref|XP_001424150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391213|emb|CAK56752.1| unnamed protein product [Paramecium tetraurelia]
          Length = 339

 Score =  143 bits (360), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 116/205 (56%), Gaps = 27/205 (13%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISHKINKLA 165
           +E EGC I G++ VNKV GNFH     S H  G  +H +  FQR    + ++SH IN ++
Sbjct: 146 KEKEGCQIAGYIIVNKVPGNFHV----SAHAFGGILHQV--FQRSQIQTLDLSHTINHIS 199

Query: 166 FGEH----------FPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQS 212
           FGE             GV+NPLD  +   +   G   M+QY+I VVPT Y DVSG     
Sbjct: 200 FGEEDDLMKIKKQFQKGVLNPLDNTKKVAQPQGGTGMMFQYYISVVPTTYVDVSG----- 254

Query: 213 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
           N++ V +   +S +     LP  +F YDLSP+ V F +   SFLHFL  +CAI+GGVFT+
Sbjct: 255 NEYYVHQFTANSNEVLTDHLPAAYFRYDLSPVTVKFLQYRESFLHFLVQICAILGGVFTI 314

Query: 273 SGIIDAFIYHGQRAIKKKIEIGKFS 297
           + I+D  I+    A+ KK E+GK S
Sbjct: 315 ASIVDGMIHKSVVALLKKYEMGKLS 339


>gi|145511431|ref|XP_001441642.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408894|emb|CAK74245.1| unnamed protein product [Paramecium tetraurelia]
          Length = 329

 Score =  143 bits (360), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 85/205 (41%), Positives = 116/205 (56%), Gaps = 27/205 (13%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISHKINKLA 165
           +E EGC I G++ VNKV GNFH     S H  G  +H +  FQR    + ++SH IN ++
Sbjct: 136 KEKEGCQIAGYIIVNKVPGNFHV----SAHAFGGILHQV--FQRSQIQTLDLSHTINHIS 189

Query: 166 FGEH----------FPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQS 212
           FGE             GV+NPLD  +   +   G   M+QY+I VVPT Y DVSG+    
Sbjct: 190 FGEEDDLMKIKKQFQKGVLNPLDNTKKVAQPQGGTGMMFQYYISVVPTTYVDVSGNEYYV 249

Query: 213 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
           +QF+      +S +     LP  +F YDLSP+ V F +   SFLHFL  +CAI+GGVFT+
Sbjct: 250 HQFTA-----NSNEVLTDHLPAAYFRYDLSPVTVKFLQYRESFLHFLVQICAILGGVFTI 304

Query: 273 SGIIDAFIYHGQRAIKKKIEIGKFS 297
           + I+D  I+    A+ KK E+GK S
Sbjct: 305 ASIVDGMIHKSVVALLKKYEMGKLS 329


>gi|195162746|ref|XP_002022215.1| GL25735 [Drosophila persimilis]
 gi|194104176|gb|EDW26219.1| GL25735 [Drosophila persimilis]
          Length = 313

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 87/201 (43%), Positives = 115/201 (57%), Gaps = 21/201 (10%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY-CGS 59
           MD SG+ HL V HDIFK RLD +G            P  + P++        N+   CGS
Sbjct: 89  MDSSGDTHLRVDHDIFKHRLDLKGE-----------PLKETPIKEIVAVSPPNKNVTCGS 137

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYG 118
           CYGAE +   CCN CE+V +AYR   W +   D I+QCK  G  +R  E+   EGC I G
Sbjct: 138 CYGAEHNATHCCNTCEDVLDAYRLHKWNVQ-VDKIEQCK--GKYKRTDEDAFKEGCRIQG 194

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPL 177
            LEVN++AG+FHFAPGKSF     H+HD   FQ  +  +SH IN L+FGE       +PL
Sbjct: 195 HLEVNRMAGSFHFAPGKSFSIRQFHIHD---FQFSNVKLSHTINHLSFGEKIEFAKTHPL 251

Query: 178 DGVRW-TQETPSGMYQYFIKV 197
           DG+R    ET + M+ +++K+
Sbjct: 252 DGLRVDVAETKTEMFNHYLKI 272


>gi|156030895|ref|XP_001584773.1| hypothetical protein SS1G_14228 [Sclerotinia sclerotiorum 1980]
 gi|154700619|gb|EDO00358.1| hypothetical protein SS1G_14228 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 381

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 107/317 (33%), Positives = 144/317 (45%), Gaps = 79/317 (24%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQ---GNVIESRQ-DGIGAPKIDKPLQRHGGRLEHNETY 56
           MD+SGEQ + V H + K RL +Q   G VI++   D   A +    L         +  Y
Sbjct: 67  MDVSGEQQVGVMHGVKKVRLSAQEEGGKVIDTTALDLHNADEAATHL---------DPNY 117

Query: 57  CGSCYGA----ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
           CG CYGA     +  + CCN C+EVREAY    WA    + ++QC+RE + +R+  +  E
Sbjct: 118 CGPCYGATPPPNAKKQGCCNTCDEVREAYASVSWAFGRGENVEQCEREHYGERLDSQRKE 177

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGE 168
           GC I G L VNKV GNFH APG+SF    +HVHD+  +           SH I+ L FG 
Sbjct: 178 GCRIEGGLRVNKVIGNFHIAPGRSFTNGNMHVHDLNNYFDTPVPGGHVFSHHIHSLRFGP 237

Query: 169 HFPGVV-----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS----- 206
             P  V                 NPLD         +  + YF+KVV T Y  +      
Sbjct: 238 ELPEEVTKKLGSDSIIPWTNHHLNPLDNTEQITHEAAYNFMYFVKVVSTSYLPLGWETTY 297

Query: 207 --------------GH----TIQSNQFSVTEHFRS------SEQGRLQTL------PGVF 236
                         GH    +I+++Q+SVT H RS      S +G  + L      PGVF
Sbjct: 298 NSPPHDASVDIGTYGHSEDGSIETHQYSVTSHRRSLNGGDDSAEGHKEKLHARGGIPGVF 357

Query: 237 FFYDLSPIKVTFTEEHV 253
           F Y      V+F E H+
Sbjct: 358 FSY------VSFLEIHM 368


>gi|195997845|ref|XP_002108791.1| hypothetical protein TRIADDRAFT_49706 [Trichoplax adhaerens]
 gi|190589567|gb|EDV29589.1| hypothetical protein TRIADDRAFT_49706 [Trichoplax adhaerens]
          Length = 324

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 78/170 (45%), Positives = 99/170 (58%), Gaps = 4/170 (2%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G + +NKVAGNFH   G S +    H H      R+S N SH+I+ LAFG   P
Sbjct: 137 DACRIHGNIPLNKVAGNFHVTAGMSINHPMGHAHVSDLVPRESVNFSHRIDLLAFGVAAP 196

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE--QGRL 229
            V+NPLDGV +  +    MYQYFIK+VPT     S   I + Q+SVTEHF   +   G+ 
Sbjct: 197 NVINPLDGVEFITKITDKMYQYFIKIVPTKVKTFSV-AIDTYQYSVTEHFSKVDHMNGK- 254

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
             + G+FF YDLSPI V  TE  V F   L  +C IVGG+F  SG+I  F
Sbjct: 255 HGVSGLFFKYDLSPISVQVTEARVPFGQLLIRLCGIVGGIFATSGMIHIF 304


>gi|229594330|ref|XP_001024169.3| hypothetical protein TTHERM_00455630 [Tetrahymena thermophila]
 gi|225566928|gb|EAS03924.3| hypothetical protein TTHERM_00455630 [Tetrahymena thermophila
           SB210]
          Length = 348

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 87/212 (41%), Positives = 121/212 (57%), Gaps = 26/212 (12%)

Query: 103 LQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-DSFNISH 159
           L+R+K+   + EGC I GF+ VNKV GNFH     S H  G ++  I    R ++ ++SH
Sbjct: 146 LERVKKAFNDREGCKISGFMLVNKVPGNFHI----SSHAYGNYLQRIFQDARINTLDLSH 201

Query: 160 KINKLAFGEHF----------PGVVNPLDGVRWTQ----ETPSGMYQYFIKVVPTVYTDV 205
            IN L+FGE             G++ PLD  +  +     T    +QY+I VVPT Y D+
Sbjct: 202 VINHLSFGEENDLNRIKKTFQQGILQPLDHTKKIKPENLRTVGVTHQYYINVVPTTYKDL 261

Query: 206 SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
           S     + ++ V +   +S +   Q LP VFF YDLSP+ V F++   SFLHFL  VCAI
Sbjct: 262 S-----NRKYHVYQFVANSNEMTTQHLPAVFFRYDLSPVTVQFSQTRESFLHFLVQVCAI 316

Query: 266 VGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           +GGVFTV+GIID+ ++     I KK E+GK S
Sbjct: 317 IGGVFTVAGIIDSIVHRSVVHILKKAEMGKLS 348


>gi|123451578|ref|XP_001313964.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121895945|gb|EAY01112.1| hypothetical protein TVAG_442240 [Trichomonas vaginalis G3]
          Length = 375

 Score =  140 bits (354), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 97/293 (33%), Positives = 148/293 (50%), Gaps = 19/293 (6%)

Query: 7   QHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESS 66
           Q  DV +DI ++R+D  G  I    D +   ++ +  +    + E  + YCG CYGA   
Sbjct: 96  QSTDV-NDIKQQRIDENGFAI----DSVNWIRLKRAAKSKKQKKEQPQQYCGKCYGALPQ 150

Query: 67  DEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVA 126
            + CCN+CE+V  A++ KGW +   D   QC  EG+    KE     CN+YG + V  ++
Sbjct: 151 GK-CCNSCEDVINAFKAKGWGIDGIDRWQQCIDEGYADLGKES----CNVYGDINVAHIS 205

Query: 127 GNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG---EHFPGVVNPLDGVRWT 183
           G  +FA  + +     H  DI       +N++H IN L FG    H PG   PLDG+   
Sbjct: 206 GFLYFAL-EDYKVGDKHPKDISRLSH-KYNLTHTINYLEFGPRVSHEPG---PLDGLTVL 260

Query: 184 QETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLS 242
           QE P  M Y Y ++VVPT +    G  + + +F      ++  +   + +PG+F  Y+L+
Sbjct: 261 QEEPGLMQYNYDLEVVPTKWFSSRGFPVSTYKFHPMITQKNFTEKVNRGVPGIFLNYNLA 320

Query: 243 PIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           PI +   E   S    +T+VCAIVGG FT   + D   +    +I+ K +IGK
Sbjct: 321 PISLVQYEVISSPWKLITSVCAIVGGCFTCVSLADQIFFRTLSSIEGKRQIGK 373


>gi|405968654|gb|EKC33703.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Crassostrea gigas]
          Length = 345

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 98/295 (33%), Positives = 144/295 (48%), Gaps = 18/295 (6%)

Query: 14  DIFKKRLDSQGNVIESRQDGIGAPKIDKPLQ-RHG-GRLEHNETYCGSCYGAESSDEDCC 71
           D F K LD       S    IGA  +D   Q  HG G L++ ET+    +    +     
Sbjct: 20  DAFPKVLDDCQEKTASGGGTIGADVLDVTGQDTHGFGELKYEETH----FELSPNQRHYH 75

Query: 72  NNCEEVREAYRKKGWALSNPDLIDQ----CKREGFLQRIKEEEGE--GCNIYGFLEVNKV 125
              +E+ E  R +  AL +   + +      + G  +R    EGE   C +YG LEVNKV
Sbjct: 76  ETVQEISEFLRSEYHALQDVMWMSRGLIATYKTGMPKREIPAEGEPDACRVYGSLEVNKV 135

Query: 126 AGNFHFAPGKS---FHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRW 182
           AGNFH   GKS   F +   H H  +      +N SH+I+  +FGE   G++NPLDG   
Sbjct: 136 AGNFHITAGKSVPVFPRG--HAHISMMVHEKEYNFSHRIDHFSFGESVKGIINPLDGEEQ 193

Query: 183 TQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDL 241
                  ++ YFIK+VPT     +   I + QFSVT+  R+    +    +PG+F  YDL
Sbjct: 194 VSSDNFHVFNYFIKIVPTEVRTYAAGNIDTYQFSVTQRNRTINHSKGSHGVPGIFVKYDL 253

Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           + +K+   E+H  F  FL  +C IVGG+F VSG++  +       +  K ++GK+
Sbjct: 254 NALKIRVVEKHRPFSQFLIRLCGIVGGIFAVSGMLHNWTEFFMEVVCCKFKLGKY 308


>gi|148224086|ref|NP_001087666.1| ERGIC and golgi 2 [Xenopus laevis]
 gi|51950053|gb|AAH82468.1| MGC81917 protein [Xenopus laevis]
          Length = 377

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 72/175 (41%), Positives = 98/175 (56%), Gaps = 14/175 (8%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E+   C I+G L++NKVAGNFH   GK+      H H       DS+N SH+I+  +FGE
Sbjct: 165 EQPNACRIHGHLDINKVAGNFHITVGKAIPHPRGHAHLAALVSHDSYNFSHRIDHFSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT------VYTDVSGHTIQSNQFSVTEHFR 222
             P ++NPLDG     E  + MYQYFI +VPT      VY D       ++QFSVTE  R
Sbjct: 225 PLPAIINPLDGTEKIAEDSNQMYQYFITIVPTKLNTNKVYCD-------THQFSVTERER 277

Query: 223 SSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                     + G+F  YD+S + VT TE+H+    FL  +C I+GG+FT +G+I
Sbjct: 278 VINHATGSHGVSGIFMKYDISSLMVTVTEDHMPLWKFLVRLCGIIGGIFTTTGMI 332


>gi|345567560|gb|EGX50490.1| hypothetical protein AOL_s00075g219 [Arthrobotrys oligospora ATCC
           24927]
          Length = 354

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 71/175 (40%), Positives = 104/175 (59%), Gaps = 11/175 (6%)

Query: 103 LQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKI 161
           L   K  +G+ C I+G ++VN+V G+FH  A G  +   G HV        D+FN SH +
Sbjct: 153 LNLPKRPKGKSCRIWGSMDVNRVMGDFHITAKGHGYWDPGQHV------DHDTFNFSHVV 206

Query: 162 NKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 221
           N+L+FGE +P +VNPLDGV    E     YQYF+ VVPT Y    G T+Q+NQ+SVTE  
Sbjct: 207 NELSFGEFYPKLVNPLDGVASVTEDKFYRYQYFMSVVPTTYK-AHGRTLQTNQYSVTEQG 265

Query: 222 RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           RS      Q++PG+FF +D+ PI +T T+ H  +++ +  +  ++GGV    G +
Sbjct: 266 RSMNP---QSVPGIFFKFDIEPIMLTITDTHTPWIYLIVRLANVIGGVMVAGGWL 317


>gi|89272944|emb|CAJ82943.1| ptx1 [Xenopus (Silurana) tropicalis]
          Length = 377

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 73/170 (42%), Positives = 98/170 (57%), Gaps = 4/170 (2%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E    C I+G LE+NKVAGNFH   GK+      H H       DS+N SH+I+  +FGE
Sbjct: 165 EPPNACRIHGHLEINKVAGNFHITVGKAIPHPRGHAHLAALVSHDSYNFSHRIDHFSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQG 227
             PG+VNPLDG     E  + MYQYFI +VPT ++T+       ++QFSVTE  R     
Sbjct: 225 PLPGIVNPLDGTEKIAEDSNQMYQYFITIVPTKLHTNKVD--CDTHQFSVTERERVINHA 282

Query: 228 R-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                + G+F  YD+S + V  TE+H+    FL  +C IVGG+FT +G+I
Sbjct: 283 SGSHGVSGIFMKYDISSLMVMVTEDHMPLWKFLVRLCGIVGGIFTTTGMI 332


>gi|126339088|ref|XP_001363644.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Monodelphis domestica]
          Length = 378

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 71/166 (42%), Positives = 99/166 (59%), Gaps = 2/166 (1%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQ 230
           G++NPLDG        + M+QYFI VVPT   +    +  ++QFSVTE  R+ +      
Sbjct: 228 GIINPLDGTEKIANDHNQMFQYFITVVPT-KLNTYKISADTHQFSVTERERAINHAAGSH 286

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
            + G+F  YDLS + VT TEEH+ F  FL  +C I+GG+F+ +G++
Sbjct: 287 GVSGIFMKYDLSSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGML 332


>gi|313661438|ref|NP_001186332.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Gallus gallus]
          Length = 377

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 75/174 (43%), Positives = 103/174 (59%), Gaps = 6/174 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 165 ESPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SE 225
             PG++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET---HQFSVTERERVINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
                 + G+F  YD+S + VT TEEH+ F  FL  +C I+GG+F+ +GI+  F
Sbjct: 282 AAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGILHGF 335


>gi|326911226|ref|XP_003201962.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Meleagris gallopavo]
          Length = 377

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 75/174 (43%), Positives = 103/174 (59%), Gaps = 6/174 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 165 ESPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SE 225
             PG++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET---HQFSVTERERVINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
                 + G+F  YD+S + VT TEEH+ F  FL  +C I+GG+F+ +GI+  F
Sbjct: 282 AAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGILHGF 335


>gi|327273481|ref|XP_003221509.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Anolis carolinensis]
          Length = 377

 Score =  136 bits (343), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 72/168 (42%), Positives = 99/168 (58%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGELIP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI--QSNQFSVTEHFRSSEQGR- 228
           G++NPLDG        + M+QYFI VVP   T +  H I  +++QFSVTE  R       
Sbjct: 228 GIINPLDGTEKVASDHNQMFQYFITVVP---TKLHTHKISAETHQFSVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YD+S + VT TEEH+ F  FL  +C I+GG+F+ +GI+
Sbjct: 285 SHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGIL 332


>gi|387015774|gb|AFJ50006.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2-like
           [Crotalus adamanteus]
          Length = 377

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 71/171 (41%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 165 QSADACRIHGHLYVNKVAGNFHVTVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI--QSNQFSVTEHFRSSEQ 226
             PG++NPLDG        + M+QYF+ VVP   T +  H I  +++QF+VTE  R    
Sbjct: 225 LIPGIINPLDGTEKIASDHNQMFQYFVTVVP---TKLQTHKISAETHQFAVTERERIINH 281

Query: 227 GR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YD+S + VT TEEH+ F  FL  +C IVGG+F+ +GI+
Sbjct: 282 AAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIVGGIFSTTGIL 332


>gi|395537817|ref|XP_003770886.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Sarcophilus harrisii]
          Length = 378

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 71/166 (42%), Positives = 99/166 (59%), Gaps = 2/166 (1%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQ 230
           G++NPLDG        + M+QYFI VVPT   +    +  ++QFSVTE  R+ +      
Sbjct: 228 GIINPLDGTEKIAIDHNQMFQYFITVVPT-KLNTYKISADTHQFSVTERERAINHAAGSH 286

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
            + G+F  YDLS + VT TEEH+ F  FL  +C I+GG+F+ +G++
Sbjct: 287 GVSGIFMKYDLSSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGML 332


>gi|224093106|ref|XP_002193654.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Taeniopygia guttata]
          Length = 377

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/174 (42%), Positives = 103/174 (59%), Gaps = 6/174 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SE 225
             PG++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET---HQFSVTERERVINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
                 + G+F  YD+S + VT TEEH+ F  FL  +C I+GG+F+ +GI+  F
Sbjct: 282 AAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGILHGF 335


>gi|449278843|gb|EMC86582.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Columba livia]
          Length = 377

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 74/174 (42%), Positives = 103/174 (59%), Gaps = 6/174 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SE 225
             PG++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LIPGIINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET---HQFSVTERERVINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
                 + G+F  YD+S + VT TEEH+ F  FL  +C I+GG+F+ +GI+  F
Sbjct: 282 AAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFSTTGILHGF 335


>gi|57208596|emb|CAI42845.1| ERGIC and golgi 3 [Homo sapiens]
          Length = 129

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 70/128 (54%), Positives = 88/128 (68%), Gaps = 9/128 (7%)

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG------HTIQSNQFSVTEHFRSSEQGRL 229
           PLD    T    S M+QYF+KVVPTVY  V G        +++NQFSVT H + +  G L
Sbjct: 1   PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEAPLPPQVLRTNQFSVTRHEKVAN-GLL 59

Query: 230 --QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
             Q LPGVF  Y+LSP+ V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IYH  RAI
Sbjct: 60  GDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAI 119

Query: 288 KKKIEIGK 295
           +KKI++GK
Sbjct: 120 QKKIDLGK 127


>gi|431908425|gb|ELK12022.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Pteropus alecto]
          Length = 377

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 75/168 (44%), Positives = 100/168 (59%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
           G++NPLDG     E  + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAEDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|355686514|gb|AER98081.1| ERGIC and golgi 2 [Mustela putorius furo]
          Length = 365

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 75/171 (43%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++  F
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGMLHGF 335


>gi|417399911|gb|JAA46936.1| Putative endoplasmic reticulum-golgi intermediate compartment
           protein 2 isoform 1 [Desmodus rotundus]
          Length = 376

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 75/168 (44%), Positives = 99/168 (58%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 167 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 226

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRSSEQGR- 228
           G+VNPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R       
Sbjct: 227 GIVNPLDGTEKIAVDHNRMFQYFITVVPTKLHTYKISADT---HQFSVTERERVVNHAAG 283

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 284 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 331


>gi|348562091|ref|XP_003466844.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Cavia porcellus]
          Length = 377

 Score =  133 bits (335), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 74/171 (43%), Positives = 101/171 (59%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
             PG++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|345441780|ref|NP_001230861.1| ERGIC and golgi 2 [Sus scrofa]
          Length = 377

 Score =  133 bits (335), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 74/168 (44%), Positives = 100/168 (59%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|82074366|sp|Q5EHU7.1|ERGI2_GECJA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 2
          Length = 377

 Score =  133 bits (335), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 74/168 (44%), Positives = 100/168 (59%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|301783747|ref|XP_002927289.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Ailuropoda melanoleuca]
          Length = 377

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 74/168 (44%), Positives = 100/168 (59%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|410964074|ref|XP_003988581.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Felis catus]
          Length = 377

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 74/168 (44%), Positives = 100/168 (59%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|417399168|gb|JAA46612.1| Putative endoplasmic reticulum-golgi intermediate compartment
           protein 2 isoform 1 [Desmodus rotundus]
          Length = 337

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 77/173 (44%), Positives = 102/173 (58%), Gaps = 7/173 (4%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 167 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 226

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRSSEQGR- 228
           G+VNPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R       
Sbjct: 227 GIVNPLDGTEKIAVDHNRMFQYFITVVPTKLHTYKISADT---HQFSVTERERVVNHAAG 283

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G  D+F++
Sbjct: 284 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTG-KDSFLF 335


>gi|123438593|ref|XP_001310077.1| MGC83277 protein [Trichomonas vaginalis G3]
 gi|121891831|gb|EAX97147.1| MGC83277 protein, putative [Trichomonas vaginalis G3]
          Length = 355

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 88/277 (31%), Positives = 140/277 (50%), Gaps = 15/277 (5%)

Query: 8   HLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSD 67
           H+DV  +I +     +G+V   R D  G P + K   ++   +  +  YCG+CYG +S  
Sbjct: 79  HVDVIDNIKESDESYEGHVRMERFDEKGNPILKKSYPKNSS-VTKDPGYCGNCYGQKSG- 136

Query: 68  EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAG 127
             CCN C+EVR+A++           I QC  EG+ + +   +GE C ++G L V++  G
Sbjct: 137 --CCNTCKEVRKAFKANNRPPPPIIHIQQCVDEGYKEELIAMKGEACRVHGTLTVHRAPG 194

Query: 128 NFHFAPGKSFHQSGVHVH--DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE 185
            FH APG+S++ +G H H  + L    D  N SH IN  + G        PLDG    Q+
Sbjct: 195 TFHVAPGESYNINGEHDHYYEDLGINIDEMNFSHTINHFSIGMPTANSYYPLDGHTEIQQ 254

Query: 186 TPSGMYQ-YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPI 244
               M   YF++ VP    ++ G    S   S  +++R S   +    PGVFF YD+S I
Sbjct: 255 KTGRMKMIYFLRAVP---INLDGRVF-SFGASSYQNYRGSNSTK---YPGVFFSYDVSLI 307

Query: 245 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
            +  + ++ S +  +T + +I+GGVF ++  +D   Y
Sbjct: 308 GIV-SSQNSSLMDLVTELMSILGGVFAIATFLDMLSY 343


>gi|432943284|ref|XP_004083140.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Oryzias latipes]
          Length = 372

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 68/173 (39%), Positives = 98/173 (56%), Gaps = 2/173 (1%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G + VNKVAGN H   GK  H    H H       +S+N SH+I++L FGE
Sbjct: 156 QSPDACRIHGDIYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHESYNFSHRIDRLCFGE 215

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQG 227
             PG++NPLDG        + MYQYFI VVPT        T  ++QFSVTE  R  +   
Sbjct: 216 EIPGIINPLDGTEKITYDNNQMYQYFITVVPTKLKTYKI-TADTHQFSVTERERVINHTA 274

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
               + G+FF YD S + VT +E+H+    FL  +C I+GG+++ +G++ + I
Sbjct: 275 GSHGVSGIFFKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIYSTTGMLHSLI 327


>gi|21312962|ref|NP_080444.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           isoform 1 [Mus musculus]
 gi|81903633|sp|Q9CR89.1|ERGI2_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 2
 gi|12835992|dbj|BAB23451.1| unnamed protein product [Mus musculus]
 gi|12843481|dbj|BAB25998.1| unnamed protein product [Mus musculus]
 gi|12844310|dbj|BAB26318.1| unnamed protein product [Mus musculus]
 gi|13905198|gb|AAH06895.1| ERGIC and golgi 2 [Mus musculus]
 gi|17390417|gb|AAH18188.1| ERGIC and golgi 2 [Mus musculus]
 gi|20072972|gb|AAH26558.1| ERGIC and golgi 2 [Mus musculus]
 gi|26326029|dbj|BAC26758.1| unnamed protein product [Mus musculus]
 gi|40353061|gb|AAH64749.1| ERGIC and golgi 2 [Mus musculus]
 gi|74191314|dbj|BAE39481.1| unnamed protein product [Mus musculus]
 gi|148678796|gb|EDL10743.1| ERGIC and golgi 2, isoform CRA_c [Mus musculus]
          Length = 377

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C I+GG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 332


>gi|12846043|dbj|BAB27008.1| unnamed protein product [Mus musculus]
          Length = 377

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C I+GG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 332


>gi|12841082|dbj|BAB25070.1| unnamed protein product [Mus musculus]
          Length = 377

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C I+GG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 332


>gi|47214843|emb|CAF95749.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 299

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 67/149 (44%), Positives = 93/149 (62%), Gaps = 19/149 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQD-----GIGAPKIDKPLQRHGGRLEHNET 55
           MD++GEQ LDV+H++FK+RLD     + +  +     G    ++  P       L+ N  
Sbjct: 89  MDVAGEQQLDVEHNLFKQRLDKNLKPVSTEAEKHELGGAEDVEVFDP-----STLDPNR- 142

Query: 56  YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
            C SCYGAE+ D  CCN+C++VREAYR++GWA  N D I+QCKREGF Q+++E++ EGC 
Sbjct: 143 -CESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADTIEQCKREGFTQKMQEQKNEGCQ 201

Query: 116 IYGFLEVNKVA-------GNFHFAPGKSF 137
           +YG LEVNKV+       G F    GK F
Sbjct: 202 VYGVLEVNKVSLIAQEGGGKFSLCSGKKF 230


>gi|291392459|ref|XP_002712727.1| PREDICTED: PTX1 protein [Oryctolagus cuniculus]
 gi|291416214|ref|XP_002724342.1| PREDICTED: PTX1 protein-like [Oryctolagus cuniculus]
          Length = 377

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 73/171 (42%), Positives = 101/171 (59%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
             PG++NPLDG        + M+QYFI +VPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LVPGIINPLDGTEKIAIDHNQMFQYFITIVPTKLHTYKISADT---HQFSVTERERIINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|149048933|gb|EDM01387.1| rCG29652, isoform CRA_c [Rattus norvegicus]
          Length = 377

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C I+GG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 332


>gi|410918691|ref|XP_003972818.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Takifugu rubripes]
          Length = 378

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 72/175 (41%), Positives = 96/175 (54%), Gaps = 6/175 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E    C IYG + VNKVAGN H   GK  H    H H       +++N SH+I+ L+FGE
Sbjct: 164 EPHNACRIYGHIYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHETYNFSHRIDHLSFGE 223

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS-SE 225
              G++NPLDG        + MYQYFI VVPT  V   VS  T   +QFSVTE  R  + 
Sbjct: 224 EITGIINPLDGTEKITSKHTQMYQYFITVVPTRLVTHKVSADT---HQFSVTERERVINH 280

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
                 + G+F  YD S + VT TE+H+    FL  +C IVGG+F+ +G++   +
Sbjct: 281 AAGSHGVSGIFVKYDTSSLTVTVTEQHMPLWQFLVRLCGIVGGIFSTTGMLHGLV 335


>gi|74189495|dbj|BAE22750.1| unnamed protein product [Mus musculus]
          Length = 303

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 73/168 (43%), Positives = 100/168 (59%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 94  DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 153

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 154 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 210

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C I+GG+F+ +G++
Sbjct: 211 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 258


>gi|344267803|ref|XP_003405755.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Loxodonta africana]
          Length = 377

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/168 (44%), Positives = 99/168 (58%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|320170541|gb|EFW47440.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 408

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 76/205 (37%), Positives = 107/205 (52%), Gaps = 8/205 (3%)

Query: 76  EVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHFAP 133
           E R+   ++  +LS      +   +   + +  +EG  + C ++G +  +K+AGNFH   
Sbjct: 177 ENRKPLTREHLSLSGTTRKAKKNFQAMPRELSSQEGTPDACRLHGSVSADKIAGNFHIIA 236

Query: 134 GKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQY 193
           G +    G H H      + + N +H+IN L+FGE  PG+  PLDG  W   + +  YQY
Sbjct: 237 GAAVEVPGGHAHMGQMIPQHALNFTHRINHLSFGEEMPGMEFPLDGDEWITTSHTMAYQY 296

Query: 194 FIKVVPTVYTDVSG--HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEE 251
           FI+VVPTVYT  +     ++S QFSVT H    E      LPG+FF YD  PI VT    
Sbjct: 297 FIQVVPTVYTRHANDPEQLRSGQFSVTRH----ESPNSNRLPGLFFKYDTFPILVTVQYS 352

Query: 252 HVSFLHFLTNVCAIVGGVFTVSGII 276
             SF H L  +  I+GGVF  SG I
Sbjct: 353 PYSFWHLLIRLSGIIGGVFATSGFI 377


>gi|430811512|emb|CCJ31046.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 264

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 80/185 (43%), Positives = 99/185 (53%), Gaps = 19/185 (10%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD+SGE   DV H++ K RLD  G  I S    I      +P++           YCGSC
Sbjct: 89  MDVSGELQTDVSHNVVKNRLDKNGIFINST--SINTLNFQQPIKVLPS------DYCGSC 140

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           YGA+   E CCN CE+V  AY    W + N    +QCK    +    +   EGCN  G +
Sbjct: 141 YGAK---EGCCNTCEDVINAYIANNWPIPNKRTFEQCKDSNNM----DGPDEGCNFVGRI 193

Query: 121 EVNKVAGNFHFAPGKSFHQ-SGVHVHDILAFQRDSF--NISHKINKLAFGEHFPGVV-NP 176
           EVNKV GNFHFAPG S    +G HVHDI  +  DS   + SH INKL+FG    G + NP
Sbjct: 194 EVNKVIGNFHFAPGHSSQTITGGHVHDIYDYLTDSLPHDFSHMINKLSFGPEIEGSLQNP 253

Query: 177 LDGVR 181
           LD V+
Sbjct: 254 LDNVK 258


>gi|432862155|ref|XP_004069750.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Oryzias latipes]
          Length = 373

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 70/178 (39%), Positives = 100/178 (56%), Gaps = 2/178 (1%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           QR        C I+G L VNKVAGNFH   GKS      H H       DS+N SH+I+ 
Sbjct: 157 QRDSSSPPNACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVSHDSYNFSHRIDH 216

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
           L+FGE  PG+++PLDG        + M+QYFI +VPT   +    + +++Q+SVTE  R 
Sbjct: 217 LSFGEAIPGLISPLDGTEKIAADYNHMFQYFITIVPT-KLNTYKVSAETHQYSVTERERV 275

Query: 224 -SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
            +       + G+F  YD+S + V  TE+H+ F  FL  +C IVGG+F+ +G+I   +
Sbjct: 276 INHAAGSHGVSGIFMKYDISSLMVKVTEQHMPFWKFLVRLCGIVGGIFSTTGMIHGLV 333


>gi|149713890|ref|XP_001502984.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Equus caballus]
          Length = 377

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/168 (44%), Positives = 99/168 (58%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|340502903|gb|EGR29544.1| hypothetical protein IMG5_153610 [Ichthyophthirius multifiliis]
          Length = 342

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 83/218 (38%), Positives = 117/218 (53%), Gaps = 31/218 (14%)

Query: 100 EGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAP---GKSFHQSGVHVHDILAFQRDS 154
           E  L R+K    + EGC I G + VNK  GNFH +     +  HQ   HV+        +
Sbjct: 136 EARLNRLKSAFLDQEGCKIQGHIFVNKAPGNFHVSAHSFDRILHQIASHVN------IST 189

Query: 155 FNISHKINKLAFGEHFP-----------GVVNPLDGVRWT----QETPSGMYQYFIKVVP 199
            ++SH IN ++FG+              G+++PLD  R      Q+  S  YQY+I VV 
Sbjct: 190 IDVSHIINHISFGDETDIIRIKRQFKSQGILDPLDRTRKIKTEDQKNISISYQYYINVVH 249

Query: 200 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
           T Y +     IQ  ++SV +   ++ +     LP  FF YDLSP+ V F++  +SFLHF+
Sbjct: 250 TTYVN-----IQKKEYSVYQFTANNNELLSDRLPACFFRYDLSPVIVRFSQSRMSFLHFI 304

Query: 260 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
             VCAI+GGVFTV+GIID+ I+     I KK E+GK S
Sbjct: 305 VQVCAIIGGVFTVAGIIDSIIHKSVVHILKKAEMGKLS 342


>gi|57106442|ref|XP_534852.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 isoform 1 [Canis lupus familiaris]
          Length = 377

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/168 (44%), Positives = 99/168 (58%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGEVVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|432879813|ref|XP_004073560.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Oryzias latipes]
          Length = 271

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 82/201 (40%), Positives = 110/201 (54%), Gaps = 22/201 (10%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I   +GEGC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 86  MKIPINQGEGCRFEGKFTINKVPGNFH-----------VSTHSATA-QPQNPDMTHSIHK 133

Query: 164 LAFGE-----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           LAFG+     +  G  N L G       P   + Y +K+VPTVY D+SG    S Q++V 
Sbjct: 134 LAFGDTLQVHNVKGAFNALGGADKLSSNPLASHDYILKIVPTVYEDLSGRQRFSYQYTVA 193

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE    F  F+T +CAIVGG FTV+GII
Sbjct: 194 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGII 251

Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
           D+ I+    A  KKI+IGK S
Sbjct: 252 DSCIFTASEA-WKKIQIGKMS 271


>gi|395839293|ref|XP_003792530.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Otolemur garnettii]
          Length = 377

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/171 (43%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE
Sbjct: 165 QSPDACRISGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
             PG++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LVPGIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|328875761|gb|EGG24125.1| DUF1692 family protein [Dictyostelium fasciculatum]
          Length = 1172

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 95/319 (29%), Positives = 148/319 (46%), Gaps = 34/319 (10%)

Query: 1    MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKP-------LQRHGGRLEHN 53
            +D+S    +++  D+    L     ++ES     G P  D         L R G  LE  
Sbjct: 864  VDVSRGNRMNINFDVHFPSLICSDIIVESVDGVDGKPIKDAAHQIVKERLNRRGSPLERL 923

Query: 54   ETYCG--SCYGAESSDE-------DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQ 104
                G  SC   E   +        CCN+CE++R  YR         D   QC     + 
Sbjct: 924  HARAGLFSCTKCELPPKYQLLEKRKCCNSCEDLRTFYRTNKVPQHLADESPQCTIGKPVT 983

Query: 105  RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQS----GVHVHDI---LAFQRDSFNI 157
                 E EGC ++G L V K+ G+ H   G+   +S      HVH +   +A +   FNI
Sbjct: 984  -----EDEGCRVFGILSVQKMKGDIHIIAGRPHEESHDGHSHHVHKLTPEIAQRIHKFNI 1038

Query: 158  SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
            SH I+K +FG+   G++NPL+G         G+  Y+++VVPT+Y   + + +++NQ+S 
Sbjct: 1039 SHHIHKFSFGQDVEGLINPLEGFGIVVPMGLGLQTYYLQVVPTIYKQ-NNYILETNQYSY 1097

Query: 218  TEHFRSSEQGRLQTL-PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
            T  ++S     L  L PG++F YDLSP+ +   +    F   +T++CAI GG++   G+ 
Sbjct: 1098 TREYKSINYNNLGYLFPGIYFKYDLSPLMIEVDQSSKPFSELITSICAIGGGMYVAFGL- 1156

Query: 277  DAFIYHGQRAIKKKIEIGK 295
                YH    I  KI+  K
Sbjct: 1157 ---FYHVTARIVGKIKKQK 1172


>gi|326672443|ref|XP_003199668.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Danio rerio]
          Length = 365

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 73/174 (41%), Positives = 101/174 (58%), Gaps = 4/174 (2%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E    C I+G + VNKVAGNFH   GK       H H     + + +N SH+I+ L+FG 
Sbjct: 166 ESQNACRIHGKIYVNKVAGNFHITLGKPIETHKGHAHYASFIKDEVYNFSHRIDHLSFGN 225

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSEQ 226
             PG +NPLDG+  T    + ++QYFI VVPT     S  ++  +QFSVTE  R  S+E+
Sbjct: 226 DVPGHINPLDGMEKTTLEQNTLFQYFITVVPT-KLHTSNVSVDMHQFSVTERERVVSNEK 284

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           G  Q + G+FF Y LSP+ V  +EEH+    FL  +C IVGG+F+ S ++   I
Sbjct: 285 GN-QGVSGIFFKYKLSPLMVRVSEEHMPLAAFLVRLCGIVGGIFSTSDLLHRLI 337


>gi|426225295|ref|XP_004006802.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Ovis aries]
          Length = 377

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 73/168 (43%), Positives = 99/168 (58%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QF+VTE  R  +    
Sbjct: 228 GIINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADT---HQFAVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|443716796|gb|ELU08142.1| hypothetical protein CAPTEDRAFT_19918 [Capitella teleta]
          Length = 403

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 71/170 (41%), Positives = 96/170 (56%), Gaps = 11/170 (6%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQS-GVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           GC  YG L+VNKVAGNFH   GKS   + G H H  +  +   +N +H+I   +FG+   
Sbjct: 169 GCRFYGTLDVNKVAGNFHITAGKSVPLNIGGHAHMAMMVKESDYNFTHRIEHFSFGDKVS 228

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVP----TVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           G +NPLDG          MYQYFI+VVP    T++TD     I + QFSVTE  R+   G
Sbjct: 229 GRINPLDGEEKNTNDNYHMYQYFIQVVPTHVKTLFTD-----INTYQFSVTEQNRTISHG 283

Query: 228 R-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           +    +PG+F  YDL+P+ V   E H  F   L  +C I+GG+F  SG++
Sbjct: 284 KGSHGIPGIFVKYDLAPMMVKVIESHKPFSQLLIRLCGIIGGLFATSGML 333


>gi|229366152|gb|ACQ58056.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Anoplopoma fimbria]
          Length = 290

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 82/201 (40%), Positives = 108/201 (53%), Gaps = 22/201 (10%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I   +G+GC   G   +NKV GNFH           V  H   A Q  S +++H I+K
Sbjct: 105 MKIPLNQGDGCRFEGEFTINKVPGNFH-----------VSTHSATA-QPQSPDMTHNIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           LAFGE        G  N L G       P   + Y +K+VPTVY D+SG    S Q++V 
Sbjct: 153 LAFGEKIQVQRVQGAFNALGGADRLSSNPLASHDYILKIVPTVYEDLSGKQRFSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAIVGG FTV+GII
Sbjct: 213 NKEYVAYSHAGRI--IPAIWFRYDLSPITVKYTERRQPVYRFITTICAIVGGTFTVAGII 270

Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
           D+ I+    A  KKI+IGK S
Sbjct: 271 DSCIFTASEA-WKKIQIGKMS 290


>gi|388501278|gb|AFK38705.1| unknown [Medicago truncatula]
          Length = 148

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 62/134 (46%), Positives = 86/134 (64%)

Query: 156 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
           N+SH I+ L+FG  +PG+ NPLD         SG ++Y+IK+VPT Y  +S   + +NQF
Sbjct: 10  NVSHVIHDLSFGPKYPGIHNPLDETSRILHDASGTFKYYIKIVPTEYRYISKEVLPTNQF 69

Query: 216 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
           SVTE+F        +T P V+F YDLSPI VT  EE  SFLHF+T +CA++GG F V+G+
Sbjct: 70  SVTEYFSPITSQFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGM 129

Query: 276 IDAFIYHGQRAIKK 289
           +D ++Y    A  K
Sbjct: 130 LDRWMYRLVEAATK 143


>gi|403269250|ref|XP_003926667.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Saimiri boliviensis boliviensis]
          Length = 377

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 73/171 (42%), Positives = 99/171 (57%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRSSEQ 226
             P ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R    
Sbjct: 225 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281

Query: 227 GRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 282 AAGSYGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|302853436|ref|XP_002958233.1| hypothetical protein VOLCADRAFT_69193 [Volvox carteri f.
           nagariensis]
 gi|300256421|gb|EFJ40687.1| hypothetical protein VOLCADRAFT_69193 [Volvox carteri f.
           nagariensis]
          Length = 337

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 74/187 (39%), Positives = 107/187 (57%), Gaps = 11/187 (5%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV--HVHDILAFQR--DSFNISHKIN 162
           + E  EGC++YG ++V +VAG  HF    S HQ+ V   +  +L   R     NISH I 
Sbjct: 156 EAEHHEGCHVYGTMDVKRVAGRLHF----SVHQNMVFQMLPQLLGAHRIPKVANISHTIK 211

Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
            L FG H+PG +NPLDG     + P   ++YF+KVVPT Y +  G   +++Q+SVTE+ +
Sbjct: 212 HLGFGPHYPGQLNPLDGYVRMVKGPPQSFKYFLKVVPTEYYNRLGRVTETHQYSVTEYTQ 271

Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
             E G + TL      YDLSPI +T  E   S LHF+  +CA+VGG F ++ + D ++  
Sbjct: 272 PLEPGYVPTLD---VHYDLSPIVMTINERPPSLLHFVVRLCAVVGGAFAITRMTDRWVDW 328

Query: 283 GQRAIKK 289
             R + K
Sbjct: 329 FVRLVTK 335


>gi|115497448|ref|NP_001069031.1| endoplasmic reticulum-Golgi intermediate compartment protein 2 [Bos
           taurus]
 gi|113912114|gb|AAI22616.1| ERGIC and golgi 2 [Bos taurus]
 gi|296487341|tpg|DAA29454.1| TPA: PTX1 protein [Bos taurus]
          Length = 377

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 70/168 (41%), Positives = 97/168 (57%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN--QFSVTEHFRS-SEQGR 228
           G++NPLDG        + M+QYFI +VP   T +  + I ++  QF+VTE  R  +    
Sbjct: 228 GIINPLDGTEKIALDHNQMFQYFITIVP---TKLQTYKISADTHQFAVTERERVINHAAG 284

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 285 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|443700340|gb|ELT99344.1| hypothetical protein CAPTEDRAFT_162161 [Capitella teleta]
          Length = 110

 Score =  130 bits (328), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 62/108 (57%), Positives = 83/108 (76%), Gaps = 2/108 (1%)

Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVT 247
           M+ Y++KVVPT Y   +G  + SNQ+SVT+H +    G L  Q LPGVF  Y+LSP+ V 
Sbjct: 1   MFSYYVKVVPTSYLRANGEFVSSNQYSVTKHHKKVGGGILGEQGLPGVFVTYELSPMMVK 60

Query: 248 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           +TE++ SF+HFLT VCAI+GGVFTV+G++DAFIYH  RAI+KKI++GK
Sbjct: 61  YTEKNRSFMHFLTGVCAIIGGVFTVAGLVDAFIYHSARAIQKKIDLGK 108


>gi|348516790|ref|XP_003445920.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Oreochromis niloticus]
          Length = 290

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 80/201 (39%), Positives = 108/201 (53%), Gaps = 22/201 (10%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I   +G+GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNQGDGCRFEGEFTINKVPGNFH-----------VSTHSATA-QPQNPDMTHTIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           LAFGE        G  N L G       P   + Y +K+VPTVY D+SG    S Q++V 
Sbjct: 153 LAFGEKLQVQKVQGAFNALGGADKMSSNPLASHDYILKIVPTVYEDLSGRQRFSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GII
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGAFTVAGII 270

Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
           D+ I+    A  KKI+IGK S
Sbjct: 271 DSCIFTASEA-WKKIQIGKMS 290


>gi|291232448|ref|XP_002736170.1| PREDICTED: MGC81917 protein-like [Saccoglossus kowalevskii]
          Length = 395

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 82/235 (34%), Positives = 121/235 (51%), Gaps = 21/235 (8%)

Query: 82  RKKGW---------ALSNP----DLIDQCKREGFLQRIKEEEGE------GCNIYGFLEV 122
           R+K W         AL+N     DL+ +   +G    + E E +       C I+G + +
Sbjct: 120 RQKQWQKKLQAVRSALANEHAIQDLLFKVGFDGSPTSMPEREDKPAGAPNSCRIHGSMSL 179

Query: 123 NKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRW 182
           NKVAGNFH   GKS      H H      +  +N SH+I+  +FG   PG+VNPLDG + 
Sbjct: 180 NKVAGNFHITLGKSIPHPRGHAHLAAFISQSQYNFSHRIDHFSFGVPTPGIVNPLDGDQR 239

Query: 183 TQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDL 241
             +  + MYQYFI++VPT   +    +  ++Q++VTE  R  S       + G+FF YDL
Sbjct: 240 VTQENARMYQYFIQIVPT-RVNTRRASADTHQYAVTERDRVISHSSGSHGVAGIFFKYDL 298

Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           S + V  TEE+  +  FL  +C I+GGVF  SG++ + I      I  K + GK+
Sbjct: 299 SSVSVKVTEEYQPYWQFLVRLCGIIGGVFATSGMLHSLIGCLYDLICCKYQFGKY 353


>gi|15010925|gb|AAK77355.1|AF302767_1 PTX1 protein [Homo sapiens]
          Length = 377

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/167 (43%), Positives = 99/167 (59%), Gaps = 6/167 (3%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE  P 
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVPA 228

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGRL 229
           ++NPLDG        + M+QYFI VVPT ++T  +S +T   +QFSVTE  R  +     
Sbjct: 229 IINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISAYT---HQFSVTERERIINHAAGS 285

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 286 HGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|323306137|gb|EGA59869.1| Erv46p [Saccharomyces cerevisiae FostersB]
          Length = 349

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 89/254 (35%), Positives = 124/254 (48%), Gaps = 48/254 (18%)

Query: 1   MDISGEQHLDVKHDIFK-KRLDSQG----NVIESRQDGIG---APKIDKPLQRHGGRLEH 52
           MD SGE  LD+    F   RL+S+G    +  E    G G   AP  + P          
Sbjct: 88  MDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGGNGDGTAPVNNDP---------- 137

Query: 53  NETYCGSCYGAESSDED---------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFL 103
              YCG CYGA+   ++         CC +C+ VR AY + GWA  +   I+QC+REG++
Sbjct: 138 --NYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEAGWAFFDGKNIEQCEREGYV 195

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKIN 162
            +I E   EGC I G  ++N++ GN HFAPGK +  +  H HD   + + S  N +H IN
Sbjct: 196 SKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHFHDTSLYDKTSNLNFNHIIN 255

Query: 163 KLAFGE--------------HFPGVV--NPLDG--VRWTQETPSGMYQYFIKVVPTVYTD 204
            L+FG+              H   VV  +PLDG  V   + T    + YF K+VPT Y  
Sbjct: 256 HLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRNTHFHQFSYFAKIVPTRYEY 315

Query: 205 VSGHTIQSNQFSVT 218
           +    I++ QFS T
Sbjct: 316 LDNVVIETAQFSAT 329


>gi|75075986|sp|Q4R5C3.1|ERGI2_MACFA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 2
 gi|67970720|dbj|BAE01702.1| unnamed protein product [Macaca fascicularis]
          Length = 377

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
             P ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|348505737|ref|XP_003440417.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Oreochromis niloticus]
          Length = 374

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 66/169 (39%), Positives = 97/169 (57%), Gaps = 2/169 (1%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G L VNKVAGNFH   GKS      H H       DS+N SH+I+ L+FGE  PG
Sbjct: 168 ACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVAHDSYNFSHRIDHLSFGEPLPG 227

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT 231
           +++PLDG        + M+QYFI +VPT   +    + +++Q+SVTE  R  +       
Sbjct: 228 IISPLDGTEKIATDSNHMFQYFITIVPT-KLNTYKVSAETHQYSVTERERVINHAAGSHG 286

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           + G+F  YD+S + V  TE+H+    FL  +C I+GG+F+ +G+I   +
Sbjct: 287 VSGIFMKYDISSLMVKVTEQHMPLWQFLVRLCGIIGGIFSTTGMIHGLV 335


>gi|380787459|gb|AFE65605.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Macaca mulatta]
 gi|383418929|gb|AFH32678.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Macaca mulatta]
 gi|384941148|gb|AFI34179.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Macaca mulatta]
          Length = 377

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
             P ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|225717192|gb|ACO14442.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Esox lucius]
          Length = 379

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 67/169 (39%), Positives = 95/169 (56%), Gaps = 2/169 (1%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G + VNKVAGNFH   GK  H    H H       D++N SH+I+  +FGE  PG
Sbjct: 168 ACRIHGHVYVNKVAGNFHITVGKPIHHPRGHAHIAAFVSHDTYNFSHRIDHFSFGEEIPG 227

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT 231
           ++NPLDG        + M+ YFI VVPT     S  +  ++QFSVTE  R  +       
Sbjct: 228 IINPLDGTEKVTTNNNHMFLYFITVVPT-KLHTSKVSADTHQFSVTERERVINHAAGSHG 286

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           + G+F  YD S + VT +E+H+    FL  +C I+GG+F+ +G+I  F+
Sbjct: 287 VSGIFMKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIFSTTGMIHGFV 335


>gi|330803630|ref|XP_003289807.1| hypothetical protein DICPUDRAFT_80570 [Dictyostelium purpureum]
 gi|325080118|gb|EGC33688.1| hypothetical protein DICPUDRAFT_80570 [Dictyostelium purpureum]
          Length = 388

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/287 (30%), Positives = 146/287 (50%), Gaps = 37/287 (12%)

Query: 3   ISGEQHLDVKHDIFKKRLDSQG-----NVIESRQDGIGAPKIDK---PLQRHGGRLEHNE 54
           I G+   D  + I K+RLDS+G      V  + + GI + +  +   P Q+ G  +   +
Sbjct: 122 IDGKPIKDAAYQIVKERLDSKGVPFAKGVALAGKKGIFSSRCTECEFPKQKKGSSVFFRQ 181

Query: 55  TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
                          CCN+C+++RE YR      +  D   QC  E  +Q     + EGC
Sbjct: 182 K--------------CCNSCDDLREYYRLNRIPQNFADDAPQCLIERPIQ-----DDEGC 222

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQS----GVHVHDILAF---QRDSFNISHKINKLAFG 167
            IYG L+V K+ G+FH   G S  +S      HVH I      +   FNI+H I+K +FG
Sbjct: 223 RIYGSLQVQKMKGDFHILAGLSADESHDGHAHHVHRITKENIGRVTQFNITHHIHKFSFG 282

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           +   G++NPL+G     ++   +  Y+I+VVP +Y   + + +++NQ+S T  +R+    
Sbjct: 283 DDIDGLINPLEGFGIVAQS-LAVQNYYIQVVPAIYKK-NDYVLETNQYSYTYDYRNVNVF 340

Query: 228 RL-QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
            L +  PG++F YD+SP+ +   +     +  +T++CAI GG+F +S
Sbjct: 341 NLGRIFPGIYFKYDMSPLMIEVDQTSKPIVELITSICAIGGGIFYIS 387


>gi|397517363|ref|XP_003828883.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Pan paniscus]
 gi|410259224|gb|JAA17578.1| ERGIC and golgi 2 [Pan troglodytes]
 gi|410298004|gb|JAA27602.1| ERGIC and golgi 2 [Pan troglodytes]
 gi|410334949|gb|JAA36421.1| ERGIC and golgi 2 [Pan troglodytes]
 gi|410334951|gb|JAA36422.1| ERGIC and golgi 2 [Pan troglodytes]
          Length = 377

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
             P ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|332233018|ref|XP_003265701.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 isoform 1 [Nomascus leucogenys]
          Length = 377

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 165 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
             P ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 225 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 281

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 282 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|402885549|ref|XP_003906216.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 2 [Papio anubis]
          Length = 364

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 152 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 211

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
             P ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 212 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 268

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 269 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 319


>gi|213512030|ref|NP_001133523.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Salmo salar]
 gi|209154344|gb|ACI33404.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Salmo salar]
          Length = 381

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 66/169 (39%), Positives = 95/169 (56%), Gaps = 2/169 (1%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G L VNKVAGNFH   GK+      H H       D++N SH+I+ L+FGE  PG
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDTYNFSHRIDHLSFGEEIPG 228

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQT 231
           ++NPLDG        + M+QYFI +VPT   +    +  +NQ+SVTE  R          
Sbjct: 229 IINPLDGTEKVCTDHNQMFQYFITIVPT-KLNTYQISADTNQYSVTERERVINHAVGSHG 287

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           + G+F  YD+S + V  TE+H+    FL  +C I+GG+F+ +G+I   +
Sbjct: 288 VSGIFMKYDISSLMVKVTEQHMPLWRFLVRLCGIIGGIFSTTGMIHGMV 336


>gi|297262047|ref|XP_001105686.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 2 [Macaca mulatta]
          Length = 374

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 100/171 (58%), Gaps = 6/171 (3%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE
Sbjct: 162 QSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGE 221

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSE 225
             P ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  + 
Sbjct: 222 LVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINH 278

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 279 AAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 329


>gi|410907774|ref|XP_003967366.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Takifugu rubripes]
          Length = 388

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 64/169 (37%), Positives = 98/169 (57%), Gaps = 2/169 (1%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G L VNKVAGNFH   GKS      H H       DS+N SH+I+ L+FGE  PG
Sbjct: 167 ACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVSHDSYNFSHRIDHLSFGEDLPG 226

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT 231
           +++PLDG        + ++QYFI +VPT   +    + +++Q+SVTE  R+ +       
Sbjct: 227 IISPLDGTEKVSADSNHIFQYFITIVPT-KLNTYRVSAETHQYSVTEQDRAINHAAGSHG 285

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           + G+F  YD++ + V  TE+H+    FL  +C I+GG+F+ +G+I   +
Sbjct: 286 VSGIFMKYDINSLMVKVTEQHMPLWQFLVRLCGIIGGIFSTTGMIHGIV 334


>gi|322791472|gb|EFZ15869.1| hypothetical protein SINV_02690 [Solenopsis invicta]
          Length = 403

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 70/171 (40%), Positives = 104/171 (60%), Gaps = 6/171 (3%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEHFP 171
            C ++G L VNKVAGNFH   GKS      H+H I AF  D  +N +H+IN+ +FG   P
Sbjct: 183 ACRVHGSLNVNKVAGNFHITAGKSLSVPHGHIH-ISAFMTDRDYNFTHRINRFSFGGPSP 241

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR-L 229
           G+V+PL+G     +    +YQYF++VVPT + T +S  T ++ Q+SV +H R  +  +  
Sbjct: 242 GIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRTLLS--TSKTYQYSVKDHQRPIDHHKGS 299

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             +PG+FF YD+S +K+  T+E  +   FL  +CA VGG+F  SG+I   +
Sbjct: 300 HGIPGIFFKYDMSALKIKVTQERDTIFQFLVKLCATVGGIFVTSGLIKNIV 350


>gi|50959176|ref|NP_057654.2| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Homo sapiens]
 gi|108935982|sp|Q96RQ1.2|ERGI2_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 2
 gi|22760017|dbj|BAC11037.1| unnamed protein product [Homo sapiens]
 gi|38173702|gb|AAH00887.2| ERGIC and golgi 2 [Homo sapiens]
 gi|78070782|gb|AAI07795.1| ERGIC and golgi 2 [Homo sapiens]
 gi|119616998|gb|EAW96592.1| ERGIC and golgi 2, isoform CRA_a [Homo sapiens]
 gi|119617000|gb|EAW96594.1| ERGIC and golgi 2, isoform CRA_a [Homo sapiens]
 gi|167773797|gb|ABZ92333.1| ERGIC and golgi 2 [synthetic construct]
          Length = 377

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 72/167 (43%), Positives = 98/167 (58%), Gaps = 6/167 (3%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE  P 
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVPA 228

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGRL 229
           ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +     
Sbjct: 229 IINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAGS 285

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 286 HGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|62897157|dbj|BAD96519.1| CDA14 variant [Homo sapiens]
          Length = 377

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 72/167 (43%), Positives = 98/167 (58%), Gaps = 6/167 (3%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE  P 
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVPA 228

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGRL 229
           ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +     
Sbjct: 229 IINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAGS 285

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 286 HGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 332


>gi|357627966|gb|EHJ77470.1| putative PTX1 protein isoform 1 [Danaus plexippus]
          Length = 353

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 66/185 (35%), Positives = 105/185 (56%), Gaps = 2/185 (1%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C ++G L +NKVAGNFH   GKS H    H+H  + F     N SH+IN+L+FG    
Sbjct: 142 DACRLHGVLTLNKVAGNFHITAGKSLHLPRGHIHLNMLFDDTPQNFSHRINRLSFGSPAN 201

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQ 230
           G++ PL+G        S +YQYF++VVPT   D +  +I++ Q+SV E  R     +   
Sbjct: 202 GIIYPLEGDEKITSDESMLYQYFLEVVPT-DVDTTFESIKTFQYSVKELARPISHSKGSH 260

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            +PGVFF YD++ +KV   +E  + L F+  + +I+GG++ +   I+  +   +  + KK
Sbjct: 261 GVPGVFFKYDMAALKVQVYQERENLLQFMLRLFSIIGGIYVIISFINTIVLTAKTLLVKK 320

Query: 291 IEIGK 295
            E+ K
Sbjct: 321 PEVKK 325


>gi|41055383|ref|NP_956701.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Danio rerio]
 gi|82188148|sp|Q7T2D4.1|ERGI2_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 2
 gi|32451749|gb|AAH54593.1| ERGIC and golgi 2 [Danio rerio]
 gi|182890474|gb|AAI64472.1| Ergic2 protein [Danio rerio]
          Length = 376

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 67/175 (38%), Positives = 98/175 (56%), Gaps = 14/175 (8%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G L VNKVAGNFH   GK+      H H       +++N SH+I+ L+FGE  PG
Sbjct: 168 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHETYNFSHRIDHLSFGEEIPG 227

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT------VYTDVSGHTIQSNQFSVTEHFRS-SE 225
           ++NPLDG        + M+QYFI +VPT      VY D       ++Q+SVTE  R  + 
Sbjct: 228 ILNPLDGTEKVSADHNQMFQYFITIVPTKLQTYKVYAD-------THQYSVTERERVINH 280

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
                 + G+F  YD+S + V  TE+H+ F  FL  +C I+GG+F+ +G++   +
Sbjct: 281 AAGSHGVSGIFMKYDISSLMVKVTEQHMPFWQFLVRLCGIIGGIFSTTGMLHNLV 335


>gi|12857352|dbj|BAB30984.1| unnamed protein product [Mus musculus]
          Length = 377

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 72/170 (42%), Positives = 99/170 (58%), Gaps = 10/170 (5%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+  +FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHCSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRSS---EQ 226
           G++NPLDG        + M+QYFI V+PT ++T  +S  T   +QFSVTE  R S     
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVMPTKLHTYKISADT---HQFSVTE--RESIINHA 282

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                + G+F  YDLS + VT TEEH+ F  F   +C I+GG+F+ +G++
Sbjct: 283 AGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 332


>gi|395744111|ref|XP_003780425.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 2 [Pongo abelii]
          Length = 387

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 72/169 (42%), Positives = 100/169 (59%), Gaps = 7/169 (4%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       +S+N SH+I+ L+FGE  P
Sbjct: 177 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVP 236

Query: 172 GVVNPLDGV-RWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQG 227
            ++NPLDG  +   +    M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +   
Sbjct: 237 AIINPLDGTEKIAIDRKHQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAA 293

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
               + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 294 GSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 342


>gi|348529156|ref|XP_003452080.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Oreochromis niloticus]
          Length = 379

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 65/173 (37%), Positives = 96/173 (55%), Gaps = 2/173 (1%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E    C I+G + VNKVAGN H   GK  H    H H       +++N SH+I+ L+FGE
Sbjct: 164 EPLNACRIHGHVYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHETYNFSHRIDHLSFGE 223

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQG 227
             PG++NPLDG        + M+QYFI VVPT   +    +  ++QFSVTE  R  +   
Sbjct: 224 ELPGIINPLDGTEKITYNNNQMFQYFITVVPT-KLNTYKISADTHQFSVTERERVINHAA 282

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
               + G+F  YD S + VT +E+H+    FL  +C I+GG+F+ +G++   +
Sbjct: 283 GSHGVSGIFVKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIFSTTGMLHGLV 335


>gi|410914052|ref|XP_003970502.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Takifugu rubripes]
          Length = 290

 Score =  127 bits (318), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 80/201 (39%), Positives = 108/201 (53%), Gaps = 22/201 (10%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I   +G GC   G   +NKV GNFH     S H +          Q  + +++H I+K
Sbjct: 105 MKIPLNQGAGCRFEGEFIINKVPGNFHI----STHSASA--------QPQNPDMTHFIHK 152

Query: 164 LAFGEHF-----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           LAFG+        G  N L G       P   + Y +K+VPTVY D+SG    S Q++V 
Sbjct: 153 LAFGDKLQMHQEKGAFNALGGADRLASNPLASHDYILKIVPTVYEDLSGKQKFSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE    F  F+T +CAIVGG FTV+GII
Sbjct: 213 NKEYVAYSHTGRI--VPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGII 270

Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
           D+ I+    A  KKI+IGK S
Sbjct: 271 DSCIFTASEA-WKKIQIGKMS 290


>gi|307188057|gb|EFN72889.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Camponotus floridanus]
          Length = 386

 Score =  127 bits (318), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 69/171 (40%), Positives = 104/171 (60%), Gaps = 6/171 (3%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEHFP 171
            C I+G L VNKVAGNFH   GKS      H+H I A+  D  +N +H+IN+ +FG   P
Sbjct: 169 ACRIHGSLVVNKVAGNFHITAGKSLSLPRGHIH-ISAYMTDQDYNFTHRINRFSFGGPSP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR-L 229
           G+V+PL+G     +    +YQYF++VVPT + T +S  T ++ Q+SV +H R  +  +  
Sbjct: 228 GIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRTLLS--TSKTYQYSVKDHQRPIDHHKGS 285

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             +PG+FF YD+S +K+  T+E  +   FL  +CA VGG+F  SG++   +
Sbjct: 286 HGIPGIFFKYDMSALKIKVTQERDTIFQFLVKLCATVGGIFVTSGLVKNIV 336


>gi|57208595|emb|CAI42844.1| ERGIC and golgi 3 [Homo sapiens]
          Length = 156

 Score =  127 bits (318), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 71/157 (45%), Positives = 89/157 (56%), Gaps = 35/157 (22%)

Query: 159 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH---------- 208
           H I  L+FGE +PG+VNPLD    T    S M+QYF+KVVPTVY  V G           
Sbjct: 1   HYIQHLSFGEDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEAQQERGRSRG 60

Query: 209 ----------------------TIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPI 244
                                  +++NQFSVT H + +  G L  Q LPGVF  Y+LSP+
Sbjct: 61  GADGGWSQVLALALAQAPLPPQVLRTNQFSVTRHEKVAN-GLLGDQGLPGVFVLYELSPM 119

Query: 245 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
            V  TE+H SF HFLT VCAI+GG+FTV+G+ID+ IY
Sbjct: 120 MVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIY 156


>gi|301093181|ref|XP_002997439.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
 gi|262110695|gb|EEY68747.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
          Length = 278

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 80/198 (40%), Positives = 112/198 (56%), Gaps = 15/198 (7%)

Query: 99  REGFLQRIKEEE--GE-GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 155
           +E  LQ+  +EE  GE GC +YG ++V KVAG+  FA     H+  + V     F   +F
Sbjct: 92  KEIMLQKDIQEEPYGENGCRLYGTVQVQKVAGDLSFA-----HEGSLTVFSFFDFL--NF 144

Query: 156 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
           N SH +N L FG   P +  PL  V          Y+YF+ VVP+ Y  ++G ++ + Q+
Sbjct: 145 NSSHVVNHLRFGPQIPDMETPLIDVSKILTKNLATYKYFVSVVPSRYVYLNGRSVTTFQY 204

Query: 216 SVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
           SVTEH  SS     Q + PGV F Y+ SPI V + E  +S LHFLT+  AIVGGVF V+ 
Sbjct: 205 SVTEHETSSRGPNGQVSFPGVIFSYEFSPIAVEYIESKLSVLHFLTSTSAIVGGVFAVAR 264

Query: 275 IIDAFIYHGQRAIKKKIE 292
           +ID  IY    ++ KK++
Sbjct: 265 MIDGAIY----SVSKKVD 278


>gi|47222972|emb|CAF99128.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 288

 Score =  127 bits (318), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 80/201 (39%), Positives = 108/201 (53%), Gaps = 22/201 (10%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I   +G GC   G   +NKV GNFH     S H +          Q  + +++H I+K
Sbjct: 103 MKIPLNQGGGCRFEGEFNINKVPGNFHI----STHSASA--------QPQNPDMTHFIHK 150

Query: 164 LAFGEHF-----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           LAFG+        G  N L G       P   + Y +K+VPTVY D+SG    S Q++V 
Sbjct: 151 LAFGDKLQMHQVKGAFNALGGADRLASNPLASHDYILKIVPTVYEDLSGKQKFSYQYTVA 210

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE    F  F+T +CAIVGG FTV+GII
Sbjct: 211 NKEYVAYSHTGRI--VPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGII 268

Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
           D+ I+    A  KKI+IGK S
Sbjct: 269 DSCIFTASEA-WKKIQIGKMS 288


>gi|71480113|ref|NP_001025133.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Danio rerio]
 gi|78099248|sp|Q4V8Y6.1|ERGI1_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 1; AltName: Full=ER-Golgi intermediate
           compartment 32 kDa protein; Short=ERGIC-32
 gi|66911928|gb|AAH97146.1| Zgc:114085 [Danio rerio]
          Length = 290

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 79/201 (39%), Positives = 107/201 (53%), Gaps = 22/201 (10%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            ++    G GC   G   +NKV GNFH           V  H   A Q  S +++H I+K
Sbjct: 105 MKVPLNNGHGCRFEGEFSINKVPGNFH-----------VSTHSATA-QPQSPDMTHIIHK 152

Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           LAFG     +H  G  N L G    Q      + Y +K+VPTVY ++ G    S Q++V 
Sbjct: 153 LAFGAKLQVQHVQGAFNALGGADRLQSNALASHDYILKIVPTVYEELGGKQRFSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE    F  F+T +CAI+GG FTV+GII
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRRPFYRFITTICAIIGGTFTVAGII 270

Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
           D+ I+    A  KKI+IGK S
Sbjct: 271 DSCIFTASEA-WKKIQIGKMS 290


>gi|332020071|gb|EGI60517.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Acromyrmex echinatior]
          Length = 390

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 70/194 (36%), Positives = 111/194 (57%), Gaps = 9/194 (4%)

Query: 86  WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 145
           W  +   L  +  +  +   + +     C ++G L +NKVAGNFH   GKS      H+H
Sbjct: 145 WKSNQVTLYSEMPKRSY---VPDYAPNACRVHGSLNINKVAGNFHITAGKSLSVPHGHIH 201

Query: 146 DILAFQRD-SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT 203
            I AF  D  +N +H+INK +FG   PG+V+PL+G     +    +YQYF++VVPT + T
Sbjct: 202 -ISAFMTDRDYNFTHRINKFSFGGPSPGIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRT 260

Query: 204 DVSGHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
            ++  T ++ Q+SV +H R  +  +    +PG+FF YD+S +K+  T+E  +   FL  +
Sbjct: 261 LLT--TSKTYQYSVKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQFLVKL 318

Query: 263 CAIVGGVFTVSGII 276
           CA VGG+F  SG++
Sbjct: 319 CATVGGIFVTSGLV 332


>gi|405119686|gb|AFR94458.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Cryptococcus neoformans var. grubii H99]
          Length = 431

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/188 (35%), Positives = 98/188 (52%), Gaps = 14/188 (7%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINK 163
           K E+G  C IYG +EV KV  N H              H  ++FQ       N+SH +++
Sbjct: 202 KVEDGPACRIYGSVEVKKVTANLHIT---------TLGHGYMSFQHTDHHLMNLSHVVHE 252

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
            +FG  FP +  PLD      E P  ++QYF++VVPT Y D S   + ++Q++VT++ RS
Sbjct: 253 FSFGPFFPAIAQPLDQSYEITEQPFTIFQYFLRVVPTTYIDASRRKLITSQYAVTDYSRS 312

Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
            E G+   +PG+FF YDL P+ V   E   S   FL  +  +VGGV+TV+          
Sbjct: 313 FEHGK--GVPGLFFKYDLEPMSVVIRERTTSLYQFLIRLAGVVGGVWTVAAFALRVFNRA 370

Query: 284 QRAIKKKI 291
           QR + K +
Sbjct: 371 QREVSKAV 378


>gi|308808274|ref|XP_003081447.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
 gi|116059910|emb|CAL55969.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
          Length = 406

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 95/295 (32%), Positives = 153/295 (51%), Gaps = 39/295 (13%)

Query: 2   DISGEQHLDVKHD--IFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           D +GEQH DV HD  I K+R+D  G  I++       P   K + +   ++   ++  G+
Sbjct: 124 DKAGEQHYDV-HDGHIEKRRVDKDGKPIDATFTS-EKPNKHKEMVQALEKMNQTDSVVGN 181

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
               +  D            A+R  G       ++ +   EG     + E  EGC + G+
Sbjct: 182 ETALQKQDR-----------AHRFAG-VFGFESMLKEAFPEGIENAFRNEAREGCEVKGY 229

Query: 120 LEVNKVAGNFHFAPGK----SFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           LEVN+V G    +PG+       Q  ++VH  L       N++H I++L+FGE FPG+V+
Sbjct: 230 LEVNRVPGRISISPGRVVMMGMQQFKLNVHTDL-------NLTHTIHRLSFGERFPGLVS 282

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT-IQSNQFSVTEHFRSSEQ-------G 227
           PLDG   +   P+ + QYF+ VV T +  + G   I ++Q+SVTE F +S++       G
Sbjct: 283 PLDGTHRSLP-PNAVQQYFLNVVATTFQPLRGDARISTHQYSVTETFTTSQRSLGGSSNG 341

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
           R    PGVFF Y++ PI+V F E   +F  F+  +C+I+GGV T++G++ + + H
Sbjct: 342 RD---PGVFFTYEIEPIRVDFKETRTTFGAFIIGICSIIGGVVTMAGVVQSAVEH 393


>gi|340709072|ref|XP_003393139.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Bombus terrestris]
          Length = 392

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 70/171 (40%), Positives = 104/171 (60%), Gaps = 6/171 (3%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFGEHFP 171
            C I+G L VNKVAGNFH   GKS      H+H IL F  D  +N +H+INK +FG   P
Sbjct: 169 SCRIHGSLNVNKVAGNFHITAGKSLSFPMGHIH-ILTFMTDKDYNFTHRINKFSFGGPSP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSE-QGRL 229
           G+++PL+G     +    +YQYF++VVPT + T +S  T ++ Q+SV +H R  + Q   
Sbjct: 228 GIIHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLS--TSKTYQYSVKDHQRPIDHQKGS 285

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
              PG+FF YD+S +K+  T++  +   FL  +CA VGG+F  SG++ + +
Sbjct: 286 HGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGMVKSIV 336


>gi|350419069|ref|XP_003492060.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Bombus impatiens]
          Length = 392

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 71/171 (41%), Positives = 103/171 (60%), Gaps = 6/171 (3%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS-FNISHKINKLAFGEHFP 171
            C I+G L VNKVAGNFH   GKS      H+H IL F  D  +N +H+INK +FG   P
Sbjct: 169 SCRIHGSLNVNKVAGNFHITAGKSLSFPMGHIH-ILTFMTDKDYNFTHRINKFSFGGPSP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSE-QGRL 229
           G+++PL+G     +    +YQYF++VVPT + T +S  T ++ Q+SV +H R  + Q   
Sbjct: 228 GIIHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLS--TSKTYQYSVKDHQRPIDHQKGS 285

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
              PG+FF YD+S +K+  T++  +   FL  +CA VGG+F  SG+I   +
Sbjct: 286 HGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGMIKNIV 336


>gi|224067439|ref|XP_002195791.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Taeniopygia guttata]
          Length = 290

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 108/200 (54%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G+GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGDGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L+G       P   + Y +K+VPTVY D+SG    S Q++V 
Sbjct: 153 LSFGDKLQVHNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T++CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|326928384|ref|XP_003210360.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Meleagris gallopavo]
          Length = 321

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 108/200 (54%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G+GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 136 MKIPLNNGDGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 183

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L+G       P   + Y +K+VPTVY D+SG    S Q++V 
Sbjct: 184 LSFGDKLQVQNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVA 243

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T++CAI+GG FTV+GI+
Sbjct: 244 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGIL 301

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 302 DSCIFTASEA-WKKIQLGKM 320


>gi|322792513|gb|EFZ16471.1| hypothetical protein SINV_10123 [Solenopsis invicta]
          Length = 141

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 52/109 (47%), Positives = 75/109 (68%)

Query: 70  CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 129
           CCN CE+V EAYR+K WA  +P  + QC+ +  ++++K    +GC IYG++EVN+V G+F
Sbjct: 12  CCNTCEDVWEAYRRKKWAPPDPADVKQCQNDKSMEKLKHAFTQGCQIYGYMEVNRVGGSF 71

Query: 130 HFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
           H APG SF  + VHVHD+  +    FN++HKI  L+FG + PG  NP+D
Sbjct: 72  HIAPGVSFSVNHVHVHDVQPYTSSHFNMTHKIRHLSFGLNIPGKTNPMD 120


>gi|145524934|ref|XP_001448289.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415833|emb|CAK80892.1| unnamed protein product [Paramecium tetraurelia]
          Length = 324

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 80/201 (39%), Positives = 111/201 (55%), Gaps = 28/201 (13%)

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD---SFNISH----------K 160
             I G++ VNKV GNFH     S H  G  +H +  FQR    + ++SH          K
Sbjct: 135 VKIAGYIIVNKVPGNFHV----SAHAFGGILHQV--FQRSQISTLDLSHTYQSYSHLVKK 188

Query: 161 INKLAFGEHF-PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFS 216
            + +   + F  GV+NPLD  +   +   G   M+QY+I VVPT Y DVSG     N++ 
Sbjct: 189 DDLVKIKKQFQKGVLNPLDNTKKIAQPQGGTGMMFQYYISVVPTTYIDVSG-----NEYY 243

Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           V +   +S + +   LP V+F YDLSP+ V F +   SFLHFL  +CAI+GGVFT++ II
Sbjct: 244 VHQFTANSNEVQTDHLPAVYFRYDLSPVTVKFLQYRESFLHFLVQICAILGGVFTIASII 303

Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
           D  I+    A+ KK E+GK S
Sbjct: 304 DGMIHKSVVALLKKYEMGKLS 324


>gi|194689880|gb|ACF79024.1| unknown [Zea mays]
 gi|413949702|gb|AFW82351.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 176

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 54/84 (64%), Positives = 70/84 (83%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           DISGEQH D++HDI K+RL+S GNVIE+R++GIG  K+++PLQ+HGGRL+  E YCG+CY
Sbjct: 91  DISGEQHHDIRHDIEKRRLNSHGNVIEARKEGIGGAKVERPLQKHGGRLDKGEQYCGTCY 150

Query: 62  GAESSDEDCCNNCEEVREAYRKKG 85
           GAE SDE CCN+CEE  +  R+KG
Sbjct: 151 GAEESDEQCCNSCEESGKHIRRKG 174


>gi|145551751|ref|XP_001461552.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124429387|emb|CAK94179.1| unnamed protein product [Paramecium tetraurelia]
          Length = 317

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 69/193 (35%), Positives = 114/193 (59%), Gaps = 24/193 (12%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF-QRDSFNISHKINKLAFG 167
           ++ EGC + G++ +++V GNFH     S H  G  V+ +L F +  + ++SH I  L+FG
Sbjct: 128 DQKEGCEMTGYIIISRVPGNFHI----SAHSYGGQVNIVLPFVEMSTIDLSHTIKHLSFG 183

Query: 168 ---------EHFP-GVVNPLDGVRW--TQETPSG--MYQYFIKVVPTVYTDVSGHTIQSN 213
                    E F  G++NPLDG+    TQE  +    +QY+I +VPT+Y D+       N
Sbjct: 184 NQNDIQKIREKFQQGLLNPLDGISRIKTQELKNVGVTHQYYISIVPTIYVDIDNREYFVN 243

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           QF+      ++ + +  ++P ++F YD+SP+ V FT+ + +F HF+  +CAI+GGVFT++
Sbjct: 244 QFTA-----NTNEAQTNSMPAIYFRYDISPVTVQFTKYYETFNHFIVQLCAILGGVFTIA 298

Query: 274 GIIDAFIYHGQRA 286
           GIID+  Y  Q+ 
Sbjct: 299 GIIDSVFYALQKT 311


>gi|145546125|ref|XP_001458746.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426567|emb|CAK91349.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 90/267 (33%), Positives = 135/267 (50%), Gaps = 57/267 (21%)

Query: 30  RQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALS 89
           +QD IG  +     Q   G L  + T  G       S  D  N  E  ++AY++K     
Sbjct: 82  QQDVIGTHQ-----QNVEGELYKSRTLNGKVIDKYLSTNDSLN-LERAQQAYQQK----- 130

Query: 90  NPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILA 149
                                 EGC++ G++ +++V GNFH     S H  G  V+ +L 
Sbjct: 131 ----------------------EGCDLAGYIIISRVPGNFHI----SAHPYGGQVNMVLP 164

Query: 150 FQRDS-FNISHKINKLAFG---------EHFP-GVVNPLDGVRW--TQE-TPSGM-YQYF 194
           F   S  ++SH I  L+FG         E F  G++NPLDG+R   TQE T  G+ +QY+
Sbjct: 165 FVGLSVIDLSHSIKHLSFGKQNDIQKIREKFKQGLLNPLDGIRRIKTQELTNVGVTHQYY 224

Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 254
           I +VPT+Y D+       NQF+      ++ + +   +P V+F YD+SP+ V FT+ + S
Sbjct: 225 ISIVPTLYVDIDNKEYFVNQFAA-----NTNEAQTTQMPAVYFRYDISPVTVQFTKYYES 279

Query: 255 FLHFLTNVCAIVGGVFTVSGIIDAFIY 281
           F HF+  +CAI+GGVFT++GIID+  Y
Sbjct: 280 FNHFIVQLCAILGGVFTIAGIIDSIFY 306


>gi|66500700|ref|XP_395190.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 1 [Apis mellifera]
          Length = 389

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 67/170 (39%), Positives = 99/170 (58%), Gaps = 4/170 (2%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G L VNKVAGNFH   GKS      H+H         +N +H+INK +FG   PG
Sbjct: 169 ACRIHGSLNVNKVAGNFHITAGKSLSIPKGHIHISAFMTEKDYNFTHRINKFSFGGPSPG 228

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQ 230
           +V+PL+G     +    +YQYF++VVPT + T +S  T ++ Q+SV +H R  + Q    
Sbjct: 229 IVHPLEGDEKIADNNMLLYQYFVEVVPTDIQTLLS--TSKTYQYSVKDHQRPINHQKGSH 286

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             PG+FF YD+S +K+  T++  +   FL  +CA VGG+F  SG++   +
Sbjct: 287 GSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGLVKNIV 336


>gi|380016475|ref|XP_003692209.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Apis florea]
          Length = 392

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 67/170 (39%), Positives = 99/170 (58%), Gaps = 4/170 (2%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G L VNKVAGNFH   GKS      H+H         +N +H+INK +FG   PG
Sbjct: 169 ACRIHGSLNVNKVAGNFHITAGKSLSIPKGHIHISAFMTEKDYNFTHRINKFSFGGPSPG 228

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQ 230
           +V+PL+G     +    +YQYF++VVPT + T +S  T ++ Q+SV +H R  + Q    
Sbjct: 229 IVHPLEGDEKIADNNMLLYQYFVEVVPTDIQTLLS--TSKTYQYSVKDHQRPINHQKGSH 286

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             PG+FF YD+S +K+  T++  +   FL  +CA VGG+F  SG++   +
Sbjct: 287 GSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGLVKNIV 336


>gi|340505495|gb|EGR31815.1| hypothetical protein IMG5_101180 [Ichthyophthirius multifiliis]
          Length = 327

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 76/204 (37%), Positives = 115/204 (56%), Gaps = 25/204 (12%)

Query: 103 LQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ-RDSFNISH 159
           LQR  +   + EGCNI G + VNKV GNFH     S H  G  +  +L+   +++ ++SH
Sbjct: 126 LQRATQAYMDKEGCNISGTMLVNKVPGNFHI----SSHAYGHVLGQVLSNAGKNTIDLSH 181

Query: 160 KINKLAFGEHFP----------GVVNPLDGVRW--TQETPSGM-YQYFIKVVPTVYTDVS 206
           K+  L+FG+ F           G+++P+D  +    Q   +G+ YQY+I +VPT Y D  
Sbjct: 182 KVKHLSFGDEFDLKNIKRQFSQGLLHPMDNKQKDKPQNILNGITYQYYINIVPTTYVDTG 241

Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
                  QF+    + S+EQ     LP V++ YDLSP+ V F+ +  SFLHFL  +CAI+
Sbjct: 242 NKNYHVYQFT----YNSNEQIN-NHLPTVYYRYDLSPVTVKFSMQKESFLHFLVQICAII 296

Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
           GG+FTV+ I+D+ +Y     I K+
Sbjct: 297 GGIFTVASIVDSIVYRAVLNILKR 320


>gi|334311203|ref|XP_001380577.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Monodelphis domestica]
          Length = 321

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 105/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    GEGC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 136 MKIPLNNGEGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 183

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 184 LSFGDTLQVQNIHGAFNALGGADKLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 243

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 244 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 301

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 302 DSCIFTASEAW-KKIQLGKM 320


>gi|66813156|ref|XP_640757.1| DUF1692 family protein [Dictyostelium discoideum AX4]
 gi|60468793|gb|EAL66793.1| DUF1692 family protein [Dictyostelium discoideum AX4]
          Length = 421

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 87/286 (30%), Positives = 135/286 (47%), Gaps = 29/286 (10%)

Query: 3   ISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYG 62
           I G    D  + I K+RLDS G   E    G+        L    G    + T C     
Sbjct: 140 IDGNPIKDAAYQIVKQRLDSYG---EPFAQGVA-------LAGKKGIFSRSCTECEFPKS 189

Query: 63  AESSD----EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
              S     + CCN+CE++R+ YR      +  D   QC  E  +Q     + EGC IYG
Sbjct: 190 KRVSSVFYKQKCCNSCEDLRQYYRLNRIPQNLADDSPQCLIERPVQ-----DDEGCRIYG 244

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILA-FQRDS------FNISHKINKLAFGEHFP 171
            L V K+ G+FH   G    QS            R++      FNI+H I+K +FGE   
Sbjct: 245 SLSVQKMKGDFHILAGTGIDQSHDGHVHHAHHIPRENIGRIKHFNITHHIHKFSFGEDIE 304

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL-Q 230
           G++NPL+      ++   +  Y+++VVP +Y   +   +++NQ+S T  +R      L Q
Sbjct: 305 GLINPLEDFGIVAQS-LAVQTYYLQVVPAIYKK-NDFVLETNQYSYTYDYRIVNMFNLGQ 362

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             PG++F YDLSP+ +   +     +  +T++CAI GG++ V G++
Sbjct: 363 LFPGIYFKYDLSPLMIEVDQTSKPLVELITSICAIGGGMYVVLGLV 408


>gi|395505103|ref|XP_003756885.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Sarcophilus harrisii]
          Length = 290

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 106/200 (53%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I   +GEGC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNDGEGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVQNIHGAFNALGGADKLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|387015778|gb|AFJ50008.1| ER Golgi intermediate [Crotalus adamanteus]
          Length = 290

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 106/200 (53%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G+GC   G   +NKV GNFH           +  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGDGCRFEGHFSINKVPGNFH-----------ISTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D+SG    S Q++V 
Sbjct: 153 LSFGDKLQVPNIHGAFNALGGTDRLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|9963759|gb|AAG09679.1|AF183410_1 cd002 protein [Homo sapiens]
          Length = 387

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 72/168 (42%), Positives = 98/168 (58%), Gaps = 7/168 (4%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH-DILAFQRDSFNISHKINKLAFGEHFP 171
            C I+G L VNKVAGNFH   GK+      H H        +S+N SH+I+ L+FGE  P
Sbjct: 178 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLGSTCSTMESYNFSHRIDHLSFGELVP 237

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGR 228
            ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 238 AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 294

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 295 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 342


>gi|312376736|gb|EFR23738.1| hypothetical protein AND_12338 [Anopheles darlingi]
          Length = 265

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 76/182 (41%), Positives = 102/182 (56%), Gaps = 22/182 (12%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCY 61
           D +GEQHL ++H I+K+RLD +GN IE        PK +  +Q    R+   ET   S  
Sbjct: 37  DSTGEQHLHIEHSIYKRRLDLEGNQIEE-------PKKED-IQVSTKRVSSTETPVTS-- 86

Query: 62  GAESSDEDCCNNCEEVREAYRKKGWALSNPDLID--QCKREGFLQRIKEEEGEGCNIYGF 119
              S+ +  C N   V +AYR++ W   NP++ D  QCK         +   EGC+IYG 
Sbjct: 87  ---STIKPACGN---VIDAYRERKW---NPNVEDFEQCKNSNHGAIEGKAFNEGCHIYGT 137

Query: 120 LEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP-GVVNPLD 178
           +EVN+V G FH APGKSF    +HVHD+  +    FN SH+IN L+FGE F  G   PLD
Sbjct: 138 MEVNRVEGRFHIAPGKSFSIQNIHVHDVQPYSSSRFNTSHRINTLSFGEQFDFGTTQPLD 197

Query: 179 GV 180
           G+
Sbjct: 198 GL 199


>gi|345320110|ref|XP_001521132.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like, partial [Ornithorhynchus anatinus]
          Length = 283

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G+GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 98  MKIPLNNGDGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 145

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   Y Y +K+VPTVY D +G    S Q++V 
Sbjct: 146 LSFGDKLQVQNIHGAFNALGGADKRSSNPLASYDYILKIVPTVYEDKNGKQRYSYQYTVA 205

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 206 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 263

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 264 DSCIFTASEA-WKKIQLGKM 282


>gi|109079798|ref|XP_001099287.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Macaca mulatta]
          Length = 379

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 194 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 241

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 242 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 301

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 302 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 359

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 360 DSCIFTASEAW-KKIQLGKM 378


>gi|449272958|gb|EMC82607.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Columba livia]
          Length = 297

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 107/200 (53%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G+GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 112 MKIPLNNGDGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 159

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L+G       P   + Y +K+VPTVY D+ G    S Q++V 
Sbjct: 160 LSFGDKLQVHNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMGGKQRYSYQYTVA 219

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T++CAI+GG FTV+GI+
Sbjct: 220 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGIL 277

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 278 DSCIFTASEA-WKKIQLGKM 296


>gi|73953406|ref|XP_852891.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 isoform 1 [Canis lupus familiaris]
          Length = 290

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 76/200 (38%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            RI    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MRIPVNNGAGCRFEGHFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|392351111|ref|XP_001066818.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Rattus norvegicus]
          Length = 497

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 75/199 (37%), Positives = 104/199 (52%), Gaps = 22/199 (11%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
           +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+KL
Sbjct: 313 KIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 360

Query: 165 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 218
           +FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V  
Sbjct: 361 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 420

Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
            E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+D
Sbjct: 421 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILD 478

Query: 278 AFIYHGQRAIKKKIEIGKF 296
           + I+    A  KKI++GK 
Sbjct: 479 SCIFTASEAW-KKIQLGKI 496


>gi|6330243|dbj|BAA86495.1| KIAA1181 protein [Homo sapiens]
          Length = 336

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 151 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 198

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 199 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 258

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 259 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 316

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 317 DSCIFTASEAW-KKIQLGKM 335


>gi|58261152|ref|XP_567986.1| ER to Golgi transport-related protein [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|134115843|ref|XP_773404.1| hypothetical protein CNBI2490 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256029|gb|EAL18757.1| hypothetical protein CNBI2490 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57230068|gb|AAW46469.1| ER to Golgi transport-related protein, putative [Cryptococcus
           neoformans var. neoformans JEC21]
          Length = 431

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 98/188 (52%), Gaps = 14/188 (7%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINK 163
           K ++G  C IYG +EV KV  N H              H  ++FQ       N+SH +++
Sbjct: 202 KVQDGPACRIYGSVEVKKVTANLHIT---------TLGHGYMSFQHTDHHLMNLSHVVHE 252

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
            +FG  FP +  PLD      E P  ++QYF++VVPT Y D S   + ++Q++VT++ RS
Sbjct: 253 FSFGPFFPAIAQPLDQSYEITEQPFTIFQYFLRVVPTTYIDASRRKLITSQYAVTDYSRS 312

Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHG 283
            E G+   +PG+FF YDL P+ V   E   S   FL  +  +VGGV+TV+          
Sbjct: 313 FEHGK--GVPGLFFKYDLEPMSVIIRERTTSLYQFLIRLAGVVGGVWTVAAFALRVFNRA 370

Query: 284 QRAIKKKI 291
           Q+ + K +
Sbjct: 371 QKHVSKAV 378


>gi|326427137|gb|EGD72707.1| hypothetical protein PTSG_04435 [Salpingoeca sp. ATCC 50818]
          Length = 357

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 65/174 (37%), Positives = 99/174 (56%), Gaps = 3/174 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + E + C ++G L V KVA NFH   GKS H S  H H       D+ N SH+I++ +F 
Sbjct: 165 DSEPDACRLHGVLPVAKVAANFHITAGKSVHHSRGHSHVNSMVPPDAVNFSHRIDRFSFS 224

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTEHFRSSEQ 226
           E   G +  LDG   T + P  ++QYF++VVP+    +      +SNQ+SVTE  R  ++
Sbjct: 225 EEPRGAMA-LDGDLRTTDQPRQVFQYFLEVVPSTTQRLGQRQPFRSNQYSVTEQHRVLKE 283

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           G  + +PG++F +D+  I V+ +EEH      L  +C IVGG+   SG++ +FI
Sbjct: 284 G-ARGIPGIYFKFDIESIGVSVSEEHPPLSRLLIRLCGIVGGIVAASGMLHSFI 336


>gi|194382656|dbj|BAG64498.1| unnamed protein product [Homo sapiens]
          Length = 235

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 50  MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 97

Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG     ++  G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 98  LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 157

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 158 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 215

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 216 DSCIFTASEAW-KKIQLGKM 234


>gi|395817675|ref|XP_003782285.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Otolemur garnettii]
          Length = 356

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 171 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 218

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 219 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVA 278

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 279 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 336

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 337 DSCIFTASEAW-KKIQLGKM 355


>gi|383865060|ref|XP_003707993.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Megachile rotundata]
          Length = 392

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 70/171 (40%), Positives = 104/171 (60%), Gaps = 6/171 (3%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEHFP 171
            C I+G L VNKV+GNFH   GKS      H+H I AF  D  +N +H+INK +FG   P
Sbjct: 169 ACRIHGSLNVNKVSGNFHITAGKSLSIPRGHIH-ISAFMIDRDYNFTHRINKFSFGGPSP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSE-QGRL 229
           GVV+PL+G     +    +YQYF++VVPT + T +S  T ++ Q+SV ++ R  + Q   
Sbjct: 228 GVVHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLS--TSKTYQYSVKDYQRPIDHQKGS 285

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             +PG+FF YD+S +K+  T++  +   FL  +CA VGG+F  SG++   +
Sbjct: 286 HGVPGIFFKYDMSALKIKVTQQRDTVSQFLVKLCATVGGIFVTSGLVKNIV 336


>gi|417409674|gb|JAA51332.1| Putative endoplasmic reticulum-golgi intermediate compartment
           protein, partial [Desmodus rotundus]
          Length = 318

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 133 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 180

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 181 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 240

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 241 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 298

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 299 DSCIFTASEA-WKKIQLGKM 317


>gi|114603487|ref|XP_001145588.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Pan troglodytes]
          Length = 424

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 239 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 286

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 287 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 346

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 347 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 404

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 405 DSCIFTASEA-WKKIQLGKM 423


>gi|410949214|ref|XP_003981318.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Felis catus]
          Length = 398

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 213 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 260

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 261 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 320

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 321 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 378

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 379 DSCIFTASEA-WKKIQLGKM 397


>gi|281351238|gb|EFB26822.1| hypothetical protein PANDA_005115 [Ailuropoda melanoleuca]
          Length = 238

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 53  MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 100

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 101 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 160

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 161 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 218

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 219 DSCIFTASEAW-KKIQLGKM 237


>gi|115452719|ref|NP_001049960.1| Os03g0321400 [Oryza sativa Japonica Group]
 gi|113548431|dbj|BAF11874.1| Os03g0321400, partial [Oryza sativa Japonica Group]
          Length = 83

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 59/83 (71%), Positives = 70/83 (84%), Gaps = 1/83 (1%)

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           QFSVTEHFR +  G  +  PGV+FFY+ SPIKV FTEE+ S LHFLTN+CAIVGG+FTV+
Sbjct: 1   QFSVTEHFREA-IGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICAIVGGIFTVA 59

Query: 274 GIIDAFIYHGQRAIKKKIEIGKF 296
           GIID+F+YHG RAIKKK+EIGK 
Sbjct: 60  GIIDSFVYHGHRAIKKKMEIGKL 82


>gi|50510831|dbj|BAD32401.1| mKIAA1181 protein [Mus musculus]
          Length = 320

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 135 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHTIHK 182

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 183 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 242

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 243 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 300

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 301 DSCIFTASEAW-KKIQLGKI 319


>gi|123449396|ref|XP_001313417.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121895300|gb|EAY00488.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 361

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 140/281 (49%), Gaps = 43/281 (15%)

Query: 19  RLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVR 78
           RLDSQG  IE+      +  ++  +Q            CGSCY A+     CC +C+EV 
Sbjct: 117 RLDSQGKPIEALD---LSTLVNTTVQEK----------CGSCYNAKDPKRICCRSCQEVF 163

Query: 79  EAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH 138
           +AYR   +       I+QCK     +++ + EGEGC +    +  +VA   H APG S++
Sbjct: 164 DAYRDAAFKPPVLTEIEQCKPVA--EKVAKMEGEGCKVDASFKALRVASEMHIAPGYSWN 221

Query: 139 QSGVHVHDILAFQRD--SFNISHKINKLAFGEH---FPGVVNPLDGVRWTQETPSGMYQY 193
             G HVHD+  F ++  S N++H I+ L+F E    +P  +N L+ V    +T +G +  
Sbjct: 222 SEGWHVHDLSLFTKEFASLNLTHTIHYLSFSEKEGDYP--LNNLNNV----QTENGAW-- 273

Query: 194 FIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIK-VTFTEEH 252
             +VV T       ++    Q    + F S          G+FF YD+SPI  VT+T+  
Sbjct: 274 --RVVYTADILEGNYSASKYQMYNPKSFAS----------GLFFKYDVSPISAVTYTDSE 321

Query: 253 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
             F H LT +  ++GGV  +  +IDA  +H +R +K+  EI
Sbjct: 322 PVF-HLLTRILTVLGGVLGLCRLIDAITFHTRR-MKRTEEI 360


>gi|407927953|gb|EKG20833.1| protein of unknown function DUF1692 [Macrophomina phaseolina MS6]
          Length = 366

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 68/167 (40%), Positives = 97/167 (58%), Gaps = 14/167 (8%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG L+ N+V G+FH  A G  + + G H+          FN SH+IN+L+FG ++
Sbjct: 171 DSCRIYGSLDANRVQGDFHITARGHGYMEFGEHL------DHSQFNFSHQINELSFGPYY 224

Query: 171 PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           P + NPLD  R    TP      +QY++ VVPTVYTD S HTI +NQ++VTE   S  + 
Sbjct: 225 PSLTNPLDYTRAVTPTPDDHFYKFQYYLSVVPTVYTDNS-HTIVTNQYAVTEQSHSVPE- 282

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
              ++PGVF  +D+ PIK+T +E +  FL  L  +  +V GV    G
Sbjct: 283 --MSVPGVFVKFDIEPIKLTISEYNGGFLALLIRLVNVVSGVMVAGG 327


>gi|355686511|gb|AER98080.1| endoplasmic reticulum-golgi intermediate compartment 1 [Mustela
           putorius furo]
          Length = 312

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 128 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 175

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 176 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 235

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 236 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 293

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 294 DSCIFTASEA-WKKIQLGKM 312


>gi|390459630|ref|XP_002744599.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Callithrix jacchus]
          Length = 342

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 157 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 204

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 205 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVA 264

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 265 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 322

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 323 DSCIFTASEA-WKKIQLGKM 341


>gi|351705474|gb|EHB08393.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Heterocephalus glaber]
          Length = 305

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 120 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 167

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 168 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQWYSYQYTVA 227

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 228 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 285

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 286 DSCIFTASEA-WKKIQLGKM 304


>gi|301763094|ref|XP_002916978.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Ailuropoda melanoleuca]
          Length = 306

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 121 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 168

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 169 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 228

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 229 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 286

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 287 DSCIFTASEA-WKKIQLGKM 305


>gi|354477345|ref|XP_003500881.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Cricetulus griseus]
          Length = 333

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 148 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 195

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 196 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 255

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 256 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 313

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 314 DSCIFTASEA-WKKIQLGKI 332


>gi|340507573|gb|EGR33515.1| hypothetical protein IMG5_050820 [Ichthyophthirius multifiliis]
          Length = 290

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 74/191 (38%), Positives = 109/191 (57%), Gaps = 24/191 (12%)

Query: 103 LQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 160
           LQRI++  +  EGC + GF+ VN+V GNFH +   +F Q   +V  I     ++ ++SHK
Sbjct: 88  LQRIQQAIQNKEGCKLSGFMYVNRVPGNFHIS-CHAFGQILGYVFRITGI--NTIDLSHK 144

Query: 161 INKLAFGEH----------FPGVVNPLDGVRWTQ----ETPSGMYQYFIKVVPTVYTDVS 206
           IN L+FG+             GV+NP+D +  T+    E     Y Y++ VVPT Y D  
Sbjct: 145 INHLSFGDEDEIKIVKKQFTLGVLNPMDKLVKTKQKHFENYGISYNYYLNVVPTTYIDEW 204

Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
           G+T   NQF  TE+     Q +   +P ++F YDLSP+ V F ++ + FLHFL  V AIV
Sbjct: 205 GYTYYVNQFVFTEN-----QIQTDYIPAIYFRYDLSPVTVMFKKDRMPFLHFLVQVSAIV 259

Query: 267 GGVFTVSGIID 277
           GG+FT++  +D
Sbjct: 260 GGIFTIAAFMD 270


>gi|13385678|ref|NP_080446.1| endoplasmic reticulum-Golgi intermediate compartment protein 1 [Mus
           musculus]
 gi|52000733|sp|Q9DC16.1|ERGI1_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 1; AltName: Full=ER-Golgi intermediate
           compartment 32 kDa protein; Short=ERGIC-32
 gi|12835932|dbj|BAB23423.1| unnamed protein product [Mus musculus]
 gi|13529617|gb|AAH05516.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Mus
           musculus]
 gi|26351067|dbj|BAC39170.1| unnamed protein product [Mus musculus]
 gi|26353098|dbj|BAC40179.1| unnamed protein product [Mus musculus]
 gi|53236959|gb|AAH83144.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Mus
           musculus]
 gi|71059789|emb|CAJ18438.1| 1200007D18Rik [Mus musculus]
 gi|74185526|dbj|BAE30231.1| unnamed protein product [Mus musculus]
 gi|148690563|gb|EDL22510.1| RIKEN cDNA 1200007D18 [Mus musculus]
 gi|158148953|dbj|BAF82010.1| MAA-136 protein [Mus musculus]
          Length = 290

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHTIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKI 289


>gi|410349413|gb|JAA41310.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
           troglodytes]
 gi|410349417|gb|JAA41312.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
           troglodytes]
          Length = 290

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|71409973|ref|XP_807304.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70871276|gb|EAN85453.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 393

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 98/299 (32%), Positives = 146/299 (48%), Gaps = 40/299 (13%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           +D++G  +L+V  +IFK  +D+QGN   I +RQ G+G        +       ++  +CG
Sbjct: 109 LDVTGTVNLNVTRNIFKTPVDAQGNFAFIGTRQ-GVGE---YGSFREQSKDDPNSPQFCG 164

Query: 59  SCYGAE---SSDED---CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
            C+ +E   S  E+   CCN C +V  AY ++G      + ++QC  +  L RI      
Sbjct: 165 RCFISEHQLSMSENKNRCCNTCNDVLNAYDQQGLPRPQKNEVEQCIYD--LSRINP---- 218

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-- 170
           GCN  G L V K  G   FAP +     G  + D++ F  DS   SH INKL+ G+    
Sbjct: 219 GCNYKGTLIVKKFGGRLVFAPKRV--PGGFLIRDVMQF--DS---SHIINKLSIGDERVT 271

Query: 171 ----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
                GV +PL+G  +  +      +YF+KVVPT+Y  +SG    S  F+ T  +     
Sbjct: 272 RFSRRGVQHPLNGHEFDTQRRFTEIRYFLKVVPTMY--LSGK--NSASFNATYEYSVQWS 327

Query: 227 GRLQTL-----PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
            RL  +     P V   +D  P++V       SF HFL  +C IVGG+F V G+ID  +
Sbjct: 328 HRLTPIGFGHFPSVSLGFDFHPMQVNNYFRRSSFPHFLVQLCGIVGGLFVVLGLIDGLV 386


>gi|313247758|emb|CBY15879.1| unnamed protein product [Oikopleura dioica]
          Length = 285

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 77/191 (40%), Positives = 102/191 (53%), Gaps = 22/191 (11%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
           K ++  GC  +G   VNKV GNFH +   S  Q   H HD        FN  HKINKL F
Sbjct: 108 KNQQKSGCRFHGEFYVNKVPGNFHVSTHASKKQP--HKHD--------FN--HKINKLFF 155

Query: 167 GE-----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF 221
           GE       PG    L G   T E PS  Y Y +K+VPTV+ D    T    Q++VT   
Sbjct: 156 GEDLSALELPGNQTSLAGQATTNE-PSLSYDYTLKIVPTVHNDNKRRTTFGYQYTVTSKT 214

Query: 222 RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
             + +G     P ++F Y+++PI V +T +   F H LT +CAIVGG FTV+G+ID+ I+
Sbjct: 215 FKNTRGT----PAIWFRYEIAPITVKYTHKKKPFYHLLTTICAIVGGTFTVAGMIDSMIF 270

Query: 282 HGQRAIKKKIE 292
              +A+KK  E
Sbjct: 271 SAHQAVKKASE 281


>gi|403290258|ref|XP_003936243.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Saimiri boliviensis boliviensis]
          Length = 415

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 230 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 277

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 278 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGRQQYSYQYTVA 337

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 338 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 395

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 396 DSCIFTASEAW-KKIQLGKM 414


>gi|338713524|ref|XP_001499596.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Equus caballus]
          Length = 356

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            ++    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 171 MKVPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 218

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 219 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 278

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 279 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 336

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 337 DSCIFTASEAW-KKIQLGKM 355


>gi|397485838|ref|XP_003814045.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Pan paniscus]
          Length = 290

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDMLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|402873423|ref|XP_003900575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Papio anubis]
 gi|380784387|gb|AFE64069.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Macaca mulatta]
 gi|383408185|gb|AFH27306.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Macaca mulatta]
 gi|384941372|gb|AFI34291.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Macaca mulatta]
          Length = 290

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|348575225|ref|XP_003473390.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Cavia porcellus]
          Length = 345

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 160 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 207

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 208 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 267

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 268 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 325

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 326 DSCIFTASEAW-KKIQLGKM 344


>gi|432100023|gb|ELK28916.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Myotis davidii]
          Length = 298

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 113 MKIPLNSGAGCRFEGQFSINKVPGNFH-----------VSTHSASA-QPQNPDMTHVIHK 160

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 161 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 220

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 221 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 278

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 279 DSCIFTASEA-WKKIQLGKM 297


>gi|350594414|ref|XP_003134100.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Sus scrofa]
          Length = 313

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 106/200 (53%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I   +G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 128 MKIPLNDGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPPNPDMTHVIHK 175

Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG     ++  G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 176 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 235

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 236 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 293

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 294 DSCIFTASEA-WKKIQLGKM 312


>gi|72534712|ref|NP_001026881.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Homo sapiens]
 gi|332248275|ref|XP_003273290.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Nomascus leucogenys]
 gi|426351000|ref|XP_004043047.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Gorilla gorilla gorilla]
 gi|51701446|sp|Q969X5.1|ERGI1_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 1; AltName: Full=ER-Golgi intermediate
           compartment 32 kDa protein; Short=ERGIC-32
 gi|15215343|gb|AAH12766.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
           [Homo sapiens]
 gi|15680269|gb|AAH14490.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
           [Homo sapiens]
 gi|119581826|gb|EAW61422.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1,
           isoform CRA_a [Homo sapiens]
 gi|208966210|dbj|BAG73119.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
           [synthetic construct]
 gi|410301142|gb|JAA29171.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
           troglodytes]
 gi|410349415|gb|JAA41311.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
           troglodytes]
          Length = 290

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG     ++  G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|145349688|ref|XP_001419260.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579491|gb|ABO97553.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 310

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 67/184 (36%), Positives = 99/184 (53%), Gaps = 15/184 (8%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
           R  + + EGC ++G LE  +VAG    + G   ++    ++D    +    ++ H +   
Sbjct: 134 REAKADVEGCRLHGELEARRVAGTLRASTGPESYEFLKEIYD----EPWEIDMRHAVKTF 189

Query: 165 AFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--------TIQSNQFS 216
            FG  FPG VNP++GVR   ET SG+Y+YF+KVVPT Y+               ++NQ+S
Sbjct: 190 TFGAEFPGAVNPMNGVR-RMETKSGIYKYFMKVVPTTYSSTRALFGFIPWTVRTRTNQYS 248

Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           VTEHF   E      LP +FF YDLS I V  T    S ++FLT   A +GG+F ++  +
Sbjct: 249 VTEHF--IETPHWGALPQLFFIYDLSAIAVNITVTSKSIVYFLTKTLATMGGIFALTRTV 306

Query: 277 DAFI 280
           D +I
Sbjct: 307 DRYI 310


>gi|355691849|gb|EHH27034.1| hypothetical protein EGK_17136, partial [Macaca mulatta]
 gi|355750428|gb|EHH54766.1| hypothetical protein EGM_15664, partial [Macaca fascicularis]
          Length = 290

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|395736490|ref|XP_002816264.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Pongo abelii]
          Length = 290

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG     ++  G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|407424942|gb|EKF39210.1| hypothetical protein MOQ_000571 [Trypanosoma cruzi marinkellei]
          Length = 393

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 93/299 (31%), Positives = 144/299 (48%), Gaps = 40/299 (13%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           +D++G  +L+V  +IFK  +D+QGN   I +RQ G+G        +       ++  +CG
Sbjct: 109 LDVTGTVNLNVTRNIFKTPVDAQGNFAFIGTRQ-GVGE---YGSFREQSKDDPNSPQFCG 164

Query: 59  SCY------GAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
            C+        + +   CCN C++V  AY ++G        ++QC  +  L RI      
Sbjct: 165 RCFINEHQVSVKENKNRCCNTCDDVLNAYDQQGLPRPRKSEVEQCIYD--LSRINP---- 218

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-- 170
           GCN  G L V K  G   FAP +     G  + D++ F  DS   SH INKL+ G+    
Sbjct: 219 GCNYKGTLIVKKFGGRLVFAPKRV--SGGFLIKDVMQF--DS---SHVINKLSIGDERVT 271

Query: 171 ----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
                GV +PL+G ++  +      +YF+K+VPT+Y  +SG    S  F+ T  +     
Sbjct: 272 RFSRRGVQHPLNGHKFDTQRRITEIRYFLKIVPTMY--LSGK--NSAPFNATYEYSVQWS 327

Query: 227 GRLQTL-----PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
            RL  +     P V   +D  P++V       SF HF+  +C IVGG+F V G+ID  +
Sbjct: 328 QRLTPIGFGHFPSVSLGFDFHPMQVNNYFRRSSFPHFIVQLCGIVGGLFVVLGLIDGLV 386


>gi|322792517|gb|EFZ16475.1| hypothetical protein SINV_13267 [Solenopsis invicta]
          Length = 110

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 58/110 (52%), Positives = 80/110 (72%), Gaps = 7/110 (6%)

Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT----LPGVFFFYDLSPIK 245
           M+ ++IK+VPT Y    G T+ +NQFSVT H   ++Q  L T    +PG+FF Y+LSP+ 
Sbjct: 4   MFYHYIKIVPTTYVRADGSTLLTNQFSVTRH---AKQVSLLTGESGMPGIFFSYELSPLM 60

Query: 246 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           V +TE+  SF HF TN CAI+GGVFTV+G+ID+ +YH  RAI++KIE+GK
Sbjct: 61  VKYTEKAKSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQRKIELGK 110


>gi|440902711|gb|ELR53466.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1,
           partial [Bos grunniens mutus]
          Length = 290

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|390337315|ref|XP_792272.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 2 [Strongylocentrotus purpuratus]
 gi|390337317|ref|XP_003724529.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 1 [Strongylocentrotus purpuratus]
          Length = 388

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 66/171 (38%), Positives = 98/171 (57%), Gaps = 4/171 (2%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C ++G L  NKVAGNFH   GKS      H H  L    +++N SH+I+  ++G   P
Sbjct: 169 DACRLHGSLTTNKVAGNFHVTIGKSIPHPRGHAHLALMIDPNNYNFSHRIDHFSYGTPVP 228

Query: 172 GVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-L 229
           G+VNPLDG ++ T E+   +YQYFI++VPT           ++Q++VTE  R    G   
Sbjct: 229 GIVNPLDGDLKVTNESLQ-IYQYFIQIVPT-KVKTRAAKAHTHQYAVTERERVINHGAGS 286

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             + G+FF Y+LS + ++  E +  F   L  +C IVGGVF  SGII++ +
Sbjct: 287 HGVTGIFFKYELSSLVISVEEVYDPFWKLLVRLCGIVGGVFATSGIINSLM 337


>gi|426246271|ref|XP_004016918.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Ovis aries]
          Length = 290

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|158292441|ref|XP_001688474.1| AGAP005044-PB [Anopheles gambiae str. PEST]
 gi|157016994|gb|EDO64057.1| AGAP005044-PB [Anopheles gambiae str. PEST]
          Length = 287

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 102/172 (59%), Gaps = 2/172 (1%)

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           I E+  + C I+G L +NKVAGNFH   GK+ H S  H+H    F     N SH+IN+ +
Sbjct: 79  IPEKPHDACRIHGVLTLNKVAGNFHITVGKTIHFSRGHIHLNSIFANTQTNFSHRINRFS 138

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           FG+H  G+++PL+G     +    M QYFI+VVPT       H+ ++ Q++V E+ +  +
Sbjct: 139 FGDHTAGIIHPLEGDEKLFDNGQVMMQYFIEVVPTDVQKFYSHS-KTYQYTVRENLQLID 197

Query: 226 QGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             + +Q + G++F YD+S ++V   ++  S  HF+  + +I+ G+  +SG++
Sbjct: 198 IDKGMQGVAGIYFKYDMSALRVLVRQDRDSIAHFIVRLSSIIAGIVVISGML 249


>gi|307206941|gb|EFN84785.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Harpegnathos saltator]
          Length = 396

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 67/171 (39%), Positives = 101/171 (59%), Gaps = 6/171 (3%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD-SFNISHKINKLAFGEHFP 171
            C I+G L VNKVAGNFH   GKS      H+H I AF  D  +N +H+IN+ +FG   P
Sbjct: 169 ACRIHGSLNVNKVAGNFHITTGKSLSVPRGHIH-ISAFMTDRDYNFTHRINRFSFGGPSP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR-L 229
           G+V+PL+G     +    +YQYF++VVPT + T +S  T ++ Q+SV ++ R        
Sbjct: 228 GIVHPLEGDEKIADYNMMLYQYFVEVVPTDIRTLLS--TSKTYQYSVKDYQRPINHNEGS 285

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             +PG+F  Y++S +K+  T++  +   FL  +CA VGG+F  SG+I   +
Sbjct: 286 HGVPGIFIKYNMSALKIKVTQQRDTIFQFLVKLCATVGGIFVTSGLIKNIV 336


>gi|149052230|gb|EDM04047.1| rCG34297 [Rattus norvegicus]
          Length = 283

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 98  MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 145

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 146 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 205

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 206 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 263

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 264 DSCIFTASEA-WKKIQLGKI 282


>gi|392331685|ref|XP_003752358.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Rattus norvegicus]
          Length = 290

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKI 289


>gi|344265732|ref|XP_003404936.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Loxodonta africana]
          Length = 338

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 153 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 200

Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG     ++  G  N L G       P   + Y +K+VPTVY D +G    S Q++V 
Sbjct: 201 LSFGDTLQVQNVQGAFNALGGADRLHSNPLASHDYILKIVPTVYEDKNGKQRYSYQYTVA 260

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 261 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 318

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 319 DSCIFTASEAW-KKIQLGKM 337


>gi|158292439|ref|XP_313915.3| AGAP005044-PA [Anopheles gambiae str. PEST]
 gi|157016993|gb|EAA09437.3| AGAP005044-PA [Anopheles gambiae str. PEST]
          Length = 371

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 102/172 (59%), Gaps = 2/172 (1%)

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           I E+  + C I+G L +NKVAGNFH   GK+ H S  H+H    F     N SH+IN+ +
Sbjct: 163 IPEKPHDACRIHGVLTLNKVAGNFHITVGKTIHFSRGHIHLNSIFANTQTNFSHRINRFS 222

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           FG+H  G+++PL+G     +    M QYFI+VVPT       H+ ++ Q++V E+ +  +
Sbjct: 223 FGDHTAGIIHPLEGDEKLFDNGQVMMQYFIEVVPTDVQKFYSHS-KTYQYTVRENLQLID 281

Query: 226 QGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             + +Q + G++F YD+S ++V   ++  S  HF+  + +I+ G+  +SG++
Sbjct: 282 IDKGMQGVAGIYFKYDMSALRVLVRQDRDSIAHFIVRLSSIIAGIVVISGML 333


>gi|307105802|gb|EFN54050.1| hypothetical protein CHLNCDRAFT_136126 [Chlorella variabilis]
          Length = 319

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 66/180 (36%), Positives = 93/180 (51%), Gaps = 17/180 (9%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAFGEH 169
           EGCNI+G+LEV +VAGN HFA         ++   I+    D+   NISH          
Sbjct: 152 EGCNIHGWLEVQRVAGNVHFAVRPEALFLSMNAEAIMQLHPDASKLNISHA--------- 202

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
                NPL+GV     T +G+ +YF+KVVPT +  + G    + Q+SVTE++     G  
Sbjct: 203 -----NPLEGVAQIDRTATGIDKYFVKVVPTDFYTLWGRKTHTYQYSVTEYYHQFRGGEE 257

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           Q  P V+  YD SPI V   E     L  L  VCA+VGG F ++G+ D  ++    A+K+
Sbjct: 258 QP-PAVYLLYDASPIMVDIREMRPGLLRLLVRVCAVVGGAFALTGLFDKMVHRAVVAVKR 316


>gi|7341109|gb|AAF61208.1|AF216751_1 CDA14 [Homo sapiens]
          Length = 378

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/168 (42%), Positives = 97/168 (57%), Gaps = 7/168 (4%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI-SHKINKLAFGEHFP 171
            C I+G L VNKVAGNFH   GK+      H H     Q  +  I SH+I+ L+FGE  P
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLGSTCQPWNLTIFSHRIDHLSFGELVP 228

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGR 228
            ++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +    
Sbjct: 229 AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAG 285

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 286 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 333


>gi|115497382|ref|NP_001069885.1| endoplasmic reticulum-Golgi intermediate compartment protein 1 [Bos
           taurus]
 gi|111308658|gb|AAI20358.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Bos
           taurus]
          Length = 290

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQFSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|308806572|ref|XP_003080597.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
 gi|116059058|emb|CAL54765.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
          Length = 327

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 71/191 (37%), Positives = 102/191 (53%), Gaps = 29/191 (15%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPG-KSFHQSGVHVHDILAFQRDSFN------I 157
           R  + + EGC ++G +E  +VAG+   + G +SF            F R+ FN       
Sbjct: 141 RKAKADMEGCRLHGRVEARRVAGSLRISTGPESFE-----------FLREMFNEPWEIDA 189

Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG--------HT 209
            H I   AFG  FPG VNPL+GV+  +E  SG+Y+YF+KVVPT Y +             
Sbjct: 190 RHAIKTFAFGPEFPGSVNPLNGVK-RKEKKSGIYKYFMKVVPTTYANSRNLFGMIPWTMR 248

Query: 210 IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
           +++NQ+SVTEHF  +E      LP + F YD+S I V    +  S ++FLT   A VGGV
Sbjct: 249 VRTNQYSVTEHF--TESAHWGMLPQILFSYDISAISVNVESQSKSGVYFLTKTIATVGGV 306

Query: 270 FTVSGIIDAFI 280
           F ++  ID ++
Sbjct: 307 FALTRTIDRYV 317


>gi|392577310|gb|EIW70439.1| hypothetical protein TREMEDRAFT_43159 [Tremella mesenterica DSM
           1558]
          Length = 435

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 63/168 (37%), Positives = 91/168 (54%), Gaps = 9/168 (5%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
           K + G  C IYG +EV KV  N H       + S  H    L       N+SH +++ +F
Sbjct: 196 KADNGPACRIYGSVEVKKVTANLHITTLGHGYMSFEHTDHAL------MNLSHVVHEFSF 249

Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
           G  FP +  PLD      + P    QYF++VVPT Y D +G  + ++Q++VT++ RS + 
Sbjct: 250 GPFFPAIAQPLDMTMQVSDNPFTAIQYFLRVVPTTYIDANGRKLVTSQYAVTDYLRSFQH 309

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC-AIVGGVFTVS 273
           G  Q +PG+FF YDL  + VT  E   S  HF+  +   IVGGV+TV+
Sbjct: 310 G--QGVPGIFFKYDLEAMAVTVRERTTSLYHFVIRLIGVIVGGVWTVA 355


>gi|296475934|tpg|DAA18049.1| TPA: endoplasmic reticulum-golgi intermediate compartment 32 kDa
           protein [Bos taurus]
          Length = 290

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGVGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHK 152

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 153 LSFGDTLQVHNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQFSYQYTVA 212

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+GI+
Sbjct: 213 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 271 DSCIFTASEA-WKKIQLGKM 289


>gi|242006215|ref|XP_002423949.1| Endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Pediculus humanus corporis]
 gi|212507219|gb|EEB11211.1| Endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Pediculus humanus corporis]
          Length = 349

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 75/208 (36%), Positives = 112/208 (53%), Gaps = 11/208 (5%)

Query: 86  WALSNPDLID-QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 144
           W  ++P  I+    R+    R      + C IYG L +NKVAGNFH + GKS      H+
Sbjct: 149 WKSASPSFINVYVPRKNLPNR----PYDACRIYGELVLNKVAGNFHISAGKSLQLPRGHI 204

Query: 145 HDILAFQRDS-FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VY 202
           H I  F  D  FN SH++N  +FG++ PG+V+PL+G           YQYFI+VVPT V 
Sbjct: 205 H-IATFMSDKEFNFSHRLNYFSFGDYSPGIVHPLEGDEKIATDAMMSYQYFIEVVPTEVK 263

Query: 203 TDVSGHTIQSNQFSVTEHFRSSEQGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
           T ++     + Q+SV ++ R          +PG+FF YD+S +KV   +E  S ++F   
Sbjct: 264 TFLTNQL--TYQYSVKDYQRPINHNTGSHGIPGIFFKYDMSALKVIVMQERDSPINFAVK 321

Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
           +CA +GG+   SG+++  I +     KK
Sbjct: 322 LCASIGGIHITSGLVNNIILYLINFYKK 349


>gi|270003406|gb|EEZ99853.1| hypothetical protein TcasGA2_TC002635 [Tribolium castaneum]
          Length = 380

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 67/173 (38%), Positives = 107/173 (61%), Gaps = 8/173 (4%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEH 169
           + C I+G L +NKV+GNFH   GKS +    H+H I AF  +RD +N SH+I+  +FG+ 
Sbjct: 175 DACRIHGSLILNKVSGNFHITAGKSLNLPRGHIH-ISAFMSERD-YNFSHRIDTFSFGDS 232

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
            PG+++PL+G          ++ YFI+VVPT V T ++   + + Q+SV E  R  +  +
Sbjct: 233 SPGIIHPLEGDELITHNGMTLFNYFIEVVPTNVKTFLAN--VNTYQYSVKELNRPIDHDK 290

Query: 229 -LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
               +PG+FF YD+S +KVT ++E      FL  +C+I+GG+F  SG +++F+
Sbjct: 291 GSHGMPGIFFKYDMSALKVTVSQERDHLGMFLARLCSIIGGIFVCSGFVNSFV 343


>gi|189235693|ref|XP_966630.2| PREDICTED: similar to AGAP005044-PA [Tribolium castaneum]
          Length = 373

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 67/173 (38%), Positives = 107/173 (61%), Gaps = 8/173 (4%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF--QRDSFNISHKINKLAFGEH 169
           + C I+G L +NKV+GNFH   GKS +    H+H I AF  +RD +N SH+I+  +FG+ 
Sbjct: 168 DACRIHGSLILNKVSGNFHITAGKSLNLPRGHIH-ISAFMSERD-YNFSHRIDTFSFGDS 225

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
            PG+++PL+G          ++ YFI+VVPT V T ++   + + Q+SV E  R  +  +
Sbjct: 226 SPGIIHPLEGDELITHNGMTLFNYFIEVVPTNVKTFLAN--VNTYQYSVKELNRPIDHDK 283

Query: 229 -LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
               +PG+FF YD+S +KVT ++E      FL  +C+I+GG+F  SG +++F+
Sbjct: 284 GSHGMPGIFFKYDMSALKVTVSQERDHLGMFLARLCSIIGGIFVCSGFVNSFV 336


>gi|301626814|ref|XP_002942582.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like, partial [Xenopus (Silurana) tropicalis]
          Length = 298

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 75/200 (37%), Positives = 105/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I      GC   GF  +NKV GNFH           V  H  +A Q  + ++ H I+K
Sbjct: 113 MKIPINNAHGCRFEGFFSINKVPGNFH-----------VSTHSAMA-QPANPDMRHIIHK 160

Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG     E+  G  N L G           + Y +K+VPTVY D++G    S Q++V 
Sbjct: 161 LSFGNTLQVENIHGAFNALGGADKLASQALESHDYVLKIVPTVYEDMNGEQQFSYQYTVA 220

Query: 219 E--HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              +   S  GR+  +P ++F YDLSPI V +TE       F+T VCAI+GG FTV+GI+
Sbjct: 221 NKAYVAYSHTGRV--VPAIWFRYDLSPITVKYTERRQPIYRFITTVCAIIGGTFTVAGIL 278

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+FI+    A  KKI++GK 
Sbjct: 279 DSFIFTASEA-WKKIQLGKM 297


>gi|328862174|gb|EGG11276.1| hypothetical protein MELLADRAFT_33547 [Melampsora larici-populina
           98AG31]
          Length = 361

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 75/226 (33%), Positives = 112/226 (49%), Gaps = 24/226 (10%)

Query: 51  EHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKE-- 108
           E  E   G     E++++   +  + VR+A  + GW            R  F ++ K   
Sbjct: 111 EGTEFSIGQAARLETNNDAGISASKMVRDA--QGGWT-----------RPTF-KKTKPLI 156

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
            EG  C I+G   V KV GN H       + S  H    L       N++H I++ +FGE
Sbjct: 157 PEGPACRIFGSTHVKKVTGNLHITTLGHGYLSWEHTDHQL------MNLTHVISEFSFGE 210

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
            FP +V PLD      + P  ++QYFI VVPT Y +  G  + +NQ+SVT+  RS+E GR
Sbjct: 211 FFPNMVQPLDNSVEITDKPFHIFQYFISVVPTTYINSGGRQVFTNQYSVTDMSRSTEHGR 270

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
              +PG+FF YD+ P+ +T  E   + + FL  +  IVGG+   +G
Sbjct: 271 --GVPGIFFKYDIEPMYLTIRERTTTLVQFLVRLAGIVGGIVVCTG 314


>gi|407859749|gb|EKG07137.1| hypothetical protein TCSYLVIO_001725 [Trypanosoma cruzi]
          Length = 393

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 95/299 (31%), Positives = 142/299 (47%), Gaps = 40/299 (13%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           +D++G  +L+V  +IFK  +D+QGN   I +RQ G+G        +       ++  +CG
Sbjct: 109 LDVTGTVNLNVTRNIFKTPVDAQGNFAFIGTRQ-GVGE---YGSFREQSKDDPNSPQFCG 164

Query: 59  SCYGAE------SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
            C+ +E       +   CCN C +V  AY ++G      + ++QC  E  L  I      
Sbjct: 165 RCFISEHQLSMMDNKNRCCNTCNDVLNAYDQQGLPRPQKNEVEQCIYE--LSLINP---- 218

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP- 171
           GCN  G L V K  G   FAP +     G  + D++ F  DS   SH INKL+ G+    
Sbjct: 219 GCNYKGTLIVKKFGGRLVFAPKRV--PGGFLIKDVMQF--DS---SHIINKLSIGDERVT 271

Query: 172 -----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
                GV +PL+G  +  +      +YF+KVVPT+Y   SG    S  F+ T  +     
Sbjct: 272 RFSRRGVQHPLNGHEFVAQRRFTEIRYFLKVVPTMY--FSGK--NSASFNATYEYSVQWS 327

Query: 227 GRLQTL-----PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
            RL  +     P V   +D  P++V       SF HF+  +C IVGG+F V G+ID  +
Sbjct: 328 HRLTPIGFGHFPSVSLGFDFHPMQVNNYFRRSSFPHFIVQLCGIVGGLFVVLGLIDGLV 386


>gi|225712696|gb|ACO12194.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Lepeophtheirus salmonis]
          Length = 372

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 66/178 (37%), Positives = 99/178 (55%), Gaps = 6/178 (3%)

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           I +E  + C I+G L +NKVAGNFH +PGK+      HVH       + +N +H+I++ +
Sbjct: 166 IPDEPHDACRIHGSLTLNKVAGNFHISPGKTLPLFRAHVHFATFGGDEVYNFTHRIDRFS 225

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT---IQSNQFSVTEHFR 222
           FG    G+V PL+G        S  YQY I+VVP   TD+ G+T     + Q+SV EH R
Sbjct: 226 FGTPHGGIVQPLEGEEKIAMQDSMHYQYLIQVVP---TDIQGYTDLIWSTYQYSVKEHKR 282

Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           ++++      PG++F YD+S +KV  +++      FL  + A VGG    S I+  FI
Sbjct: 283 ATKERGSGDTPGIYFKYDMSALKVLASQDREPIFKFLVRLLAAVGGRIATSQIVCVFI 340


>gi|148678794|gb|EDL10741.1| ERGIC and golgi 2, isoform CRA_a [Mus musculus]
          Length = 375

 Score =  117 bits (294), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 64/168 (38%), Positives = 91/168 (54%), Gaps = 16/168 (9%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 176 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 235

Query: 172 GVVNPLDGVR--WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGR 228
           G++NPLDG         P+ ++ Y I             +  ++QFSVTE  R  +    
Sbjct: 236 GIINPLDGTEKIAVDLVPTKLHTYKI-------------SADTHQFSVTERERIINHAAG 282

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              + G+F  YDLS + VT TEEH+ F  F   +C I+GG+F+ +G++
Sbjct: 283 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 330


>gi|154418008|ref|XP_001582023.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121916255|gb|EAY21037.1| hypothetical protein TVAG_172950 [Trichomonas vaginalis G3]
          Length = 371

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 78/245 (31%), Positives = 116/245 (47%), Gaps = 14/245 (5%)

Query: 56  YCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           YCG+CY   S+D+ CCN C EV + ++ KG         +QC REG L    +   E C 
Sbjct: 134 YCGNCY--LSTDKKCCNTCREVMDVFKAKGLTYYASFRWEQCIREGVL----DFGNETCR 187

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQS-GVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
           I G L+V K +GNFH A G + + +   H HD+ +    S  ++H I+ L FGE      
Sbjct: 188 IKGKLKVKKQSGNFHIALGANTNDNYKGHSHDLSSVDA-SHKLNHVIHSLTFGEPVDYYK 246

Query: 175 NPLDGVRWTQETPSG----MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
             L  V       +G    M  Y++   P   +  +   I S ++S     R       +
Sbjct: 247 PQLTDVEMQLPELNGSNYWMVTYYLHAAPERIS--TTDKIDSYRYSAFPSRRKVTNKTKK 304

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
             PG+ F+YD +P+ V +   H S    + ++C IVGG F+ + IIDA  +     I+ K
Sbjct: 305 GFPGIVFYYDFAPMIVVYQPTHGSIRSIIVDICGIVGGAFSFAAIIDALAFGALSGIRGK 364

Query: 291 IEIGK 295
             IGK
Sbjct: 365 TMIGK 369


>gi|321258600|ref|XP_003194021.1| ER to Golgi transport-related protein [Cryptococcus gattii WM276]
 gi|317460491|gb|ADV22234.1| ER to Golgi transport-related protein, putative [Cryptococcus
           gattii WM276]
          Length = 444

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 61/171 (35%), Positives = 92/171 (53%), Gaps = 14/171 (8%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS---FNISHKINK 163
           K ++G  C IYG ++V KV  N H              H  ++FQ       N+SH +++
Sbjct: 204 KVQDGPACRIYGSVQVKKVTANLHITTLG---------HGYMSFQHTDHHLMNLSHVVHE 254

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
            +FG  FP +  PLD        P  ++QYF++VVPT Y D S   + ++Q++VT++ RS
Sbjct: 255 FSFGPFFPAIAQPLDQSYEITLQPFTIFQYFLRVVPTTYIDASRRKLITSQYAVTDYSRS 314

Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
            E G+   +PG+FF YDL P+ V   E   S   FL  +  +VGGV+TV+ 
Sbjct: 315 FEHGK--GVPGLFFKYDLEPMSVVIRERTTSLFQFLIRLAGVVGGVWTVAA 363


>gi|412991249|emb|CCO16094.1| predicted protein [Bathycoccus prasinos]
          Length = 409

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 74/235 (31%), Positives = 109/235 (46%), Gaps = 69/235 (29%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E+ EGC +YG + V +V GNFH +     +++  H    +    +  NISH I  L+FG 
Sbjct: 173 EKKEGCRLYGRMHVQRVGGNFHISAHAEEYETLQHAFGAV----NKINISHTITHLSFGA 228

Query: 169 HFPGVVNPLDGV------------------------------------------------ 180
            +PG+VNPLDGV                                                
Sbjct: 229 GYPGLVNPLDGVARSGSDDEFHYDESSKDSRSSDRKNIEKEKEEEEKRKKKEQVRRSRLM 288

Query: 181 --RWTQETPSGMYQYFIKVVPTVYTDVSG---------HTIQSNQFSVTEHFRSSEQGRL 229
              W  E  SG+Y+YF+K+VPT Y               ++ +NQ+SVTE+FR ++    
Sbjct: 289 DLTW-DENGSGVYKYFLKLVPTFYRTHRSVFLGLFSWTKSVSTNQYSVTEYFRKTDAWS- 346

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT----VSGIIDAFI 280
            +LP V+F YD SPI VT   +   F++FLT +CA+ GGVF     +S ++DA +
Sbjct: 347 GSLPAVYFLYDFSPIAVTIDTKRPHFVYFLTRLCAVCGGVFAFAHMISNLVDALL 401


>gi|358333955|dbj|GAA52416.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Clonorchis sinensis]
          Length = 306

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 65/168 (38%), Positives = 91/168 (54%), Gaps = 6/168 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFH-QSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + CNI G   V KVAGN H  PG+ F    G HVH     +   FN SH+IN L+FG   
Sbjct: 86  DACNIVGTFHVQKVAGNMHVLPGRPFDGPGGSHVHIAPFVRLADFNFSHRINHLSFGAQV 145

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
              VNPLD V      P   ++Y+I +VPT  VY   +  ++ + Q+++T   R++E  +
Sbjct: 146 ANRVNPLDAVEEISYNPMETFRYYISIVPTRVVY---AFSSLDTYQYAITVKNRTAEGNK 202

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             ++PG+FF YD  P+ V  TE    F  FL  + A+VGG+F   G I
Sbjct: 203 SDSIPGIFFSYDTFPLLVQVTESRELFGTFLARLAALVGGLFATVGFI 250


>gi|331239265|ref|XP_003332286.1| hypothetical protein PGTG_14582 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|309311276|gb|EFP87867.1| hypothetical protein PGTG_14582 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 366

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 63/175 (36%), Positives = 92/175 (52%), Gaps = 12/175 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           +G  C IYG  +V KV GN H       + S  H    L       N+SH I + +FG+ 
Sbjct: 157 DGPACRIYGNTQVKKVTGNLHITTLGHGYLSWEHTDHKL------MNLSHVITEFSFGQF 210

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           FP +V PLD      + P  ++QYFI VVPT Y D  G  + +NQ+SVT+  R  E G  
Sbjct: 211 FPKIVQPLDNSVELTDKPFHIFQYFISVVPTTYIDRLGRQLHTNQYSVTDMSRPVEHG-- 268

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG----IIDAFI 280
           Q +PG+FF YD+ P+ +   E   S + FL  +  ++GG+   +G    ++D F+
Sbjct: 269 QGIPGLFFKYDMEPMSLILHERTTSLIQFLVRLAGMIGGIVVCTGWTFRLVDRFV 323


>gi|340372649|ref|XP_003384856.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Amphimedon queenslandica]
          Length = 347

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 92/169 (54%), Gaps = 1/169 (0%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C ++G ++VNKV+GNFH   G++      H H       +  N SH+I+   FG   PG
Sbjct: 164 SCRVHGHIQVNKVSGNFHITAGQAVPHPQGHAHLSAFVPTNMINFSHRIDSFGFGVSTPG 223

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS-SEQGRLQT 231
           +V+PL+G        + ++QY+I++VPT      G  + +NQ+SVTE  R+ S +     
Sbjct: 224 MVDPLEGTYVIARESNRLFQYYIQIVPTTLQMRGGSDLHTNQYSVTERNRAISHKAGSHG 283

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           LPG+FF Y++  + V   E       FL  +CAIVGGVF   G+I  F+
Sbjct: 284 LPGLFFKYEIYSLMVLMKEVDRPLSLFLVRLCAIVGGVFATLGMISQFL 332


>gi|198422133|ref|XP_002131157.1| PREDICTED: similar to ptx1 [Ciona intestinalis]
          Length = 391

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/172 (36%), Positives = 94/172 (54%), Gaps = 6/172 (3%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C  YG L +NKVAGNFH   GK     G H H  + F    +N SH+I+  +FG    
Sbjct: 173 DACRFYGNLPLNKVAGNFHIVAGKPIQMFGGHAHLSMMFSPIPYNFSHRIDHFSFGNMKT 232

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN--QFSVTEHFRSSEQGR- 228
           G +N LDG      + S ++QY++ VV    T ++   I ++  QFSV+E  R+ +    
Sbjct: 233 GFINALDGDERVTSSESYIFQYYLDVVS---TKINSRRITTDTFQFSVSEQSRALDHASG 289

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
               PGVFF Y+ SP+ V  TE+ + F   L  +C+IVGG+F  S +++A +
Sbjct: 290 SHGQPGVFFKYNFSPLSVMITEQKMPFYRLLVRLCSIVGGIFATSHVLNALL 341


>gi|156553212|ref|XP_001600226.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Nasonia vitripennis]
          Length = 391

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 63/176 (35%), Positives = 98/176 (55%), Gaps = 2/176 (1%)

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           I       C IYG L+VNKVAGNFH   GKS      H H        ++N +H+IN+ +
Sbjct: 161 IPSYPSNACRIYGSLDVNKVAGNFHVTSGKSVILPRGHFHFTSFHSSTAYNFTHRINRFS 220

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           FG+  PG+++PL+G          ++QYFI+VV T   ++  H  ++ Q+SV +H R   
Sbjct: 221 FGKPSPGIIHPLEGDEKITTDNMMLFQYFIEVVSTD-INMLMHKSKTYQYSVKDHQRPIN 279

Query: 226 QGR-LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             +    +PG+FF YD S +K+  ++E  S   FL  +CA VG +F  +GI+++ +
Sbjct: 280 HAKGSHGIPGIFFKYDTSALKIKVSQERDSIGQFLVKLCATVGCIFVTNGILNSIV 335


>gi|385302035|gb|EIF46185.1| erv46p [Dekkera bruxellensis AWRI1499]
          Length = 266

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/180 (35%), Positives = 97/180 (53%), Gaps = 17/180 (9%)

Query: 1   MDISGEQHLDV-KHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           MD++G+   D+ + +  + RLD  G  I + +      K++K           +  YCGS
Sbjct: 89  MDLTGDVQADILEGNFLRTRLDRDGKEIATDE----PFKVNKEDXVKSELSTEDSQYCGS 144

Query: 60  CYGA--ESSDED--------CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE 109
           CYGA  +S +E         CCN+CE V+ AY K  W   + + I+QC++EG++ RI + 
Sbjct: 145 CYGAIDQSGNEKESDPTKWVCCNSCEAVKLAYSKAAWKFYDGEGIEQCEKEGYVDRINKR 204

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR--DSFNISHKINKLAFG 167
             EGC + G  ++N++ GN HFAPG S   +  HVHD+  F +  D FN  H IN  +FG
Sbjct: 205 LDEGCRVKGTAQLNRIGGNLHFAPGSSITMNDRHVHDLSLFDKHQDKFNFDHVINHFSFG 264


>gi|321465392|gb|EFX76393.1| hypothetical protein DAPPUDRAFT_306117 [Daphnia pulex]
          Length = 289

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 111/207 (53%), Gaps = 25/207 (12%)

Query: 101 GFLQ---RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
           GF++   +    +G+GC       +N+V GNFH +          H  D    Q DS ++
Sbjct: 98  GFIENTLKTPWNKGKGCIFESRFHINRVPGNFHVS---------THSADK---QPDSADM 145

Query: 158 SHKINKLAFGE-----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS 212
           +H I  L FGE     + PG  NPL     +Q  P+  + Y +K+VPT+Y D +G T+ S
Sbjct: 146 AHYITSLTFGEMLDNKNLPGNFNPLARRDRSQADPAESHDYTMKIVPTIYEDSAGTTLVS 205

Query: 213 NQFSV--TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
            Q++   + +   S  GR  +   ++F YDL+PI V + E       FLT+VCAI+GG F
Sbjct: 206 YQYTYAYSNYVSFSLGGR--SPAAIWFRYDLNPITVKYHERRQPIYAFLTSVCAIIGGTF 263

Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           TV+GIID+F++     I KK E+GK S
Sbjct: 264 TVAGIIDSFVFTASE-IFKKFELGKLS 289


>gi|159464951|ref|XP_001690702.1| hypothetical protein CHLREDRAFT_180779 [Chlamydomonas reinhardtii]
 gi|158270379|gb|EDO96229.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 656

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 72/190 (37%), Positives = 100/190 (52%), Gaps = 27/190 (14%)

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGV-----------HVHDILAFQRDSFNISHKINKLA 165
           Y   +V +VAG  H     S HQ+ V           H+  IL       N+SH I  L 
Sbjct: 84  YHTPQVKRVAGRLHL----SVHQNMVFQMLPQLLGTHHIPKIL-------NMSHVIKHLG 132

Query: 166 FGEHFPGVVNPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
           FG H+PG +NPLDG VR     P   Y+YF+KVVPT Y +  G   +++Q+SVTE+ +  
Sbjct: 133 FGPHYPGQLNPLDGYVRMVGREPFS-YKYFLKVVPTEYYNRLGRATETHQYSVTEYAQPL 191

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
           ++G     P V   YDLSPI +T  E   S LHF+  +CA+VGGVF ++ + D ++    
Sbjct: 192 QRG---YAPAVDVHYDLSPIVMTINERPPSLLHFVVRLCAVVGGVFAITRLTDRWVDWLV 248

Query: 285 RAIKKKIEIG 294
           R + K    G
Sbjct: 249 RLVNKAAARG 258


>gi|324516732|gb|ADY46617.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Ascaris suum]
          Length = 286

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 73/206 (35%), Positives = 109/206 (52%), Gaps = 24/206 (11%)

Query: 101 GFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 158
           GF+  + +   E  GC      E+NKV GNFH     S H +        A Q +S+++ 
Sbjct: 96  GFITDVTKVPTEENGCRFEANFEINKVPGNFHL----STHSA--------ASQPESYDMR 143

Query: 159 HKINKLAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
           H +N + FG+        G  NPL      Q  P   ++Y +KVVP+VY D++G T  S 
Sbjct: 144 HIVNSVKFGDDLQEKAQIGSFNPLQDRTALQGDPLNTHEYILKVVPSVYEDIAGRTKYSY 203

Query: 214 QFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
           Q++    E+      GR+  +P V+F Y+L PI V +TE       F+T+VCA+VGG FT
Sbjct: 204 QYTYAHKEYIAYHHSGRI--IPAVWFKYELQPITVKYTERRQPLYAFITSVCAVVGGTFT 261

Query: 272 VSGIIDAFIYHGQRAIKKKIEIGKFS 297
           V+GIID+ ++     + KK ++GK S
Sbjct: 262 VAGIIDSSLF-SLSELYKKHQLGKLS 286


>gi|340058906|emb|CCC53277.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 394

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 81/301 (26%), Positives = 141/301 (46%), Gaps = 32/301 (10%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLE-HNETYCGSC 60
           D +G    +V  ++ K  LD+ G  +   +        D  + ++  + +  +  +CG C
Sbjct: 111 DATGSTRFNVTMNVHKTPLDASGKSVFVGERHF---HTDYTVPQYNAKFDPTSPKFCGKC 167

Query: 61  YGA------ESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
           +        +  +  C N CE+V E + ++  A  +   ++QC  E        EE  GC
Sbjct: 168 FVGRKYSYLQQPETPCRNTCEQVMEEFERRKLAKPSKSTVEQCIGE------LSEENPGC 221

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF---- 170
           N  G L++ K +G   FAP     ++   ++D++      FN SH INKL+ G+      
Sbjct: 222 NYRGSLKLKKASGTLIFAP--KMFENVFRINDLM-----QFNASHVINKLSIGDDLVRRF 274

Query: 171 --PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY-TDVSGHTIQSN-QFSVTEHFRSSEQ 226
              GV  PL+  R+         +YF+K+VPT Y +D + + + S  ++SV    R    
Sbjct: 275 SKRGVYFPLNNQRFVTTKQFAQVRYFMKIVPTTYISDNTANPVASTYEYSVQWDHRQVPL 334

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
           G  + +P V F +D S ++V    +  SF HF+ ++C IVGG+F V G++D  +    R 
Sbjct: 335 GSGE-IPSVVFSFDFSSMQVNNYFQRPSFCHFIVSLCGIVGGLFVVLGMVDGLVARVLRL 393

Query: 287 I 287
           +
Sbjct: 394 L 394


>gi|296415728|ref|XP_002837538.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295633410|emb|CAZ81729.1| unnamed protein product [Tuber melanosporum]
          Length = 341

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 58/165 (35%), Positives = 94/165 (56%), Gaps = 11/165 (6%)

Query: 114 CNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           C IYG + VN++ G+FH  A G  + + G H+         SFN SH I +L+FG+++P 
Sbjct: 155 CRIYGSMGVNRILGDFHITAKGHGYWEDGAHI------DHRSFNFSHVITELSFGDYYPK 208

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVY-TDVSGHTIQSNQFSVTEHFRSSEQGRLQT 231
           +VNPLDGV    +     +QYF+ +VPT Y +  SG ++ +NQ++VTE  R        +
Sbjct: 209 LVNPLDGVVSKTDENFHKFQYFLSIVPTTYESQTSGKSLLTNQYAVTEQSRKISS---HS 265

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           +PG++F YD+ PI +  ++   + L F+  +  IV G+    G +
Sbjct: 266 VPGIYFKYDIEPISLKISDRRTALLAFVVRLVNIVSGILVGGGWV 310


>gi|440801547|gb|ELR22565.1| serologically defined breast cancer antigen 84 isoform 1, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 355

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 68/173 (39%), Positives = 93/173 (53%), Gaps = 8/173 (4%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD----ILAFQRDSFNISHKINKLA 165
           +G GC ++G  EV KV GN H A G +  QS          I   Q  SFN+SH I  L+
Sbjct: 148 KGSGCRVFGKAEVQKVKGNLHIAAGSNAPQSHDGHQHHVHHITPEQVASFNVSHFIPHLS 207

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
           FG  FP   +PL   R  +  P+ M   + I++VPT+Y D  G+ I+  Q+S   +++  
Sbjct: 208 FGPAFPRRTDPLSWTRVIE--PNAMQVNHMIQLVPTIYEDWGGNVIEGYQYSAQTNYKHI 265

Query: 225 EQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             G     LPGVF  +D+SP  + + E   SF HFLT +CAI GG F V G+I
Sbjct: 266 VPGASSFPLPGVFIKWDMSPFVIQYRETGRSFAHFLTRLCAITGGTFVVLGLI 318


>gi|115623567|ref|XP_794044.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Strongylocentrotus purpuratus]
          Length = 289

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 73/201 (36%), Positives = 110/201 (54%), Gaps = 21/201 (10%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           ++I    G+GC  Y    +NKV GNFH           V  H +   Q  S + +H I++
Sbjct: 103 KKIPLNNGQGCLFYSAFTINKVPGNFH-----------VSTHAVGMNQPQSTDFAHIIHE 151

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSV 217
           ++FG+           NPL+G R  +++ S + + Y++K+VPTVY D+ G    S Q++ 
Sbjct: 152 VSFGDDIQNKTLGASFNPLEG-RDKRDSKSDLSHDYYMKIVPTVYEDLWGTKNVSYQYTY 210

Query: 218 T-EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             + + S   GR + LP ++F YD+SPI V + E+   F  F+T VCAIVGG FTV+GI 
Sbjct: 211 AYKDYGSQGHGR-RVLPAIWFRYDISPITVKYHEKRAPFYTFITTVCAIVGGTFTVAGIF 269

Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
           D+ I+      KK  E+GK S
Sbjct: 270 DSIIFTAAEVFKKA-ELGKLS 289


>gi|148223633|ref|NP_001084786.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Xenopus laevis]
 gi|78099249|sp|Q6NS19.1|ERGI1_XENLA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 1; AltName: Full=ER-Golgi intermediate
           compartment 32 kDa protein; Short=ERGIC-32
 gi|47125098|gb|AAH70532.1| MGC78834 protein [Xenopus laevis]
          Length = 290

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 73/200 (36%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I      GC   G   +NKV GNFH           V  H  +A Q  + ++ H I+K
Sbjct: 105 MKIPINNAYGCRFEGLFSINKVPGNFH-----------VSTHSAIA-QPANPDMRHIIHK 152

Query: 164 LAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG     ++  G  N L G           + Y +K+VPTVY D++G    S Q++V 
Sbjct: 153 LSFGNTLQVDNIHGAFNALGGADKLASKALESHDYVLKIVPTVYEDLNGKQQFSYQYTVA 212

Query: 219 E--HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              +   S  GR+  +P ++F YDLSPI V +TE       F+T VCAI+GG FTV+GI+
Sbjct: 213 NKAYVAYSHTGRV--VPAIWFRYDLSPITVKYTERRQPMYRFITTVCAIIGGTFTVAGIL 270

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+FI+    A  KKI++GK 
Sbjct: 271 DSFIFTASEA-WKKIQLGKM 289


>gi|325187435|emb|CCA21973.1| endoplasmic reticulumGolgi intermediate compartment protein
           putative [Albugo laibachii Nc14]
          Length = 283

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 66/181 (36%), Positives = 94/181 (51%), Gaps = 8/181 (4%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
           ++   EGC   G L + K+ G+  F  G S     + + +++   R  FN SH I KL F
Sbjct: 110 EDPHNEGCRYKGTLTIQKLQGDIFFCHGGS-----LSIFNLMEMFR--FNSSHVITKLNF 162

Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
           G   P +  PL  V  T       Y+YF KVVP+ Y  + G +  + Q+SVTEH    + 
Sbjct: 163 GLSIPKMQTPLTDVHKTVLAQVATYKYFAKVVPSRYVYLDGKSTMTYQYSVTEHLLKMD- 221

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
           G +  +PGV   YD SPI V + E   +  HF+TN CAI+GGV  V+ I DA +Y   + 
Sbjct: 222 GFVTNIPGVIISYDFSPIAVDYIETKPNIFHFITNTCAILGGVIAVARIFDAALYSMSKK 281

Query: 287 I 287
           +
Sbjct: 282 L 282


>gi|261334705|emb|CBH17699.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 391

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 82/298 (27%), Positives = 141/298 (47%), Gaps = 29/298 (9%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           D+SG   ++V  ++ K  +D  GN+  + +R+     P+     +R+     ++  +CG 
Sbjct: 111 DVSGTFSINVTENLLKTPVDVGGNLAYLGTRR-FFTDPRSPLYTRRND---PNSPDFCGR 166

Query: 60  CYG---AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C+    A +  ++CCN CEEV   + +KG    N ++++QC  E  L      E  GCN 
Sbjct: 167 CFTGNKAIAGGKNCCNTCEEVMAEHDRKGLPRPNKNVVEQCIGELSL------ENPGCNY 220

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF------ 170
            G L V KV+G   F P     ++ + + D+L      F+ SH INK + G+        
Sbjct: 221 RGALNVRKVSGVIFFTP--KVIKNTIKMEDLL-----KFDASHVINKFSIGDESVRRHSR 273

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG-RL 229
            GV+NPL+  R+         +Y++ +VPT Y   +   +    +  + ++ S E     
Sbjct: 274 RGVLNPLEKQRFNGSGRFMKVRYYLNIVPTTYGSGASSGLHPPTYEYSANWNSREVAIGY 333

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
              P V F +D  P++V    +     HFL  +C IVGG+F V G++D+ +    R +
Sbjct: 334 GGFPSVEFSFDFFPMQVNNNFKREPIYHFLVQLCGIVGGLFVVLGLVDSVVARLTRLV 391


>gi|363738942|ref|XP_414530.3| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 1 [Gallus gallus]
          Length = 291

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 72/200 (36%), Positives = 106/200 (53%), Gaps = 21/200 (10%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G+GC   G   +NKV+           H   V  H   A Q  + +++H I+K
Sbjct: 105 MKIPLNNGDGCRFEGHFSINKVSP-------WXLH---VSTHSATA-QPQNPDMTHIIHK 153

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L+G       P   + Y +K+VPTVY D+SG    S Q++V 
Sbjct: 154 LSFGDKLQVQNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVA 213

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T++CAI+GG FTV+GI+
Sbjct: 214 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGIL 271

Query: 277 DAFIYHGQRAIKKKIEIGKF 296
           D+ I+    A  KKI++GK 
Sbjct: 272 DSCIFTASEA-WKKIQLGKM 290


>gi|388858415|emb|CCF48009.1| uncharacterized protein [Ustilago hordei]
          Length = 415

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 58/165 (35%), Positives = 91/165 (55%), Gaps = 8/165 (4%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           +G  C IYG +EV +V GN H       + S  H    L       N+SH I++ +FG +
Sbjct: 171 DGPACRIYGSMEVKRVTGNLHITTLGHGYLSLEHTDHKL------MNLSHVIHEFSFGPY 224

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           FP +  PLD    T +    ++QYFI  VPT++ D  G  + ++Q+SVT++ R  E G+ 
Sbjct: 225 FPEISQPLDSSVETTDKHFTVFQYFISAVPTLFVDARGRKLHTHQYSVTDYTRQIEHGK- 283

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
             +PG+F  YD+ PI++T  E   +F+ FL  +  ++GGV+   G
Sbjct: 284 -GVPGIFIKYDIEPIQMTIRERSSTFVQFLVRLAGVLGGVWVCVG 327


>gi|71755761|ref|XP_828795.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70834181|gb|EAN79683.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 391

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 81/298 (27%), Positives = 141/298 (47%), Gaps = 29/298 (9%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCGS 59
           D+SG   ++V  ++ K  +D  GN+  + +R+     P+     +R+     ++  +CG 
Sbjct: 111 DVSGTFSINVTENLLKTPVDVGGNLAYLGTRR-FFTDPRSPLYTRRND---PNSPDFCGR 166

Query: 60  CYG---AESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C+    A +  ++CCN CEEV   + +KG    N ++++QC  E  L      E  GCN 
Sbjct: 167 CFTGNKAIAGGKNCCNTCEEVMAEHDRKGLPRPNKNVVEQCIGELSL------ENPGCNY 220

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF------ 170
            G L V KV+G   F P     ++ + + D+L      F+ SH INK + G+        
Sbjct: 221 RGALNVRKVSGVIFFTP--KVIKNTIKMEDLL-----KFDASHVINKFSIGDESVRRHSR 273

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG-RL 229
            GV+NPL+  R+         +Y++ +VPT Y   +   +    +  + ++ S E     
Sbjct: 274 RGVLNPLEKQRFNGSGRFMKVRYYLNIVPTTYGSGASSGLHPPTYEYSANWNSREVAIGY 333

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
              P V F +D  P++V    +     HFL  +C I+GG+F V G++D+ +    R +
Sbjct: 334 GGFPSVEFSFDFFPMQVNNNFKREPIYHFLVQLCGIIGGLFVVLGLVDSVVARLTRLV 391


>gi|190346055|gb|EDK38054.2| hypothetical protein PGUG_02151 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 407

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 63/181 (34%), Positives = 101/181 (55%), Gaps = 13/181 (7%)

Query: 99  REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 158
           REG+ +    E    C+I+G + VN+V+G+FH       ++   HV         + N S
Sbjct: 204 REGYHE---AESAPACHIFGSIPVNQVSGDFHITAKGMGYRDRAHV------DPQALNFS 254

Query: 159 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           H I + +FGE +P + NPLD    T +     Y+Y+ KVVPT+Y  + G  + +NQ+S+T
Sbjct: 255 HIIAEFSFGEFYPLIKNPLDFTGKTTDDHFQAYKYYAKVVPTLYERM-GLQVDTNQYSIT 313

Query: 219 EHFRSSE---QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
           E  R  E    GR+Q +PG+FF Y+   IK+  +++ + F  F+  +  I+GGVF V+G 
Sbjct: 314 ESHRKYELNTNGRIQGVPGIFFKYEFEAIKLIVSDKRIPFTSFVARLATIIGGVFIVAGY 373

Query: 276 I 276
           +
Sbjct: 374 L 374


>gi|327265232|ref|XP_003217412.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Anolis carolinensis]
          Length = 291

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 69/199 (34%), Positives = 102/199 (51%), Gaps = 22/199 (11%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
           +I    G+GC       +NK+ GNFH           V  H   A Q  + +++H I+KL
Sbjct: 107 KIPLNNGDGCRFESHFSINKIPGNFH-----------VSTHSATA-QPQNPDMTHVIHKL 154

Query: 165 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 218
           +FG+        G  N L+G       P   + Y +K+VPTVY D+SG      Q++V  
Sbjct: 155 SFGDQLQAQKIRGSFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQQYPFQYTVAN 214

Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
            E+   S  GR+   P ++F YDL+PI + + E       F+T +CAI+GG FTV+GI D
Sbjct: 215 KEYVVYSHTGRIT--PAIWFRYDLTPITLKYIERRQPLYRFITTICAIIGGTFTVAGIFD 272

Query: 278 AFIYHGQRAIKKKIEIGKF 296
           + I+    A  KKI++GK 
Sbjct: 273 SCIFTASEA-WKKIQLGKM 290


>gi|343427702|emb|CBQ71229.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 412

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 56/165 (33%), Positives = 90/165 (54%), Gaps = 8/165 (4%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           +G  C IYG +EV +V GN H       + S  H    L       N+SH I++ +FG +
Sbjct: 171 DGPACRIYGSMEVKRVTGNLHITTLGHGYLSMEHTDHKL------MNLSHVIHEFSFGPY 224

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           FP +  PLD    T +    ++QYF+  VPT++ D  G  + ++Q+SVT++ R  E G+ 
Sbjct: 225 FPEISQPLDSSVETTDKHFTVFQYFVSAVPTLFVDARGRKLHTHQYSVTDYTRQIEHGK- 283

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
             +PG+F  YD+ P+++T  E   + L FL  +  ++GGV+   G
Sbjct: 284 -GVPGIFIKYDIEPLQMTIRERSTTLLQFLVRLAGVLGGVWVCVG 327


>gi|361126303|gb|EHK98312.1| putative ER-derived vesicles protein 41 [Glarea lozoyensis 74030]
          Length = 343

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 71/176 (40%), Positives = 99/176 (56%), Gaps = 18/176 (10%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           R++   G+ C IYG LEVNKV G+FH  A G  + + G    D  AF     N SH +N+
Sbjct: 142 RLRGNVGDSCRIYGNLEVNKVQGDFHLTARGHGYQEWGAGHLDHTAF-----NFSHIVNE 196

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYT-DVSGH----TIQSNQFS 216
           L+FG  +P ++NPLD  R    TP+    +QYF+ VVPT YT D S      TI +NQ++
Sbjct: 197 LSFGAFYPSLLNPLD--RTVSTTPNHFHKFQYFLSVVPTAYTVDSSSRSARDTIFTNQYA 254

Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
           VTE    S +   +++PG+FF YD+ P+ +T  E   SFL F+  V  +  GV   
Sbjct: 255 VTEQ---SHEVNERSVPGIFFKYDIEPMLLTVEESRDSFLRFVVKVVNVFSGVLVA 307


>gi|260800124|ref|XP_002594986.1| hypothetical protein BRAFLDRAFT_128968 [Branchiostoma floridae]
 gi|229280225|gb|EEN50997.1| hypothetical protein BRAFLDRAFT_128968 [Branchiostoma floridae]
          Length = 292

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 76/220 (34%), Positives = 114/220 (51%), Gaps = 28/220 (12%)

Query: 92  DLIDQCKRE--GFLQ---RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 146
           D+ D+  R   GF++   ++    G GC   G   +NKV GNFH     S H + V    
Sbjct: 87  DIQDEMGRHEVGFVEDTEKVPVNNGLGCRFEGRFWINKVPGNFHM----STHSAHV---- 138

Query: 147 ILAFQRDSFNISHKINKLAFGE--------HFPGVVNPLDGVRWTQETPSGMYQYFIKVV 198
               Q  S +++H ++ L FGE        H  G  NPLD V          + YF+K+V
Sbjct: 139 ----QPASPDMTHVVHDLRFGEDLAAFLPDHIKGSFNPLDEVERLHANALSSHDYFLKIV 194

Query: 199 PTVYTDVSGHTIQSNQFSVT-EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 257
           PT++ + S     + Q++   + + S   G  + +P ++F YDLSPI V +T++   F H
Sbjct: 195 PTIFENRSDKKSFAFQYTYAYKDYISFGHGN-RVMPAIWFRYDLSPITVKYTDKRKPFYH 253

Query: 258 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           F+T +CA+VGG FTV+GIID+ I+      KK  E+GK S
Sbjct: 254 FITTICAVVGGTFTVAGIIDSVIFTAAEVFKKA-ELGKLS 292


>gi|291244956|ref|XP_002742359.1| PREDICTED: endoplasmic reticulum-golgi intermediate compartment
           (ERGIC) 1-like [Saccoglossus kowalevskii]
          Length = 318

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 69/199 (34%), Positives = 97/199 (48%), Gaps = 17/199 (8%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I      GC    + ++NKV GNFH     S H +G       + Q    +  H I++
Sbjct: 132 NKIPLNNNAGCRFEAYFKINKVPGNFHV----STHAAG-------SRQPQKADFVHTIHE 180

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           +  G+           NPL G   +       + Y++KVVPTVY DV G    S Q++  
Sbjct: 181 IIIGDDIQNKSINAAFNPLAGYDRSDAAAESSHDYYMKVVPTVYEDVWGRVNLSYQYTYA 240

Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
                S     + +P ++F YD+SPI V + E+   F  F+T +CAIVGG FTV+GIID+
Sbjct: 241 YKDYVSYGHGHRVMPAIWFRYDISPITVKYHEKRAPFYTFITTICAIVGGTFTVAGIIDS 300

Query: 279 FIYHGQRAIKKKIEIGKFS 297
            IY      KK  EIGK S
Sbjct: 301 MIYSASEVFKKA-EIGKLS 318


>gi|348667045|gb|EGZ06871.1| hypothetical protein PHYSODRAFT_319561 [Phytophthora sojae]
          Length = 469

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 110/228 (48%), Gaps = 44/228 (19%)

Query: 92  DLIDQCKREGFLQRIK--EEEG----------EGCNIYGFLEVNKVAGNFHFAPGKSFHQ 139
           D ++  K+E F Q  K   E+G          EGC +YG L V +V GNFH         
Sbjct: 257 DAVEARKKELFEQDKKNAREQGKAIARSAVGPEGCRLYGHLYVKRVPGNFH--------- 307

Query: 140 SGVHVHDILAFQRDS--FNISHKINKLAFGEHFPG--------------VVNPLDGVRWT 183
             VH+ +  A+  DS   N SH +N+L FGEH                   + LD   +T
Sbjct: 308 --VHLANP-AYSMDSSLVNASHTVNELWFGEHLTSGEMSMLPRDAQMQLYTHRLDNQDYT 364

Query: 184 QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSP 243
               +  Y ++IKVV   Y       I  N +  T H  S+E      LP + F YDLSP
Sbjct: 365 SFYKNHTYVHYIKVVTNSYVQSDAADI--NVYKYTAH--SNEYLETDDLPSIMFRYDLSP 420

Query: 244 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           + V  +E+ V F HFLT+ CAI+GGVFTV GI+D  I+   RA+ KK+
Sbjct: 421 MSVRISEDSVPFYHFLTSACAIIGGVFTVIGILDQIIHQTARALNKKV 468


>gi|146421059|ref|XP_001486481.1| hypothetical protein PGUG_02151 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 407

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 63/181 (34%), Positives = 101/181 (55%), Gaps = 13/181 (7%)

Query: 99  REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 158
           REG+ +    E    C+I+G + VN+V+G+FH       ++   HV         + N S
Sbjct: 204 REGYHE---AESAPACHIFGSIPVNQVSGDFHITAKGMGYRDRAHV------DPQALNFS 254

Query: 159 HKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           H I + +FGE +P + NPLD    T +     Y+Y+ KVVPT+Y  + G  + +NQ+S+T
Sbjct: 255 HIIAEFSFGEFYPLIKNPLDFTGKTTDDHFQAYKYYAKVVPTLYERM-GLQVDTNQYSIT 313

Query: 219 EHFRSSE---QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
           E  R  E    GR+Q +PG+FF Y+   IK+  +++ + F  F+  +  I+GGVF V+G 
Sbjct: 314 ELHRKYELNTNGRIQGVPGIFFKYEFEAIKLIVSDKRIPFTLFVARLATIIGGVFIVAGY 373

Query: 276 I 276
           +
Sbjct: 374 L 374


>gi|156406959|ref|XP_001641312.1| predicted protein [Nematostella vectensis]
 gi|156228450|gb|EDO49249.1| predicted protein [Nematostella vectensis]
          Length = 287

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 78/217 (35%), Positives = 110/217 (50%), Gaps = 26/217 (11%)

Query: 92  DLIDQCKRE--GFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 146
           D+ D+  R   GF + ++  E   GEGC I     +NKV GNFH     S H +G     
Sbjct: 86  DIQDEMGRHEVGFKENVERREINNGEGCFISTRFTINKVPGNFHV----STHGAGK---- 137

Query: 147 ILAFQRDSFNISHKINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
               Q DS +++H IN + FG    +  PG    L   +         + Y +K+VPT+Y
Sbjct: 138 ----QPDSPDMNHIINAVNFGSRIMDKLPGAFTALKDRKRHDTNGLASHDYILKIVPTIY 193

Query: 203 TDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
             + G T  S Q++    E+   S  G  Q LP ++F YDLSPI V + E      HF+T
Sbjct: 194 QKLDGTTTFSYQYTWAYKEYVSYSHGG--QMLPAIWFRYDLSPITVKYIERRQPLYHFIT 251

Query: 261 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
            VCAIVGG FTV+GIID+ ++      +K  ++GK S
Sbjct: 252 TVCAIVGGTFTVAGIIDSAVFTASEMWRKH-QLGKLS 287


>gi|388583623|gb|EIM23924.1| DUF1692-domain-containing protein [Wallemia sebi CBS 633.66]
          Length = 396

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/166 (36%), Positives = 91/166 (54%), Gaps = 7/166 (4%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           ++G  C IYG +E  KV GN H       + S  H    L       N+SH I++ +FG+
Sbjct: 158 KDGPACRIYGSVETKKVNGNMHITTLGHGYSSLEHTDHKL------MNLSHTIDEFSFGQ 211

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
           HFP +  PLD      +    +YQYF+ VVPT Y D SGH++ +NQ+S  E  +     +
Sbjct: 212 HFPYISQPLDKSVEITDNHFPVYQYFMHVVPTTYVDASGHSLSTNQYSAREDIKFIHNHQ 271

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
            + +PG+FF Y+L PI ++ +   +SF   L  + A++GGV+  SG
Sbjct: 272 -RGIPGLFFRYELEPIHLSLSATTMSFTKLLIRLTALIGGVWCCSG 316


>gi|301100294|ref|XP_002899237.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
 gi|262104154|gb|EEY62206.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
          Length = 469

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 116/232 (50%), Gaps = 52/232 (22%)

Query: 92  DLIDQCKREGFLQRIKE--EEG----------EGCNIYGFLEVNKVAGNFHFAPGKSFHQ 139
           D+++  K+E F Q  K+  E+G          EGC ++G L V +V GNFH         
Sbjct: 257 DVVEARKKELFEQDKKDAREQGRAIARSAVGPEGCRLFGHLYVKRVPGNFH--------- 307

Query: 140 SGVHVHDILAFQRDS--FNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQY---- 193
             VH+ +  A+  DS   N SH +N+L FGEH    + P D  R  +E  + +Y +    
Sbjct: 308 --VHLANP-AYSMDSSLVNASHTVNELWFGEH----LAPGDMSRLPREAQTQLYTHRLEN 360

Query: 194 --------------FIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
                         +IKVV   Y  V G   + N +  T H  S+E      LP V F Y
Sbjct: 361 QDFTSLYKNHTYVHYIKVVTNSY--VQGDGSEINVYKYTAH--SNEYLETDDLPSVMFRY 416

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           DLSP+ V  +E+ V F HF+T+ CAI+GGVFTV GI+D  I+   RA+ KK+
Sbjct: 417 DLSPMSVRISEDTVPFYHFVTSACAIIGGVFTVIGIVDQIIHQTARALNKKV 468


>gi|310800159|gb|EFQ35052.1| hypothetical protein GLRG_10196 [Glomerella graminicola M1.001]
          Length = 377

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/168 (38%), Positives = 93/168 (55%), Gaps = 14/168 (8%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           EG+ C IYG L+VN+V G+FH  A G  + + G H+         +FN SH I++L+FG 
Sbjct: 183 EGDSCRIYGNLDVNRVQGDFHITARGHGYMEFGAHL------DHAAFNFSHIISELSFGP 236

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGHTIQSNQFSVTEHFRSS 224
            +P +VNPLD            +QY++ VVPTVYT      S +TI +NQ++VTE  + +
Sbjct: 237 FYPSLVNPLDRTVNLARINFHKFQYYLSVVPTVYTVGKSASSSNTIFTNQYAVTEQSKET 296

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
           +      +PG+FF YD+ PI ++  E    FL  L  +  IV GV   
Sbjct: 297 DD---HNIPGIFFKYDIEPILLSVEESRDGFLQLLMKIVNIVSGVLVA 341


>gi|71013590|ref|XP_758634.1| hypothetical protein UM02487.1 [Ustilago maydis 521]
 gi|46098292|gb|EAK83525.1| hypothetical protein UM02487.1 [Ustilago maydis 521]
          Length = 415

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 56/165 (33%), Positives = 89/165 (53%), Gaps = 8/165 (4%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           +G  C IYG +EV +V GN H       + S  H    L       N+SH I++ +FG +
Sbjct: 171 DGPACRIYGSMEVKRVTGNLHITTLGHGYLSVEHTDHKL------MNLSHVIHEFSFGPY 224

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           FP +  PLD    T E    ++QYF+  VPT++ D  G  + ++Q+SVT++ R  E G+ 
Sbjct: 225 FPEISQPLDSSVETTEKHFTVFQYFVSAVPTLFIDARGRKLHTHQYSVTDYTRQIEHGK- 283

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
             +PG+F  YD+ P+++T  +   S   FL  +  ++GGV+   G
Sbjct: 284 -GVPGIFIKYDIEPLQMTIRQRSTSLFQFLVRLAGVLGGVWVCVG 327


>gi|313220803|emb|CBY31643.1| unnamed protein product [Oikopleura dioica]
          Length = 289

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 79/219 (36%), Positives = 121/219 (55%), Gaps = 28/219 (12%)

Query: 92  DLIDQCKRE--GFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 146
           D+ D+  R   G+L+  +++    G+GC   G   VNKV GNFH     S H S V    
Sbjct: 86  DIQDEHGRHEVGYLENTRKDPINGGKGCIFGGTFHVNKVPGNFHV----STHSSQV---- 137

Query: 147 ILAFQRDSFNISHKINKLAFGEHFPGVVN-------PLDGVRWTQETPSGMYQYFIKVVP 199
               Q  + +++H+I++L+FGE   G+ +       PL+G +   E  +  + Y +KVVP
Sbjct: 138 ----QPQNPDMNHEIHELSFGESMKGINSNLPANFIPLNGKKTGAEKMAS-HDYTLKVVP 192

Query: 200 TVYTDVSGHTIQSNQFS-VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
           TVY D+   T    QF+ V + F +   G  + +P ++F Y++SPI V +TE+     HF
Sbjct: 193 TVYQDIKKRTKFGYQFTAVYKDFVAFGHGH-RVMPAIWFRYEVSPITVKYTEKSKPLYHF 251

Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LT  CAI+GG FTV+G+ID+ I+   + +KK  E GK S
Sbjct: 252 LTTFCAIIGGTFTVAGMIDSMIFSAHQMVKKAGE-GKLS 289


>gi|313230728|emb|CBY08126.1| unnamed protein product [Oikopleura dioica]
          Length = 289

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 79/219 (36%), Positives = 121/219 (55%), Gaps = 28/219 (12%)

Query: 92  DLIDQCKRE--GFLQRIKEEE---GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHD 146
           D+ D+  R   G+L+  +++    G+GC   G   VNKV GNFH     S H S V    
Sbjct: 86  DIQDEHGRHEVGYLENTRKDPINGGKGCIFGGTFHVNKVPGNFHV----STHSSQV---- 137

Query: 147 ILAFQRDSFNISHKINKLAFGEHFPGVVN-------PLDGVRWTQETPSGMYQYFIKVVP 199
               Q  + +++H+I++L+FGE   G+ +       PL+G +   E  +  + Y +KVVP
Sbjct: 138 ----QPQNPDMNHEIHELSFGESMKGINSNLPANFIPLNGKKTGAEKMAS-HDYTLKVVP 192

Query: 200 TVYTDVSGHTIQSNQFS-VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
           TVY D+   T    QF+ V + F +   G  + +P ++F Y++SPI V +TE+     HF
Sbjct: 193 TVYQDIKKRTKFGYQFTAVYKDFVAFGHGH-RVMPAIWFRYEVSPITVKYTEKSKPLYHF 251

Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           LT  CAI+GG FTV+G+ID+ I+   + +KK  E GK S
Sbjct: 252 LTTFCAIIGGTFTVAGMIDSMIFSAHQMVKKAGE-GKLS 289


>gi|336269097|ref|XP_003349310.1| hypothetical protein SMAC_05593 [Sordaria macrospora k-hell]
 gi|380089883|emb|CCC12416.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 379

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 93/169 (55%), Gaps = 10/169 (5%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           R+     + C ++G LE+NKV G+FH  A G  + + G H+         +FN SH I++
Sbjct: 181 RLWGATPDSCRVFGSLELNKVQGDFHITAKGHGYMEFGQHL------DHSAFNFSHIISE 234

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
           L++G   P +VNPLD       +    +QYFI VVPTVY+   G +I +NQ++VTE    
Sbjct: 235 LSYGPFLPSLVNPLDQTVNLATSNFHKFQYFISVVPTVYSVSGGRSIVTNQYAVTEQ--- 291

Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
           S++   + +PG+F  YD+ PI +   EE  SFL FL  V  ++ G    
Sbjct: 292 SQEVTERIIPGIFVKYDIEPILLNIVEERDSFLLFLIKVVNVISGALVA 340


>gi|348690307|gb|EGZ30121.1| COPII vesicle trafficking protein [Phytophthora sojae]
          Length = 306

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 82/227 (36%), Positives = 115/227 (50%), Gaps = 44/227 (19%)

Query: 99  REGFLQRIKEEE--GE-GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 155
           +E  L++  +EE  GE GC ++G ++V KVAG+  FA     H+  + V     F   +F
Sbjct: 91  KEILLKKDIQEEPFGENGCRLFGTVQVQKVAGDLSFA-----HEGSLTVFSFFDFL--NF 143

Query: 156 NISHKINKLAFGEHFPGVVNPLDGV------RWTQET----------------------- 186
           N SH +N L FG   P +  PL  V        TQE+                       
Sbjct: 144 NSSHVVNHLRFGPQIPDMETPLIDVSKILERNCTQESCWLARSWDSVAALLTSFIALLLF 203

Query: 187 PSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIK 245
               Y+YF+ VVP+ Y  ++G ++ + Q+SVTEH  SS     Q + PGV F Y+ SPI 
Sbjct: 204 TVATYKYFVNVVPSRYVYLNGRSVTTFQYSVTEHETSSRGPNGQVSFPGVIFSYEFSPIA 263

Query: 246 VTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
           V + E   S LHFLT+  AIVGGVF V+ +ID  IY    ++ KKI+
Sbjct: 264 VEYIESKPSVLHFLTSTSAIVGGVFAVARMIDGAIY----SVSKKID 306


>gi|67623433|ref|XP_667999.1| serologically defined breast cancer antigen 84 like (42.9 kD)
           (XQ234) [Cryptosporidium hominis TU502]
 gi|54659178|gb|EAL37768.1| serologically defined breast cancer antigen 84 like (42.9 kD)
           (XQ234) [Cryptosporidium hominis]
          Length = 388

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 78/265 (29%), Positives = 119/265 (44%), Gaps = 38/265 (14%)

Query: 57  CGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--- 109
           CG CY A  +++    +CCN C++V   Y KKG  L +     QC  +   +RI      
Sbjct: 117 CGPCYDASINNDLGVVNCCNTCKDVFNEYDKKGIKLPHVISFKQCDYDK-SKRISNALSS 175

Query: 110 --EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV---HVHDILAFQRDSFNISHKINKL 164
               EGC I     + KV G    +     H+  V    + D+   +   FN S+K+N L
Sbjct: 176 NLNSEGCKIKVNGYIPKVKGKIEIS-----HKRWVKYKEMTDLEIAESHLFNFSYKMNYL 230

Query: 165 AFGEHFPGVVNPLDGVRWTQET-------------PSGMYQYFIKVVPTVYTDVSGHTIQ 211
            FGE  PG+ N      + Q +                   + +  +PT Y  ++  +I 
Sbjct: 231 DFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFDDAYIDFDMHCIPTQYNTINNKSIN 290

Query: 212 SNQFSVTEHFR----SSEQGRL---QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
           S+QFSV   ++    S   G+     ++PG+   YD +P  V  TE   SFL F+T  CA
Sbjct: 291 SHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPFLVKMTESRRSFLSFITECCA 350

Query: 265 IVGGVFTVSGIIDAFIYHGQRAIKK 289
           I+GG+F  SG+ID F +    ++ K
Sbjct: 351 IIGGIFAFSGMIDIFFFKFLSSVNK 375


>gi|336472105|gb|EGO60265.1| hypothetical protein NEUTE1DRAFT_56465 [Neurospora tetrasperma FGSC
           2508]
 gi|350294686|gb|EGZ75771.1| DUF1692-domain-containing protein [Neurospora tetrasperma FGSC
           2509]
          Length = 379

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 66/185 (35%), Positives = 99/185 (53%), Gaps = 14/185 (7%)

Query: 92  DLIDQCKREGFLQRIKEEEG---EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDI 147
           D++   +R+    R     G   + C ++G LE+NKV G+FH  A G  + + G H+   
Sbjct: 166 DIVSLGRRKAKWARTPRLWGATPDSCRVFGSLELNKVQGDFHITAKGHGYMEFGQHL--- 222

Query: 148 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
                 +FN SH I++L+FG   P +VNPLD            +QYFI VVPTVY+  SG
Sbjct: 223 ---DHSAFNFSHIISELSFGPFLPSLVNPLDQTVNIASANFHKFQYFISVVPTVYSS-SG 278

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
            +I +NQ++VTE    S++   + +PG+F  YD+ PI +   EE  SFL F+  V  ++ 
Sbjct: 279 KSIVTNQYAVTEQ---SQEVTERIIPGIFVKYDIEPILLNIEEERDSFLVFIIKVVNVIS 335

Query: 268 GVFTV 272
           G    
Sbjct: 336 GALVA 340


>gi|443683891|gb|ELT87978.1| hypothetical protein CAPTEDRAFT_224400 [Capitella teleta]
          Length = 292

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 69/202 (34%), Positives = 106/202 (52%), Gaps = 23/202 (11%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
           ++     EGC      ++NKV GNFH +   S  Q                N+ H +++L
Sbjct: 105 KVPINNNEGCRFKSSFKINKVPGNFHISTHASKEQP------------PQPNMKHIVHEL 152

Query: 165 AFGE------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
            FG+      H PG  NPL     ++      + Y++K+VP V+ D SG T+  + +  T
Sbjct: 153 IFGDRVPQTIHIPGSFNPLLEKDKSESNALSSHDYYLKIVPAVFNDYSGKTLM-HPYQYT 211

Query: 219 EHFRSS--EQGRLQTLPGVFFFYDLSPIKVTFTEEH-VSFLHFLTNVCAIVGGVFTVSGI 275
             +R S  ++G    +P ++F Y L+P+ V ++E+  + F HFLT VCAIVGG FTV+GI
Sbjct: 212 FAYRHSIRQRGGQVVIPAIWFKYKLNPMCVKYSEQRPIPFYHFLTAVCAIVGGTFTVAGI 271

Query: 276 IDAFIYHGQRAIKKKIEIGKFS 297
            D+F++     I KK E+GK S
Sbjct: 272 FDSFLFTAAE-IFKKAELGKLS 292


>gi|325184531|emb|CCA19024.1| endoplasmic reticulumGolgi intermediate compartment protein
           putative [Albugo laibachii Nc14]
          Length = 466

 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 72/192 (37%), Positives = 96/192 (50%), Gaps = 26/192 (13%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           EGC +YG L V +V GNFH       H S    H   +      N SH +N+L FGE   
Sbjct: 288 EGCQLYGHLIVKRVPGNFHI------HLS----HPFYSMNSSLVNASHTVNELWFGEVLS 337

Query: 172 GVV-------NPLDGVRWT-QETPSGM----YQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
                       LD  R   QE  + M    Y ++IKVV   Y   +G  I + +++   
Sbjct: 338 ASALAKLPPNTRLDSHRLARQEFTAYMQNYTYVHYIKVVTNTYVQRNGEVISAYRYTA-- 395

Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
              S+E    + LP V F YDLSP+ V  TE  + F HF+T+ CAI+GGVFTV GIID  
Sbjct: 396 --HSNEYLETEDLPSVMFRYDLSPMSVRITERSMPFYHFVTSACAIIGGVFTVIGIIDQL 453

Query: 280 IYHGQRAIKKKI 291
           ++   RA+ KK+
Sbjct: 454 VHQTVRAMNKKV 465


>gi|62319241|dbj|BAD94459.1| hypothetical protein [Arabidopsis thaliana]
          Length = 56

 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 53/56 (94%), Positives = 56/56 (100%)

Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           SPIKVTFTEEH+SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ+AIKKK+EIGKFS
Sbjct: 1   SPIKVTFTEEHISFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQKAIKKKMEIGKFS 56


>gi|402085784|gb|EJT80682.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 379

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 71/206 (34%), Positives = 106/206 (51%), Gaps = 22/206 (10%)

Query: 99  REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 157
           R G   R+     + C ++G L++NKV G+FH  A G  + + G H+        D+FN 
Sbjct: 174 RWGKTPRLWGSTADSCRLFGSLDLNKVQGDFHITARGHGYMEFGEHL------DHDAFNF 227

Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS-----GHTIQS 212
           +H IN+ +FGE +P +VNPLD       T    +QYF+ VVPTVY+  S     G TI +
Sbjct: 228 THIINEFSFGEFYPSLVNPLDRTINGANTHFHKFQYFLSVVPTVYSVKSSAGGFGSTIFT 287

Query: 213 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV--- 269
           NQ++VTE      +   + +PG+FF YD+ P+ +   E   +FL FL  V  I+ G    
Sbjct: 288 NQYAVTEQNAEISE---RAIPGIFFKYDIEPVLLNIEESRDTFLLFLVKVVNILSGAMVA 344

Query: 270 ----FTVSGIIDAFIYHGQRAIKKKI 291
               FT++  I   +   +RA    I
Sbjct: 345 GHWGFTMTEWIKEIMGKRRRATSGMI 370


>gi|403337257|gb|EJY67839.1| hypothetical protein OXYTRI_11647 [Oxytricha trifallax]
          Length = 279

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 75/218 (34%), Positives = 116/218 (53%), Gaps = 28/218 (12%)

Query: 99  REGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN 156
           R+  L+RIK+E  + +GC + GF  +N+V GNFH +   S  Q  + V+  L  Q  +F+
Sbjct: 71  RQDLLKRIKDEMDQKQGCQLKGFFNINRVPGNFHIS---SHSQKDLIVN--LEMQGYTFD 125

Query: 157 ISHKINKLAFG--EHFP---------GVVNPLDGVRWTQE-----TPSGM-YQYFIKVVP 199
            +HKIN ++FG  E F          GV+NPLDG+ ++        P  +   +F+  V 
Sbjct: 126 FTHKINHVSFGRQEDFKVIQKNFKQQGVLNPLDGLEFSANQDNKGKPQALATNFFMVAVS 185

Query: 200 TVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
           + Y D + +T    Q + T   +S+       L    F Y+LSPIKV F +E  + + F+
Sbjct: 186 SYYMDTNRNTYNMYQLTSTHKSQSNANVNENML---VFSYELSPIKVLFNQEKENIVDFM 242

Query: 260 TNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
             +CAI+GGVFT+S ++D  I H   ++  K  IGK S
Sbjct: 243 IQLCAIIGGVFTISSVVDTII-HRSVSLLFKQRIGKLS 279


>gi|289741661|gb|ADD19578.1| cOPII vesicle protein [Glossina morsitans morsitans]
          Length = 418

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 62/205 (30%), Positives = 112/205 (54%), Gaps = 3/205 (1%)

Query: 86  WALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVH 145
           + + +P+ +D+   E   + + EE+ + C ++G L +NKVAG  H   G       +  H
Sbjct: 153 YIIQSPE-VDETATEEDEKPLSEEQYDACRLHGTLGINKVAGVLHLVGGTQPVVDLLGEH 211

Query: 146 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV 205
            ++ F+  + N +H+IN+L+FG++   +V PL+G          + QYF+ +VPT     
Sbjct: 212 LMIGFRHIAANFTHRINRLSFGQYARRIVQPLEGDETFVSEEGTIVQYFLNIVPT-EIHK 270

Query: 206 SGHTIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
           +  TI + Q+SVTE+ R  +  R     PG++F YD S +K+    +  + L F+  +C+
Sbjct: 271 TFTTISTYQYSVTENVRVLDSDRNSYGSPGIYFKYDWSALKIIVRTDRDNMLQFIIRLCS 330

Query: 265 IVGGVFTVSGIIDAFIYHGQRAIKK 289
           I+ G+  +SGI++ F+   +R I K
Sbjct: 331 IISGIVVLSGILNVFLLTLRRNIIK 355


>gi|429862433|gb|ELA37083.1| copii-coated vesicle protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 375

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 62/168 (36%), Positives = 93/168 (55%), Gaps = 14/168 (8%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           EG+ C IYG L+VN+V G+FH  A G  + + G H+         +FN SH I++++FG 
Sbjct: 183 EGDSCRIYGNLDVNRVQGDFHITARGHGYMEFGEHL------DHAAFNFSHIISEMSFGP 236

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGHTIQSNQFSVTEHFRSS 224
            +P +VNPLD            +QY++ VVPTVYT      + +TI +NQ++VTE  +  
Sbjct: 237 FYPSLVNPLDRTVNAARINFHKFQYYLSVVPTVYTVGKSASTSNTIFTNQYAVTEQSKEV 296

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
           +      +PG+FF YD+ PI ++  E    FL FL  +  +V GV   
Sbjct: 297 DD---HNVPGIFFKYDIEPILLSVEESRDGFLQFLMKIVNVVSGVLVA 341


>gi|198421328|ref|XP_002120997.1| PREDICTED: similar to Endoplasmic reticulum-Golgi intermediate
           compartment protein 1 (ER-Golgi intermediate compartment
           32 kDa protein) (ERGIC-32) [Ciona intestinalis]
          Length = 289

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 110/201 (54%), Gaps = 21/201 (10%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           +++   +G GC      ++NKV GNFH           V  H   + Q D+ +++H+I +
Sbjct: 103 EKVPTHDGNGCLFTSRFQINKVPGNFH-----------VSTHSARS-QPDNPDMTHEIKE 150

Query: 164 LAFGEHF--PGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS- 216
           L  G++   PGV     N L+G     + P   + Y +K+VPTVY  + G+     Q++ 
Sbjct: 151 LRIGDNMVIPGVKSQSFNALEGKTTFDKHPLSSHDYIMKIVPTVYESIDGNLRYLYQYTN 210

Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             + + +   G+ + +P ++F Y+++PI V +TE    F HF+T VCAI+GG FTV+GII
Sbjct: 211 AYKDYIAYGHGQ-RVMPAIWFRYEMTPITVKYTERRKPFYHFITMVCAIIGGTFTVAGII 269

Query: 277 DAFIYHGQRAIKKKIEIGKFS 297
           D+ I+     + KK+ IGK S
Sbjct: 270 DSMIFSATE-MYKKLTIGKLS 289


>gi|332373256|gb|AEE61769.1| unknown [Dendroctonus ponderosae]
          Length = 382

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 59/170 (34%), Positives = 95/170 (55%), Gaps = 2/170 (1%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C IYG L +NKVAGNF  + GK +     +           +N +H+IN+ +FG   P
Sbjct: 175 DACRIYGTLGLNKVAGNFLISGGKRYMFGLGYQQFRTLISEGEYNFTHRINRFSFGHSSP 234

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQ 230
           G+V+PL+G       P  +  YFI++VPT   +   +TI + Q+SV E  R  +  +   
Sbjct: 235 GIVHPLEGDELILPDPMTVVNYFIEIVPTT-VNTFMYTISTYQYSVKELTRPIDHNKGSH 293

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             P ++F YD+S ++VT ++E      FL  +C+IVGGV+  SGI+++ +
Sbjct: 294 GTPAIYFKYDMSALRVTVSQERDHLGMFLARLCSIVGGVYVCSGILNSIV 343


>gi|380492334|emb|CCF34678.1| hypothetical protein CH063_01185 [Colletotrichum higginsianum]
          Length = 377

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/168 (35%), Positives = 94/168 (55%), Gaps = 14/168 (8%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +G+ C +YG L+VN+V G+FH  A G  + + G H+         +FN SH +++L+FG 
Sbjct: 183 DGDSCRVYGNLDVNRVQGDFHITARGHGYMEFGEHL------DHAAFNFSHIVSELSFGP 236

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----DVSGHTIQSNQFSVTEHFRSS 224
            +P +VNPLD            +QY++ +VPTVYT      S +TI +NQ++VTE  + +
Sbjct: 237 FYPSLVNPLDRTVNLARINFHKFQYYLSIVPTVYTVGKSASSSNTIFTNQYAVTEQSKET 296

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
           +      +PG+FF YD+ PI ++  E    FL FL  +  +V GV   
Sbjct: 297 DD---HNIPGIFFKYDIEPILLSVEESRDGFLQFLMKIVNVVSGVLVA 341


>gi|85101064|ref|XP_961083.1| hypothetical protein NCU04293 [Neurospora crassa OR74A]
 gi|11611445|emb|CAC18610.1| conserved hypothetical protein [Neurospora crassa]
 gi|28922621|gb|EAA31847.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 379

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 66/185 (35%), Positives = 99/185 (53%), Gaps = 14/185 (7%)

Query: 92  DLIDQCKREGFLQRIKEEEG---EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDI 147
           D++   +R+    R     G   + C ++G LE+NKV G+FH  A G  + + G H+   
Sbjct: 166 DIVSLGRRKAKWARTPRLWGATPDSCRVFGSLELNKVQGDFHITAKGHGYMEFGQHL--- 222

Query: 148 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
                 +FN SH I++L+FG   P +VNPLD            +QYFI VVPTVY+  SG
Sbjct: 223 ---DHSAFNFSHIISELSFGPFLPSLVNPLDQTVNIASANFHKFQYFISVVPTVYSS-SG 278

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
            +I +NQ++VTE    S++   + +PG+F  YD+ PI +   EE  SFL F+  V  ++ 
Sbjct: 279 KSIVTNQYAVTEQ---SQEVTERIIPGIFVKYDIEPILLHIDEERDSFLVFIIKVVNVIS 335

Query: 268 GVFTV 272
           G    
Sbjct: 336 GALVA 340


>gi|342878666|gb|EGU79974.1| hypothetical protein FOXB_09504 [Fusarium oxysporum Fo5176]
          Length = 376

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 66/183 (36%), Positives = 102/183 (55%), Gaps = 18/183 (9%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG L++NKV G+FH  A G  +  +G H+          FN SH I++L++G  +
Sbjct: 189 DSCRIYGSLDLNKVQGDFHITARGHGYRGNGEHL------DHSKFNFSHIISELSYGPFY 242

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
           P +VNPLDG   T       +QY++ VVPTVY+ V+  +I +NQ++VTE  ++ ++   +
Sbjct: 243 PSLVNPLDGTVNTAPDNFHKFQYYLSVVPTVYS-VNSKSILTNQYAVTEQSKAVDE---R 298

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------FTVSGIIDAFIYHG 283
            +PG+FF YD+ PI +T  E     +  L  V  I+ GV       FT+S  I   I   
Sbjct: 299 YIPGIFFKYDIEPILLTVHESRDGIISLLVKVINIMSGVLVAGHWGFTISDWIHDVIGRR 358

Query: 284 QRA 286
           +R+
Sbjct: 359 RRS 361


>gi|66360024|ref|XP_627190.1| ERV41 like membrane associated protein involved in vesicular
           transport with a transmembrane region near the
           C-terminus [Cryptosporidium parvum Iowa II]
 gi|46228832|gb|EAK89702.1| ERV41 like membrane associated protein involved in vesicular
           transport with a transmembrane region near the
           C-terminus [Cryptosporidium parvum Iowa II]
          Length = 403

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 77/265 (29%), Positives = 118/265 (44%), Gaps = 38/265 (14%)

Query: 57  CGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--- 109
           CG CY A   ++    +CCN C+++   Y KKG  L +     QC  +   +RI      
Sbjct: 132 CGPCYDASIINDLGAVNCCNTCKDIFNEYDKKGIKLPHVISFKQCDYDK-SKRISNALSS 190

Query: 110 --EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV---HVHDILAFQRDSFNISHKINKL 164
               EGC I     + KV G    +     H+  V    + D+   +   FN S+K+N L
Sbjct: 191 NLNSEGCKIKVNGYIPKVKGKIEIS-----HKRWVKYKEMTDLEIAESHLFNFSYKMNYL 245

Query: 165 AFGEHFPGVVNPLDGVRWTQET-------------PSGMYQYFIKVVPTVYTDVSGHTIQ 211
            FGE  PG+ N      + Q +                   + +  +PT Y  ++  +I 
Sbjct: 246 DFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFEDAYIDFDMHCIPTQYNTINNKSIN 305

Query: 212 SNQFSVTEHFR----SSEQGRL---QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
           S+QFSV   ++    S   G+     ++PG+   YD +P  V  TE   SFL F+T  CA
Sbjct: 306 SHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPFLVKITESRRSFLSFITECCA 365

Query: 265 IVGGVFTVSGIIDAFIYHGQRAIKK 289
           I+GG+F  SG+ID F +    ++ K
Sbjct: 366 IIGGIFAFSGMIDIFFFKFLSSVNK 390


>gi|323509323|dbj|BAJ77554.1| cgd8_2900 [Cryptosporidium parvum]
 gi|323510503|dbj|BAJ78145.1| cgd8_2900 [Cryptosporidium parvum]
          Length = 388

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 77/265 (29%), Positives = 118/265 (44%), Gaps = 38/265 (14%)

Query: 57  CGSCYGAESSDE----DCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEE--- 109
           CG CY A   ++    +CCN C+++   Y KKG  L +     QC  +   +RI      
Sbjct: 117 CGPCYDASIINDLGAVNCCNTCKDIFNEYDKKGIKLPHVISFKQCDYDK-SKRISNALSS 175

Query: 110 --EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV---HVHDILAFQRDSFNISHKINKL 164
               EGC I     + KV G    +     H+  V    + D+   +   FN S+K+N L
Sbjct: 176 NLNSEGCKIKVNGYIPKVKGKIEIS-----HKRWVKYKEMTDLEIAESHLFNFSYKMNYL 230

Query: 165 AFGEHFPGVVNPLDGVRWTQET-------------PSGMYQYFIKVVPTVYTDVSGHTIQ 211
            FGE  PG+ N      + Q +                   + +  +PT Y  ++  +I 
Sbjct: 231 DFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFEDAYIDFDMHCIPTQYNTINNKSIN 290

Query: 212 SNQFSVTEHFR----SSEQGRL---QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
           S+QFSV   ++    S   G+     ++PG+   YD +P  V  TE   SFL F+T  CA
Sbjct: 291 SHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPFLVKITESRRSFLSFITECCA 350

Query: 265 IVGGVFTVSGIIDAFIYHGQRAIKK 289
           I+GG+F  SG+ID F +    ++ K
Sbjct: 351 IIGGIFAFSGMIDIFFFKFLSSVNK 375


>gi|328771759|gb|EGF81798.1| hypothetical protein BATDEDRAFT_86854 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 333

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 69/212 (32%), Positives = 100/212 (47%), Gaps = 21/212 (9%)

Query: 84  KGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGV 142
           KG   S+ DL D     G          + C   G  + NKV G  HF A G  +   GV
Sbjct: 138 KGLRDSSRDLEDHASESG--------TPDACRFRGSFQANKVEGMLHFTALGHGYF--GV 187

Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
           H         D+ N +H+I++L+FG  +P + NPLD       T    + YF+ VVPT+Y
Sbjct: 188 HT------PHDAINFTHRIDELSFGARYPDLHNPLDHTLEIGTTNFDSFMYFLGVVPTIY 241

Query: 203 TDVS----GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
            D +    G T+ +NQ++VTE   + +      LPG+F  Y + PI V  TE  +  + F
Sbjct: 242 VDKARSLFGATLLTNQYAVTEFSHAVDPQNPDALPGIFIKYHIEPISVRITESRLGLVQF 301

Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
            T +C I+GG F   G I  F  + +  +  K
Sbjct: 302 TTRMCGIIGGAFVTIGAILGFFRNVRTMLSAK 333


>gi|367025937|ref|XP_003662253.1| hypothetical protein MYCTH_2302675 [Myceliophthora thermophila ATCC
           42464]
 gi|347009521|gb|AEO57008.1| hypothetical protein MYCTH_2302675 [Myceliophthora thermophila ATCC
           42464]
          Length = 380

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 65/175 (37%), Positives = 95/175 (54%), Gaps = 19/175 (10%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           R+   E + C IYG LE+NKV G+FH  A G  + + G H+        ++FN SH I++
Sbjct: 181 RLWGAEADSCRIYGSLELNKVQGDFHITARGHGYMEFGEHL------DHNAFNFSHIISE 234

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMY--QYFIKVVPTVYT-----DVSGHTIQSNQFS 216
           L+FG   P +VNPLD  R     P+  Y  QYF+ VVPT Y+     +    ++ +NQ++
Sbjct: 235 LSFGPFLPSLVNPLD--RTVNTAPAHFYKFQYFLSVVPTTYSVGHPEERGSRSVLTNQYA 292

Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
           VTE  ++  +    T+PG+F  YD+ PI +   E   SF  FL  V  +V GV  
Sbjct: 293 VTEQSKAVPE---NTVPGIFVKYDIEPILLNIVETRDSFFVFLIKVINVVSGVLV 344


>gi|449303002|gb|EMC99010.1| hypothetical protein BAUCODRAFT_120300 [Baudoinia compniacensis
           UAMH 10762]
          Length = 387

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/176 (35%), Positives = 92/176 (52%), Gaps = 17/176 (9%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           K +E + C IYG +  NKV G+FH  A G  + + G H+      +  SFN SH IN+L+
Sbjct: 184 KSKEADSCRIYGSMHGNKVQGDFHITARGHGYMEFGQHL------EHSSFNFSHHINELS 237

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-------DVSGHTIQSNQFSVT 218
           FG  +P + NPLD      E     +QY++ VVPT+YT        ++  T+ +NQ++VT
Sbjct: 238 FGPFYPSLTNPLDNTLAATEFNFFKFQYYLSVVPTIYTTNAKALRKITKSTVFTNQYAVT 297

Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
           E  R   + +   +PGVF  YD+ PI +   EE  SF      +  ++ GV    G
Sbjct: 298 EQSRPVPENQ---VPGVFVKYDIEPILLMIAEERNSFPALFIRLVNVISGVLVAGG 350


>gi|238880883|gb|EEQ44521.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 345

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 74/220 (33%), Positives = 113/220 (51%), Gaps = 26/220 (11%)

Query: 90  NPDLIDQCKREGFLQRIKEE-----EGE-GCNIYGFLEVNKVAGNFHFAPGKSF-HQSGV 142
            PDL D+  +E      + E     EG   C+I+G + VN+V G+F    GK F ++   
Sbjct: 126 TPDL-DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRIT-GKGFGYRDRS 183

Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
           HV        +S N SH I + +FGE +P + NPLD      E     Y Y+ KVVPT+Y
Sbjct: 184 HV------PFESLNFSHVIQEFSFGEFYPYLNNPLDATGKVTEERLQTYMYYAKVVPTLY 237

Query: 203 TDVSGHTIQSNQFSVTE--HFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
             + G  I +NQ+S+TE  H    +Q   R   +PG++F YD  PIK+   E+ + F  F
Sbjct: 238 EQL-GLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQF 296

Query: 259 LTNVCAIVGGVFTVSGII------DAFIYHGQRAIKKKIE 292
           +  +  I GG+   +G +        FI++GQ+A+++  E
Sbjct: 297 IAKLATIGGGLLIAAGYLFRLYEKLLFIFYGQKAVQQNRE 336


>gi|167383125|ref|XP_001736415.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165901233|gb|EDR27345.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 116

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 54/115 (46%), Positives = 75/115 (65%), Gaps = 9/115 (7%)

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQG 227
           +VNP+DG+     T + MYQYF++VVP  YT +    I +N +SVTEH+R     S EQG
Sbjct: 1   MVNPMDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNRIINTNGYSVTEHYRPGNLKSPEQG 60

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
               +PGVF  YD+S I+V + EE  SF H LT++C I+GGVF +  ++D FI+H
Sbjct: 61  ----IPGVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFSLLDYFIFH 111


>gi|68465583|ref|XP_723153.1| likely COPII secretory vesicle component [Candida albicans SC5314]
 gi|68465876|ref|XP_723006.1| likely COPII secretory vesicle component [Candida albicans SC5314]
 gi|46445018|gb|EAL04289.1| likely COPII secretory vesicle component [Candida albicans SC5314]
 gi|46445174|gb|EAL04444.1| likely COPII secretory vesicle component [Candida albicans SC5314]
          Length = 345

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 74/220 (33%), Positives = 113/220 (51%), Gaps = 26/220 (11%)

Query: 90  NPDLIDQCKREGFLQRIKEE-----EGE-GCNIYGFLEVNKVAGNFHFAPGKSF-HQSGV 142
            PDL D+  +E      + E     EG   C+I+G + VN+V G+F    GK F ++   
Sbjct: 126 TPDL-DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRIT-GKGFGYRDRS 183

Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
           HV        +S N SH I + +FGE +P + NPLD      E     Y Y+ KVVPT+Y
Sbjct: 184 HV------PFESLNFSHVIQEFSFGEFYPYLNNPLDATGKVTEERLQTYMYYAKVVPTLY 237

Query: 203 TDVSGHTIQSNQFSVTE--HFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
             + G  I +NQ+S+TE  H    +Q   R   +PG++F YD  PIK+   E+ + F  F
Sbjct: 238 EQL-GLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQF 296

Query: 259 LTNVCAIVGGVFTVSGII------DAFIYHGQRAIKKKIE 292
           +  +  I GG+   +G +        FI++GQ+A+++  E
Sbjct: 297 IAKLATIGGGLLIAAGYLFRLYEKLLFIFYGQKAVQQNRE 336


>gi|241560364|ref|XP_002401002.1| COPII vesicle protein, putative [Ixodes scapularis]
 gi|215501827|gb|EEC11321.1| COPII vesicle protein, putative [Ixodes scapularis]
 gi|442749161|gb|JAA66740.1| Putative copii vesicle protein [Ixodes ricinus]
          Length = 285

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 73/204 (35%), Positives = 106/204 (51%), Gaps = 22/204 (10%)

Query: 101 GFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISH 159
           GF++   K   G GC   G   ++KV GNFH     S H +        A Q D  +++H
Sbjct: 97  GFVENTEKTPVGAGCRFEGKFYIHKVPGNFHM----STHAA--------AKQPDKIDMTH 144

Query: 160 KINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
            I+ L FG    E   G  N LD +  ++      + Y +K+VPTV+       I+S Q+
Sbjct: 145 IIHDLTFGNKMVEGVRGSFNSLDEMDKSEANGLESHDYVMKIVPTVFEKSPSERIESYQY 204

Query: 216 SVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           +     +   S  GR+  +P ++F YDL+PI V +T   V    FLT+VCAIVGG FTV+
Sbjct: 205 TYAYKSYVSISHSGRI--MPAIWFRYDLTPITVKYTRRSVPLYSFLTSVCAIVGGTFTVA 262

Query: 274 GIIDAFIYHGQRAIKKKIEIGKFS 297
           GI+D+ ++     I KK E+GK S
Sbjct: 263 GIVDSLVFTASE-IFKKYEMGKLS 285


>gi|302882273|ref|XP_003040047.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256720914|gb|EEU34334.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 376

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/205 (32%), Positives = 108/205 (52%), Gaps = 20/205 (9%)

Query: 92  DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
           D++   +++    +  + +G  + C +YG L +NKV G+FH  A G  +  +G H     
Sbjct: 167 DIVALGRKKAKWAKTPKVKGRADSCRVYGSLHLNKVQGDFHITARGHGYMGNGEH----- 221

Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
                +FN SH I++L++G  +P +VNPLDG           +QY++ +VPTVY+ V   
Sbjct: 222 -LDHKNFNFSHIISELSYGPFYPSLVNPLDGTVNAASDNFHKFQYYLSIVPTVYS-VGSR 279

Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
           +I +NQ++VTE  +S  +     +PG+FF YD+ PI +T  E     L FL  +  IV G
Sbjct: 280 SILTNQYAVTEQSKSVNE---HYIPGIFFKYDIEPILLTVHESRDGILTFLVKIINIVSG 336

Query: 269 V-------FTVSGIIDAFIYHGQRA 286
           V       FT+S  +   I   +R+
Sbjct: 337 VLVAGHWGFTISDWVKDVIGRRRRS 361


>gi|401427507|ref|XP_003878237.1| hypothetical protein, unknown function [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322494484|emb|CBZ29786.1| hypothetical protein, unknown function [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 309

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/191 (35%), Positives = 102/191 (53%), Gaps = 19/191 (9%)

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE-- 168
            EGC + G+++V KV GNFH +     H    H         +  N+ H I+ L+FG   
Sbjct: 130 AEGCRLEGYIKVGKVPGNFHISSHGRQHLLAQHF-------PNGINVEHSIHHLSFGTTD 182

Query: 169 ----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
                    ++PLDG     E P  +YQYF+ +VPT+Y + S  T+ + QF+ T    SS
Sbjct: 183 VKKLAKKAALHPLDGKEHRSEVPM-VYQYFLDIVPTIY-ESSFSTVHTYQFTGTS---SS 237

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
                + +  V F Y LSPI V ++   VS  HFLT VCAI+GGV+TV+G++  F++   
Sbjct: 238 TPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSA 297

Query: 285 RAIKKKIEIGK 295
              ++++ +GK
Sbjct: 298 AQFQRRV-LGK 307


>gi|119497911|ref|XP_001265713.1| COPII-coated vesicle protein (Erv41), putative [Neosartorya
           fischeri NRRL 181]
 gi|119413877|gb|EAW23816.1| COPII-coated vesicle protein (Erv41), putative [Neosartorya
           fischeri NRRL 181]
          Length = 397

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/203 (33%), Positives = 100/203 (49%), Gaps = 27/203 (13%)

Query: 94  IDQCKREGFLQRIKEEEGEG---CNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILA 149
           + +  R+ F +  K   G+    C IYG LE NKV G+FH  A G  +H S  H+     
Sbjct: 170 VRRNPRKKFAKGPKLRRGDAVDSCRIYGSLEGNKVQGDFHITARGHGYHNSAPHL----- 224

Query: 150 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD----- 204
            +  +FN SH I +L+FG H+P ++NPLD    T E     YQYF+ +VPT+Y+      
Sbjct: 225 -EHKTFNFSHMITELSFGPHYPTLLNPLDKTIATTEDHYYKYQYFLSIVPTIYSKGNLAL 283

Query: 205 -----------VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 253
                       S + I +NQ++ T    S+       +PG+FF Y++ PI +  +EE  
Sbjct: 284 DTYANAPPTSRYSKNLIFTNQYAATSQ-SSAIPENPYFIPGIFFKYNIEPILLMISEERT 342

Query: 254 SFLHFLTNVCAIVGGVFTVSGII 276
           SFL  L  +   + GV    G +
Sbjct: 343 SFLSLLVRLVNTISGVMVTGGWL 365


>gi|398021306|ref|XP_003863816.1| hypothetical protein, unknown function [Leishmania donovani]
 gi|322502049|emb|CBZ37133.1| hypothetical protein, unknown function [Leishmania donovani]
          Length = 309

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/191 (35%), Positives = 102/191 (53%), Gaps = 19/191 (9%)

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE-- 168
            EGC + G+++V KV GNFH +     H    H         +  N+ H I+ L+FG   
Sbjct: 130 AEGCRLEGYIKVAKVPGNFHISSHGRQHLLAQHF-------PNGINVEHSIHHLSFGTID 182

Query: 169 ----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
                    ++PLDG     E P  +YQYF+ +VPT+Y + S  T+ + QF+ T    SS
Sbjct: 183 VKKLAKKAALHPLDGKEHRSEVPM-VYQYFLDIVPTIY-ESSFSTVHTYQFTGTS---SS 237

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
                + +  V F Y LSPI V ++   VS  HFLT VCAI+GGV+TV+G++  F++   
Sbjct: 238 TPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSA 297

Query: 285 RAIKKKIEIGK 295
              ++++ +GK
Sbjct: 298 AQFQRRV-LGK 307


>gi|146097219|ref|XP_001468078.1| hypothetical protein, unknown function [Leishmania infantum JPCM5]
 gi|134072444|emb|CAM71154.1| hypothetical protein, unknown function [Leishmania infantum JPCM5]
          Length = 309

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/191 (35%), Positives = 102/191 (53%), Gaps = 19/191 (9%)

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE-- 168
            EGC + G+++V KV GNFH +     H    H         +  N+ H I+ L+FG   
Sbjct: 130 AEGCRLEGYIKVAKVPGNFHISSHGRQHLLAQHF-------PNGINVEHSIHHLSFGTID 182

Query: 169 ----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
                    ++PLDG     E P  +YQYF+ +VPT+Y + S  T+ + QF+ T    SS
Sbjct: 183 VKKLAKKAALHPLDGKEHRSEVPM-VYQYFLDIVPTIY-ESSFSTVHTYQFTGTS---SS 237

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
                + +  V F Y LSPI V ++   VS  HFLT VCAI+GGV+TV+G++  F++   
Sbjct: 238 TPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSA 297

Query: 285 RAIKKKIEIGK 295
              ++++ +GK
Sbjct: 298 AQFQRRV-LGK 307


>gi|241953329|ref|XP_002419386.1| COPii-coated vesicle-associated protein, putative [Candida
           dubliniensis CD36]
 gi|223642726|emb|CAX42980.1| COPii-coated vesicle-associated protein, putative [Candida
           dubliniensis CD36]
          Length = 345

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 74/220 (33%), Positives = 113/220 (51%), Gaps = 26/220 (11%)

Query: 90  NPDLIDQCKREGFLQRIKEE-----EGE-GCNIYGFLEVNKVAGNFHFAPGKSF-HQSGV 142
            PDL D+  +E      + E     EG   C+I+G + VN+V G+F    GK F ++   
Sbjct: 126 TPDL-DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRIT-GKGFGYRDRS 183

Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
           HV        +S N SH I + +FGE +P + NPLD      E     Y Y+ KVVPT+Y
Sbjct: 184 HV------PFESLNFSHVIQEFSFGEFYPYLNNPLDATGKITEERLQTYMYYAKVVPTLY 237

Query: 203 TDVSGHTIQSNQFSVTE--HFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
             + G  I +NQ+S+TE  H    +Q   R   +PG++F YD  PIK+   E+ + F  F
Sbjct: 238 EQL-GLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQF 296

Query: 259 LTNVCAIVGGVFTVSGII------DAFIYHGQRAIKKKIE 292
           +  +  I GG+   +G +        FI++GQ+A+++  E
Sbjct: 297 IAKLATIGGGLLIAAGYLFRLYEKLLFIFYGQKAVQQNRE 336


>gi|400594740|gb|EJP62573.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Beauveria bassiana ARSEF 2860]
          Length = 374

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 63/162 (38%), Positives = 91/162 (56%), Gaps = 10/162 (6%)

Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
            + C IYG L++NKV G+FH  A G  + + G H+        D FN SH I++L++G  
Sbjct: 185 ADSCRIYGSLDLNKVQGDFHITARGHGYMEFGQHL------DHDKFNFSHVISELSYGAF 238

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           +P +VNPLD            +QY++ VVPTVY+ V   TIQ+NQ++VTE  +S E    
Sbjct: 239 YPSLVNPLDRTVNVAAAHFHKFQYYLSVVPTVYS-VGRSTIQTNQYAVTE--QSKEIDEH 295

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
             +PG+F  YD+ PI +   E   SF+ FL  +  +V GV  
Sbjct: 296 SAVPGIFVKYDIEPILLAVHESRDSFIVFLLKLINVVSGVLV 337


>gi|308494873|ref|XP_003109625.1| hypothetical protein CRE_07522 [Caenorhabditis remanei]
 gi|308245815|gb|EFO89767.1| hypothetical protein CRE_07522 [Caenorhabditis remanei]
          Length = 286

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 103/190 (54%), Gaps = 18/190 (9%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 168
           GC      E+NKV GNFH +   +            A Q D++++ H I+ + FG+    
Sbjct: 110 GCRFESRFEINKVPGNFHLSTHSA------------ATQPDNYDMRHTIHSIKFGDDVSH 157

Query: 169 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            +  G  +PL     +QE     ++Y +K+VP+V+ D SG+ + S Q++       +   
Sbjct: 158 KNLKGSFDPLANRDTSQENGLNTHEYILKIVPSVHEDYSGNILNSYQYTFGHKSYITYHH 217

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
             + +P V+F Y+L PI +  TE+  SF  FLT++CA+VGG FTV+GIID+  +     +
Sbjct: 218 SGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTISELV 277

Query: 288 KKKIEIGKFS 297
           KK+ ++GK +
Sbjct: 278 KKQ-QMGKLT 286


>gi|358058634|dbj|GAA95597.1| hypothetical protein E5Q_02253 [Mixia osmundae IAM 14324]
          Length = 682

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 59/166 (35%), Positives = 85/166 (51%), Gaps = 8/166 (4%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E G  C IYG + V KV GN H       + S  H    L       N+SH I++ +FG 
Sbjct: 170 ENGPACRIYGTMAVKKVTGNLHITTLGHGYLSWEHTDHKL------MNLSHVIHEFSFGP 223

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
            FPG+  PLD      E+   ++QYF+ +V T Y D   + +++ Q+SVT+  R++  GR
Sbjct: 224 LFPGISQPLDNTLEVTESSFHIFQYFMSIVSTTYVDHHRNVLETAQYSVTDMSRATVHGR 283

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
              +PG+F  YD  P+ +T  E   +   FL  +  IVGGV   SG
Sbjct: 284 --GVPGIFLKYDPEPMMLTLRERTTTLGQFLIRLAGIVGGVIVCSG 327


>gi|312081872|ref|XP_003143209.1| HT034 [Loa loa]
 gi|307761627|gb|EFO20861.1| hypothetical protein LOAG_07628 [Loa loa]
          Length = 292

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 108/204 (52%), Gaps = 20/204 (9%)

Query: 101 GFLQRIKEEE--GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNIS 158
           GF+Q  ++      GC   G  E++KV GNFH +          H  D    Q +++++ 
Sbjct: 102 GFVQNTEKIPIGTSGCRFEGKFEISKVPGNFHLS---------THAADT---QPETYDMR 149

Query: 159 HKINKLAFGEHF-----PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
           H I+ + FG++       G  NPL      Q   S  + Y +K+VP+VY D++G+T  S 
Sbjct: 150 HTIHSVVFGDNIITSQNLGSFNPLKNREALQTDGSFTHDYVLKIVPSVYEDINGNTKYSY 209

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           Q++       +     + +P ++F Y+L PI + +TE    F  F+T++CA+VGG FTV+
Sbjct: 210 QYTYAHKEYVTYHYSGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVA 269

Query: 274 GIIDAFIYHGQRAIKKKIEIGKFS 297
           GIIDA ++     + +K +IGK S
Sbjct: 270 GIIDASLF-SLTELYRKHQIGKLS 292


>gi|212527292|ref|XP_002143803.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
           marneffei ATCC 18224]
 gi|210073201|gb|EEA27288.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
           marneffei ATCC 18224]
          Length = 402

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 64/183 (34%), Positives = 94/183 (51%), Gaps = 25/183 (13%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG LE NKV G+FH  A G  +++ G H+         +FN +H + +L+FG H+
Sbjct: 191 DSCRIYGSLESNKVHGDFHITARGHGYNEVGQHL------DHSNFNFTHMVTELSFGPHY 244

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-----------------DVSGHTIQSN 213
           P ++NPLD    + ET    +QYFI VVPT+Y                  + S +TI +N
Sbjct: 245 PSLLNPLDKTVASTETHYYKFQYFINVVPTIYAKGNNAVEKYTANPAKAFEKSRNTIFTN 304

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           Q+S T       +    T PG+FF Y++ PI +  +EE  SFL  L  +  +V GV    
Sbjct: 305 QYSATSQSHPLPESPFNT-PGIFFKYNIEPILLFVSEERGSFLALLVRLVNVVSGVIVTG 363

Query: 274 GII 276
           G +
Sbjct: 364 GWL 366


>gi|427788003|gb|JAA59453.1| Putative copii vesicle protein [Rhipicephalus pulchellus]
          Length = 285

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 105/204 (51%), Gaps = 22/204 (10%)

Query: 101 GFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISH 159
           GF++   K   G GC   G   ++KV GNFH     S H +        A Q D  +++H
Sbjct: 97  GFVENTEKTPVGSGCRFEGKFFIHKVPGNFHV----STHAA--------AKQPDKIDMTH 144

Query: 160 KINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
            I+ L FG    +   G  N LD +  +       + Y +K+VPTVY    G  I+S Q+
Sbjct: 145 IIHDLTFGVKMTDEVRGSFNSLDEMDKSGANGIESHDYVMKIVPTVYEKSKGERIESYQY 204

Query: 216 SVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           +     +   S  GR+  +P ++F YDL+PI V +T   +    FLT+VCAIVGG FTV+
Sbjct: 205 TYAYKSYVSISHSGRI--MPAIWFRYDLTPITVKYTRRGIPLYSFLTSVCAIVGGTFTVA 262

Query: 274 GIIDAFIYHGQRAIKKKIEIGKFS 297
           GI+D+ ++       +K E+GK S
Sbjct: 263 GIVDSLVFTASEVF-RKFEMGKLS 285


>gi|390594538|gb|EIN03948.1| DUF1692-domain-containing protein [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 551

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 57/167 (34%), Positives = 82/167 (49%), Gaps = 8/167 (4%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + +G  C IYG L+V KV  N H       + S  HV        D  N+SH I + +FG
Sbjct: 173 QPDGGACRIYGTLQVKKVTANLHITTAGHGYASVQHV------PHDQMNLSHVITEFSFG 226

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            +FP +  PLD        P   YQYF+ VVPT Y       +++ Q+SVT + R  E G
Sbjct: 227 PYFPDITQPLDDSFEITTDPFIAYQYFLHVVPTTYVAPRSSPLKTAQYSVTHYTRVLEHG 286

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
           R    PG+FF ++L P+ +T  +   +       V  +VGG+F  +G
Sbjct: 287 R--GTPGIFFKFELDPLSITVNQRTTTLAQLFIRVIGVVGGIFVCAG 331


>gi|195439332|ref|XP_002067585.1| GK16119 [Drosophila willistoni]
 gi|194163670|gb|EDW78571.1| GK16119 [Drosophila willistoni]
          Length = 443

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 63/202 (31%), Positives = 111/202 (54%), Gaps = 4/202 (1%)

Query: 89  SNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 148
           S P ++D   ++  LQ+  E + + C ++G L +NKVAG  H   G          H ++
Sbjct: 179 SLPAVLD-LHQDTHLQQ-PEAKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFQDHWMI 236

Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
            F+R   N +H+IN+L+FG++   +V PL+G     +  +   QYF+K+VPT   + +  
Sbjct: 237 EFRRMPANFTHRINRLSFGQYSRRIVQPLEGDETIIQEEATTVQYFLKIVPT-EIEQTFS 295

Query: 209 TIQSNQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
           TI + Q+SVTE+ R  +  R     PG++F YD S +K+  + +    L F+  +C+I+ 
Sbjct: 296 TINTFQYSVTENVRKLDSERNSYGSPGIYFKYDWSALKIVVSNDRDHILTFVIRLCSIIS 355

Query: 268 GVFTVSGIIDAFIYHGQRAIKK 289
           G+  +SG I++ +   QR + +
Sbjct: 356 GIIVLSGAINSLLLGMQRRLLR 377


>gi|398412138|ref|XP_003857398.1| hypothetical protein MYCGRDRAFT_66006 [Zymoseptoria tritici IPO323]
 gi|339477283|gb|EGP92374.1| hypothetical protein MYCGRDRAFT_66006 [Zymoseptoria tritici IPO323]
          Length = 407

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 98/207 (47%), Gaps = 39/207 (18%)

Query: 98  KREGFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSF 155
           KR  +  R  + EE + C IYG +  NKV G+FH  A G  +     H+         +F
Sbjct: 172 KRYQYTPRTPRNEEADSCRIYGSMHSNKVQGDFHITARGHGYMAYSQHL------DHSAF 225

Query: 156 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT------------ 203
           N SH IN+L+FG ++P +VNPLD      E     +QY++ VVPT+YT            
Sbjct: 226 NFSHHINELSFGPYYPKLVNPLDSTYARTEAHFHKFQYYLSVVPTIYTVDVNALKRMDSK 285

Query: 204 ----------------DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 247
                            V+ H++ +NQ++VTE   S  +     +PG+FF YD+ P+++T
Sbjct: 286 YETPSSGDDGLNQHPRRVTQHSVFTNQYAVTEQSHSVPENH---VPGIFFKYDIEPLQLT 342

Query: 248 FTEEHVSFLHFLTNVCAIVGGVFTVSG 274
             EE  S    L  +  +V G+    G
Sbjct: 343 IAEEWTSVPALLLRIVNVVSGLLVAGG 369


>gi|157874469|ref|XP_001685717.1| hypothetical protein LMJF_32_3910 [Leishmania major strain
           Friedlin]
 gi|68128789|emb|CAJ08922.1| hypothetical protein LMJF_32_3910 [Leishmania major strain
           Friedlin]
          Length = 309

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 67/191 (35%), Positives = 101/191 (52%), Gaps = 19/191 (9%)

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE-- 168
            EGC + G+++V KV GNFH +     H    H         +  N+ H I+ L+FG   
Sbjct: 130 AEGCRLEGYIKVAKVPGNFHISSHGRQHLLAQHF-------PNGINVEHSIHHLSFGTID 182

Query: 169 ----HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
                    ++PLDG     E P  +YQYF+ +VPT+Y + S  T+ + QF+ T    SS
Sbjct: 183 VKKLAKKAALHPLDGKEHRSEMPM-VYQYFLDIVPTIY-ESSFSTVYTYQFTGTS---SS 237

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
                + +  V F Y LSPI V ++   VS  HFLT VCAI+GGV+TV+G++  F++   
Sbjct: 238 TPVPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSA 297

Query: 285 RAIKKKIEIGK 295
              ++ + +GK
Sbjct: 298 AQFQRHV-LGK 307


>gi|19112857|ref|NP_596065.1| COPII-coated vesicle component Erv41 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|74582843|sp|O94283.1|ERV41_SCHPO RecName: Full=ER-derived vesicles protein 41
 gi|3850069|emb|CAA21880.1| COPII-coated vesicle component Erv41 (predicted)
           [Schizosaccharomyces pombe]
          Length = 333

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 64/184 (34%), Positives = 97/184 (52%), Gaps = 12/184 (6%)

Query: 97  CKREGFLQRIKEEEGEG--CNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRD 153
            + E F ++   E G G  C IYG L VN+V G  H  APG  + +S +  H        
Sbjct: 133 ARTEKFRKKNNAEPGSGTACRIYGQLVVNRVNGQLHITAPGWGYGRSNIPFH-------- 184

Query: 154 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
           S N +H I +L+FGE++P +VN LDG           +QY++ V+PT Y   S  + ++N
Sbjct: 185 SLNFTHYIEELSFGEYYPALVNALDGHYGHANDHPFAFQYYLSVLPTSYKS-SFRSFETN 243

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           Q+S+TE+    + G     PG+F  YDL P+ V   ++H +    L  + AI GG+ TV+
Sbjct: 244 QYSLTENSVVRQLGFGSLPPGIFIDYDLEPLAVRVVDKHPNVASTLLRILAISGGLITVA 303

Query: 274 GIID 277
             I+
Sbjct: 304 SWIE 307


>gi|448105220|ref|XP_004200441.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
 gi|448108351|ref|XP_004201072.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
 gi|359381863|emb|CCE80700.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
 gi|359382628|emb|CCE79935.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
          Length = 344

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/194 (34%), Positives = 100/194 (51%), Gaps = 17/194 (8%)

Query: 89  SNPDLIDQCKREGFLQRIK------EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 142
           + PDL D+  RE              E+   C+IYG + VNKVAG+FH   GK F  +  
Sbjct: 125 NTPDL-DEVMRETVRAEFNVAGTRMNEDASACHIYGSIPVNKVAGDFHIT-GKGFGYADR 182

Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
           H    + F++   N SH I + +FGE +P + NPLD            Y+YF+  VPT+Y
Sbjct: 183 HR---VPFEK--LNFSHVIMEFSFGEFYPMIKNPLDFTGKIASQKLQSYKYFMTAVPTLY 237

Query: 203 TDVSGHTIQSNQFSVTEHFR---SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
             + G  + + Q+S+TE  R   + E G    +PG++F YD   IK+   E+ + FL F+
Sbjct: 238 EKL-GIEVDTYQYSLTEQHRAITTDETGLPSDIPGLYFKYDFDTIKLLIAEKRIPFLQFV 296

Query: 260 TNVCAIVGGVFTVS 273
             +  IV G+F V+
Sbjct: 297 ARLATIVSGLFIVA 310


>gi|336370998|gb|EGN99338.1| hypothetical protein SERLA73DRAFT_108802 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336383753|gb|EGO24902.1| hypothetical protein SERLADRAFT_449635 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 503

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 61/181 (33%), Positives = 87/181 (48%), Gaps = 8/181 (4%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + +G  C IYG L+V KV  N H       + S VHV           N+SH I + +FG
Sbjct: 166 QADGSACRIYGTLQVKKVTANLHITTLGHGYTSNVHV------DHTKMNLSHVITEFSFG 219

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            +FP +  PLD      + P   YQYF+ VVPT +       + +NQ+SVT H+    +G
Sbjct: 220 PYFPDITQPLDYSFEVAKDPFVAYQYFLHVVPTTFIAPRSEPLHTNQYSVT-HYTRVLKG 278

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
              T PG+FF +DL P+ +T  +   SFL        ++GGVFT +     F      A+
Sbjct: 279 HHGT-PGIFFKFDLDPMVITIHQRTTSFLQLFIRCVGVIGGVFTCTSYFLRFTTRAVDAV 337

Query: 288 K 288
            
Sbjct: 338 S 338


>gi|119191516|ref|XP_001246364.1| hypothetical protein CIMG_00135 [Coccidioides immitis RS]
 gi|392864406|gb|EAS34753.2| COPII-coated vesicle protein [Coccidioides immitis RS]
          Length = 399

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 73/242 (30%), Positives = 114/242 (47%), Gaps = 52/242 (21%)

Query: 64  ESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +  D+   +   EVR +++++   G  L   D++D C+                 IYG L
Sbjct: 157 QEEDQHVGHVLGEVRRSWKRQFPPGPKLKRKDVVDSCR-----------------IYGSL 199

Query: 121 EVNKVAGNFHF-APGKSFHQSG--VHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
           E NKV GNFH  A G  ++     V+V+D+        N +H I +L+FG H+P ++NPL
Sbjct: 200 EGNKVQGNFHITAKGLGYYDPTGMVNVNDM--------NFTHLITELSFGPHYPTLLNPL 251

Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSG--------------------HTIQSNQFSV 217
           D      +     YQY++ VVPT+YT                        +TI +NQ++V
Sbjct: 252 DKTVAATKDKFYKYQYYLSVVPTIYTRAGTVDPYSQRLPDPSTITVSQRKNTIFTNQYAV 311

Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           T   R+  QG   ++PG+FF +D+ PI +  +EE  S L  L  +  +V GV    G + 
Sbjct: 312 TSQSRTISQGPY-SVPGIFFKFDIEPILLVVSEERGSLLALLVRLVNVVSGVLVAGGWVF 370

Query: 278 AF 279
            F
Sbjct: 371 NF 372


>gi|346469653|gb|AEO34671.1| hypothetical protein [Amblyomma maculatum]
          Length = 285

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 105/204 (51%), Gaps = 22/204 (10%)

Query: 101 GFLQRI-KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISH 159
           GF++   K   G GC   G   ++KV GNFH     S H +        A Q +  +++H
Sbjct: 97  GFVENTEKTPVGSGCRFEGKFFIHKVPGNFHV----STHAA--------AKQPEKIDMTH 144

Query: 160 KINKLAFG----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
            I+ L FG    +   G  N LD +  +       + Y +K+VPTVY    G  I+S Q+
Sbjct: 145 IIHDLTFGVKMTDEVKGSFNSLDEMDKSGGNGIESHDYVMKIVPTVYEKSRGERIESYQY 204

Query: 216 SVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           +     +   S  GR+  +P ++F YDL+PI V +T   V    FLT+VCAIVGG FTV+
Sbjct: 205 TYAYKSYVSISHTGRI--MPAIWFRYDLTPITVKYTRRGVPLYSFLTSVCAIVGGTFTVA 262

Query: 274 GIIDAFIYHGQRAIKKKIEIGKFS 297
           GI+D+ I+       +K E+GK S
Sbjct: 263 GIVDSLIFTASEVF-RKFEMGKLS 285


>gi|303313533|ref|XP_003066778.1| hypothetical protein CPC735_060030 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240106440|gb|EER24633.1| hypothetical protein CPC735_060030 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320036232|gb|EFW18171.1| COPII-coated vesicle protein [Coccidioides posadasii str. Silveira]
          Length = 399

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 73/242 (30%), Positives = 114/242 (47%), Gaps = 52/242 (21%)

Query: 64  ESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +  D+   +   EVR +++++   G  L   D++D C+                 IYG L
Sbjct: 157 QEEDQHVGHVLGEVRRSWKRQFPPGPKLKRKDVVDSCR-----------------IYGSL 199

Query: 121 EVNKVAGNFHF-APGKSFHQSG--VHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
           E NKV GNFH  A G  ++     V+V+D+        N +H I +L+FG H+P ++NPL
Sbjct: 200 EGNKVQGNFHITAKGLGYYDPTGMVNVNDM--------NFTHLITELSFGPHYPTLLNPL 251

Query: 178 DGVRWTQETPSGMYQYFIKVVPTVYTDVSG--------------------HTIQSNQFSV 217
           D      +     YQY++ VVPT+YT                        +TI +NQ++V
Sbjct: 252 DKTVAATKDKFYKYQYYLSVVPTIYTRAGTVDPYSQRLPDPSTITPSQRKNTIFTNQYAV 311

Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           T   R+  QG   ++PG+FF +D+ PI +  +EE  S L  L  +  +V GV    G + 
Sbjct: 312 TSQSRTISQGPY-SVPGIFFKFDIEPILLVVSEERGSLLALLVRLVNVVSGVLVAGGWVF 370

Query: 278 AF 279
            F
Sbjct: 371 NF 372


>gi|402591333|gb|EJW85263.1| hypothetical protein WUBG_03826, partial [Wuchereria bancrofti]
          Length = 244

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/190 (31%), Positives = 101/190 (53%), Gaps = 18/190 (9%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP- 171
           GC + G  E++KV GNFH +          H  D    Q +++++ H I+ + FG+    
Sbjct: 68  GCRLEGKFEISKVPGNFHIS---------THAADT---QPETYDMRHTIHSVVFGDDIST 115

Query: 172 ----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
               G  NPL      +   S  + Y +K+VP+VY D++G+   S Q++       +   
Sbjct: 116 SQNLGSFNPLKNREALESDGSFTHDYVLKIVPSVYEDITGNKKYSYQYTYAHKEYVTYHY 175

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
             + +P ++F Y+L PI + +TE    F  F+T++CA+VGG FTV+GIIDA ++     +
Sbjct: 176 SGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLF-SLTEL 234

Query: 288 KKKIEIGKFS 297
            +K ++GK S
Sbjct: 235 YRKHQMGKLS 244


>gi|17570549|ref|NP_508375.1| Protein Y102A11A.6 [Caenorhabditis elegans]
 gi|351063407|emb|CCD71590.1| Protein Y102A11A.6 [Caenorhabditis elegans]
          Length = 286

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 101/190 (53%), Gaps = 18/190 (9%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 168
           GC      E+NKV GNFH +   +            A Q +S+++ H I+ + FG+    
Sbjct: 110 GCRFESRFEINKVPGNFHLSTHSA------------ATQPESYDMRHLIHSIKFGDDVSH 157

Query: 169 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            +  G  +PL     +QE     ++Y +K+VP+V+ D SG  + S Q++       +   
Sbjct: 158 KNLKGSFDPLAKRNTSQENGLNTHEYILKIVPSVHEDYSGTILNSYQYTFGHKSYITYHH 217

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
             + +P V+F Y+L PI +  TE+  SF  FLT++CA+VGG FTV+GIID+  +     +
Sbjct: 218 SGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTISELV 277

Query: 288 KKKIEIGKFS 297
           KK+  +GK +
Sbjct: 278 KKQ-RLGKLT 286


>gi|70988875|ref|XP_749289.1| COPII-coated vesicle protein (Erv41) [Aspergillus fumigatus Af293]
 gi|66846920|gb|EAL87251.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
           fumigatus Af293]
 gi|159128703|gb|EDP53817.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
           fumigatus A1163]
          Length = 379

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 78/252 (30%), Positives = 110/252 (43%), Gaps = 53/252 (21%)

Query: 44  QRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKRE 100
           Q H  RL   E           +D    +   EVR   RKK   G  L   D +D C+  
Sbjct: 130 QEHADRLSEQE-----------ADAHVHHVLGEVRRNPRKKFAKGPKLRRGDAVDSCR-- 176

Query: 101 GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISH 159
                          IYG LE NKV G+FH  A G  +H +  H+      +  +FN SH
Sbjct: 177 ---------------IYGSLEGNKVQGDFHITARGHGYHNNAPHL------EHKTFNFSH 215

Query: 160 KINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT------DVSGHTIQSN 213
            I +L+FG H+P ++NPLD    T E     YQYF+ +VPT+Y+      D   +   SN
Sbjct: 216 MITELSFGPHYPTLLNPLDKTIATTEDHYYKYQYFLSIVPTIYSKGNLALDTYANAPPSN 275

Query: 214 Q----FSVTEHFRSSEQGRLQT-----LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
           +       T  +  + Q  +       +PG+FF Y++ PI +  +EE  SFL  L  +  
Sbjct: 276 RRGKNLVFTNQYAVTSQSSVIPESPYFIPGLFFKYNIEPILLLISEERTSFLSLLVRLVN 335

Query: 265 IVGGVFTVSGII 276
            V GV    G +
Sbjct: 336 TVSGVMVTGGWL 347


>gi|346970151|gb|EGY13603.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Verticillium dahliae VdLs.17]
          Length = 373

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 67/190 (35%), Positives = 100/190 (52%), Gaps = 20/190 (10%)

Query: 92  DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
           D++ Q K+     R     G  + C I+G L++NKV G+FH  A G  +  +G H+    
Sbjct: 160 DIVAQSKKRQKWARTPRLRGPPDSCRIFGSLDLNKVQGDFHITARGHGYQGAGQHL---- 215

Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYT--- 203
                SFN SH +N+L+FG  +P + NPLD  R     P+    +QY++ +VPTVYT   
Sbjct: 216 --DHTSFNFSHIVNELSFGAFYPNLENPLD--RTVNLAPANFHKFQYYLSIVPTVYTVGR 271

Query: 204 -DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
                +T+ +NQF+VTE  +S E G   ++PGVF  YD+ PI +   E    F+ F   V
Sbjct: 272 SASKANTVYTNQFAVTE--QSKEVGD-HSVPGVFVKYDIEPILLLVEETRPGFVQFWLKV 328

Query: 263 CAIVGGVFTV 272
             ++ GV   
Sbjct: 329 INVLSGVLVA 338


>gi|320591987|gb|EFX04426.1| copii-coated vesicle protein [Grosmannia clavigera kw1407]
          Length = 385

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/182 (33%), Positives = 96/182 (52%), Gaps = 19/182 (10%)

Query: 99  REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 157
           R G   R++    + C I+G L++N+V G++H  A G  + + G H+         SFN 
Sbjct: 179 RWGKTPRLRGAAPDSCRIFGSLDLNRVQGDYHITARGHGYMEMGDHL------DHTSFNF 232

Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMY--QYFIKVVPTVYT-----DVSGHTI 210
           SH +N+L+FG  +P +VNPLD      E  +  Y  QYF+ +VPTVY+       S  +I
Sbjct: 233 SHVVNELSFGPFYPSLVNPLDQT--VNEATANFYRFQYFMSIVPTVYSVGHAGSRSARSI 290

Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
            +NQ++VTE     +Q   + +PG+FF YD+ PI +   E    FL F+  +  ++ G  
Sbjct: 291 VTNQYAVTEQSAEIDQ---RAIPGIFFKYDIEPILLYIEESRDGFLVFVLKIVNVLSGAL 347

Query: 271 TV 272
             
Sbjct: 348 VA 349


>gi|294655234|ref|XP_457337.2| DEHA2B08778p [Debaryomyces hansenii CBS767]
 gi|199429792|emb|CAG85341.2| DEHA2B08778p [Debaryomyces hansenii CBS767]
          Length = 354

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 56/172 (32%), Positives = 101/172 (58%), Gaps = 11/172 (6%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSF-HQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E    C+I+G + VN+V G+FH   GK F +  G     ++ F+  + N +H I++ ++G
Sbjct: 150 EGAPACHIFGSIPVNQVKGDFHIT-GKGFGYNDG---RSVVPFE--ALNFTHVISEFSYG 203

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH---FRSS 224
           + +P + NPLD      E     Y+Y+ KVVPT+Y  + G  I +NQ+S+TE    ++ +
Sbjct: 204 DFYPFINNPLDFTGKVTEQKLQAYKYYSKVVPTIYEKL-GMIIDTNQYSLTEQHNVYKVN 262

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
               ++ +PG+FF Y+  PIK+  +E+ + F+ F++ +  I+GG+  V+G +
Sbjct: 263 RFNNVEGIPGIFFKYEFEPIKLIISEKRIPFIQFVSRLATIIGGLLIVAGYL 314


>gi|164661257|ref|XP_001731751.1| hypothetical protein MGL_1019 [Malassezia globosa CBS 7966]
 gi|159105652|gb|EDP44537.1| hypothetical protein MGL_1019 [Malassezia globosa CBS 7966]
          Length = 454

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 56/173 (32%), Positives = 98/173 (56%), Gaps = 20/173 (11%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFH---FAP---GKSFHQSGVHVHDILAFQRDSFNISHKIN 162
           EE   C +YG + V KV GN H   F P     + H++G+ +           ++SH I+
Sbjct: 217 EEARACRVYGSILVKKVTGNLHISTFVPTFMAVNAHENGMGI-----------DMSHIIH 265

Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
           + +FG++FP +  PLD      + P+  +QYF+ VVPT +       I++NQ+SV + ++
Sbjct: 266 EFSFGDYFPNIAEPLDASLELTDDPAAAFQYFLSVVPTHFIH-GRRVIKTNQYSVHD-YK 323

Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
            + QG L T PG++F YD+ P+ +  T + VS + F+  VC+++GG++  + +
Sbjct: 324 RNPQGSL-TFPGLYFKYDIEPLTMKVTHKSVSLVAFIVRVCSVLGGLWICTDL 375


>gi|443897407|dbj|GAC74748.1| CDK9 kinase-activating protein cyclin T [Pseudozyma antarctica
           T-34]
          Length = 414

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 52/154 (33%), Positives = 84/154 (54%), Gaps = 8/154 (5%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           +G  C IYG +EV +V GN H       + S  H    L       N+SH I++ +FG +
Sbjct: 171 DGPACRIYGSMEVKRVTGNLHITTLGHGYLSMEHTDHKL------MNLSHVIHEFSFGPY 224

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           FP +  PLD    T +    ++QYF+  +PT++ D  G  + ++Q+SVT++ R  E G+ 
Sbjct: 225 FPEISQPLDSSVETTDKHFTVFQYFVSAIPTLFIDARGRRLHTHQYSVTDYARPIEHGK- 283

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
             +PG+F  YD+ P+++T  E  VS + FL  + 
Sbjct: 284 -GVPGIFIKYDIEPLQMTIRERSVSLVQFLVRLA 316


>gi|121710902|ref|XP_001273067.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
           clavatus NRRL 1]
 gi|119401217|gb|EAW11641.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
           clavatus NRRL 1]
          Length = 401

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 88/302 (29%), Positives = 130/302 (43%), Gaps = 64/302 (21%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGI--GAPKIDKPLQRHGGRLEHNETYCGS 59
           D SG++ L    ++ ++   S    +E R   I  GA +     Q HG RL   E     
Sbjct: 106 DASGDRIL--AGELLQRERTSWNLWMEKRNYEIHGGAHEYQTLNQEHGDRLAEQE----- 158

Query: 60  CYGAESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
                  D    +   EVR   RKK   G  L   D++D C+                 I
Sbjct: 159 ------QDAHVHHVLGEVRRNPRKKFPRGPRLRRGDVVDSCR-----------------I 195

Query: 117 YGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           YG LE NKV G+FH  A G  +H +  H+      +  +FN SH + +L+FG H+P ++N
Sbjct: 196 YGSLEGNKVQGDFHITARGHGYHAAAPHL------EHSTFNFSHMVTELSFGPHYPTILN 249

Query: 176 PLDGVRWTQETPSGMYQYFIKVVPTVYT---------------------DVSGHTIQSNQ 214
           PLD    T E     YQYF+ VVPT+Y+                     + + + I +NQ
Sbjct: 250 PLDKTIATTEEHYYKYQYFLSVVPTIYSKGNLALDAYSGSAPTLHDPNRNRNRNLIFTNQ 309

Query: 215 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
           ++ T    +  +     +PG+FF Y + PI +  +EE  SFL  L  +   V GV    G
Sbjct: 310 YAATSQSTALPESPY-FVPGIFFKYSIEPILLIISEERGSFLTLLVRLVNTVSGVIVTGG 368

Query: 275 II 276
            +
Sbjct: 369 WL 370


>gi|123483410|ref|XP_001324018.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121906894|gb|EAY11795.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 384

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 74/242 (30%), Positives = 110/242 (45%), Gaps = 13/242 (5%)

Query: 51  EHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC--KREGFLQRIKE 108
           E   + C SCYG    +  CCN+CE+    +   G A +  D   QC  K  G     K 
Sbjct: 121 ETISSICHSCYGL-LPEGSCCNSCEQTLLLHIMNGKAANTKDW-PQCQGKNPG-----KV 173

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
            E E C I G + +NK  GNFH APG +  +   HVHD L+ Q  +F++SH I  +  G 
Sbjct: 174 YENEKCRIKGKVCLNKAQGNFHIAPGTNMKERYGHVHD-LSGQLPNFDLSHVIQGMRVGP 232

Query: 169 HFPGVVNPLDGVRWTQETPSG-MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
             P   NPL  V+  Q      +Y+Y + V P VY   SG+ I    +  T        G
Sbjct: 233 KIPLTYNPLRYVQQIQNPNQPVVYRYDLVVTPAVYK--SGNRILGKGYDYTAMINRFFVG 290

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
                PG++F Y  +P  VT    +++     T++   + G + +  IID  ++   + +
Sbjct: 291 NSGGAPGIYFHYSFTPYGVTVNATYLTIAQIFTSIFGFMSGAYAIFSIIDESMFKDDKRM 350

Query: 288 KK 289
            K
Sbjct: 351 AK 352


>gi|46137745|ref|XP_390564.1| hypothetical protein FG10388.1 [Gibberella zeae PH-1]
          Length = 376

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 63/183 (34%), Positives = 99/183 (54%), Gaps = 18/183 (9%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG L++NKV G+FH  A G  +   G H+          FN SH I++L++G  +
Sbjct: 189 DSCRIYGSLDLNKVQGDFHITARGHGYMGHGEHL------DHSKFNFSHIISELSYGPFY 242

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
           P + NPLDG   T +     +QY++ VVPTVY+ V+  +I +NQ++VTE  ++ +    +
Sbjct: 243 PSLENPLDGTVNTADGNFHKFQYYLSVVPTVYS-VNSRSILTNQYAVTEQSKAVDD---R 298

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------FTVSGIIDAFIYHG 283
            +PG+FF YD+ PI +T  E     +     +  I+ GV       FT+S  I   I   
Sbjct: 299 YIPGIFFKYDIEPILLTVHESRDGIISLFVKIINIISGVLVAGHWGFTISDWIHDVIGRR 358

Query: 284 QRA 286
           +R+
Sbjct: 359 RRS 361


>gi|408393109|gb|EKJ72376.1| hypothetical protein FPSE_07400 [Fusarium pseudograminearum CS3096]
          Length = 376

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 63/183 (34%), Positives = 99/183 (54%), Gaps = 18/183 (9%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG L++NKV G+FH  A G  +   G H+          FN SH I++L++G  +
Sbjct: 189 DSCRIYGSLDLNKVQGDFHITARGHGYMGHGEHL------DHSKFNFSHIISELSYGPFY 242

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
           P + NPLDG   T +     +QY++ VVPTVY+ V+  +I +NQ++VTE  ++ +    +
Sbjct: 243 PSLENPLDGTVNTADGNFHKFQYYLSVVPTVYS-VNSRSILTNQYAVTEQSKAVDD---R 298

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------FTVSGIIDAFIYHG 283
            +PG+FF YD+ PI +T  E     +     +  I+ GV       FT+S  I   I   
Sbjct: 299 YIPGIFFKYDIEPILLTVHESRDGIISLFVKIINIISGVLVAGHWGFTISDWIHDVIGRR 358

Query: 284 QRA 286
           +R+
Sbjct: 359 RRS 361


>gi|389749487|gb|EIM90658.1| DUF1692-domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 533

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 57/163 (34%), Positives = 82/163 (50%), Gaps = 7/163 (4%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + +G  C +YG LEV KV  N H       + S VHV           N+SH I + +FG
Sbjct: 169 QADGSACRVYGSLEVKKVTANLHITSLGHGYASKVHV------DHTKINMSHVITEFSFG 222

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            HFP +V PLD            YQYF++VVPT Y       + +NQ+SVT + R+ EQ 
Sbjct: 223 PHFPDIVQPLDNSFEITHDHFTAYQYFMRVVPTTYVAPRSAPLNTNQYSVTHYTRTFEQ- 281

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
                PG+FF +++ P+++   +   +F  F      +VGGVF
Sbjct: 282 HSGLAPGIFFKFEIEPVRLIQHQRTTTFAQFFVRWAGVVGGVF 324


>gi|346322712|gb|EGX92310.1| COPII-coated vesicle protein (Erv41), putative [Cordyceps militaris
           CM01]
          Length = 376

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 61/163 (37%), Positives = 90/163 (55%), Gaps = 10/163 (6%)

Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
            + C +YG L++NKV G+FH  A G  + + G H+        + FN SH I++L++G  
Sbjct: 186 ADSCRVYGSLDLNKVQGDFHITARGHGYMEFGQHL------DHNQFNFSHVISELSYGAF 239

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           +P +VNPLD            +QY++ VVPT+Y+ V   TIQ+NQ++VTE  +S E    
Sbjct: 240 YPSLVNPLDRTVNLAAAHFHKFQYYLSVVPTIYS-VGSSTIQTNQYAVTE--QSKEIDEH 296

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
             +PG+F  YD+ PI +   E   SF  FL  +  IV GV   
Sbjct: 297 SAVPGIFVKYDIEPILLAVHESRDSFPVFLLKLINIVSGVLVA 339


>gi|302422316|ref|XP_003008988.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Verticillium albo-atrum VaMs.102]
 gi|261352134|gb|EEY14562.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Verticillium albo-atrum VaMs.102]
          Length = 374

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 65/188 (34%), Positives = 97/188 (51%), Gaps = 16/188 (8%)

Query: 92  DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
           D++ Q K+     R     G  + C I+G L++NKV G+FH  A G  +  +G H+    
Sbjct: 161 DIVAQSKKRQKWARTPRLRGPPDSCRIFGSLDLNKVQGDFHITARGHGYQGAGQHL---- 216

Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT----D 204
                SFN SH +N+L+FG  +P + NPLD            +QY++ +VPTVYT     
Sbjct: 217 --DHTSFNFSHIVNELSFGAFYPNLENPLDRTVNLASANFHKFQYYLSIVPTVYTVGRSA 274

Query: 205 VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
              +T+ +NQF+VTE  +S E G   ++PGVF  YD+ PI +   E    F+ F   V  
Sbjct: 275 SKANTVYTNQFAVTE--QSKEVGD-HSVPGVFVKYDIEPILLLVEETRPGFVQFWLKVIN 331

Query: 265 IVGGVFTV 272
           ++ GV   
Sbjct: 332 VLSGVLVA 339


>gi|358390077|gb|EHK39483.1| hypothetical protein TRIATDRAFT_302881 [Trichoderma atroviride IMI
           206040]
          Length = 372

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 65/205 (31%), Positives = 105/205 (51%), Gaps = 20/205 (9%)

Query: 92  DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
           D+I   +R     R     G  + C ++G +++NKV G+FH  A G  +   G H+    
Sbjct: 163 DIIALTQRRAKWARTPRPRGKPDSCRMFGSMDLNKVQGDFHITARGHGYMGMGQHL---- 218

Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
               D FN SH I+++++G ++P +VNPLD    +       +QY++ VVPTVY   +  
Sbjct: 219 --DHDKFNFSHIISEMSYGPYYPSLVNPLDRTVNSAIVHFHKFQYYLSVVPTVYL-ANRR 275

Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
            + +NQ++VTEH ++        +PG+FF YD+ PI ++  E    FL F+  +  I  G
Sbjct: 276 IVNTNQYAVTEHSKTISD---HQIPGIFFKYDIEPILLSVEESRDGFLSFVIKIVNIFSG 332

Query: 269 V-------FTVSGIIDAFIYHGQRA 286
           V       FT+S  I   I   +R+
Sbjct: 333 VMVAGHWGFTLSDWIREVIGKRRRS 357


>gi|123425245|ref|XP_001306773.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121888365|gb|EAX93843.1| hypothetical protein TVAG_177510 [Trichomonas vaginalis G3]
          Length = 353

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 91/301 (30%), Positives = 139/301 (46%), Gaps = 42/301 (13%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETY--CG 58
           +D SG    + + DI ++RLD                   KPL++     +    +  CG
Sbjct: 86  IDASGNPQPNARQDISRQRLDVHF----------------KPLEQLISDSDPKSVFQTCG 129

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           +C GA  S   CC  C ++  ++R+    + N   ++QC R+    +   E+ E C I  
Sbjct: 130 NCLGANVSK--CCLTCTDIANSFRQMEEFIPNLQNVEQCNRD----KKAIEDKETCRIVA 183

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD------SFNISHKINKLAFGEHFPG 172
            L       N HF  GK    +G  V   + ++ D      + N++H I+ L FG  F G
Sbjct: 184 KL-------NTHFTKGKLTIMAGGIVPTPVNYKFDLSHFGDNVNLTHTIHTLRFGRDFEG 236

Query: 173 VVNPLDGVRWTQETPSG-MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT 231
           + NPLD     Q   S  MY Y I +VPT+  DV    I ++Q+S +   +   +   + 
Sbjct: 237 LKNPLDNYTNNQLKKSQFMYNYKIDLVPTITNDVENQ-IPAHQYSASSSSKEITKMITKK 295

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
            PG+ F +D +P+   F  E  S   FLT +CAI+GG FT+ G ID+FI+   R   KK 
Sbjct: 296 HPGITFDFDTAPVAARFIVEKQSLSSFLTQLCAILGGGFTLGGFIDSFIF---RVRAKKF 352

Query: 292 E 292
           E
Sbjct: 353 E 353


>gi|341874049|gb|EGT29984.1| hypothetical protein CAEBREN_24080 [Caenorhabditis brenneri]
          Length = 286

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/190 (32%), Positives = 102/190 (53%), Gaps = 18/190 (9%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 168
           GC      E+NKV GNFH +   +            A Q +++++ H I+ + FG+    
Sbjct: 110 GCRFESRFEINKVPGNFHLSTHSA------------ASQPENYDMKHIIHSIKFGDDVSH 157

Query: 169 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            +  G  +PL      QE     ++Y +K+VP+V+ D SG+ + S Q++       +   
Sbjct: 158 KNLKGSFDPLANRDSLQENGLSTHEYILKIVPSVHEDYSGNILNSYQYTFGHKSYITYHH 217

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
             + +P V+F Y+L PI +  TE+  SF  FLT++CA+VGG FTV+GIID+  +     +
Sbjct: 218 SGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTISELV 277

Query: 288 KKKIEIGKFS 297
           KK+ ++GK +
Sbjct: 278 KKQ-QMGKLT 286


>gi|169778245|ref|XP_001823588.1| COPII-coated vesicle protein (Erv41) [Aspergillus oryzae RIB40]
 gi|83772325|dbj|BAE62455.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 390

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/181 (33%), Positives = 89/181 (49%), Gaps = 22/181 (12%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG LE NKV G+FH  A G  +   G H+         +FN SH I +L+FG H+
Sbjct: 186 DSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHSTFNFSHMITELSFGTHY 239

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS----------VTEH 220
           P ++NPLD      E+    YQYF+ VVPT+Y+      + S  ++           T  
Sbjct: 240 PTLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALDSTLYTSKPSHSKNVIFTNQ 299

Query: 221 FRSSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
           + ++ QG         +PG+FF Y++ PI +  +EE  SFL  L  +   V GV    G 
Sbjct: 300 YAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFLSLLIRLVNTVSGVMVTGGW 359

Query: 276 I 276
           +
Sbjct: 360 L 360


>gi|170108190|ref|XP_001885304.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
           bicolor S238N-H82]
 gi|164639780|gb|EDR04049.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
           bicolor S238N-H82]
          Length = 398

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 55/167 (32%), Positives = 83/167 (49%), Gaps = 8/167 (4%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           +  G  C ++G L+V +V  N H       + S  HV        +  N+SH I + +FG
Sbjct: 169 QPHGNACRVWGSLQVKRVTANLHITTLGHGYASYEHV------DHNQMNLSHVITEFSFG 222

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            HFP +  PLD    + +     YQYF+ VVPT Y       +Q++Q+SVT + R  +  
Sbjct: 223 PHFPDITQPLDNSFESTDERFVAYQYFLHVVPTTYIAPRSAPLQTHQYSVTHYTRVMQHN 282

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
             Q  PG+FF +DL P+ +T  +   +FL  L     ++GGVF   G
Sbjct: 283 --QGTPGIFFKFDLDPLAITQHQRTTTFLQLLIRCVGVIGGVFVCMG 327


>gi|123430864|ref|XP_001307985.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121889642|gb|EAX95055.1| hypothetical protein TVAG_428580 [Trichomonas vaginalis G3]
          Length = 358

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 74/299 (24%), Positives = 133/299 (44%), Gaps = 35/299 (11%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           MD  G Q   +K+ +  +RL++ G VI    D +                      C  C
Sbjct: 90  MDSLGFQRSYIKNTVTFRRLNNLGRVIGYTNDTLSD-------------------VCEPC 130

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKRE---GFLQRIKEEEGEGCNIY 117
           Y   ++ ++CCN+C +V+        +L     +D  K      + ++      E C + 
Sbjct: 131 YNLSTNPDECCNSCLKVQL------LSLMQNKPVDFSKYRVCNNYEKKPNVSLSEKCLVK 184

Query: 118 GFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
           G L VN++ G+FH APG +  QS  ++HD+ + Q    +++H I +L FG H P   NPL
Sbjct: 185 GKLTVNRIPGSFHIAPGTNVPQSA-YLHDLSSMQM-FHDMTHSIQRLRFGPHIPRTSNPL 242

Query: 178 DGVRWTQETPS--GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           D  +  Q+ P+    Y Y + + P ++       ++  +++       + Q      PG+
Sbjct: 243 DNFKSFQQIPTHDRTYFYNLLITPVIFYRDGVEYLKGYEYTAFSEAIDTFQ-LFGISPGL 301

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
           FF Y  +P  +  +    +FL F++N   ++ G++    I+D  I  G+      +EIG
Sbjct: 302 FFQYQFTPYTIVVSANRQNFLQFISNTFGVISGIYACLSILDKLI--GEDIGSNVVEIG 358


>gi|391872305|gb|EIT81439.1| COPII vesicle protein [Aspergillus oryzae 3.042]
          Length = 390

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 61/181 (33%), Positives = 89/181 (49%), Gaps = 22/181 (12%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG LE NKV G+FH  A G  +   G H+         +FN SH I +L+FG H+
Sbjct: 186 DSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHSTFNFSHMITELSFGPHY 239

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS----------VTEH 220
           P ++NPLD      E+    YQYF+ VVPT+Y+      + S  ++           T  
Sbjct: 240 PTLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALDSTLYTSKPSHSKNVIFTNQ 299

Query: 221 FRSSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
           + ++ QG         +PG+FF Y++ PI +  +EE  SFL  L  +   V GV    G 
Sbjct: 300 YAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFLSLLIRLVNTVSGVMVTGGW 359

Query: 276 I 276
           +
Sbjct: 360 L 360


>gi|238495520|ref|XP_002378996.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus flavus
           NRRL3357]
 gi|220695646|gb|EED51989.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus flavus
           NRRL3357]
          Length = 390

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 61/181 (33%), Positives = 89/181 (49%), Gaps = 22/181 (12%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG LE NKV G+FH  A G  +   G H+         +FN SH I +L+FG H+
Sbjct: 186 DSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHSTFNFSHMITELSFGPHY 239

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS----------VTEH 220
           P ++NPLD      E+    YQYF+ VVPT+Y+      + S  ++           T  
Sbjct: 240 PTLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALDSTLYTSKPSHSKNVIFTNQ 299

Query: 221 FRSSEQG-----RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
           + ++ QG         +PG+FF Y++ PI +  +EE  SFL  L  +   V GV    G 
Sbjct: 300 YAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFLSLLIRLVNTVSGVMVTGGW 359

Query: 276 I 276
           +
Sbjct: 360 L 360


>gi|242783317|ref|XP_002480163.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218720310|gb|EED19729.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 400

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 63/183 (34%), Positives = 92/183 (50%), Gaps = 25/183 (13%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG LE NKV G+FH  A G  +++ G H+         +FN +H I +L+FG H+
Sbjct: 190 DSCRIYGSLESNKVHGDFHITARGHGYNELGEHL------DHKTFNFTHMITELSFGPHY 243

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-----------------DVSGHTIQSN 213
           P ++NPLD      E     +QYF+ VVPT+Y                    S +TI +N
Sbjct: 244 PSLLNPLDKTVAYTEDHYYKFQYFLNVVPTIYAKGNNAVEKYTANPALAFKKSRNTIFTN 303

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           Q+S T    +  +    T PG+FF Y++ PI +  +EE  SFL  L  +  +V GV    
Sbjct: 304 QYSATSQSHALPENPYNT-PGIFFKYNIEPILLFVSEERGSFLALLVRLVNVVSGVIVTG 362

Query: 274 GII 276
           G +
Sbjct: 363 GWL 365


>gi|300123494|emb|CBK24766.2| unnamed protein product [Blastocystis hominis]
          Length = 235

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 69/228 (30%), Positives = 106/228 (46%), Gaps = 24/228 (10%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVI---ESRQDGIGAPKIDKPLQR-----HGGRLEHN 53
           D  G    D++++I K  LD  GN I   +  Q  +  P  ++ L+          +  +
Sbjct: 8   DALGNDRADIENEILKTNLDVNGNPIGKTDKSQVTVTVPTKEEVLENTKHDDDEIVVIDD 67

Query: 54  ETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGE 112
           +  CG C+GA+   E CCN CEE+  AYRKK W +        QC    +LQ+ K     
Sbjct: 68  KKECGDCFGAKEKSE-CCNTCEELIAAYRKKNWDVDRIKAQAPQCAGFNYLQKWKNGVER 126

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR-----DSFNISHKINKLAFG 167
           GC + G L + KV G+    PG+        ++D+L+        +S N++H I+  + G
Sbjct: 127 GCRLEGKLSITKVQGHVFIIPGR--------INDLLSNSEIRQIANSLNVTHTIHHFSLG 178

Query: 168 EHFPGVVNPLDGVRWTQETP-SGMYQYFIKVVPTVYTDVSGHTIQSNQ 214
           E  P   NP    R       + MYQYF+  +PT Y + SG  ++S Q
Sbjct: 179 EAIPEQKNPFVDHRGVMAVDHASMYQYFVNAIPTTYINKSGKELKSYQ 226


>gi|225712562|gb|ACO12127.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Lepeophtheirus salmonis]
          Length = 290

 Score =  103 bits (257), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 66/195 (33%), Positives = 100/195 (51%), Gaps = 20/195 (10%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           +G GC       +NKV GNFH +          H  D+   Q D +N SH+I++++FG  
Sbjct: 109 DGVGCLFEAHFHINKVPGNFHVS---------THSVDV---QPDEYNFSHEIHEVSFGSK 156

Query: 170 FP-------GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
                    G  N L G   ++      ++Y +K+VPT Y  + G  + + Q++      
Sbjct: 157 IKKISSKNIGTFNSLSGRDSSESGALDSHEYVMKIVPTTYESLGGAKLFAYQYTYAYRSY 216

Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
            S     + +P ++F YDL+PI V + E      HFLT VCAIVGG FTV+GIID+ ++ 
Sbjct: 217 VSFGHGGRVVPALWFRYDLNPITVKYHETRPPIYHFLTTVCAIVGGTFTVAGIIDSTLFT 276

Query: 283 GQRAIKKKIEIGKFS 297
             + + KK E+GK S
Sbjct: 277 ATQ-LFKKFELGKLS 290


>gi|358374656|dbj|GAA91246.1| COPII-coated vesicle protein [Aspergillus kawachii IFO 4308]
          Length = 399

 Score =  103 bits (257), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 75/237 (31%), Positives = 109/237 (45%), Gaps = 48/237 (20%)

Query: 63  AESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           A  +D    +   EVR+  R+K   G  L   D +D C+                 IYG 
Sbjct: 156 AREADAHVHHVLGEVRKNPRRKFAKGPRLRRGDTVDSCR-----------------IYGS 198

Query: 120 LEVNKVAGNFHF-APGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
           LE NKV G+FH  A G  +   G H+ H +       FN SH + +L+FG H+P ++NPL
Sbjct: 199 LEGNKVQGDFHITARGHGYRNFGEHLDHGV-------FNFSHMVTELSFGPHYPTLLNPL 251

Query: 178 DGVRWTQETPSGMYQYFIKVVPTVY------------------TDVSGHTIQSNQFSVTE 219
           D    T ET    YQYF+ VVPT+Y                  T+ + + + +NQ++ T 
Sbjct: 252 DKTIATTETHYYKYQYFLSVVPTLYSKGASALDTYTNHPDLIATNRNRNLVFTNQYAATT 311

Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             +   +     +PG+FF Y++ PI +  +EE  SFL  L  +   V GV    G I
Sbjct: 312 QAQELPENPY-FIPGIFFKYNIEPILLMISEERTSFLSLLIRLVNTVSGVMVTGGWI 367


>gi|321479391|gb|EFX90347.1| hypothetical protein DAPPUDRAFT_309719 [Daphnia pulex]
          Length = 369

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 98/183 (53%), Gaps = 9/183 (4%)

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFH-QSGVHVHDILAFQRDSFNISHKINKL 164
           +  +  + C ++G L++ KVAGNFH   GK        H H       + FN SH+I+K 
Sbjct: 162 VPSQPSDACRLHGTLQLTKVAGNFHITAGKVLPLPMRAHAHLSPMMDDERFNYSHRIDKF 221

Query: 165 AFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYTDVSGHT-----IQSNQFSVT 218
           +FG H   ++ PL+G     +  + ++QYF+  VPT + + VS  +     +++ Q+SV 
Sbjct: 222 SFG-HSSTLIQPLEGDEVITDKGAMLFQYFVTAVPTEIESLVSASSGIHGSMKTWQYSVR 280

Query: 219 EHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
              R    Q     +PG++F YD++P++V    +    L F+  +CAIVGGV+T +GI+ 
Sbjct: 281 NQSRIIGHQKGSHGIPGIYFKYDVAPLRVRVVPDAPPLLRFVLRLCAIVGGVYTSAGIVH 340

Query: 278 AFI 280
             I
Sbjct: 341 KVI 343


>gi|154286632|ref|XP_001544111.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407752|gb|EDN03293.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 315

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/205 (31%), Positives = 99/205 (48%), Gaps = 32/205 (15%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E+ + C IYG LE NKV G+FH  A G  + + G H+        D+FN SH + +L+FG
Sbjct: 102 EKADSCRIYGSLEGNKVQGDFHITARGHGYFEFGEHL------SHDAFNFSHMVTELSFG 155

Query: 168 EHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVS------------------- 206
            H+P ++NPLD  +    TP+    +QY++ VVPT+YT                      
Sbjct: 156 PHYPSLLNPLD--KTISVTPARFFKFQYYLSVVPTIYTRAGIVDPYNHVLPDPTTIRPSE 213

Query: 207 -GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            G TI +NQ++ T         +   +PG+FF Y++ PI +  +EE  S L  L  +  +
Sbjct: 214 RGSTIFTNQYAATSQSHEVPDPQYH-IPGIFFKYNIEPILLVVSEERGSLLALLVRLVNV 272

Query: 266 VGGVFTVSGIIDAFIYHGQRAIKKK 290
           + GV    G +          +KK+
Sbjct: 273 LAGVVVAGGWLFQISTWAMENLKKR 297


>gi|344229081|gb|EGV60967.1| DUF1692-domain-containing protein [Candida tenuis ATCC 10573]
 gi|344229082|gb|EGV60968.1| hypothetical protein CANTEDRAFT_115996 [Candida tenuis ATCC 10573]
          Length = 352

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 98/198 (49%), Gaps = 19/198 (9%)

Query: 90  NPDLIDQCKREGFL-------QRIKEEEGE-GCNIYGFLEVNKVAGNFHFAPGKSFHQSG 141
            PDL D   RE          Q+I +  G   C+I+G + VN V G FH          G
Sbjct: 127 TPDL-DHVMRENIRAEFYISGQKINQVAGAPACHIFGTIPVNHVQGEFHIT------AKG 179

Query: 142 VHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTV 201
           V   D L    +  N SH I + +FG  +P + NPLD            Y+Y+  VVPT+
Sbjct: 180 VGYQDSLHTPWERMNFSHVIQEFSFGTFYPMIDNPLDMSGKITHESLQSYKYYSNVVPTL 239

Query: 202 YTDVSGHTIQSNQFSVTEH---FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
           Y  + G  + +NQ+S++E     R    GR+ + PG+FF Y+  PIK+T  E+ + F+ F
Sbjct: 240 YERL-GIVVDTNQYSISEQHLVIRKDSNGRIYSPPGIFFKYEFEPIKLTIVEKRLPFIQF 298

Query: 259 LTNVCAIVGGVFTVSGII 276
           +  +  I+GG+  ++G +
Sbjct: 299 VARLGTILGGLLILAGYV 316


>gi|123472317|ref|XP_001319353.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121902134|gb|EAY07130.1| hypothetical protein TVAG_342940 [Trichomonas vaginalis G3]
          Length = 358

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 68/223 (30%), Positives = 111/223 (49%), Gaps = 16/223 (7%)

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           CGSCYGA S    CCN C++V+ A++KKG    +   I QC R+  +        E C++
Sbjct: 135 CGSCYGAASG---CCNTCKDVKNAFKKKGRVPPSLSTIRQC-RDAVIDY-NHIRNESCHV 189

Query: 117 YGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNP 176
           YG + V    G      G S+          L    D FN +HKIN +  GE+  G  +P
Sbjct: 190 YGTVIVPPTHGTIVMNSGDSYGAQMNTTTSSLGISIDDFNFTHKINDIYIGENDLG-DHP 248

Query: 177 LDGVRWTQETPSGMYQ--YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
           L G++  Q+   G Y+  YFI+ +      +  +   S+      H+    +G     PG
Sbjct: 249 LKGIKKVQKE-VGRYKGLYFIRTLREQKGSLQVYRATSS------HYDRYREGTTGKFPG 301

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           ++F YD+SPI V +  +  + L+F+  + AI+GG++++  ++D
Sbjct: 302 LYFNYDVSPIIVMYKRD-TTVLNFVIELMAILGGIYSLGSLLD 343


>gi|440632946|gb|ELR02865.1| hypothetical protein GMDG_05797 [Geomyces destructans 20631-21]
          Length = 384

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 61/170 (35%), Positives = 91/170 (53%), Gaps = 14/170 (8%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +G+ C I+G + +NKV G+FH  A G  + ++    H        SFN SH +++ +FG 
Sbjct: 187 DGDSCRIFGSMMLNKVQGDFHITARGHGYQEAFGTKH----LDHSSFNFSHIVSEFSFGA 242

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH------TIQSNQFSVTEHFR 222
            +P ++NPLD    T        QYF+ VVPT+YT  S +      TI +NQ++VT   R
Sbjct: 243 FYPKLINPLDQTITTTANQFYKSQYFMSVVPTIYTVSSPNPLSSKSTIFTNQYAVTHEDR 302

Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
              +   +T+PG+FF YD+ P+ +T  E   SFL F   V  I+ GV   
Sbjct: 303 KINE---RTVPGIFFKYDIEPLMLTIEERRDSFLRFAIKVVNILSGVLVA 349


>gi|452988546|gb|EME88301.1| hypothetical protein MYCFIDRAFT_25415 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 380

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 59/169 (34%), Positives = 85/169 (50%), Gaps = 11/169 (6%)

Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
            + C IYG +  NKV G+FH  A G  + +   H+          FN SH+IN+L+FG  
Sbjct: 181 ADSCRIYGTMHGNKVQGDFHITARGHGYLEFAEHL------DHSKFNFSHRINELSFGPF 234

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY-TDVSGHTIQSNQFSVTEHFRSSEQGR 228
           +P + NPLD    T +     +QYF+ VVPTVY TD     +  N F  T  +  +EQ R
Sbjct: 235 YPSLENPLDNTFATTDINYYKFQYFLSVVPTVYTTDARALRLLDNNFVFTNQYAVTEQSR 294

Query: 229 LQT---LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
             +   +PG+F  +D+ PI +T  EE  SF      +  +V G+    G
Sbjct: 295 KVSENFVPGIFIKFDMEPIGLTIAEEWSSFPALFIRIVNVVSGLLVAGG 343


>gi|240275142|gb|EER38657.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Ajellomyces capsulatus H143]
 gi|325094499|gb|EGC47809.1| COPII-coated vesicle protein [Ajellomyces capsulatus H88]
          Length = 401

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 95/191 (49%), Gaps = 32/191 (16%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E+ + C IYG LE NKV G+FH  A G  + + G H+        D+FN SH + +L+FG
Sbjct: 188 EKADSCRIYGSLEGNKVQGDFHITARGHGYPEYGEHL------SHDAFNFSHMVTELSFG 241

Query: 168 EHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVS------------------- 206
            H+P ++NPLD  +    TP+    +QY++ VVPT+YT                      
Sbjct: 242 PHYPSLLNPLD--KTISVTPARFFKFQYYLSVVPTIYTRAGIVDPYNHVLPDPTTIRPSE 299

Query: 207 -GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            G TI +NQ++ T         +   +PG+FF Y++ PI +  +EE  S L  L  +  +
Sbjct: 300 RGSTIFTNQYAATSQSHEVPDPQYH-IPGIFFKYNIEPILLVVSEERGSLLALLVRLVNV 358

Query: 266 VGGVFTVSGII 276
           + GV    G +
Sbjct: 359 LAGVVVAGGWL 369


>gi|351707253|gb|EHB10172.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Heterocephalus glaber]
          Length = 211

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/137 (42%), Positives = 78/137 (56%), Gaps = 10/137 (7%)

Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
           H H       DS+N SH+I+ L+FGE  PG++NPLDG        + M+QYFI VVPT  
Sbjct: 80  HAHLAALVNHDSYNFSHRIDHLSFGELVPGIINPLDGTEKIAIDHNQMFQYFITVVPT-- 137

Query: 203 TDVSGHTIQ----SNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 257
                HT +    ++QFSVTE  R  +       + G+F  YDLS + VT TEEH+ F  
Sbjct: 138 ---KLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQ 194

Query: 258 FLTNVCAIVGGVFTVSG 274
           F   +C IVGG+F+ +G
Sbjct: 195 FFVRLCGIVGGIFSTTG 211


>gi|12006037|gb|AAG44724.1|AF267855_1 HT034 [Homo sapiens]
          Length = 199

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 58/143 (40%), Positives = 82/143 (57%), Gaps = 10/143 (6%)

Query: 161 INKLAFG-----EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
           I+KL+FG     ++  G  N L G       P   + Y +K+VPTVY D SG    S Q+
Sbjct: 59  IHKLSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQY 118

Query: 216 SVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           +V   E+   S  GR+  +P ++F YDLSPI V +TE       F+T +CAI+GG FTV+
Sbjct: 119 TVANKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVA 176

Query: 274 GIIDAFIYHGQRAIKKKIEIGKF 296
           GI+D+ I+    A  KKI++GK 
Sbjct: 177 GILDSCIFTASEAW-KKIQLGKM 198


>gi|389640739|ref|XP_003718002.1| hypothetical protein MGG_00949 [Magnaporthe oryzae 70-15]
 gi|351640555|gb|EHA48418.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Magnaporthe oryzae 70-15]
 gi|440464580|gb|ELQ33987.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Magnaporthe oryzae Y34]
 gi|440481695|gb|ELQ62250.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Magnaporthe oryzae P131]
          Length = 376

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/196 (35%), Positives = 104/196 (53%), Gaps = 23/196 (11%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           R+     + C I+G L++NKV G+FH  A G  + + G H+         +FN SH +N+
Sbjct: 176 RLWGATPDSCRIFGSLDLNKVQGDFHITARGHGYIEFGDHL------DHSAFNFSHIVNE 229

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG------HTIQSNQFSV 217
            +FG+ +P +VNPLD    T E     +QYF+ VVPT+Y+  S        TI +NQ++V
Sbjct: 230 FSFGDFYPSLVNPLDKTVNTCEKNFHKFQYFLSVVPTLYSVKSSTGAFGYSTIFTNQYAV 289

Query: 218 TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV-------F 270
           TE  +SSE   +  +PG+FF YD+ PI +   E   + L FL  V  I+ G        F
Sbjct: 290 TE--QSSEISEMN-VPGIFFKYDIEPILLDIEESRDTILVFLIKVINILSGAMVAGHWGF 346

Query: 271 TVSGIIDAFIYHGQRA 286
           T+S  I   +   +RA
Sbjct: 347 TMSEWIKEVLGKRRRA 362


>gi|145235453|ref|XP_001390375.1| COPII-coated vesicle protein (Erv41) [Aspergillus niger CBS 513.88]
 gi|134058058|emb|CAK38286.1| unnamed protein product [Aspergillus niger]
 gi|350632895|gb|EHA21262.1| hypothetical protein ASPNIDRAFT_191708 [Aspergillus niger ATCC
           1015]
          Length = 399

 Score =  102 bits (253), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/237 (31%), Positives = 108/237 (45%), Gaps = 48/237 (20%)

Query: 63  AESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGF 119
           A  +D    +   EVR+  R+K   G  L   D +D C+                 IYG 
Sbjct: 156 AREADAHVHHVLGEVRKNPRRKFAKGPRLRRGDTVDSCR-----------------IYGS 198

Query: 120 LEVNKVAGNFHF-APGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAFGEHFPGVVNPL 177
           LE NKV G+FH  A G  +   G H+ H +       FN SH + +L+FG H+P ++NPL
Sbjct: 199 LEGNKVQGDFHITARGHGYRNFGEHLDHGV-------FNFSHMVTELSFGPHYPTLLNPL 251

Query: 178 DGVRWTQETPSGMYQYFIKVVPTVY------------------TDVSGHTIQSNQFSVTE 219
           D    T ET    YQYF+ VVPT+Y                  T+ + + + +NQ++ T 
Sbjct: 252 DKTIATTETHYYKYQYFLSVVPTLYSKGASALDTYTNHPDLIATNRNRNLVFTNQYAATT 311

Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
                 +     +PG+FF Y++ PI +  +EE  SFL  L  +   V GV    G +
Sbjct: 312 QATELPENPY-FIPGIFFKYNIEPILLMISEERTSFLSLLIRLVNTVSGVMVTGGWV 367


>gi|322710423|gb|EFZ01998.1| COPII-coated vesicle protein (Erv41), putative [Metarhizium
           anisopliae ARSEF 23]
          Length = 372

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 60/184 (32%), Positives = 97/184 (52%), Gaps = 13/184 (7%)

Query: 92  DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
           D++   +R     +    +G  + C IYG L++NKV G+FH  A G  +   G H+    
Sbjct: 163 DIVALGQRRAKWAKTPRVKGPPDSCRIYGSLDLNKVQGDFHITARGHGYRGQGSHL---- 218

Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
               + FN SH I++L+FG ++P +VNPLD      E     +QY++ VVPT Y+ V   
Sbjct: 219 --DHEQFNFSHIISELSFGSYYPSLVNPLDRTLNIAENHFHKFQYYVSVVPTRYS-VGSS 275

Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
           +I +NQ++VTE  +   +     +PGVF  YD+ PI ++  E+    L F+  +  ++ G
Sbjct: 276 SIFTNQYAVTEQSKGVSE---YNVPGVFVKYDIEPILLSVNEDRDGILMFVVKLINVLSG 332

Query: 269 VFTV 272
           V   
Sbjct: 333 VLVA 336


>gi|261193579|ref|XP_002623195.1| COPII-coated vesicle protein [Ajellomyces dermatitidis SLH14081]
 gi|239588800|gb|EEQ71443.1| COPII-coated vesicle protein [Ajellomyces dermatitidis SLH14081]
 gi|239613876|gb|EEQ90863.1| COPII-coated vesicle protein [Ajellomyces dermatitidis ER-3]
 gi|327349942|gb|EGE78799.1| COPII-coated vesicle protein [Ajellomyces dermatitidis ATCC 18188]
          Length = 401

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 97/204 (47%), Gaps = 30/204 (14%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E  + C IYG L  NKV G+FH  A G  + + G H+        DSFN SH I +L+FG
Sbjct: 188 ENADSCRIYGSLVGNKVQGDFHITARGHGYFEFGEHL------SHDSFNFSHMITELSFG 241

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS--------------------G 207
            H+  ++NPLD    T       YQY++ +VPT+YT                       G
Sbjct: 242 PHYSTLLNPLDKTISTTPAHFHKYQYYMSIVPTIYTRAGVVDPYSQALPDPSTITPSQRG 301

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
           +TI +NQ++VT   RS E    +  +PG+FF Y + PI +  +EE  S L  L  +  ++
Sbjct: 302 NTIFTNQYAVTS--RSHELPDAEYDVPGIFFKYTIEPILLVVSEERGSLLALLVRLVNVL 359

Query: 267 GGVFTVSGIIDAFIYHGQRAIKKK 290
            GV    G +          +KK+
Sbjct: 360 AGVVVAGGWLFQIFTWAMDNLKKR 383


>gi|426372082|ref|XP_004052960.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 2 [Gorilla gorilla
           gorilla]
          Length = 354

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 58/147 (39%), Positives = 84/147 (57%), Gaps = 6/147 (4%)

Query: 133 PGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQ 192
           P ++      H H       +S+N SH+I+ L+FGE  P ++NPLDG        + M+Q
Sbjct: 166 PPRAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVPAIINPLDGTEKIAIDHNQMFQ 225

Query: 193 YFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFT 249
           YFI VVPT ++T  +S  T   +QFSVTE  R  +       + G+F  YDLS + VT T
Sbjct: 226 YFITVVPTKLHTYKISADT---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVT 282

Query: 250 EEHVSFLHFLTNVCAIVGGVFTVSGII 276
           EEH+ F  F   +C IVGG+F+ +G++
Sbjct: 283 EEHMPFWQFFVRLCGIVGGIFSTTGML 309


>gi|344301277|gb|EGW31589.1| hypothetical protein SPAPADRAFT_62204 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 353

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 90/174 (51%), Gaps = 12/174 (6%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           QR+ E     C+I+G + +N+V G+F           G    D++A   D  N SH I +
Sbjct: 146 QRVNEN-APACHIFGSIPINQVKGDFRIT------AKGYGYRDVIAAPIDKLNFSHVIQE 198

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF-- 221
            ++GE +P + NPLD      E     Y Y  KVVPT Y  + G  +++NQ+SVTE+   
Sbjct: 199 FSYGEFYPFINNPLDATGKVTEEKFQKYMYSAKVVPTSYEKL-GLIVETNQYSVTENHQV 257

Query: 222 --RSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
             ++S+ G    +PG++  YD  PIK+   E+ + F+ F+  +  I GG+   +
Sbjct: 258 LQKNSQTGVPIGVPGIYIKYDFEPIKMVIKEKRMPFMQFVAKLATIAGGILITA 311


>gi|339233696|ref|XP_003381965.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316979152|gb|EFV61980.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 331

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 57/172 (33%), Positives = 89/172 (51%), Gaps = 4/172 (2%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF-QRDSFNISHKINKLAFGEHF 170
           + C I+G+  +NK+ G       ++     V    I A  Q + FN SH+I K  FG   
Sbjct: 144 DACRIHGYFLMNKLRGKLRIKFKETVRLEAVSNFIIFARRQNEGFNFSHRIEKFGFGPRI 203

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSEQGR 228
            G++NPLDG +        M+ Y+I+VVPT  TD++G    ++Q+SVT   R    +QG 
Sbjct: 204 AGIINPLDGFQKESFDRRDMFYYYIQVVPTKITDLNGMETFTSQYSVTHKRRIIDHDQGS 263

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
             +  G+F ++D +P+ V   +   S   F   +CAIVGG+F  +  I A +
Sbjct: 264 HGSC-GIFIYFDFAPMMVLIRKSKTSLFVFALRICAIVGGIFACTDFIIALM 314


>gi|170587366|ref|XP_001898447.1| HT034 [Brugia malayi]
 gi|158594071|gb|EDP32661.1| HT034, putative [Brugia malayi]
          Length = 286

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 59/190 (31%), Positives = 100/190 (52%), Gaps = 18/190 (9%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP- 171
           GC   G  +++KV GNFH +          H  D    Q +++++ H I+ + FG+    
Sbjct: 110 GCRFEGKFDISKVPGNFHIS---------THAADT---QPETYDMRHTIHSVVFGDDVST 157

Query: 172 ----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
               G  NPL      +   S  + Y +K+VP+VY D++G+   S Q++       +   
Sbjct: 158 SQNLGSFNPLKNREALESDGSFTHDYVLKIVPSVYEDITGNKKYSYQYTYAHKEYVTYHY 217

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
             + +P ++F Y+L PI + +TE    F  F+T++CA+VGG FTV+GIIDA ++     +
Sbjct: 218 SGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLF-SLTEL 276

Query: 288 KKKIEIGKFS 297
            +K ++GK S
Sbjct: 277 YRKHQMGKLS 286


>gi|255944653|ref|XP_002563094.1| Pc20g05600 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211587829|emb|CAP85889.1| Pc20g05600 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 396

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 90/185 (48%), Gaps = 26/185 (14%)

Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
            + C IYG LE NKV G+FH  A G  + ++  H+         SF+ SH I +L+FG H
Sbjct: 189 ADACRIYGSLEGNKVQGDFHITARGHGYRENAPHL------DHSSFDFSHMITELSFGPH 242

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG------------------HTIQ 211
           +P + NPLD      E     +QYF+ VVPT+Y+   G                   T+ 
Sbjct: 243 YPTLQNPLDKTIAETEEHYYKFQYFLSVVPTLYSRGKGALDAYTRSPDAAASRYGRDTVF 302

Query: 212 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
           +NQ++ T    +  +  +  +PG+FF Y++ PI +  +EE  SFL  L  V   + GV  
Sbjct: 303 TNQYAATSQSSAIPESPM-VVPGIFFKYNIEPILLLVSEERASFLSLLVRVINTISGVLV 361

Query: 272 VSGII 276
             G +
Sbjct: 362 TGGWL 366


>gi|154343635|ref|XP_001567763.1| hypothetical protein, unknown function [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134065095|emb|CAM43209.1| hypothetical protein, unknown function [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 309

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 98/193 (50%), Gaps = 19/193 (9%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
              EGC + G+++V KV GNFH +     H    H         +  N  H I+ L+FG 
Sbjct: 128 SAAEGCRLEGYIKVGKVPGNFHISSHGRQHLLMTHF-------PNGTNAEHSIHHLSFGT 180

Query: 169 ------HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR 222
                      ++PLDG     E P  +YQYF+ +VPT+Y + S  T  + QF+ T    
Sbjct: 181 LDVKKLDKKAQLHPLDGKEHRSEVPK-IYQYFLDIVPTIY-ESSFSTAHTYQFTGTSSSS 238

Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
                ++     V F Y +SPI V ++   VS  HFLT VCAI+GGV+TV+G++  F++ 
Sbjct: 239 PVPSSQMA---AVVFQYQMSPITVRYSSARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHS 295

Query: 283 GQRAIKKKIEIGK 295
                +++I +GK
Sbjct: 296 SAAQFQRRI-LGK 307


>gi|300121843|emb|CBK22417.2| unnamed protein product [Blastocystis hominis]
          Length = 251

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 70/225 (31%), Positives = 107/225 (47%), Gaps = 23/225 (10%)

Query: 60  CYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQ----CK--REGFLQRIKEEEGEG 113
           CYGA  ++  CCN C  + EAY  +GW+   P  + Q    C+  R   L         G
Sbjct: 35  CYGA-GAEGQCCNTCSAIVEAYNSRGWS---PHFVLQFSPLCRNSRPSVLSF-----KSG 85

Query: 114 CNIYGFLEVNKVAGNFHFAPGKSFHQ-SGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           C I+G ++V++VAG+ H           G  V+D     +     SH I   +FG+H PG
Sbjct: 86  CMIWGAIDVHQVAGDIHIQTTTGMIDILGAPVYDAEIISK--LKSSHFIEHFSFGKHIPG 143

Query: 173 VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH---FRSSEQGRL 229
           V NPL+G R+     +  + Y I+++P +Y +  G  I+SN+ SV E          G  
Sbjct: 144 VENPLNGRRFLANQLTS-HAYQIEILPAIY-ERGGVEIRSNEISVYETDKVVTVEPSGTA 201

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
              PG+FF Y +SP +    E+   F   +  +C ++GG+  V G
Sbjct: 202 DVEPGLFFKYRISPFEHVIREDRKEFWSLVVRLCGVMGGMMAVGG 246


>gi|268577857|ref|XP_002643911.1| Hypothetical protein CBG02175 [Caenorhabditis briggsae]
          Length = 282

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 63/190 (33%), Positives = 102/190 (53%), Gaps = 19/190 (10%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---- 168
           GC      E+NKV GNFH     S H +          Q D +++ H I+ + FG+    
Sbjct: 107 GCRFESRFEINKVPGNFHL----STHSATT--------QPDGYDMRHIIHSIKFGDDVSH 154

Query: 169 -HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            +  G  +PL   R  +E+    ++Y +K+VP+V+ D SG+ + S Q++       +   
Sbjct: 155 KNLKGSFDPLAN-REAKESGLNTHEYILKIVPSVHEDYSGNILNSYQYTYGHKSYVTYHH 213

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
             + +P V+F Y+L PI +  TE   SF  FLT++CA+VGG FTV+GIID+  +     +
Sbjct: 214 SGKIIPAVWFKYELQPITLKQTEHRQSFYIFLTSICAVVGGTFTVAGIIDSTFFTISEMV 273

Query: 288 KKKIEIGKFS 297
           KK+ ++GK +
Sbjct: 274 KKQ-QMGKLT 282


>gi|367038975|ref|XP_003649868.1| hypothetical protein THITE_2126029 [Thielavia terrestris NRRL 8126]
 gi|346997129|gb|AEO63532.1| hypothetical protein THITE_2126029 [Thielavia terrestris NRRL 8126]
          Length = 380

 Score =  100 bits (250), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 62/174 (35%), Positives = 89/174 (51%), Gaps = 15/174 (8%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           R+   E + C IYG LE+NKV G+FH  A G  +   G H+        ++FN SH I++
Sbjct: 181 RLWGAEPDSCRIYGSLELNKVQGDFHITARGHGYMAFGDHL------DHNAFNFSHIISE 234

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-----DVSGHTIQSNQFSVT 218
           L+FG   P + NPLD            +QYF+ VVPT Y+      +   +I +NQ++VT
Sbjct: 235 LSFGPFLPSLANPLDRTVNIATAHFHKFQYFLSVVPTTYSVGRPGALGARSIFTNQYAVT 294

Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
           E    S++    T+PG+F  YD+ PI +   E    F  FL  V  +V GV   
Sbjct: 295 EQ---SQEVPDTTIPGIFVKYDIEPILLNIVETRDGFFVFLLRVINVVSGVLVA 345


>gi|340514865|gb|EGR45124.1| predicted protein [Trichoderma reesei QM6a]
          Length = 372

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 58/184 (31%), Positives = 95/184 (51%), Gaps = 13/184 (7%)

Query: 92  DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
           D++   +++    +  +  G  + C +YG L++NKV G+FH  A G  +   G H+    
Sbjct: 163 DIVSLSRKKAKWAKTPKPRGRTDSCRMYGSLDLNKVQGDFHITARGHGYSGIGGHL---- 218

Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
               D FN SH I++L++G  +P ++NPLD    T       +QY++ VVPTVY   S  
Sbjct: 219 --DHDKFNFSHIISELSYGPFYPSLINPLDRTVNTAIVHFHKFQYYLSVVPTVYI-ASHR 275

Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
            + +NQ++VTE  ++        +PG+FF YD+ PI ++  E    F  FL  +  +  G
Sbjct: 276 IVNTNQYAVTEQSKTISD---HQVPGIFFKYDIEPIMLSVEETRDGFFAFLLKLVNVFSG 332

Query: 269 VFTV 272
           V   
Sbjct: 333 VMVA 336


>gi|322697212|gb|EFY88994.1| COPII-coated vesicle protein (Erv41), putative [Metarhizium acridum
           CQMa 102]
          Length = 372

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 59/184 (32%), Positives = 96/184 (52%), Gaps = 13/184 (7%)

Query: 92  DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
           D++   +R     +    +G  + C IYG L++NKV G+FH  A G  +   G H+    
Sbjct: 163 DIVALGQRRAKWAKTPRVKGPPDSCRIYGSLDLNKVQGDFHITARGHGYRGQGSHL---- 218

Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
                 FN SH I++L+FG ++P +VNPLD      E     +QY++ VVPT Y+ V   
Sbjct: 219 --DHSQFNFSHIISELSFGSYYPSLVNPLDRTINIAENHFHKFQYYVSVVPTRYS-VGSS 275

Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
           +I +NQ++VTE  +   +     +PG+F  YD+ PI ++  E+    L F+  +  ++ G
Sbjct: 276 SIFTNQYAVTEQSKGVSE---YNVPGIFVKYDIEPILLSVNEDRDGILMFVVKLINVLSG 332

Query: 269 VFTV 272
           V   
Sbjct: 333 VLVA 336


>gi|225558748|gb|EEH07032.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 401

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 61/191 (31%), Positives = 94/191 (49%), Gaps = 32/191 (16%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E+ + C IYG LE NKV G+FH  A G  + + G H+        D+FN SH + +L+FG
Sbjct: 188 EKADSCRIYGSLEGNKVQGDFHITARGHGYPEFGEHL------SHDAFNFSHMVTELSFG 241

Query: 168 EHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVS------------------- 206
            H+P ++NPLD  +    TP+    +QY++ VVPT+YT                      
Sbjct: 242 PHYPSLLNPLD--KTISVTPARFFKFQYYLSVVPTIYTRAGIVDPYNHVLPDPTTIRPSE 299

Query: 207 -GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            G TI +NQ++ T         +   +PG+FF Y++ PI +  +EE    L  L  +  +
Sbjct: 300 RGSTIFTNQYAATSQSHEVPDPQYH-IPGIFFKYNIEPILLVVSEERGGLLALLVRLVNV 358

Query: 266 VGGVFTVSGII 276
           + GV    G +
Sbjct: 359 LAGVVVAGGWL 369


>gi|425765498|gb|EKV04175.1| COPII-coated vesicle protein (Erv41), putative [Penicillium
           digitatum PHI26]
 gi|425783511|gb|EKV21358.1| COPII-coated vesicle protein (Erv41), putative [Penicillium
           digitatum Pd1]
          Length = 396

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 61/184 (33%), Positives = 90/184 (48%), Gaps = 26/184 (14%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG LE NKV G+FH  A G  + ++  H+         +FN SH I +L+FG H+
Sbjct: 190 DACRIYGSLEGNKVQGDFHITARGHGYRENAPHL------DHSAFNFSHMITELSFGPHY 243

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY---------------TDVSGH---TIQS 212
           P + NPLD      E     +QYF+ +VPT+Y               T  + H   T+ +
Sbjct: 244 PTLQNPLDKTIAETEEHYYKFQYFLSIVPTLYSRGKSALDLYTRSPETLAARHGRNTVFT 303

Query: 213 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
           NQ++ T    +  +  +  +PG+FF YD+ PI +  +EE   FL  L  V   V GV   
Sbjct: 304 NQYAATSQSSAIPESPM-VVPGIFFKYDIEPILLLVSEERAGFLSLLIRVINTVSGVLVT 362

Query: 273 SGII 276
            G +
Sbjct: 363 GGWL 366


>gi|366998832|ref|XP_003684152.1| hypothetical protein TPHA_0B00460 [Tetrapisispora phaffii CBS 4417]
 gi|357522448|emb|CCE61718.1| hypothetical protein TPHA_0B00460 [Tetrapisispora phaffii CBS 4417]
          Length = 349

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/172 (31%), Positives = 90/172 (52%), Gaps = 12/172 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC++YG + VN+VAG             G    D     +D  + +H +N+ +FG+ 
Sbjct: 155 ELTGCHVYGSVTVNRVAGEMQIT------AKGYGYRDRKRAPKDLIDFNHVVNEFSFGDF 208

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF----RSS 224
           +P + NPLDG  +    +P   Y YF+ VVPT Y  + G  I +NQ+S+ E+      S+
Sbjct: 209 YPYIENPLDGTCKMYPNSPFSSYNYFMSVVPTFYQKL-GAEIDTNQYSIREYHVDLKNSN 267

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              +L T+PG+F  YD  P+ +  ++  ++FL F+  + AI+  V  ++  I
Sbjct: 268 VNAKLSTIPGIFLKYDFEPLAIIISDVRLTFLQFIVRLVAILSFVLYIASWI 319


>gi|426200953|gb|EKV50876.1| hypothetical protein AGABI2DRAFT_113626 [Agaricus bisporus var.
           bisporus H97]
          Length = 542

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/165 (32%), Positives = 82/165 (49%), Gaps = 8/165 (4%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           +G  C IYG + V +V  N H       + S  HV        +  N+SH I + +FG +
Sbjct: 173 DGGACRIYGTMPVKRVTANLHITTVGHGYSSYQHV------DHNQMNLSHVITEFSFGPY 226

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           FP +V PLD      +     YQYF+ VVPT Y       +++NQ+SVT + R  E  + 
Sbjct: 227 FPEIVQPLDESFEVTQDHFTAYQYFLHVVPTTYIAPRTSPLRTNQYSVTHYTRQVEHNK- 285

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
              PG+FF +DL P+ +T  ++  + +  L     ++GGVF   G
Sbjct: 286 -GTPGIFFKFDLDPLNITIHQKTTTLIQLLIRCVGVIGGVFVCMG 329


>gi|409083992|gb|EKM84349.1| hypothetical protein AGABI1DRAFT_32491 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 542

 Score = 99.8 bits (247), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 54/165 (32%), Positives = 82/165 (49%), Gaps = 8/165 (4%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           +G  C IYG + V +V  N H       + S  HV        +  N+SH I + +FG +
Sbjct: 173 DGGACRIYGTMPVKRVTANLHITTVGHGYSSYQHV------DHNQMNLSHVITEFSFGPY 226

Query: 170 FPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           FP +V PLD      +     YQYF+ VVPT Y       +++NQ+SVT + R  E  + 
Sbjct: 227 FPEIVQPLDESFEVTQDHFTAYQYFLHVVPTTYIAPRTSPLRTNQYSVTHYTRQVEHNK- 285

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
              PG+FF +DL P+ +T  ++  + +  L     ++GGVF   G
Sbjct: 286 -GTPGIFFKFDLDPLNITIHQKTTTLIQLLIRCVGVIGGVFVCMG 329


>gi|340914937|gb|EGS18278.1| hypothetical protein CTHT_0063020 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 388

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 59/169 (34%), Positives = 86/169 (50%), Gaps = 13/169 (7%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           + + C IYG LE+NKV G+FH  A G  + + G   H        +FN SH I++L+FG 
Sbjct: 192 QADSCRIYGSLELNKVQGDFHITARGHGYLEGGNAQH----LDHSAFNFSHIISELSFGP 247

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-----DVSGHTIQSNQFSVTEHFRS 223
             P + NPLD            +QYF+ +VPT Y+     ++   +I +NQ++VTE    
Sbjct: 248 FLPSLSNPLDRTVNLASHHFHRFQYFLSIVPTTYSVGRPGEMGSQSIFTNQYAVTEQSHP 307

Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
             +   + +PG+FF YD+ PI +   E   S   FL  V  IV GV   
Sbjct: 308 VSE---RNIPGIFFKYDIEPILLNIVETRDSVFKFLVKVVNIVSGVLVA 353


>gi|392564830|gb|EIW58008.1| DUF1692-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 539

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 55/180 (30%), Positives = 83/180 (46%), Gaps = 8/180 (4%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + +G  C ++G +   +V  N H       + S  HV   L       N+SH I + +FG
Sbjct: 175 QPDGSACRVFGTITAKRVTANLHITTLGHGYASQTHVDHKL------MNLSHVITEFSFG 228

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            +FP +  PLD        P   YQY++ VVPT Y       + +NQ+SVT + R  +  
Sbjct: 229 PYFPDITQPLDNSFELTSEPFVAYQYYLHVVPTTYIAPRTKPLNTNQYSVTHYTRVLDHH 288

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
           R    PG+FF +DL P+K+T  +   SF+        ++GGVF   G       H   A+
Sbjct: 289 R--GTPGIFFKFDLEPMKLTIHQRTTSFVQLFIRTVGVIGGVFVCMGYAVKITGHAVDAV 346


>gi|414586932|tpg|DAA37503.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
           protein [Zea mays]
          Length = 63

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 46/54 (85%), Positives = 53/54 (98%)

Query: 244 IKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           ++VTFTE+HVSFLHFLTNVCAIVGGVFTVSGIID+F+YH QRAIKKK+EIGKF+
Sbjct: 10  LQVTFTEQHVSFLHFLTNVCAIVGGVFTVSGIIDSFVYHSQRAIKKKMEIGKFN 63


>gi|395326723|gb|EJF59129.1| hypothetical protein DICSQDRAFT_156384 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 559

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/168 (34%), Positives = 80/168 (47%), Gaps = 10/168 (5%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAF 166
           + +G  C IYG +   +V  N H       + S  HV H  +       N+SH I + +F
Sbjct: 178 QPDGSACRIYGTITAKRVTANLHVTTLGHGYASHEHVDHKFM-------NLSHVITEFSF 230

Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
           G +FP +  PLD        P   YQYF+ VVPT Y       + +NQ+SVT + R  + 
Sbjct: 231 GPYFPDITQPLDNSFEMAHDPFVAYQYFLHVVPTTYIAPRSKPLHTNQYSVTHYTRVLDH 290

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
            R    PG+FF +DL PI +T  +   S   FL     +VGGVF   G
Sbjct: 291 HR--GTPGIFFKFDLEPIHMTIHQRTTSLAAFLLRCAGVVGGVFVCMG 336


>gi|308198100|ref|XP_001386838.2| predicted protein [Scheffersomyces stipitis CBS 6054]
 gi|149388859|gb|EAZ62815.2| putative ER to golgi transport [Scheffersomyces stipitis CBS 6054]
          Length = 352

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 56/160 (35%), Positives = 82/160 (51%), Gaps = 9/160 (5%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E    C+I+G + V+ V G+FH       +    HV        ++ N SH I + +FG+
Sbjct: 150 EGAPACHIFGSIPVSHVKGDFHITAKGLGYSDRSHV------PLEALNFSHVIQEFSFGD 203

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE--HFRSSEQ 226
            +P + NPLD      E P   Y YF KVVPT+Y  + G  + +NQ+S+TE  H    E 
Sbjct: 204 FYPFINNPLDASGKLTEEPLISYSYFAKVVPTLYQRL-GLVVDTNQYSLTENNHVFKLEH 262

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
            R   +PG+FF YD  PIK+   E  + F+ F+  +  IV
Sbjct: 263 KRPTGIPGIFFKYDFEPIKLIIIERRLPFIQFVARLATIV 302


>gi|440794754|gb|ELR15909.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
          Length = 306

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/191 (32%), Positives = 94/191 (49%), Gaps = 16/191 (8%)

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGK----SFHQSGVHVHDIL-----AFQRDS--FNISH 159
            + C + G + V K+ G F  +  +    S + S ++ H            DS  FN++H
Sbjct: 118 ADRCLLTGHMAVRKIRGQFQISSRRFNPFSIYGSSLNKHTPTEDHPHPHPEDSLPFNVTH 177

Query: 160 KINKLAFGEHFPGVVNPLDGVRWT-QETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           +I +L+FG      V PLDG+  T +E     Y YF+++VP  Y    G  ++S  F+ T
Sbjct: 178 RIRELSFGPKVLPDVGPLDGIVQTMREGERSQYSYFLQIVPASYHYADGRVVESYSFAFT 237

Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
            H     + R +  PGVF+ YD SP   +  E   SF HF+T  CA++GG F V G++ A
Sbjct: 238 MH----TESRSELAPGVFWKYDFSPYATSLREVPKSFSHFITRCCAVIGGTFVVFGLLSA 293

Query: 279 FIYHGQRAIKK 289
                + A KK
Sbjct: 294 LASRLETAAKK 304


>gi|258573091|ref|XP_002540727.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237900993|gb|EEP75394.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 398

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 67/236 (28%), Positives = 105/236 (44%), Gaps = 47/236 (19%)

Query: 64  ESSDEDCCNNCEEVREAYRKK---GWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +  D+   +   EVR ++++K   G  L + D +D C+                 IYG L
Sbjct: 157 QEEDQHVGHVLGEVRRSWKRKFPKGPKLKSKDAMDSCR-----------------IYGSL 199

Query: 121 EVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV 180
           E NKV GNFH          G+   D   F  +  N +H I +L+FG  +  ++NPLD  
Sbjct: 200 EGNKVQGNFHIT------ARGLGYWDPSGFHLEGLNFTHLITELSFGPRYSTLLNPLDKT 253

Query: 181 RWTQETPSGMYQYFIKVVPTVYTDVSG--------------------HTIQSNQFSVTEH 220
               +     YQY++ VVPT+YT                        +TI +NQ++VT  
Sbjct: 254 VAGTKDAFYKYQYYLSVVPTIYTRAGTVDPYNQELPDPSTITSRQRKNTIFTNQYAVTSQ 313

Query: 221 FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
             +  Q  ++ +PG+FF +D+ PI +  +EE  S L  L  +  +V GV    G +
Sbjct: 314 SHAIPQ-NVRAVPGIFFKFDIEPILLVVSEERGSLLALLVRLVNVVSGVLVAGGWV 368


>gi|296821254|ref|XP_002850059.1| ER-derived vesicles protein ERV41 [Arthroderma otae CBS 113480]
 gi|238837613|gb|EEQ27275.1| ER-derived vesicles protein ERV41 [Arthroderma otae CBS 113480]
          Length = 399

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 78/274 (28%), Positives = 119/274 (43%), Gaps = 52/274 (18%)

Query: 44  QRHGGRLEHNETYCGSCYGAESSDEDCC--NNCEEVREAYRKK---GWALSNPDLIDQCK 98
           QR GG  E+        +  E  +ED    +   EVR   +KK      L   D +D C+
Sbjct: 135 QRGGGSPEYQTLSKEDPFRLEEQEEDLHVEHVLGEVRRGRKKKFPKAPKLKKSDAVDSCR 194

Query: 99  REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 157
                            ++G LE NKV GN H  A G  + + G   +        S N 
Sbjct: 195 -----------------VFGSLEGNKVQGNLHITARGFGYLEWGQPTNP------HSLNF 231

Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--------- 208
           +H I +L+FG H+  ++NPLD    T       YQY + VVPT+YT  SGH         
Sbjct: 232 THLITELSFGPHYARLLNPLDKTVSTTSVNFYKYQYHLSVVPTIYTK-SGHIDPNHRSLP 290

Query: 209 ------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFL 256
                       T+ +NQ++VT  +    Q R++++PG+FF Y++ PI +  ++E  S L
Sbjct: 291 DPSSITAKDSKTTVSTNQYAVTS-YSQPVQPRIESIPGIFFKYNIEPILLIVSQERDSLL 349

Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
             L  +  +V GV    G +         A++K+
Sbjct: 350 ALLVRLVNVVSGVLVTGGWLFQIGSWAVEAMRKR 383


>gi|403216157|emb|CCK70655.1| hypothetical protein KNAG_0E04020 [Kazachstania naganishii CBS
           8797]
          Length = 351

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 55/171 (32%), Positives = 97/171 (56%), Gaps = 12/171 (7%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+I+G + VN+V G F           G+   D+ A  ++  N +H IN+ +FG+ 
Sbjct: 154 EFNGCHIFGSIPVNRVRGEFQIT------AKGLGYRDMNAAPKEKINFAHVINEWSFGDF 207

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH-FRSSEQG 227
           +P + NPLD   ++ ++ P   + Y++ VVPT+Y  + G  + +NQ+SV+E+ F S+++ 
Sbjct: 208 YPYIDNPLDATAKFDKDDPLTAFVYYLSVVPTIYQKL-GAEVDTNQYSVSEYRFNSTDKT 266

Query: 228 RLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG-GVFTVSGI 275
              T  +PG+FF Y+   + +  T+  +SFL F+  + AI+   V+  S I
Sbjct: 267 FRDTGYVPGIFFRYNFESLSIVMTDRRLSFLQFIVRLVAIMSFAVYIASWI 317


>gi|67901384|ref|XP_680948.1| hypothetical protein AN7679.2 [Aspergillus nidulans FGSC A4]
 gi|40742675|gb|EAA61865.1| hypothetical protein AN7679.2 [Aspergillus nidulans FGSC A4]
 gi|259484020|tpe|CBF79887.1| TPA: COPII-coated vesicle protein (Erv41), putative
           (AFU_orthologue; AFUA_2G01530) [Aspergillus nidulans
           FGSC A4]
          Length = 394

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 64/207 (30%), Positives = 99/207 (47%), Gaps = 30/207 (14%)

Query: 93  LIDQCKREG---FLQRIKEEEGE---GCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVH 145
           ++++ +R G   F +  K   G+    C IYG LE NKV G+FH  A G  +     H+ 
Sbjct: 164 VLNELRRNGKRKFAKGPKLRRGDVVDSCRIYGSLEGNKVQGDFHITARGHGYRDGREHL- 222

Query: 146 DILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-- 203
                   +FN SH I +L+FG H+P + NPLD    T E     YQYF+ +VPT+Y+  
Sbjct: 223 -----DHSAFNFSHIITELSFGPHYPSLHNPLDKTIATTEFHYYKYQYFLSIVPTIYSRN 277

Query: 204 --------------DVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFT 249
                           + + I +NQ++ T    +  +     +PG+FF Y++ PI +  +
Sbjct: 278 QNLRLDALPSSSSARSNKNLIFTNQYAATSQSDAIPESPY-VIPGIFFKYNIEPIMLLIS 336

Query: 250 EEHVSFLHFLTNVCAIVGGVFTVSGII 276
           EE   FL+ L  +   V GV    G +
Sbjct: 337 EERTGFLNLLIRIVNTVSGVLVTGGWV 363


>gi|115433364|ref|XP_001216819.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114189671|gb|EAU31371.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 449

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 98/207 (47%), Gaps = 31/207 (14%)

Query: 94  IDQCKREGFLQRIKEEEGEG---CNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILA 149
           + +  R+ F +  K   G+    C IYG LE NKV G+FH  A G  +     H+     
Sbjct: 218 VRRNPRKKFPKSPKLRRGDAVDSCRIYGSLEGNKVQGDFHITARGHGYRDFAPHL----- 272

Query: 150 FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT------ 203
               +FN SH I +L+FG H+P ++NPLD      ET    +QYF+ VVPT+Y+      
Sbjct: 273 -DHQTFNFSHMITELSFGPHYPTLLNPLDKTIAETETHYYKFQYFLSVVPTIYSKGNRVL 331

Query: 204 -----------DVSGHT---IQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFT 249
                      D S H    + +NQ++ T    +  +     +PG+FF Y++ PI +  +
Sbjct: 332 DTYSIAPPTLHDNSRHNKNLVFTNQYAATSQSDALPESPF-FVPGIFFKYNIEPILLLIS 390

Query: 250 EEHVSFLHFLTNVCAIVGGVFTVSGII 276
           EE  SFL  L  +   V GV    G +
Sbjct: 391 EERGSFLSLLIRLVNTVSGVMVTGGWL 417


>gi|213408569|ref|XP_002175055.1| COPII-coated vesicle component Erv41 [Schizosaccharomyces japonicus
           yFS275]
 gi|212003102|gb|EEB08762.1| COPII-coated vesicle component Erv41 [Schizosaccharomyces japonicus
           yFS275]
          Length = 331

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 102/189 (53%), Gaps = 12/189 (6%)

Query: 94  IDQCKREGFLQRIKE--EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAF 150
           + + +R+ F ++ K   + G  C  YG + V++  G  H  APG  +  S + +      
Sbjct: 133 LRRTRRKKFNKKSKTLPDGGSACRFYGAVTVHRTQGLLHITAPGWGYGMSNIPL------ 186

Query: 151 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 210
             ++ N +H I++L+FG+++P +VN LDG     +  +  +QY+  ++PT YT  +   +
Sbjct: 187 --NALNFTHAIDELSFGDYYPSLVNALDGSYGFTDEHAFAFQYYTSIIPTTYTS-TFRNV 243

Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
           Q+NQ++VTE+    + G     PG+F  YD+ P+ +   E + S  + +  + AI GG+ 
Sbjct: 244 QTNQYAVTENSVRRQTGFRSDPPGIFISYDIEPLGIHIRETYPSLGNTILRILAISGGLV 303

Query: 271 TVSGIIDAF 279
           TV+  ++ F
Sbjct: 304 TVTTWVERF 312


>gi|225685292|gb|EEH23576.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 386

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 64/190 (33%), Positives = 94/190 (49%), Gaps = 30/190 (15%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E  + C IYG LE NKV G+FH  A G  + + G H+         +FN SH I +L+FG
Sbjct: 173 EMPDSCRIYGSLEGNKVQGDFHITARGHGYFEFGEHL------DHHAFNFSHMITELSFG 226

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG-------------------- 207
            H+  ++NPLD    T       YQY++ +VPT+YT                        
Sbjct: 227 PHYSTLLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRAGTIDPYSQVLPDPSTISPSQRK 286

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
           +TI +NQ++VT   RS E   +Q  +PG+FF Y++ PI +  +EE  S L  L  +  ++
Sbjct: 287 NTIFTNQYAVTS--RSHELPDVQFHVPGIFFKYNIEPILLIISEERGSLLALLVRLVNVM 344

Query: 267 GGVFTVSGII 276
            GV    G +
Sbjct: 345 SGVVVAGGWL 354


>gi|396485364|ref|XP_003842153.1| hypothetical protein LEMA_P079130.1 [Leptosphaeria maculans JN3]
 gi|312218729|emb|CBX98674.1| hypothetical protein LEMA_P079130.1 [Leptosphaeria maculans JN3]
          Length = 486

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 59/188 (31%), Positives = 89/188 (47%), Gaps = 34/188 (18%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E + C I+G +E NKV G+FH  A G  + + GVH+         +FN SH I +L+FG 
Sbjct: 267 ETDSCRIFGSIEGNKVQGDFHITARGHGYIEYGVHL------DHKTFNFSHIIRELSFGP 320

Query: 169 HFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTD--------------------- 204
           ++P + NPLD       TP      +QYF+ +VPT+YTD                     
Sbjct: 321 YYPSLTNPLDNTIAITPTPDDHFYKFQYFLSIVPTIYTDDPSLIPYLDILNRYGKNPDLF 380

Query: 205 VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
            S H +++NQ++VT       +     +PGVF  +D+ PI +   EE   F   L  +  
Sbjct: 381 NSAHAVKTNQYAVTSQSHPVSE---YYVPGVFVKFDIEPIMLNVVEEWGGFWRLLVRLVN 437

Query: 265 IVGGVFTV 272
           ++ GV   
Sbjct: 438 VISGVMVA 445


>gi|406607484|emb|CCH41148.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Wickerhamomyces ciferrii]
          Length = 359

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 62/196 (31%), Positives = 99/196 (50%), Gaps = 23/196 (11%)

Query: 89  SNPDLIDQCKREGFLQRI------KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGV 142
           + PDL D+   +G +         K+     C+IYG + VNKV+G+FH       ++   
Sbjct: 140 NTPDL-DEVMAQGIIAEFRDRGDAKDSGAPACHIYGSIPVNKVSGDFHITAQGYGYRGNS 198

Query: 143 HVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
             H  +    D  N +H I++ +FGE +P + NPLD      +     YQY++ VVPTVY
Sbjct: 199 RSHVGI----DGLNFTHIISEFSFGEFYPYIHNPLDATVQITKEHLQSYQYYLSVVPTVY 254

Query: 203 TDVSGHTIQSNQFSVTEHFRSSEQGRLQT-----LPGVFFFYDLSPIKVTFTEEHVSFLH 257
             + G  I++NQ+S      +S Q +L +     +PG+FF YD  PI +   ++ + F  
Sbjct: 255 KKL-GVEIETNQYS------TSLQKKLYSFENKGVPGLFFKYDFEPISLIVEDKRIPFST 307

Query: 258 FLTNVCAIVGGVFTVS 273
           FL  +  I GG+  V+
Sbjct: 308 FLVRLATIYGGIIVVA 323


>gi|449542382|gb|EMD33361.1| hypothetical protein CERSUDRAFT_117979 [Ceriporiopsis subvermispora
           B]
          Length = 530

 Score = 97.4 bits (241), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 52/180 (28%), Positives = 78/180 (43%), Gaps = 6/180 (3%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + +G  C ++G +   KV  N H       H    H H          N+SH I + +FG
Sbjct: 174 QPDGSACRVFGSITAKKVTANLHIT--TLGHGYATHSH----VDHSKMNLSHVITEFSFG 227

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            HFP +  PLD        P   YQYF+ VVPT Y       + ++Q+SVT + R  +  
Sbjct: 228 PHFPDITQPLDNSFEVAHDPFVAYQYFLHVVPTTYIAPRSSPLHTHQYSVTHYTRILDPS 287

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
             +  PG+FF +DL P+ +   +   S +        ++GGVF   G       H   A+
Sbjct: 288 HHRHTPGIFFKFDLDPLAIKIEQRTTSLVQLAIRCVGVIGGVFVCMGYAVKITTHAVDAV 347


>gi|392594239|gb|EIW83563.1| DUF1692-domain-containing protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 506

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 55/183 (30%), Positives = 90/183 (49%), Gaps = 16/183 (8%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + +G  C IYG L V KV  N H       + S +HV           N+SH I + +FG
Sbjct: 169 QPDGSACRIYGTLAVKKVTANLHVTTLGHGYTSHMHV------DHTKMNLSHVITEFSFG 222

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH---FRSS 224
            +FP +  PLD      + P   +QY++ VVPT Y       +++NQ+SVT +   +++ 
Sbjct: 223 PYFPDISQPLDYSFEVAKDPYTAFQYYMHVVPTNYIAPRSKPLETNQYSVTHYTHIYKTP 282

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
            +G    +PG+FF +DL P+ ++  +   S    +     ++GGVFT +     F+    
Sbjct: 283 HEG----IPGIFFKFDLDPMVLSIHQRTTSLTALIIRCVGVIGGVFTCA---TYFVRASM 335

Query: 285 RAI 287
           RA+
Sbjct: 336 RAV 338


>gi|453088947|gb|EMF16987.1| DUF1692-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 404

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 92/194 (47%), Gaps = 38/194 (19%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E + C IYG +  NKV G+FH  A G  + + G H+         +FN SH+I +L+FG 
Sbjct: 184 EADSCRIYGSMHGNKVKGDFHITARGHGYMEFGQHL------DHSTFNFSHRITELSFGP 237

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTD------------------------ 204
           ++P + NPLD    T E+    +QY++ VVPT+YT                         
Sbjct: 238 YYPSLTNPLDNTFATTESNFYKFQYYLSVVPTIYTADAKALRKIDKYHESPTSGDDGLSQ 297

Query: 205 ----VSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
                S +T+ +NQ++VTE      +    ++PG+F  +D+ PI++T  E   S    L 
Sbjct: 298 QPKRYSKNTVFTNQYAVTEQSHPVSE---SSVPGIFVKFDIEPIQLTIAENWSSVPALLI 354

Query: 261 NVCAIVGGVFTVSG 274
            +  +V G+    G
Sbjct: 355 RIVNVVSGLLVAGG 368


>gi|255726548|ref|XP_002548200.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240134124|gb|EER33679.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 355

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 62/197 (31%), Positives = 96/197 (48%), Gaps = 18/197 (9%)

Query: 90  NPDLIDQCKREGFLQRIKE------EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 143
            PDL D+  +E      +       E    C+I+G + V +V G+F        ++   H
Sbjct: 126 TPDL-DEIMQESLRAEFRSQGARVNEGAPACHIFGSIPVTQVRGDFRITAKGFGYRDRSH 184

Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
           V        ++FN SH I + +FGE +P + NPLD      E     Y Y+ KVVPT+Y 
Sbjct: 185 V------PIEAFNFSHVIQEFSFGEFYPFINNPLDATGKITEEKLQTYLYYAKVVPTMYE 238

Query: 204 DVSGHTIQSNQFSVTEH---FRSSEQG-RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
            + G  I +NQ+S+TE     +  EQ  R   +PG++F YD  PIK+   E+ + F  F+
Sbjct: 239 QL-GLEIDTNQYSLTESQHVIQVDEQTKRPNGIPGIYFRYDFEPIKLVIREKRIPFFQFI 297

Query: 260 TNVCAIVGGVFTVSGII 276
             +  I GG+   +G +
Sbjct: 298 AKLGTIGGGIMIAAGYL 314


>gi|320580226|gb|EFW94449.1| COPii-coated vesicle-associated protein, putative [Ogataea
           parapolymorpha DL-1]
          Length = 901

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 54/167 (32%), Positives = 84/167 (50%), Gaps = 12/167 (7%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           +E    C I+G + VN+V G  H          G    D      +  N +H I++ +FG
Sbjct: 707 DEGAPACRIFGAIPVNRVKGELHIT------AKGYGYRDRTRIPAEGLNFTHAISEFSFG 760

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           E FP + NPLD    T +     ++Y I VVPT+Y  + G  I +NQ+S+     S  + 
Sbjct: 761 EFFPYLDNPLDMTLKTTDAHLHTFKYHINVVPTLYRKL-GVEIDTNQYSL-----SLTES 814

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
             + +PG+FF Y+  PIK+   E  +SF  F+  +  I+GG+  V+G
Sbjct: 815 SGKYVPGIFFQYEFEPIKLVVEETRLSFWQFVVRLATIMGGILVVAG 861


>gi|154305556|ref|XP_001553180.1| hypothetical protein BC1G_08547 [Botryotinia fuckeliana B05.10]
          Length = 381

 Score = 97.1 bits (240), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 61/161 (37%), Positives = 89/161 (55%), Gaps = 26/161 (16%)

Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           G+ C +YG LEVNKV G+FH  A G  + + G H+         +FN SH IN+L+FG  
Sbjct: 185 GDSCRVYGSLEVNKVQGDFHLTARGHGYPEMGHHL------DHSAFNFSHIINELSFGPF 238

Query: 170 FPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHT--------IQSNQFSVT- 218
           +P ++NPLD  R    TP+    YQYF+ VVPT+Y+              +++NQ++VT 
Sbjct: 239 YPSLLNPLD--RTIAGTPNHFHKYQYFLSVVPTLYSLSPSTFSPSSSPTLLRTNQYAVTS 296

Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
            EH         +++PG+FF YD+ P+ +T  E    FL F
Sbjct: 297 QEHIVGE-----RSVPGIFFKYDIEPLLLTVEESRDGFLRF 332


>gi|47219772|emb|CAG03399.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 378

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/224 (30%), Positives = 98/224 (43%), Gaps = 62/224 (27%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGK-----------SFHQSGVHVHDILAFQR-------- 152
             C I+G L VNKVAGNFH   GK           S H   + V   L   R        
Sbjct: 130 RACRIHGHLYVNKVAGNFHITVGKYVTSLLGYSVVSLHSIPIGVTLFLLLSRSIPHPRGH 189

Query: 153 ---------DSFNISHKINKLAFGEHFPGVVNPLDGVRWTQE--------TP-------- 187
                    DS+N SH+I+ L+FGE  PG+++PLDG              TP        
Sbjct: 190 AHLAALVSHDSYNFSHRIDHLSFGEDLPGIISPLDGTEKVSADCTAVLSLTPLHRCDFFL 249

Query: 188 ----------------SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-LQ 230
                           + ++QYFI +VPT   +    + +++Q+SVTE  R+        
Sbjct: 250 PRLFFKMCDFRFSLLANHIFQYFITIVPT-KLNTYKVSAETHQYSVTEQDRAINHAAGSH 308

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
            + G+F  YD+S + V  TE+H+    FL  +C IVGG+F+ + 
Sbjct: 309 GVSGIFMKYDISSLMVKVTEQHMPLWQFLVRLCGIVGGIFSTTA 352


>gi|146079597|ref|XP_001463805.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398011570|ref|XP_003858980.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134067893|emb|CAM66174.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322497192|emb|CBZ32265.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 368

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 70/224 (31%), Positives = 105/224 (46%), Gaps = 22/224 (9%)

Query: 65  SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNK 124
           +++  CC+ CE V + Y++ G  +   + I QC  E   QR       GC + G L++ K
Sbjct: 148 AAELKCCDTCESVLDLYKELGKGIPGTEYIPQC-LEQLYQR-----ASGCTVMGSLDLKK 201

Query: 125 VAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG----EHFP--GVVNPLD 178
           V     F P ++ H     + D++       + SH I KL  G    E F   GV  PL 
Sbjct: 202 VPVTVIFGPRRTGH--FYSLKDVI-----RLDTSHFIRKLRIGDETVERFSKNGVAEPLS 254

Query: 179 GVRWTQETPSGMYQYFIKVVPTVY--TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVF 236
           G + + +T S   +Y +KVVPT Y  T        + ++S     R+   G    +P V 
Sbjct: 255 GHKSSSKTYSET-RYLVKVVPTTYRKTKTKNAKASTYEYSAQWSRRTIVVGFAGAVPAVL 313

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           F ++ +PI+V    E   F HFL  +C IVGG+F V G ID  +
Sbjct: 314 FEFEPAPIQVNNVFERQPFSHFLVQLCGIVGGLFVVLGFIDNVV 357


>gi|353236810|emb|CCA68797.1| related to ERV41-component of copii vesicles involved in transport
           between the ER and golgi complex [Piriformospora indica
           DSM 11827]
          Length = 559

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 53/166 (31%), Positives = 82/166 (49%), Gaps = 9/166 (5%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAP-GKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +G  C +YG   V K+ GNFH    G  +     H         D+ N+SH I + +FG 
Sbjct: 198 DGGACRVYGSFAVRKLTGNFHITTLGHGYGGHNAHA------SHDNINMSHVITEFSFGP 251

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
           ++P +V PLD    T +     +QYFI VVPT Y       + ++Q+SVT + +  E   
Sbjct: 252 YYPDIVQPLDYSFETTQEHFVAFQYFITVVPTTYVAPRSKPLHTHQYSVTHYVK--ELPH 309

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
            Q  PG+FF YD+ P+ +   +   +   FL  +  ++GGV+   G
Sbjct: 310 SQGTPGIFFKYDIDPVALEIHQRTTTLTQFLVRIVGVIGGVWVCFG 355


>gi|297803392|ref|XP_002869580.1| hypothetical protein ARALYDRAFT_492089 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315416|gb|EFH45839.1| hypothetical protein ARALYDRAFT_492089 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 480

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 106/204 (51%), Gaps = 39/204 (19%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
           GC I G++ V KV GN   +      +SG H     +F     N+SH +N L+FG+    
Sbjct: 293 GCRIEGYIRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGQRIMP 342

Query: 170 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 214
                      + G+  + LDG  +  +    P+   ++++++V T         ++SN 
Sbjct: 343 QKFSELKRLSPYLGLSHDRLDGRPFINQRDLGPNVTIEHYLQIVKT-------EVVKSNG 395

Query: 215 FSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
            ++ E +  +    +     LP   F ++LSP++V  TE   SF HF+TNVCAI+GGVFT
Sbjct: 396 QALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGVFT 455

Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
           V+GI+D+ ++H    + KKIE+GK
Sbjct: 456 VAGILDSILHHSM-TLMKKIELGK 478


>gi|391338468|ref|XP_003743580.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Metaseiulus occidentalis]
          Length = 292

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 69/202 (34%), Positives = 98/202 (48%), Gaps = 32/202 (15%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           +G+GCN      +NKV GNFH     S H +          Q D  ++SH+I+ L FGE 
Sbjct: 109 DGKGCNFVSKFTINKVPGNFHV----STHAAKT--------QPDDIDMSHEIHSLTFGEQ 156

Query: 170 F--------PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS----- 216
                     G  N L      +      + Y +K+VPTVY   SG ++   Q++     
Sbjct: 157 LIYELGDDIKGSFNALQNHDRLKADGKESHDYVMKIVPTVYELSSGDSLVGYQYTHAHKS 216

Query: 217 -VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
            +T  F +   GR+  +P ++F YDL+PI V +         FLTNVCAIVGG FTV GI
Sbjct: 217 YITLSFSA---GRI--IPAIWFKYDLNPITVRYHRRTQPLYSFLTNVCAIVGGTFTVVGI 271

Query: 276 IDAFIYHGQRAIKKKIEIGKFS 297
           I++  +       +K E+GK S
Sbjct: 272 INSICFTAGEVF-RKFEMGKLS 292


>gi|451847161|gb|EMD60469.1| hypothetical protein COCSADRAFT_98785 [Cochliobolus sativus ND90Pr]
          Length = 395

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/190 (31%), Positives = 91/190 (47%), Gaps = 36/190 (18%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG L  NKV G+FH  A G  + + G H+      +  SFN SH I +++FG ++
Sbjct: 178 DSCRIYGNLVGNKVQGDFHITARGHGYMEFGEHL------EHSSFNFSHIIREMSFGPYY 231

Query: 171 PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTD-----------VS---------- 206
           P + NPLD       TP+     +QY++ +VPT+YTD           VS          
Sbjct: 232 PSLTNPLDSTIAVTPTPADHFYKFQYYLSIVPTIYTDDPALMPIMESMVSTNDQPSSNMF 291

Query: 207 --GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
              H I++NQ++VT      +      +PG+F  +D+ PI +   EE  SF   +  +  
Sbjct: 292 RMAHAIKTNQYAVTSQSHKVDDSY---VPGIFVKFDIEPIMLAIVEESKSFWKLVITLVN 348

Query: 265 IVGGVFTVSG 274
           +V GV    G
Sbjct: 349 VVSGVMVAGG 358


>gi|326470603|gb|EGD94612.1| COPII-coated vesicle protein [Trichophyton tonsurans CBS 112818]
          Length = 399

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/192 (31%), Positives = 93/192 (48%), Gaps = 30/192 (15%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           K +  + C ++G LE NKV GN H  A G  + + G       A    S N +H I +L+
Sbjct: 186 KSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWG------RATNPHSLNFTHLITELS 239

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------------- 208
           FG H+  ++NPLD    +       YQY++ VVPT+YT  SGH                 
Sbjct: 240 FGPHYGRLLNPLDKTVSSTSINFYKYQYYLSVVPTIYTK-SGHIDPNRRSLPDASTITAK 298

Query: 209 ----TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
               T+ +NQ++VT  +    Q R+ + PG+FF Y++ PI +  ++E  S L  +  +  
Sbjct: 299 DSKTTVSTNQYAVTS-YSQPIQPRIDSTPGIFFKYNIEPILLIVSQERDSLLALMVRLVN 357

Query: 265 IVGGVFTVSGII 276
           +V GV    G +
Sbjct: 358 VVSGVLVTGGWL 369


>gi|347828541|emb|CCD44238.1| similar to endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Botryotinia fuckeliana]
          Length = 381

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 60/161 (37%), Positives = 89/161 (55%), Gaps = 26/161 (16%)

Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           G+ C +YG LEVNKV G+FH  A G  + + G H+         +FN SH IN+L+FG  
Sbjct: 185 GDSCRVYGSLEVNKVQGDFHLTARGHGYPEMGHHL------DHSAFNFSHIINELSFGPF 238

Query: 170 FPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHT--------IQSNQFSVT- 218
           +P ++NPLD  R    TP+    YQYF+ +VPT+Y+              +++NQ++VT 
Sbjct: 239 YPSLLNPLD--RTIAGTPNHFHKYQYFLSIVPTLYSLSPSTFSPSSSPTLLRTNQYAVTS 296

Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
            EH         +++PG+FF YD+ P+ +T  E    FL F
Sbjct: 297 QEHIVGE-----RSVPGIFFKYDIEPLLLTVEESRDGFLRF 332


>gi|167523643|ref|XP_001746158.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775429|gb|EDQ89053.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1400

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Composition-based stats.
 Identities = 51/149 (34%), Positives = 84/149 (56%), Gaps = 7/149 (4%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + E +GC ++G + V +V+ NFHF+ GKS H +  H H  +   + + N SH+I++ +F 
Sbjct: 165 DAEPDGCRVHGTMPVARVSSNFHFSAGKSVHHASGHAHVPIDPNQKTINFSHRIDRFSFS 224

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDV-SGHTIQSNQFSVTE--HFRSS 224
               G +  LDG     ++   ++QYF+KVVPT    +      +SNQ+SVTE  H  ++
Sbjct: 225 SEQRGAM-ALDGDMKVSDSNKQLFQYFLKVVPTTTKRMDEAEPFRSNQYSVTEQHHILAA 283

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHV 253
            +   + LPG+ F Y++ PI V   E+ V
Sbjct: 284 NE---RKLPGIHFKYEIEPIGVLVHEQAV 309


>gi|149241719|ref|XP_001526345.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146450468|gb|EDK44724.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 353

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 94/177 (53%), Gaps = 12/177 (6%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           QRI E     C+I+G + VN+V G+F    GK F  S   +H  LA    + N +H I +
Sbjct: 146 QRINEG-APACHIFGSIPVNQVKGDFRIT-GKGFGYSD-RLHVPLA----ALNFTHVIQE 198

Query: 164 LAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRS 223
            ++GE FP + NPLD      E     Y Y  +VVPT+Y  + G  + +NQ+S+TE+   
Sbjct: 199 FSYGEFFPFLNNPLDATGKVTEEKLQAYIYNAQVVPTLYEKL-GLEVDTNQYSLTENHHV 257

Query: 224 SE----QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
            +      R Q +PG++F Y+  PIK+T  E+ + F  F+  +  I GG+   +G +
Sbjct: 258 IKLDEISNRPQGVPGIYFRYEFEPIKLTIREKRIPFFQFVARLGTICGGLLVAAGYL 314


>gi|403413226|emb|CCL99926.1| predicted protein [Fibroporia radiculosa]
          Length = 546

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 59/190 (31%), Positives = 89/190 (46%), Gaps = 15/190 (7%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           +++G  C IYG +   K   N H       + S  HV           N+SH IN+ +FG
Sbjct: 177 QKDGSACRIYGTITAKKATANLHITTIGHGYASRDHV------DHKYMNLSHVINEFSFG 230

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSE 225
             FP +V PLD        P   YQY++ VVPT Y       + ++Q+SVT + R  S+ 
Sbjct: 231 PFFPEIVQPLDNSFELALDPFVAYQYYLHVVPTTYIAPRSTPLHTHQYSVTHYTRTMSTH 290

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQR 285
           QG     PG+FF +DL P+ +T  +   +   FL     +VGG+F   G     +  G R
Sbjct: 291 QG----TPGIFFKFDLEPMHLTIHQRTTTLAQFLIRCVGVVGGIFVCMGYA---VRVGTR 343

Query: 286 AIKKKIEIGK 295
           A++    + +
Sbjct: 344 AVEAATGVDR 353


>gi|22328963|ref|NP_567765.2| protein PDI-like 5-4 [Arabidopsis thaliana]
 gi|75213708|sp|Q9T042.1|PDI54_ARATH RecName: Full=Protein disulfide-isomerase 5-4; Short=AtPDIL5-4;
           AltName: Full=Protein disulfide-isomerase 7; Short=PDI7;
           AltName: Full=Protein disulfide-isomerase 8-2;
           Short=AtPDIL8-2; Flags: Precursor
 gi|4490704|emb|CAB38838.1| putative protein [Arabidopsis thaliana]
 gi|7269561|emb|CAB79563.1| putative protein [Arabidopsis thaliana]
 gi|15450832|gb|AAK96687.1| putative protein [Arabidopsis thaliana]
 gi|20259836|gb|AAM13265.1| putative protein [Arabidopsis thaliana]
 gi|332659897|gb|AEE85297.1| protein PDI-like 5-4 [Arabidopsis thaliana]
          Length = 480

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 64/204 (31%), Positives = 105/204 (51%), Gaps = 39/204 (19%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
           GC + G++ V KV GN   +      +SG H     +F     N+SH +N L+FG     
Sbjct: 293 GCRVEGYMRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGRRIMP 342

Query: 170 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 214
                      + G+  + LDG  +  +    P+   ++++++V T         ++SN 
Sbjct: 343 QKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIVKT-------EVVKSNG 395

Query: 215 FSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
            ++ E +  +    +     LP   F ++LSP++V  TE   SF HF+TNVCAI+GGVFT
Sbjct: 396 QALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGVFT 455

Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
           V+GI+D+ ++H    + KKIE+GK
Sbjct: 456 VAGILDSILHHSM-TLMKKIELGK 478


>gi|156841160|ref|XP_001643955.1| hypothetical protein Kpol_1001p9 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156114586|gb|EDO16097.1| hypothetical protein Kpol_1001p9 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 349

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 51/167 (30%), Positives = 97/167 (58%), Gaps = 13/167 (7%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           +GC+IYG +++N+VAG   F A G  +  +G    D + F       +H IN+ +FG+ +
Sbjct: 157 DGCHIYGSVKLNRVAGELQFTAKGWGYRDNGRAPLDQIDF-------NHVINEFSFGDFY 209

Query: 171 PGVVNPLDGVRWTQETPS-GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL 229
           P + NPLDG    ++  S   Y Y   VVPT++  + G  + +NQ+S+ E+  + + G++
Sbjct: 210 PYIDNPLDGTAKIEKQKSISRYIYSTSVVPTIFQKL-GAEVDTNQYSLAEYHTAPKDGKI 268

Query: 230 Q---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           +   ++PG+FF YD  P+ +  +++ +SF+ F+  + AI+  +  ++
Sbjct: 269 KLTTSIPGIFFRYDFEPLSIVISDKRLSFVQFIVRLVAILSFILYMA 315


>gi|378726952|gb|EHY53411.1| hypothetical protein HMPREF1120_01605 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 326

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 94/208 (45%), Gaps = 47/208 (22%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG LE NKV G+FH  A G  + + G+  H         FN SH IN+L+FG H+
Sbjct: 86  DSCRIYGSLEGNKVQGDFHITARGHGYMEFGMQQH----LDHSRFNFSHHINELSFGPHY 141

Query: 171 PGVVNPLDGVRW-TQETPSGMYQYFIKVVPTVYT-------------------------- 203
           PG++NPLD     T +     YQY++ +VPT++T                          
Sbjct: 142 PGLLNPLDKTSAVTTDVHFMRYQYYLSIVPTIFTKRRVSTSSGALDPAAIPQPPTLDLTP 201

Query: 204 ----DVSG--------HTIQSNQFSVTEHFRSSEQGRL---QTLPGVFFFYDLSPIKVTF 248
               D  G        H  + ++   T  + ++ Q R     T+PGVFF YD+ PI +  
Sbjct: 202 NDHRDKDGVVRHVPNPHAGRDSKSVFTNQYAATSQSREVPGNTVPGVFFKYDIEPILLIV 261

Query: 249 TEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           +E   SFL  +  +  ++ GV    G +
Sbjct: 262 SERRSSFLGLIVRLVNVISGVLVAGGWM 289


>gi|238480964|ref|NP_680742.2| protein PDI-like 5-4 [Arabidopsis thaliana]
 gi|332659898|gb|AEE85298.1| protein PDI-like 5-4 [Arabidopsis thaliana]
          Length = 532

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 102/201 (50%), Gaps = 33/201 (16%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
           GC + G++ V KV GN   +      +SG H     +F     N+SH +N L+FG     
Sbjct: 345 GCRVEGYMRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGRRIMP 394

Query: 170 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 214
                      + G+  + LDG  +  +    P+   ++++++V T     +G  +    
Sbjct: 395 QKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIVKTEVVKSNGQALV-EA 453

Query: 215 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
           +  T H   S       LP   F ++LSP++V  TE   SF HF+TNVCAI+GGVFTV+G
Sbjct: 454 YEYTAH---SSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGVFTVAG 510

Query: 275 IIDAFIYHGQRAIKKKIEIGK 295
           I+D+ ++H    + KKIE+GK
Sbjct: 511 ILDSILHHSM-TLMKKIELGK 530


>gi|169860063|ref|XP_001836668.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Coprinopsis cinerea okayama7#130]
 gi|116502344|gb|EAU85239.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Coprinopsis cinerea okayama7#130]
          Length = 516

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 55/163 (33%), Positives = 80/163 (49%), Gaps = 8/163 (4%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + +   C I+G + V KV  N H       + S  HV   L       N+SH I + +FG
Sbjct: 169 QADASACRIWGTMYVKKVTANLHVTTLGHGYASYEHVDHHL------MNLSHVIQEFSFG 222

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            HFP +V PLD            YQYF+ VVPT Y       +++NQ+SVT + R  E  
Sbjct: 223 PHFPEIVQPLDNSFEATHEHFIAYQYFLHVVPTTYVAPRTAPLETNQYSVTHYTRVLEHN 282

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
           R    PG+FF ++L P+K+T  +   + L  +     ++GGVF
Sbjct: 283 R--GTPGIFFKFELDPLKITQYQRTTTLLQLMIRCVGVIGGVF 323


>gi|358388143|gb|EHK25737.1| hypothetical protein TRIVIDRAFT_33251 [Trichoderma virens Gv29-8]
          Length = 370

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/184 (30%), Positives = 95/184 (51%), Gaps = 15/184 (8%)

Query: 92  DLIDQCKREGFLQRIKEEEG--EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDIL 148
           D++   +++    +    +G  + C +YG L++N+V G+FH  A G  +   G H+    
Sbjct: 163 DIVALSRKKAKWAKTPSPKGRPDSCRMYGSLDLNRVQGDFHITARGHGY--GGQHL---- 216

Query: 149 AFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH 208
               D FN SH I+++++G  +P +VNPLD    +       +QY++ VVPTVY   +  
Sbjct: 217 --DHDKFNFSHIISEMSYGPFYPSLVNPLDRTVNSAIVHFHKFQYYLSVVPTVYL-ANNR 273

Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
            + +NQ++VTE  ++        +PG+FF YD+ PI ++  E    F  FL  +  I  G
Sbjct: 274 IVNTNQYAVTEQSKTISD---HQVPGIFFKYDIEPIMLSVEESRDGFFTFLVKIVNIFSG 330

Query: 269 VFTV 272
           V   
Sbjct: 331 VMVA 334


>gi|451997913|gb|EMD90378.1| hypothetical protein COCHEDRAFT_27091 [Cochliobolus heterostrophus
           C5]
          Length = 395

 Score = 95.5 bits (236), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 98/207 (47%), Gaps = 37/207 (17%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C IYG L  NKV G+FH  A G  + + G H+         SFN SH I +++FG ++
Sbjct: 178 DSCRIYGNLVGNKVQGDFHITARGHGYMEFGEHL------DHSSFNFSHIIREMSFGPYY 231

Query: 171 PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTD-----------VS---------- 206
           P + NPLD       TP+     +QY++ +VPT+YTD           VS          
Sbjct: 232 PSLTNPLDSTIAVTPTPADHFYKFQYYLSIVPTIYTDDPSLMPLMESVVSTNDQPSSNMF 291

Query: 207 --GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
              H I++NQ++VT     S +     +PG+F  +D+ PI +   EE  SF   L  +  
Sbjct: 292 RMAHAIKTNQYAVTSQ---SHKVDDTYVPGIFVKFDIEPIMLAIVEESKSFWKLLITLVN 348

Query: 265 IVGGVFTV-SGIIDAFIYHGQRAIKKK 290
           +V GV    S +   F +  +   K+K
Sbjct: 349 VVSGVMVAGSWVWQMFDWASEFVGKRK 375


>gi|21618302|gb|AAM67352.1| unknown [Arabidopsis thaliana]
          Length = 317

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 100/201 (49%), Gaps = 33/201 (16%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
           GC + G++ V KV GN   +      +SG H     +F     N+SH +N L+FG     
Sbjct: 130 GCRVEGYMRVKKVPGNLMVSA-----RSGSH-----SFDSSQMNMSHVVNHLSFGRRIMP 179

Query: 170 -----------FPGVV-NPLDGVRWTQET---PSGMYQYFIKVVPTVYTDVSGHTIQSNQ 214
                      + G+  + LDG  +  +    P+   ++++++V T     +G  +    
Sbjct: 180 QKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIVKTEVVKSNGQAL---- 235

Query: 215 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
               E+   S       LP   F ++LSP++V  TE   SF HF+TNVCAI+GG FTV+G
Sbjct: 236 VEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNVCAIIGGAFTVAG 295

Query: 275 IIDAFIYHGQRAIKKKIEIGK 295
           I+D+ ++H    + KKIE+GK
Sbjct: 296 ILDSILHHSM-TLMKKIELGK 315


>gi|328352874|emb|CCA39272.1| Peroxisomal membrane protein PEX28 [Komagataella pastoris CBS 7435]
          Length = 849

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 61/189 (32%), Positives = 95/189 (50%), Gaps = 17/189 (8%)

Query: 94  IDQCKREGFLQRIKEE------EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 147
           +D+  RE  L   +E+      +   C+I+G + VNKV G FH   GK     G    D 
Sbjct: 644 LDEVMRESALAEFREKKSFTHGDAPACHIFGSIPVNKVHGFFHIT-GK-----GYGYRDR 697

Query: 148 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
               +++ N +H I++ +FGE +P + NPLD    T       + Y++ VVPT Y  + G
Sbjct: 698 SIVPKEALNFTHVISEFSFGEFYPYMNNPLDFTARTTNDHIHTFNYYLDVVPTEYKKL-G 756

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
             I + Q+S+T     +E   L   PG+FF Y   PI ++  E+ +SF+ FL  +  I G
Sbjct: 757 IVIDTTQYSMT----VTELPGLSRPPGLFFNYQFEPIILSIEEKRISFVRFLVRLVTICG 812

Query: 268 GVFTVSGII 276
           G+  V+  I
Sbjct: 813 GIMVVAKWI 821


>gi|345325542|ref|XP_001508860.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Ornithorhynchus anatinus]
          Length = 372

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 53/116 (45%), Positives = 69/116 (59%), Gaps = 5/116 (4%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +  + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE
Sbjct: 165 QPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSYNFSHRIDHLSFGE 224

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFR 222
             PG++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE  R
Sbjct: 225 LVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISAET---HQFSVTERER 277


>gi|260950511|ref|XP_002619552.1| hypothetical protein CLUG_00711 [Clavispora lusitaniae ATCC 42720]
 gi|238847124|gb|EEQ36588.1| hypothetical protein CLUG_00711 [Clavispora lusitaniae ATCC 42720]
          Length = 347

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 52/168 (30%), Positives = 87/168 (51%), Gaps = 8/168 (4%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E+   C+I+G + VN V G F   P  S ++      D  +    ++N SH I++ +FG+
Sbjct: 150 EDAPACHIFGTIPVNHVRGEFFIVPKGSMYR------DRSSIDPKAYNFSHVISEFSFGD 203

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
            +P + NPLD      E     Y+YF K+VPT Y  + G  + + Q+S+TE   + +  R
Sbjct: 204 FYPFITNPLDFTAKVTEENRQAYRYFAKLVPTHYEKL-GLVVDTYQYSLTE-IHNVDHNR 261

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
               PG+FF Y   PIK+T  E+ + F  F+  +  ++ G+   +G +
Sbjct: 262 GIPPPGIFFDYSFEPIKLTIREKRIGFFAFVARLMTVLSGLLIAAGYL 309


>gi|452847826|gb|EME49758.1| hypothetical protein DOTSEDRAFT_58941 [Dothistroma septosporum
           NZE10]
          Length = 402

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 69/238 (28%), Positives = 104/238 (43%), Gaps = 49/238 (20%)

Query: 75  EEVREAYRKKGWALSNPDL---IDQCKREGFLQRIKE----EEGEGCNIYGFLEVNKVAG 127
           E +R  Y  KG      D+   +   KR+   ++        + + C IYG +  NKV G
Sbjct: 140 ERIRSGYDGKGAEYEEEDVHNYLGAAKRQKKFKKTPGLPWGAQADSCRIYGSMHGNKVQG 199

Query: 128 NFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQET 186
           +FH  A G  + + G H+         +FN SH +N+L+FG  +P + NPLD       T
Sbjct: 200 DFHITARGHGYMEFGAHL------DHSTFNFSHTVNELSFGPFYPSLTNPLDNT--VATT 251

Query: 187 PSGMY--QYFIKVVPTVYTD----------------------------VSGHTIQSNQFS 216
           P   Y  QY++ VVPT+YT                              S +T+ +NQ++
Sbjct: 252 PDHFYKFQYYLSVVPTIYTTDAKTLRKIDKHHESPSSGEDGLSQYPHRYSRNTVFTNQYA 311

Query: 217 VTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
           VTE    S +     +PGVF  +D+ PI +T  EE  S    L  +  +V G+    G
Sbjct: 312 VTEQ---SHRVPENAVPGVFIKFDIEPIGLTIAEEWSSIPALLIRLVNVVSGLLVAGG 366


>gi|402224967|gb|EJU05029.1| DUF1692-domain-containing protein [Dacryopinax sp. DJM-731 SS1]
          Length = 517

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 51/155 (32%), Positives = 81/155 (52%), Gaps = 11/155 (7%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAP-GKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           ++G  C +YG +EV KV  N H    G  +H +    H ++       N+SH I + +FG
Sbjct: 174 KDGSACRVYGSMEVKKVQANLHITTLGHGYHSNEHTDHSLM-------NLSHIITEFSFG 226

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            +FP +V PLD    + + P   +QYF+ VVPT Y    G  +++NQ+SV  H +  + G
Sbjct: 227 PYFPDIVQPLDYTIESSDDPFTAFQYFLTVVPTEYRTSKG-VVKTNQYSVGSHMQHIQHG 285

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
           R    P +FF YDL P+ +   +   + + FL  +
Sbjct: 286 R--GTPVIFFKYDLEPLSLIVEQRTTTLIQFLIRL 318


>gi|406868300|gb|EKD21337.1| copii-coated vesicle protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 382

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 61/160 (38%), Positives = 84/160 (52%), Gaps = 16/160 (10%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
           K  E + C I+G LEVNKV G  H       +Q     H        +FN SH +++L+F
Sbjct: 184 KSAEMDSCRIFGNLEVNKVQGELHITARGHGYQELAAGH----LDHHAFNFSHVVSELSF 239

Query: 167 GEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVY-----TDVSGHTIQSNQFSVTE 219
           G  +P + NPLD  R    TP+    +QYF+ VVPTVY     T  S  T+ +NQ++VTE
Sbjct: 240 GPFYPSLHNPLD--RTVSTTPNNFHKFQYFLSVVPTVYSVDSSTTYSSQTLFTNQYAVTE 297

Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
                 +    ++PG+FF YD  P+ +T  E   SFL FL
Sbjct: 298 QSHVVSEF---SVPGIFFKYDFEPMLLTVQESRDSFLRFL 334


>gi|393231429|gb|EJD39021.1| DUF1692-domain-containing protein [Auricularia delicata TFB-10046
           SS5]
          Length = 518

 Score = 95.1 bits (235), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 47/152 (30%), Positives = 76/152 (50%), Gaps = 8/152 (5%)

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           G  C ++G + V KV  N H       + S  H    +       N+SH I++ +FG   
Sbjct: 178 GSACRVFGSMFVKKVTANLHITTAGHGYSSNAHTDHTM------MNLSHIISEFSFGPFM 231

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
           P +  PLD +    + P   YQYF+ VVPT Y     + +++NQ+SVT + R  E GR  
Sbjct: 232 PDISQPLDNLFEVAKEPFTAYQYFLTVVPTTYVAPRSYPMRTNQYSVTNYKRVFEHGR-- 289

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
             PG+FF +D+ P+++T  +   +F   +  +
Sbjct: 290 ATPGIFFKFDIDPMQLTVIQRTTTFTQLIIRI 321


>gi|156065931|ref|XP_001598887.1| hypothetical protein SS1G_00976 [Sclerotinia sclerotiorum 1980]
 gi|154691835|gb|EDN91573.1| hypothetical protein SS1G_00976 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 421

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/161 (37%), Positives = 88/161 (54%), Gaps = 26/161 (16%)

Query: 111 GEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           G+ C +YG LEVNKV G+FH  A G  + + G H+        ++FN SH IN+L+FG  
Sbjct: 185 GDSCRVYGSLEVNKVQGDFHITAKGHGYPELGQHL------DHNAFNFSHIINELSFGPF 238

Query: 170 FPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYTDVSGHTI--------QSNQFSVT- 218
           +P ++NPLD  R    TP+    YQYF+ +VPT+Y+               ++NQ++VT 
Sbjct: 239 YPSLLNPLD--RTIAGTPNHFHKYQYFLSIVPTLYSLSPSTFSPSSSPSLLRTNQYAVTS 296

Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
            EH         + +PG+FF YD+ P+ +T  E    FL F
Sbjct: 297 QEHIVGE-----RNVPGIFFKYDIEPLLLTVEESRDGFLRF 332


>gi|315054535|ref|XP_003176642.1| hypothetical protein MGYG_00729 [Arthroderma gypseum CBS 118893]
 gi|311338488|gb|EFQ97690.1| hypothetical protein MGYG_00729 [Arthroderma gypseum CBS 118893]
          Length = 399

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 61/192 (31%), Positives = 92/192 (47%), Gaps = 30/192 (15%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           K +  + C ++G LE NKV GN H  A G  + + G       A    S N +H I +L+
Sbjct: 186 KSDVVDSCRVFGSLEGNKVQGNLHITARGFGYFEWG------RATNPHSLNFTHLITELS 239

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------------- 208
           FG H+  ++NPLD    T       YQY + VVPT+YT  SGH                 
Sbjct: 240 FGPHYGRLLNPLDKTVSTTSVNFYKYQYHLSVVPTIYTK-SGHMDPSRRSLPDSSTITAK 298

Query: 209 ----TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
               T+ +NQ++VT  +    Q R+ + PG+FF Y++ PI +  ++E  S L  +  +  
Sbjct: 299 DSKTTVSTNQYAVTS-YSQPIQPRIDSTPGIFFKYNIEPILLIVSQERDSLLGLMIRLVN 357

Query: 265 IVGGVFTVSGII 276
           +V GV    G +
Sbjct: 358 VVSGVLVTGGWL 369


>gi|448521200|ref|XP_003868450.1| Erv41 protein [Candida orthopsilosis Co 90-125]
 gi|380352790|emb|CCG25546.1| Erv41 protein [Candida orthopsilosis]
          Length = 352

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 102/208 (49%), Gaps = 23/208 (11%)

Query: 85  GWALSNPDL------IDQCKREGF------LQRIKEEEGEGCNIYGFLEVNKVAGNFHFA 132
           G++++NP+       +D+  +E        L R   E    C+I+G + VN+V G F   
Sbjct: 114 GFSINNPNDFHETPDLDEVMQESLRAEFSQLGRRVNEGAPACHIFGSIPVNQVKGEFRIT 173

Query: 133 PGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQ 192
                   G+   D      ++ N SH I + ++G+ FP + NPLD      E    +Y 
Sbjct: 174 ------AKGLGYKDRSFVPVEALNFSHVIQEFSYGDFFPFLNNPLDATGKVTEENLQIYL 227

Query: 193 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFR----SSEQGRLQTLPGVFFFYDLSPIKVTF 248
           Y  KVVPT+Y  + G  + + Q+S+TE+      +    + Q +PG++F Y+  PIK+  
Sbjct: 228 YHSKVVPTLYEKL-GLEVDTTQYSLTENHHIVKVNPHSKKPQGIPGIYFAYEFEPIKLII 286

Query: 249 TEEHVSFLHFLTNVCAIVGGVFTVSGII 276
            E+ + FL F+  +  IVGG+   +G +
Sbjct: 287 REKRIPFLQFIAKLGTIVGGIIVAAGYL 314


>gi|66773206|ref|NP_080631.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           isoform 2 [Mus musculus]
 gi|12854944|dbj|BAB30175.1| unnamed protein product [Mus musculus]
          Length = 302

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 52/110 (47%), Positives = 67/110 (60%), Gaps = 5/110 (4%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 168 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 227

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTE 219
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE
Sbjct: 228 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTE 274


>gi|326479518|gb|EGE03528.1| COPII-coated vesicle protein [Trichophyton equinum CBS 127.97]
          Length = 399

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 60/192 (31%), Positives = 92/192 (47%), Gaps = 30/192 (15%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           K +  + C ++G LE NKV GN H  A G  + + G       A    S N +H I +L+
Sbjct: 186 KSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWG------RATNPHSLNFTHLITELS 239

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH----------------- 208
           FG H+  ++NPLD    +       YQY + VVPT+YT  SGH                 
Sbjct: 240 FGPHYGRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYTK-SGHIDPNRRSLPDASTITAK 298

Query: 209 ----TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
               T+ +NQ++VT  +    Q R+ + PG+FF Y++ PI +  ++E  S L  +  +  
Sbjct: 299 DSKTTVSTNQYAVTS-YSQPIQPRIDSTPGIFFKYNIEPILLIVSQERDSLLALMVRLVN 357

Query: 265 IVGGVFTVSGII 276
           +V GV    G +
Sbjct: 358 VVSGVLVTGGWL 369


>gi|148678795|gb|EDL10742.1| ERGIC and golgi 2, isoform CRA_b [Mus musculus]
          Length = 310

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 52/110 (47%), Positives = 67/110 (60%), Gaps = 5/110 (4%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           + C I+G L VNKVAGNFH   GK+      H H       DS+N SH+I+ L+FGE  P
Sbjct: 176 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 235

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTE 219
           G++NPLDG        + M+QYFI VVPT ++T  +S  T   +QFSVTE
Sbjct: 236 GIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADT---HQFSVTE 282


>gi|410082748|ref|XP_003958952.1| hypothetical protein KAFR_0I00360 [Kazachstania africana CBS 2517]
 gi|372465542|emb|CCF59817.1| hypothetical protein KAFR_0I00360 [Kazachstania africana CBS 2517]
          Length = 354

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 49/169 (28%), Positives = 90/169 (53%), Gaps = 11/169 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+++G + VN+V G             G+   D      D  N +H IN+L+FG+ 
Sbjct: 160 EFNGCHVFGSIPVNRVTGELQIT------AKGMGYPDREKAPIDEVNFAHVINELSFGDF 213

Query: 170 FPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
           +P + NPLD   ++ QE P   Y Y + V+PT+Y  + G  + +NQ+SV+E+  +     
Sbjct: 214 YPYIDNPLDNSAKFDQENPISAYVYHMNVIPTIYQKL-GAEVDTNQYSVSEYHYTEADNA 272

Query: 229 LQT---LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
           ++    +PG+F  Y+  P+ +  T++ +SF+ F+  + AI+  +  ++ 
Sbjct: 273 IRKAGRVPGIFLKYNFEPLSIVVTDKRLSFIQFVIRLVAILSFIVYIAS 321


>gi|123361353|ref|XP_001295947.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121875215|gb|EAX83017.1| hypothetical protein TVAG_111750 [Trichomonas vaginalis G3]
          Length = 338

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 77/291 (26%), Positives = 133/291 (45%), Gaps = 36/291 (12%)

Query: 8   HLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSD 67
           HLD+        LDS G+      D +   ++++  ++    L + +  C SCY     +
Sbjct: 61  HLDI--------LDSIGHKQLLVNDTLKWRRVNQ--EKGFMELYNKKKQCHSCYDF-YDN 109

Query: 68  EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAG 127
             CCN CE+++E Y       + P+   QCK E    + K +  E C++ G + VN+V G
Sbjct: 110 RFCCNGCEKLKEIYHSNN-KTATPENWTQCKPEN---KQKFDPNEKCHVKGKISVNRVPG 165

Query: 128 NFHFAPGKSFHQSGVHVHDILA-FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQET 186
           +FH A G+S    G H H +L  +Q  +F+  H I  L FG + P   +PL G       
Sbjct: 166 SFHLAIGQSIEDYG-HQHILLDDYQTITFD--HDIIDLRFGANIPMTSHPLRGTHIKSTG 222

Query: 187 PSGMYQYFIKVVPTVYTDVSGHTIQSNQ-----FSVTEHFRSSEQGRLQTLPGVFFFYDL 241
                +Y + + P V+    G  I+        +S+T H           +PG++F+Y  
Sbjct: 223 EPLATEYNLIITPIVFY-ADGQYIEKGFEYVYFYSMTYHL----------VPGIYFYYSF 271

Query: 242 SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
           +P  +  T +  SF  FL +   ++ G++ +  ++  F+    +  KKK+E
Sbjct: 272 TPYTIAVTWQSRSFRSFLISTGGLLSGIYAIFSMVSTFLEKSDQK-KKKVE 321


>gi|225461068|ref|XP_002281649.1| PREDICTED: protein disulfide isomerase-like 5-4 [Vitis vinifera]
 gi|297735969|emb|CBI23943.3| unnamed protein product [Vitis vinifera]
          Length = 482

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 104/204 (50%), Gaps = 38/204 (18%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
           GC I GF+ V KV GN   +      +SG H     +F     N+SH I+ L+FG    P
Sbjct: 294 GCRIEGFVRVKKVPGNLVISA-----RSGSH-----SFDPSQMNMSHVISHLSFGRKIAP 343

Query: 172 GVVNPLDGV-------------RWTQETPSG-----MYQYFIKVVPTVYTDVSGHTIQSN 213
            V++ +  V             R     PS        +++++VV T       H +   
Sbjct: 344 RVMSDMKRVLPYIGGSHDRLNGRSYISHPSDSNANVTIEHYLQVVKTEVITTRDHKL--- 400

Query: 214 QFSVTEHFRSSEQGRLQTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
              V E+  ++    +Q+L  P   F ++LSP++V  TE   SF HF+TNVCAI+GGVFT
Sbjct: 401 ---VEEYEYTAHSSLVQSLYIPVAKFHFELSPMQVLVTENRKSFWHFITNVCAIIGGVFT 457

Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
           V+GI+D+ +++  R + KKIE+GK
Sbjct: 458 VAGILDSVLHNTMR-LMKKIELGK 480


>gi|401416963|ref|XP_003872975.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322489202|emb|CBZ24457.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 368

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 64/221 (28%), Positives = 105/221 (47%), Gaps = 26/221 (11%)

Query: 70  CCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNF 129
           CC+ CE V + Y++ G  +   + + QC  + +      ++  GCN+ G L++ KV    
Sbjct: 153 CCDTCESVLDLYKELGKGIPGTEYLPQCLEQLY------QQASGCNVVGSLDLKKVHVTV 206

Query: 130 HFAPGKS--FHQSGVHVHDILAFQRDSFNISHKINKLAFG----EHFP--GVVNPLDGVR 181
            F P ++  F+     + D++       + SH I KL  G    E F   GV  PL G +
Sbjct: 207 IFGPRRTGRFYS----LKDVI-----RLDTSHSIRKLRIGDEAVERFSKNGVAEPLSGHK 257

Query: 182 WTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF--RSSEQGRLQTLPGVFFFY 239
              +T S   +Y +KVVPT Y        +++ +  +  +  R+   G    +P V F +
Sbjct: 258 SFSKTYSET-RYLVKVVPTTYRKTKKRNAKASTYEYSAQWSKRTIVVGFAGAVPAVLFEF 316

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           + +PI+V    E   F HF+  +C IVGG+F V G ID  +
Sbjct: 317 EPAPIQVNNVFERQPFSHFVVQLCGIVGGLFVVLGFIDNVV 357


>gi|302808800|ref|XP_002986094.1| hypothetical protein SELMODRAFT_123299 [Selaginella moellendorffii]
 gi|300146242|gb|EFJ12913.1| hypothetical protein SELMODRAFT_123299 [Selaginella moellendorffii]
          Length = 475

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 82/283 (28%), Positives = 130/283 (45%), Gaps = 48/283 (16%)

Query: 35  GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 94
           G P I    + H  + EH      S YG   +D     +  +  EA   K   L+  D  
Sbjct: 217 GFPSIRIFRKGHDLKDEHGHHEHDSYYGERDTD-----SLVKAMEALVPKETTLALED-- 269

Query: 95  DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 154
              K  G ++R     G GC I GF+   KV GN   +       SG H     +F   +
Sbjct: 270 ---KTNGTVKRPAPRAG-GCRIEGFIRAKKVPGNIIISA-----HSGSH-----SFDASA 315

Query: 155 FNISHKINKLAFGEH------------FPGVVNPLDGVR-------WTQETPSGMYQYFI 195
            N++H +++ +FG              +P + +  D V        +  +  +  + +++
Sbjct: 316 MNMTHYVSQFSFGRELNFWMRRELYRIYPHLASVYDTVEANLTGRIYVSQHENITHDHYL 375

Query: 196 KVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQT--LPGVFFFYDLSPIKVTFTEEH 252
           +VV T    +     +  +FS+ E +  +S    +Q   +P   F Y+LSP++V   E  
Sbjct: 376 QVVKTEVVSLQ----KRKEFSLLEQYDYTSHSNTVQNTNVPVAKFHYELSPMQVLVKENP 431

Query: 253 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
            SF HF+TNVCAI+GGVFTV+GI+D+ + HG   + KKIE+GK
Sbjct: 432 KSFSHFITNVCAIIGGVFTVAGIVDSML-HGAMRMVKKIELGK 473


>gi|444706692|gb|ELW48018.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Tupaia chinensis]
          Length = 821

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 47/108 (43%), Positives = 68/108 (62%), Gaps = 5/108 (4%)

Query: 191 YQYFIKVVPTVYTDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTF 248
           + Y +K+VPTVY D SG    S Q++V   E+   S  GR+  +P ++F YDLSPI V +
Sbjct: 716 HDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSHTGRI--IPAIWFRYDLSPITVKY 773

Query: 249 TEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           TE       F+T +CAI+GG FTV+GI+D+ I+    A  KK+++GK 
Sbjct: 774 TERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW-KKVQLGKM 820


>gi|189207969|ref|XP_001940318.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187976411|gb|EDU43037.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 394

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 59/193 (30%), Positives = 89/193 (46%), Gaps = 37/193 (19%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E + C IYG L+ NKV G+FH  A G  + + G H+         SFN SH I +++FG 
Sbjct: 174 ETDSCRIYGSLDGNKVQGDFHITARGHGYIEFGQHL------DHSSFNFSHIIREMSFGP 227

Query: 169 HFPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDV-------------------- 205
           ++P + NPLD       TP      +QY++ +VPT+YTD                     
Sbjct: 228 YYPSLTNPLDATIAVTPTPDDKFYKFQYYLSIVPTIYTDDPSLIPLLELVGSTSNHPGAA 287

Query: 206 ----SGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
                 H I++NQ++VT     S +     +PG+F  +D+ PI +   EE   F   +  
Sbjct: 288 SMFHGAHAIKTNQYAVTSQ---SHKVPENYVPGIFVKFDIEPIVLRVVEEWGGFWRLIVT 344

Query: 262 VCAIVGGVFTVSG 274
           +  +V GV    G
Sbjct: 345 LINVVSGVMVAGG 357


>gi|254572003|ref|XP_002493111.1| Protein localized to COPII-coated vesicles, forms a complex with
           Erv46p [Komagataella pastoris GS115]
 gi|238032909|emb|CAY70932.1| Protein localized to COPII-coated vesicles, forms a complex with
           Erv46p [Komagataella pastoris GS115]
          Length = 333

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 61/189 (32%), Positives = 95/189 (50%), Gaps = 17/189 (8%)

Query: 94  IDQCKREGFLQRIKEE------EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDI 147
           +D+  RE  L   +E+      +   C+I+G + VNKV G FH   GK     G    D 
Sbjct: 128 LDEVMRESALAEFREKKSFTHGDAPACHIFGSIPVNKVHGFFHIT-GK-----GYGYRDR 181

Query: 148 LAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
               +++ N +H I++ +FGE +P + NPLD    T       + Y++ VVPT Y  + G
Sbjct: 182 SIVPKEALNFTHVISEFSFGEFYPYMNNPLDFTARTTNDHIHTFNYYLDVVPTEYKKL-G 240

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
             I + Q+S+T     +E   L   PG+FF Y   PI ++  E+ +SF+ FL  +  I G
Sbjct: 241 IVIDTTQYSMT----VTELPGLSRPPGLFFNYQFEPIILSIEEKRISFVRFLVRLVTICG 296

Query: 268 GVFTVSGII 276
           G+  V+  I
Sbjct: 297 GIMVVAKWI 305


>gi|209877186|ref|XP_002140035.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209555641|gb|EEA05686.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 384

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 71/249 (28%), Positives = 108/249 (43%), Gaps = 27/249 (10%)

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCK---REGFLQRIKEEEG-E 112
           CGSCY   S    CCN C EV  +Y++    L      +QCK   RE   + I       
Sbjct: 117 CGSCYNP-SKKNHCCNTCSEVIRSYQEDNIKLPQKINFEQCKFDPRERLEKAISAPLNIS 175

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GC I   + + KV G    +  +  + + +   DI   +   +N S+ +  L +G+  PG
Sbjct: 176 GCKIKVDINIPKVKGRIEISHKRWMNYNEMTNLDIS--EAHLYNFSYIVKYLHYGDDLPG 233

Query: 173 VVNPLDGVRWTQETP-------------SGMYQYFIKVVPTVYTDV-SGHTIQSNQFSV- 217
           + N  +   + Q                       +  +PT +  + S  T   +QFSV 
Sbjct: 234 INNIWNNQEYIQTAKFTHNKESDNLFLEDAHLDIDMHCIPTQFNSINSKKTKIGHQFSVR 293

Query: 218 --TEHFRSSEQGRL---QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
             ++       GR     +LPG++  YD +P  V  TE   SFL FLT  CAI+GG+F  
Sbjct: 294 KQSKQVNVLNNGRFVPETSLPGIYINYDFTPFIVKITESRRSFLSFLTECCAIIGGIFAF 353

Query: 273 SGIIDAFIY 281
           S +ID F++
Sbjct: 354 SSMIDIFMF 362


>gi|330935325|ref|XP_003304912.1| hypothetical protein PTT_17645 [Pyrenophora teres f. teres 0-1]
 gi|311318248|gb|EFQ86993.1| hypothetical protein PTT_17645 [Pyrenophora teres f. teres 0-1]
          Length = 395

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 74/260 (28%), Positives = 113/260 (43%), Gaps = 54/260 (20%)

Query: 44  QRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFL 103
           Q  G  LE      G+  G   S E+  +  E++ +A+++K    S              
Sbjct: 124 QWTGRNLERGTHELGTEAGDAPSWEEAWDVREQLGKAHKRK---FSK------------T 168

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKIN 162
            RI+    + C IYG L+ NKV G+FH  A G  + + G H+         SFN SH I 
Sbjct: 169 PRIRGNP-DSCRIYGSLDGNKVQGDFHITARGHGYMEFGEHL------DHSSFNFSHIIR 221

Query: 163 KLAFGEHFPGVVNPLDGVRWTQETPSGMY---QYFIKVVPTVYTD----------VS--- 206
           +++FG ++P + NPLD       TP   +   QY++ +VPT+YTD          VS   
Sbjct: 222 EMSFGPYYPSLTNPLDATIAVTPTPDDKFYKFQYYLSIVPTIYTDDPTLIPYLEAVSSTA 281

Query: 207 ------------GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 254
                          I++NQ++VT     S +     +PGVF  +D+ PI +   EE   
Sbjct: 282 GNHPGAASIFHGARAIKTNQYAVTSQ---SHKVPENYVPGVFVKFDIEPIMLAVVEEWSG 338

Query: 255 FLHFLTNVCAIVGGVFTVSG 274
           F   +  +  +V GV    G
Sbjct: 339 FWRLIVTLVNVVSGVMVAGG 358


>gi|255563725|ref|XP_002522864.1| thioredoxin domain-containing protein, putative [Ricinus communis]
 gi|223537948|gb|EEF39562.1| thioredoxin domain-containing protein, putative [Ricinus communis]
          Length = 478

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 76/241 (31%), Positives = 116/241 (48%), Gaps = 40/241 (16%)

Query: 79  EAYRKKGWALSNPDLIDQCKREGFLQRIKEEE--GEGCNIYGFLEVNKVAGNFHFAPGKS 136
           E+  K   +L  P  ++  K E   Q  K       GC I G++ V KV GN   +    
Sbjct: 252 ESLVKTMESLVAPIQLESLKSENATQSTKRPAPLTGGCRIEGYVRVKKVPGNLIISA--- 308

Query: 137 FHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-PGVVN--------------PLDG-- 179
             +SG H     +F     N+SH I+ L+FG    P V+N               L+G  
Sbjct: 309 --RSGAH-----SFDPSQMNMSHVISHLSFGLKVSPKVMNEAKRLVPYIGGSHDKLNGRS 361

Query: 180 -VRWTQETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT---LPG 234
            V       +   ++++++V T V T  S     S +  + E +  +    L     +P 
Sbjct: 362 FVNHRDVDANVTIEHYLQIVKTEVVTRRS-----SREHKLLEEYEYTAHSSLVQSVYIPA 416

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIG 294
             F ++LSP++V  TE   SF HF+TNVCAI+GGVFTV+GI+D+ ++H  R + KK+E+G
Sbjct: 417 AKFHFELSPMQVLITENPKSFSHFITNVCAIIGGVFTVAGILDSILHHTVR-LMKKVELG 475

Query: 295 K 295
           K
Sbjct: 476 K 476


>gi|224117462|ref|XP_002317580.1| predicted protein [Populus trichocarpa]
 gi|222860645|gb|EEE98192.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 69/217 (31%), Positives = 104/217 (47%), Gaps = 30/217 (13%)

Query: 98  KREGFLQRIKEE--EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 155
           K E   Q +K       GC I G++ V KV GN   +       SG H     +F     
Sbjct: 277 KPENATQHVKRPAPSAGGCRIEGYVRVKKVPGNLMISA-----LSGAH-----SFDSKQM 326

Query: 156 NISHKINKLAFG-EHFPGVV--------------NPLDGVRWTQETPSGMYQYFIKVVPT 200
           N+SH I+  +FG +  P V+              + L+G  +      G        +  
Sbjct: 327 NLSHVISHFSFGMKVLPRVMSDVKRLLPYIGRSHDKLNGRSFINHRDVGANVTIEHYLQV 386

Query: 201 VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHF 258
           V T+V      S +  + E+  ++     QT  +P   F ++LSP++V  TE   SF HF
Sbjct: 387 VKTEVVTRRSSSERKLIEEYEYTAHSSLSQTVYMPTAKFHFELSPMQVLITENSKSFSHF 446

Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           +TNVCAI+GGVFTV+GI+D+ ++H  R + KK+E+GK
Sbjct: 447 ITNVCAIIGGVFTVAGILDSILHHTVRMM-KKVELGK 482


>gi|195130281|ref|XP_002009580.1| GI15435 [Drosophila mojavensis]
 gi|193908030|gb|EDW06897.1| GI15435 [Drosophila mojavensis]
          Length = 433

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 61/190 (32%), Positives = 104/190 (54%), Gaps = 4/190 (2%)

Query: 103 LQRIKEEEG--EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHK 160
           LQ+I + E   + C ++G L +NKVAG  H   G          H ++ F+R   N +H+
Sbjct: 184 LQQISQMESKYDACRLHGTLGINKVAGVLHLVGGAQPVVGMFEDHWMIEFRRMPANFTHR 243

Query: 161 INKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
           IN+L+FG++   +V PL+G        +   QYFIKVVPT     +  TI + Q++VTE+
Sbjct: 244 INRLSFGQYSRRIVQPLEGDETIIREEATTVQYFIKVVPTEIRH-TFSTISTFQYAVTEN 302

Query: 221 FRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
            R  +  R     PG++F YD S +K+  + +  + + F+  +C+I+ G+  +SG ++A 
Sbjct: 303 VRKLDAERNSYGSPGIYFKYDWSALKIVVSHDRDNLVTFVIRLCSIISGIIVISGAVNAL 362

Query: 280 IYHGQRAIKK 289
           +   QR + +
Sbjct: 363 LVAIQRRLLR 372


>gi|169614774|ref|XP_001800803.1| hypothetical protein SNOG_10535 [Phaeosphaeria nodorum SN15]
 gi|111060809|gb|EAT81929.1| hypothetical protein SNOG_10535 [Phaeosphaeria nodorum SN15]
          Length = 404

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/190 (30%), Positives = 88/190 (46%), Gaps = 35/190 (18%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C I+G L+ NKV G+FH  A G  + + G    D       +FN SH I +++FG ++
Sbjct: 177 DACRIFGSLDGNKVQGDFHITARGHGYQEFGEQHLD-----HKTFNFSHIIREMSFGPYY 231

Query: 171 PGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSG-------------------- 207
           P + NPLD    T  T       +QY++ +VPT+YTD  G                    
Sbjct: 232 PSLTNPLDNTIATTPTDQDHFYKFQYYLSIVPTIYTDNPGLLPLLESVNRDPSAHPAKSI 291

Query: 208 ---HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
              H I++NQ++VT    +  +     +PGVF  +D+ PI +   EE   F   L  +  
Sbjct: 292 FSTHAIKTNQYAVTSQSHTVPE---NYVPGVFVKFDIEPIMLAVVEEWGGFWRLLVRIVN 348

Query: 265 IVGGVFTVSG 274
           +V GV    G
Sbjct: 349 VVSGVMVAGG 358


>gi|393221326|gb|EJD06811.1| DUF1692-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 537

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 77/154 (50%), Gaps = 12/154 (7%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + +G  C +YG ++  KV  N H       ++S  HV           N+SH I   +FG
Sbjct: 172 KPDGGACRVYGSIQAKKVTANLHITTAGHGYRSMHHV------DHSQMNLSHVITDFSFG 225

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR--SSE 225
            +FP +  PL         P   YQYF+ VVPT Y   +G  + ++Q+SVT + R    E
Sbjct: 226 PYFPDMAQPLKNTFELTHEPFIAYQYFLSVVPTTYIASNGKQVHTSQYSVTHYTRVLQHE 285

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
           QG     PG+FF YDL P+++T  ++  + + FL
Sbjct: 286 QG----TPGIFFKYDLEPLQMTIHQKTTTLVQFL 315


>gi|327307836|ref|XP_003238609.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
 gi|326458865|gb|EGD84318.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
          Length = 399

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 73/260 (28%), Positives = 112/260 (43%), Gaps = 52/260 (20%)

Query: 44  QRHGGRLEHNETYCGSCYGAESSDEDCC--NNCEEVREAYRKK---GWALSNPDLIDQCK 98
           +R GG  E+        +  E  +ED    +   EVR + +KK      L   D +D C+
Sbjct: 135 RRSGGSPEYQTLNKEDTFRLEEQEEDLHVEHVLGEVRRSRKKKFPKAPKLKRSDAVDSCR 194

Query: 99  REGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNI 157
                            ++G LE NKV GN H  A G  + + G   +        S N 
Sbjct: 195 -----------------VFGSLEGNKVQGNLHITARGFGYFEWGRTTNP------HSLNF 231

Query: 158 SHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH--------- 208
           +H I +L+FG H+  ++NPLD    +       YQY + VVPT+YT  SGH         
Sbjct: 232 THLITELSFGPHYGRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYTK-SGHIDPNRRSLP 290

Query: 209 ------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFL 256
                       T+ +NQ++VT  +    Q R+   PG+FF Y++ PI +  ++E  S L
Sbjct: 291 DASTITAKDSKTTVSTNQYAVTS-YSQPIQPRIDATPGIFFKYNIEPILLIVSQEWDSLL 349

Query: 257 HFLTNVCAIVGGVFTVSGII 276
             +  +  +V GV    G +
Sbjct: 350 ALMVRLVNVVSGVLVTGGWL 369


>gi|45190741|ref|NP_984995.1| AER136Wp [Ashbya gossypii ATCC 10895]
 gi|44983720|gb|AAS52819.1| AER136Wp [Ashbya gossypii ATCC 10895]
 gi|374108218|gb|AEY97125.1| FAER136Wp [Ashbya gossypii FDAG1]
          Length = 340

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 53/169 (31%), Positives = 89/169 (52%), Gaps = 10/169 (5%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
           +E E  GC+IYG + VN+V G  H  P    + S   V        D  N++H  N+ +F
Sbjct: 147 EEFEFNGCHIYGSIPVNRVKGELHITPKGWRYSSRQRV------PHDEINLTHIFNEFSF 200

Query: 167 GEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           GE FP + N LD V R+ Q+  +  + YF+ V+PT+Y  + G  + +NQ+SV+ +  +  
Sbjct: 201 GEFFPYIDNTLDQVGRYAQQRLTR-FHYFVSVLPTIYRKM-GAVVDTNQYSVSHNDITYT 258

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
             RL T PG+F  Y+   + V   ++ +SF  FL  +  ++  +  ++ 
Sbjct: 259 SSRLYT-PGIFILYNFEALTVVVQDKRISFWAFLIRLVTMLSFIVYIAA 306


>gi|343473351|emb|CCD14737.1| hypothetical protein, unlikely [Trypanosoma congolense IL3000]
          Length = 141

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 53/134 (39%), Positives = 77/134 (57%), Gaps = 17/134 (12%)

Query: 179 GVRWTQETPSGMYQYFIKVVPTVY---TDVS-GHTIQSNQFSVTEHFRSS---------- 224
           GV    E   G + YF+KVVPT+Y   T +S G  ++SNQ+SVT HF +S          
Sbjct: 6   GVENPSEDLIGRFAYFVKVVPTLYQVRTLMSLGRVVESNQYSVTHHFTASWDAADQNNQT 65

Query: 225 -EQGRLQTLPGVFFFYDLSPIKVTFTEEHV--SFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
                 + +PGVF  YD+SPI+V+    H   S +H +  +CA+ GGV+TV G+ID+  +
Sbjct: 66  NRDANPRVVPGVFVSYDISPIRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVMGLIDSMFF 125

Query: 282 HGQRAIKKKIEIGK 295
           H  R +++KI  GK
Sbjct: 126 HSIRRVQEKINRGK 139


>gi|302675040|ref|XP_003027204.1| hypothetical protein SCHCODRAFT_70909 [Schizophyllum commune H4-8]
 gi|300100890|gb|EFI92301.1| hypothetical protein SCHCODRAFT_70909 [Schizophyllum commune H4-8]
          Length = 528

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 50/153 (32%), Positives = 75/153 (49%), Gaps = 10/153 (6%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAF 166
           E  G  C ++G LEV KV  N H       + S  H  H ++       N++H I++ +F
Sbjct: 168 EPHGSACRVWGSLEVKKVTANLHITTAGHGYASREHADHKVM-------NLTHVISEFSF 220

Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
           G HFP +V PLD      + P   YQY++ VVPT Y       + +NQ+SVT + +  E 
Sbjct: 221 GPHFPDIVQPLDYTFEVAKDPFVAYQYYLHVVPTTYIAPRSAPLSTNQYSVTHYKKVFEH 280

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
              Q  PG+FF +D+ P+ +   +   SF    
Sbjct: 281 N--QATPGIFFKFDIDPLAIQIHQRTTSFARLF 311


>gi|50303625|ref|XP_451754.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49640886|emb|CAH02147.1| KLLA0B04950p [Kluyveromyces lactis]
          Length = 341

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 52/165 (31%), Positives = 86/165 (52%), Gaps = 11/165 (6%)

Query: 113 GCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           GC+I+G + VNKV G  H  A G  +  +        A  +D  N +H IN+L+FG+ +P
Sbjct: 153 GCHIFGSVPVNKVKGELHITAHGWGYRSAS-------AIPKDQINFNHVINELSFGDFYP 205

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQT 231
            + NPLD      +     Y YF  +VPT+Y  + G  + +NQ++++E     E  +   
Sbjct: 206 YIDNPLDNTAKFSDEKIKAYYYFTSIVPTLYKKM-GAEVDTNQYALSET-EYGESSKATG 263

Query: 232 LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG-VFTVSGI 275
           +PG+F  Y   P+K+  ++  + F  F+  + AI+   V+T S I
Sbjct: 264 VPGIFIRYQFEPMKIIISDMRIGFFQFIIRLVAILSFIVYTASWI 308


>gi|50293697|ref|XP_449260.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528573|emb|CAG62234.1| unnamed protein product [Candida glabrata]
          Length = 352

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 91/174 (52%), Gaps = 25/174 (14%)

Query: 113 GCNIYGFLEVNKVAGNFHFA------PGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
           GC+I+G + VN+V G           PGK                ++  + +H IN+L+F
Sbjct: 162 GCHIFGSVPVNRVKGELQITASGYGYPGKRA-------------PKEEIDFAHAINELSF 208

Query: 167 GEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH---FR 222
           G+ +P + NPLD   R+ +E P   Y Y+I  VPT+Y  + G  I++ Q+SV ++     
Sbjct: 209 GDFYPYIDNPLDKTARFDKEHPLSAYMYYISAVPTMYKKL-GVEIETFQYSVNDYKYSMT 267

Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG-GVFTVSGI 275
            ++   ++ +PG+FF Y   P+ +  T+  +SFL F+  + AI+   +F VS I
Sbjct: 268 DADPATVRKIPGIFFRYGFEPLSIEITDVRISFLQFIVRLVAILSFFMFVVSWI 321


>gi|384244593|gb|EIE18093.1| protein disulfide isomerase [Coccomyxa subellipsoidea C-169]
          Length = 479

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 62/213 (29%), Positives = 98/213 (46%), Gaps = 56/213 (26%)

Query: 113 GCNIYGFLEVNKVAGNFHF---APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE- 168
           GC + GF+ V KV G  HF   +PG SF                + N+SH +N L FG  
Sbjct: 291 GCALSGFVLVKKVPGALHFLAKSPGHSF-------------DYQAMNMSHVVNYLYFGNK 337

Query: 169 ------------HFPGV----VNPLDGVRWTQETPSGMYQYFIKVV----------PTVY 202
                       H  G+     + L G  +        ++++++VV          P + 
Sbjct: 338 PSPRRHQSLAKLHPAGLSDDWADKLAGQDFFSRAAKATFEHYMQVVLTTIEPSKHRPELS 397

Query: 203 TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
            D   +T+ S+ +   +            +P   F YDLSPI++  +E+  ++ HF+T  
Sbjct: 398 YDAYEYTVHSHTYDTAD------------IPAAKFTYDLSPIQILVSEKRRAWYHFVTTT 445

Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           CAI+GGVFTV+GI+D  ++ G R   KK+E+GK
Sbjct: 446 CAIIGGVFTVAGIVDGLVHTGAR-FAKKVELGK 477


>gi|255082155|ref|XP_002508296.1| predicted protein [Micromonas sp. RCC299]
 gi|226523572|gb|ACO69554.1| predicted protein [Micromonas sp. RCC299]
          Length = 507

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 70/257 (27%), Positives = 118/257 (45%), Gaps = 37/257 (14%)

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           + Y  + + E      EE+  A++    A  + D     ++    Q +K+ +G GC++ G
Sbjct: 268 TSYHGDRTVEAITTFAEELLPAWK----ATDHKDTELAIRQPVETQTVKKIDGPGCSVTG 323

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-PGVVNPL 177
           F+ V KV G+         H          +F  +S N+SH ++   FG+   P     L
Sbjct: 324 FVLVKKVPGHLWVTATSKSH----------SFHAESMNMSHVVHHFYFGQQLTPQRKRYL 373

Query: 178 DGVRWTQETPSG------------------MYQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
           D     ++ P G                   ++++++ V T     SG     N +  T+
Sbjct: 374 DRFHSREKDPKGDWHDKLAGGTFTSEEDNVTHEHYLQTVLTTIKP-SGSPAPFNVYEYTQ 432

Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
           H  S    +   LP   F +D SP++++ +EE   F HF+T + AIVGGV++V GI D F
Sbjct: 433 HSHSLRSEK--ELPRAKFHFDPSPVQISVSEERQKFYHFITTLMAIVGGVYSVMGIADGF 490

Query: 280 IYHGQRAIKKKIEIGKF 296
           +++  +A KKK E+GKF
Sbjct: 491 VHNSIQAWKKK-ELGKF 506


>gi|449468488|ref|XP_004151953.1| PREDICTED: protein disulfide-isomerase 5-4-like [Cucumis sativus]
          Length = 481

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 72/223 (32%), Positives = 113/223 (50%), Gaps = 37/223 (16%)

Query: 93  LIDQCKRE-GFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 151
           L D+   E G ++R     G GC I G++ V KV G+   A     H          +F 
Sbjct: 274 LEDKSNNETGNVKRPAPSAG-GCRIEGYVRVKKVPGSLVIAARSESH----------SFD 322

Query: 152 RDSFNISHKINKLAFGEH--------------FPGVV-NPLDGVRWTQETPSG---MYQY 193
               N+SH I+ L+FG                + G+  + L+G  +  +   G     ++
Sbjct: 323 ASQMNMSHIISHLSFGRKISPKAFSDAKQLIPYIGISHDRLNGRSFINQRDLGANVTIEH 382

Query: 194 FIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEH 252
           ++++V T V T  SG  ++  ++  T H   S+      +P V F + LSP++V  TE  
Sbjct: 383 YLQIVKTEVLTRRSGKLLE--EYEYTAHSSVSQS---LYIPVVKFHFVLSPMQVVITENQ 437

Query: 253 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
            SF HF+TNVCAI+GGVFTV+GI+DA +++  R + KK+E+GK
Sbjct: 438 KSFSHFITNVCAIIGGVFTVAGILDALLHNTIR-LMKKVELGK 479


>gi|356517290|ref|XP_003527321.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
          Length = 480

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 100/201 (49%), Gaps = 33/201 (16%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
           GC + G++ V KV GN   +     H          +F     N+SH IN L+FG+   P
Sbjct: 293 GCRVEGYVRVKKVPGNLIISARSDAH----------SFDASQMNMSHFINNLSFGKKVTP 342

Query: 172 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 214
             +              + L+G  +T     G     +++I++V T     +G+ +   +
Sbjct: 343 RAMSDVKLLIPYIGSSHDRLNGRSFTNTHDLGANVTIEHYIQIVKTEVVTRNGYKLI-EE 401

Query: 215 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
           +  T H   S       +P   F  +LSP++V  TE   SF HF+TNVCAI+GGVFTV+G
Sbjct: 402 YEYTAH---SSVAHSVDIPAAKFHLELSPMQVLITENQRSFSHFITNVCAIIGGVFTVAG 458

Query: 275 IIDAFIYHGQRAIKKKIEIGK 295
           I+D+ +++  R + KK+E+GK
Sbjct: 459 ILDSILHNTIRMM-KKVELGK 478


>gi|356549839|ref|XP_003543298.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
          Length = 480

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 101/204 (49%), Gaps = 39/204 (19%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
           GC I G++ V KV GN  F+   + H          +F     N+SH IN L+FG    P
Sbjct: 293 GCRIDGYVRVKKVPGNLIFSARSNAH----------SFDASQMNMSHVINHLSFGRKVSP 342

Query: 172 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 214
            V+              + L+G  +      G     ++++++V T         I    
Sbjct: 343 RVMSDVKRLIPYVGSSHDRLNGRSFINTHDLGANVTMEHYLQIVKT-------EVITRKD 395

Query: 215 FSVTEHFRSSEQGRL-QTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
           + + E +  +    + Q+L  P   F  +LSP++V  TE   SF HF+TNVCAIVGG+FT
Sbjct: 396 YKLVEEYEYTAHSSVAQSLHIPVAKFHLELSPMQVLITENQKSFSHFITNVCAIVGGIFT 455

Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
           V+GI+DA +++  R + KK+E+GK
Sbjct: 456 VAGIMDAILHNTIR-LMKKVELGK 478


>gi|296086862|emb|CBI33029.3| unnamed protein product [Vitis vinifera]
          Length = 139

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 40/66 (60%), Positives = 52/66 (78%)

Query: 20  LDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVRE 79
           +D+ GN +  +QD IG P+I+K LQRHGGRLE N  YCGSCYGAE +D+DC N+C+E RE
Sbjct: 73  IDAHGNEVAVKQDEIGGPQIEKLLQRHGGRLERNGKYCGSCYGAEVTDDDCGNSCDEDRE 132

Query: 80  AYRKKG 85
            Y+K+G
Sbjct: 133 TYKKRG 138


>gi|440798302|gb|ELR19370.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
          Length = 328

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 100/201 (49%), Gaps = 38/201 (18%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFA------------------------PGKSFHQSGVH 143
           + E  GC+I G++ V KV GNFH +                          + F+ SGV 
Sbjct: 116 DSELSGCSIAGYINVPKVPGNFHLSTHGRNVQAQDIDMQHNINSFFFTDSPRVFYPSGVS 175

Query: 144 VHDILAFQRDSFNISHKINKLA----FGEHFPGVVNPLDGV-RWTQETPSGM---YQYFI 195
           V    A++    N+  ++N  A      +   G+  PLDG+ +   +  +G+   Y+Y+I
Sbjct: 176 VP---AWRNWHSNVVAELNAQARDQDTDDDVVGLFRPLDGITKANSQRKNGVGVSYEYYI 232

Query: 196 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 255
           ++VPT+     G T  + QF+   +  ++ +G+    P V+F YD+SPI V  T    S 
Sbjct: 233 QIVPTILEFPDGRTKHTYQFTYNFNDVATPEGKT---PSVYFKYDISPITVKITRGRGSL 289

Query: 256 LHFLTNVCAIVGGVFTVSGII 276
            HFL  +CAIVGG+FTVSG+I
Sbjct: 290 GHFLLQLCAIVGGIFTVSGLI 310


>gi|119928709|ref|XP_001256294.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Bos taurus]
          Length = 144

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 48/106 (45%), Positives = 67/106 (63%), Gaps = 5/106 (4%)

Query: 193 YFIKVVPTVYTDVSGHTIQSNQFSVT--EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTE 250
           Y +K+VPTVY D SG    S Q++V   E+   S  GR+  +P ++F YDLSPI V +TE
Sbjct: 41  YILKIVPTVYEDKSGKQQFSYQYTVANKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTE 98

Query: 251 EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
                  F+T +CAI+GG FTV+GI+D+ I+    A  KKI++GK 
Sbjct: 99  RRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEA-WKKIQLGKM 143


>gi|449489976|ref|XP_004158474.1| PREDICTED: protein disulfide-isomerase 5-3-like [Cucumis sativus]
          Length = 224

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 66/202 (32%), Positives = 104/202 (51%), Gaps = 35/202 (17%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
           GC I G++ V KV G+   A     H          +F     N+SH I+ L+FG     
Sbjct: 37  GCRIEGYVRVKKVPGSLVIAARSESH----------SFDASQMNMSHIISHLSFGRKISP 86

Query: 170 -----------FPGVV-NPLDGVRWTQETPSG---MYQYFIKVVPT-VYTDVSGHTIQSN 213
                      + G+  + L+G  +  +   G     ++++++V T V T  SG  ++  
Sbjct: 87  KAFSDAKQLIPYIGISHDRLNGRSFINQRDLGANVTIEHYLQIVKTEVLTRRSGKLLE-- 144

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           ++  T H   S+      +P V F + LSP++V  TE   SF HF+TNVCAI+GGVFTV+
Sbjct: 145 EYEYTAHSSVSQS---LYIPVVKFHFVLSPMQVVITENQKSFSHFITNVCAIIGGVFTVA 201

Query: 274 GIIDAFIYHGQRAIKKKIEIGK 295
           GI+DA +++  R + KK+E+GK
Sbjct: 202 GILDALLHNTIR-LMKKVELGK 222


>gi|195042004|ref|XP_001991346.1| GH12601 [Drosophila grimshawi]
 gi|193901104|gb|EDV99970.1| GH12601 [Drosophila grimshawi]
          Length = 434

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 85/157 (54%), Gaps = 2/157 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++ F+R   N +H+IN+L+FG
Sbjct: 190 EAKYDACRLHGTLGINKVAGVLHLVGGAQPVVGMFDDHWMIEFRRMPANFTHRINRLSFG 249

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G   T    +   QYFIKVVPT     +  T+ + Q++VTE+ R  +  
Sbjct: 250 QYSRRIVQPLEGDETTITEEATTVQYFIKVVPTEIQQ-TFSTVSTFQYAVTENVRKLDSE 308

Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
           R     PG++F YD S +KV  + +   FL F+  +C
Sbjct: 309 RNSYGSPGIYFKYDWSALKVVISHDRDYFLTFVIRLC 345


>gi|354545468|emb|CCE42196.1| hypothetical protein CPAR2_807450 [Candida parapsilosis]
          Length = 351

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 59/197 (29%), Positives = 92/197 (46%), Gaps = 18/197 (9%)

Query: 90  NPDLIDQCKREGF------LQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 143
            PDL D+  +E        L R   E    C+I+G + VN+V G+F           G  
Sbjct: 126 TPDL-DEVMQESLRAEFSQLGRRVNEGAPACHIFGSIPVNQVKGDFRIT------AKGFG 178

Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
             D      ++ N SH I + ++G+ +P + NPLD      E     Y Y  KVVPT+Y 
Sbjct: 179 YRDRSFVPLEALNFSHVIQEFSYGDFYPFLNNPLDATGKVTEENLQTYLYHAKVVPTLYE 238

Query: 204 DVSGHTIQSNQFSVTEHFR----SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
            + G  + + Q+S+TE+           R Q + G++F Y+  PIK+   E+ + FL F+
Sbjct: 239 KL-GLEVDTTQYSLTENHHVVKVDPHSKRPQEISGIYFAYEFEPIKLIIREKRIPFLQFI 297

Query: 260 TNVCAIVGGVFTVSGII 276
             +  I GGV   +G +
Sbjct: 298 AKLGTIAGGVVVAAGYL 314


>gi|168012320|ref|XP_001758850.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689987|gb|EDQ76356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 487

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 100/202 (49%), Gaps = 35/202 (17%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG----- 167
           GC + GF+ V KV G    +       SG H     +F   S N++H +   +FG     
Sbjct: 300 GCRVEGFVRVKKVPGELMISA-----HSGSH-----SFDATSMNMTHYVGFFSFGRKTSW 349

Query: 168 -------EHFPGV---VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQS----N 213
                  E  P +   ++ L G  +  E  +  + ++++VV T    ++ H  Q      
Sbjct: 350 RSVHWVNEMLPALDSNIDRLTGQVFPSEYENITHDHYLQVVKTEV--ITLHRKQDLRVLE 407

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           Q+  T H   S   +   +P V F Y+LSP++V   E   SF HFLTN+CAI+GGVFTV+
Sbjct: 408 QYDYTAH---SNMIQSTKVPVVKFHYELSPMQVLVKENPKSFSHFLTNLCAIIGGVFTVA 464

Query: 274 GIIDAFIYHGQRAIKKKIEIGK 295
           GIID+ + H    I KK+E+GK
Sbjct: 465 GIIDSML-HNAMHIMKKVELGK 485


>gi|440293957|gb|ELP87004.1| hypothetical protein EIN_318630 [Entamoeba invadens IP1]
          Length = 316

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 63/188 (33%), Positives = 92/188 (48%), Gaps = 22/188 (11%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQ--------------SGVHVHDILAFQRDSFNIS 158
           GC ++G ++V++V+G FH A GK  ++              + +H H     +  SFN +
Sbjct: 117 GCRMHGTMKVSRVSGEFHVAFGKIAYRQQRTNQVITATQKHTQMHTHQFTMQEMKSFNPT 176

Query: 159 HKINKLAFGEHFPGVVN-----PLDGVRWTQE-TPSGMYQYFIKVVPTVYTDVSGHTIQS 212
           H IN LAF    P         PL+G  +T +   +  Y Y+I V+PT+      HT +S
Sbjct: 177 HFINNLAFSNT-PSYTTHAGETPLNGKEYTLKGYDNARYTYYINVIPTL-NKYPTHTTRS 234

Query: 213 NQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTV 272
            Q S+ E F     G   T PGVFF Y+LSP  V       SF H + +  AI+GGV+ +
Sbjct: 235 YQLSINERFVPVTYGPTFTQPGVFFKYELSPYIVINEMMDHSFAHSIASTAAIIGGVWII 294

Query: 273 SGIIDAFI 280
            G I  F+
Sbjct: 295 FGWISRFL 302


>gi|226479782|emb|CAX73187.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Schistosoma japonicum]
          Length = 410

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 51/161 (31%), Positives = 80/161 (49%), Gaps = 3/161 (1%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSG-VHVHDILAFQRDSFNISHKINKLAFGEHF 170
           + C I G L V KV GN H   GK     G +H+H      + + N SH+IN  +FG+  
Sbjct: 182 DACRIVGTLFVKKVEGNIHILLGKPLEGLGNLHLHVAPFLSKTNLNFSHRINHFSFGDLV 241

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR-L 229
            G ++PL+ +       S  +QYF+ +VPT   +   H  ++ Q++ T   R+ +     
Sbjct: 242 NGQIHPLEAIESITAVASTSFQYFVTMVPTKVVN-QFHVTETYQYAATVQNRTIDHASDS 300

Query: 230 QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
             +PG+FF YD  P+ V  T +      F T + A+ GG+F
Sbjct: 301 HGIPGIFFIYDTFPLVVKITYDRELLGTFFTRLAALAGGIF 341


>gi|344250048|gb|EGW06152.1| UPF0474 protein C5orf41-like [Cricetulus griseus]
          Length = 745

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 64/186 (34%), Positives = 87/186 (46%), Gaps = 25/186 (13%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
           +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+KL
Sbjct: 125 KIPLNNGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHIIHKL 172

Query: 165 AFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT- 218
           +FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V  
Sbjct: 173 SFGDTLQVQNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVAN 232

Query: 219 -EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
            E+   S  GR+  +P ++F YDLSPI V +TE       F+T   A    VF  +G+  
Sbjct: 233 KEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTREAAEWFVFWGTGM-- 288

Query: 278 AFIYHG 283
              YHG
Sbjct: 289 --AYHG 292


>gi|67482091|ref|XP_656395.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56473591|gb|EAL51010.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449705171|gb|EMD45274.1| Hypothetical protein EHI5A_018710 [Entamoeba histolytica KU27]
          Length = 315

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 97/200 (48%), Gaps = 20/200 (10%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGK-SFHQSGV-------------HVHDILAFQRDSFNIS 158
           GC +YG ++V++V+G FH A GK SF Q  +             H+H     +  SFN +
Sbjct: 116 GCRMYGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175

Query: 159 HKINKLAFGEHFPGVV----NPLDGVRWTQET-PSGMYQYFIKVVPTVYTDVSGHTIQSN 213
           H IN L+F       V     PL+G ++T     +    Y+I V+PT++   S +T+++ 
Sbjct: 176 HYINHLSFSNTLGSTVHSGETPLNGKKFTLSGFDNARKTYYINVIPTLFKYPS-YTLRTY 234

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           Q SV E       G   T PGVFF Y+LSP  V       SF H L +V AI+GGV  + 
Sbjct: 235 QLSVNERDVPVTYGASFTQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIIGGVLIIM 294

Query: 274 GIIDAFIYHGQRAIKKKIEI 293
           G++          +   +E+
Sbjct: 295 GLLSRLFDSKHELVTSVVEM 314


>gi|356543934|ref|XP_003540413.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
          Length = 480

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 63/204 (30%), Positives = 98/204 (48%), Gaps = 39/204 (19%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GC I G++ V KV GN   +   + H          +F     N+SH IN L+FG     
Sbjct: 293 GCRIDGYVRVKKVPGNLIISARSNAH----------SFDASQMNMSHVINHLSFGRKVSL 342

Query: 173 VV---------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 214
            V               + L+G  +      G     ++++++V T         I   +
Sbjct: 343 RVMSDVKRLIPYVGSSHDRLNGRSFINTHDLGANVTIEHYLQIVKT-------EVITRKE 395

Query: 215 FSVTEHFRSSEQGRL-QTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
           + + E +  +    + Q+L  P   F  +LSP++V  TE   SF HF+TNVCAI+GG+FT
Sbjct: 396 YKLVEEYEYTAHSSVAQSLHIPVAKFHLELSPMQVLITENQKSFSHFITNVCAIIGGIFT 455

Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
           V+GI+DA I+H    + KK+E+GK
Sbjct: 456 VAGIMDA-IFHNTIRLMKKVELGK 478


>gi|157865526|ref|XP_001681470.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68124767|emb|CAJ02321.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 365

 Score = 89.0 bits (219), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 69/226 (30%), Positives = 103/226 (45%), Gaps = 26/226 (11%)

Query: 65  SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNK 124
           ++   CC+ CE V   Y++ G  +   + I QC  E   QR       GC + G L++ K
Sbjct: 148 AAASKCCDTCESVLGLYKELGRGVPGTEYIPQC-LEQLYQR-----ASGCAVMGSLDLKK 201

Query: 125 VAGNFHFAPGKS--FHQSGVHVHDILAFQRDSFNISHKINKLAFG----EHFP--GVVNP 176
           V     F P ++  F+     + D++       + SH I KL  G    E F   GV   
Sbjct: 202 VPVTVIFGPRRTGQFYS----LKDVI-----RLDTSHFIRKLRIGDETVERFSKNGVAER 252

Query: 177 LDGVRWTQETPSGMYQYFIKVVPTVY--TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
           L G + + +T S   +Y +KVVPT Y  T        + ++S     R+   G    +P 
Sbjct: 253 LSGHKSSSKTYSET-RYLVKVVPTTYRKTKTKNAKASTYEYSAQWSRRTILVGFAGAVPA 311

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           V F ++ +PI+V    E   F HFL  +C IVGG+F V G ID  +
Sbjct: 312 VLFEFEPAPIQVNNVFERQPFSHFLVQLCGIVGGLFVVLGFIDNVV 357


>gi|302800507|ref|XP_002982011.1| hypothetical protein SELMODRAFT_115492 [Selaginella moellendorffii]
 gi|300150453|gb|EFJ17104.1| hypothetical protein SELMODRAFT_115492 [Selaginella moellendorffii]
          Length = 476

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 82/284 (28%), Positives = 129/284 (45%), Gaps = 49/284 (17%)

Query: 35  GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 94
           G P I    + H  + EH      S YG   +D     +  +  EA   K   L+  D  
Sbjct: 217 GFPSIRIFHKGHDLKDEHGHHEHDSYYGERDTD-----SLVKAMEALVPKETTLALED-- 269

Query: 95  DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVA-GNFHFAPGKSFHQSGVHVHDILAFQRD 153
              K  G ++R     G GC I GF+   KV  GN   +       SG H     +F   
Sbjct: 270 ---KTNGTVKRPAPRAG-GCRIEGFIRAKKVVPGNIIISA-----HSGSH-----SFDAS 315

Query: 154 SFNISHKINKLAFGEH------------FPGVVNPLDGVR-------WTQETPSGMYQYF 194
           + N++H +++  FG              +P + +  D V        +  +  +  + ++
Sbjct: 316 AMNMTHYVSQFTFGRELNFWMRRELYRIYPHLASVYDTVEANLTGRIYVSQHENITHDHY 375

Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFR-SSEQGRLQT--LPGVFFFYDLSPIKVTFTEE 251
           ++VV T    +     +  +FS+ E +  +S    +Q   +P   F Y+LSP++V   E 
Sbjct: 376 LQVVKTEVVSLR----KRKEFSLLEQYDYTSHSNTIQNTNVPVAKFHYELSPMQVLVKEN 431

Query: 252 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
             SF HF+TNVCAI+GGVFTV+GI+D+ + HG   + KKIE+GK
Sbjct: 432 PKSFSHFITNVCAIIGGVFTVAGIVDSML-HGAMRMVKKIELGK 474


>gi|224000371|ref|XP_002289858.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975066|gb|EED93395.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 338

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 103/220 (46%), Gaps = 22/220 (10%)

Query: 91  PDLIDQCKR--EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDIL 148
           P +ID  K    GF + +     +GC + G ++V +V G    +      +       IL
Sbjct: 123 PTVIDYKKAAVSGF-KDVNTARRQGCTLVGTIKVPRVGGTMSISVSPEAWRRAT---SIL 178

Query: 149 AF------QRDSF-----NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSG--MYQYFI 195
           +F       +D F     N++H ++ + FG+ FP   NPL GV    +  SG  +    +
Sbjct: 179 SFGVDLGKDQDMFHGKLPNVTHYVHDITFGDPFPPGSNPLKGVHHVMDNGSGVALANVAV 238

Query: 196 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ---GRLQTLPGVFFFYDLSPIKVTFTEEH 252
           K+VPT Y        ++ Q SV+ H    E     R   LPG+   YD +P+ V   E  
Sbjct: 239 KLVPTTYKRTIYSAKETYQASVSRHIVQPETLAAQRSTLLPGLMLTYDFTPLAVRHVESR 298

Query: 253 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
            ++L FL+++  IVGGVF   G++   + +  +A+ KK++
Sbjct: 299 ENWLVFLSSLVGIVGGVFVTVGLVSGCLVNSAQAVAKKMD 338


>gi|302659461|ref|XP_003021421.1| hypothetical protein TRV_04495 [Trichophyton verrucosum HKI 0517]
 gi|291185318|gb|EFE40803.1| hypothetical protein TRV_04495 [Trichophyton verrucosum HKI 0517]
          Length = 427

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 60/214 (28%), Positives = 93/214 (43%), Gaps = 46/214 (21%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFH--------FAPGKS---------------FHQSGVH 143
           K +  + C ++G LE NKV GN H        F  G++                H    +
Sbjct: 186 KSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWGRATNPHSMSLLQPIITCIHGDAKN 245

Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
           + D L       N +H I +L+FG H+  ++NPLD    +       YQY + VVPT+YT
Sbjct: 246 LTDQLTKLFPGLNFTHLITELSFGPHYGRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYT 305

Query: 204 DVSGH---------------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLS 242
             SGH                     T+ +NQ++VT  +    Q R+   PG+FF Y++ 
Sbjct: 306 K-SGHIDPNRRSLPDASTITAKDSKTTVSTNQYAVTS-YSQPIQPRIDATPGIFFKYNIE 363

Query: 243 PIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           PI +  ++E  S L  +  +  +V GV    G +
Sbjct: 364 PILLIVSQERDSLLALMVRLVNVVSGVLVTGGWL 397


>gi|302508773|ref|XP_003016347.1| hypothetical protein ARB_05746 [Arthroderma benhamiae CBS 112371]
 gi|291179916|gb|EFE35702.1| hypothetical protein ARB_05746 [Arthroderma benhamiae CBS 112371]
          Length = 427

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 60/214 (28%), Positives = 93/214 (43%), Gaps = 46/214 (21%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFH--------FAPGKS---------------FHQSGVH 143
           K +  + C ++G LE NKV GN H        F  G++                H    +
Sbjct: 186 KSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWGRATNPHSMSLLQPIITCIHGDAKN 245

Query: 144 VHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
           + D L       N +H I +L+FG H+  ++NPLD    +       YQY + VVPT+YT
Sbjct: 246 LTDQLTKLFPGLNFTHLITELSFGPHYGRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYT 305

Query: 204 DVSGH---------------------TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLS 242
             SGH                     T+ +NQ++VT  +    Q R+   PG+FF Y++ 
Sbjct: 306 K-SGHIDPNRRSLPDTSTITAKDSKTTVSTNQYAVTS-YSQPIQPRIDATPGIFFKYNIE 363

Query: 243 PIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           PI +  ++E  S L  +  +  +V GV    G +
Sbjct: 364 PILLIVSQERDSLLALMVRLVNVVSGVLVTGGWL 397


>gi|356545151|ref|XP_003541008.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
          Length = 453

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 98/201 (48%), Gaps = 33/201 (16%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
           GC + G++ V KV GN   +     H          +F     N+SH IN L+FG+   P
Sbjct: 266 GCRVEGYVRVKKVPGNLIISARSDAH----------SFDASQMNMSHVINNLSFGKKVTP 315

Query: 172 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQ 214
             +              + L+G  +      G     +++I++V T      G+ +   +
Sbjct: 316 RAMSDVKLLIPYIGSSHDRLNGRSFINTRDLGANVTIEHYIQIVKTEVVTRKGYKLI-EE 374

Query: 215 FSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
           +  T H   S       +P   F  +LSP++V  TE   SF HF+TNVCAI+GGVFTV+G
Sbjct: 375 YEYTAH---SSVAHSLDIPVAKFHLELSPMQVLITENQRSFSHFITNVCAIIGGVFTVAG 431

Query: 275 IIDAFIYHGQRAIKKKIEIGK 295
           I+D+ +++  R + KKIE+GK
Sbjct: 432 ILDSILHNTIRMV-KKIELGK 451


>gi|324499844|gb|ADY39943.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Ascaris suum]
          Length = 429

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 57/175 (32%), Positives = 89/175 (50%), Gaps = 8/175 (4%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           +++EG  C ++G + VNKV G+      GK     G+  H  +    ++ NISH+I +L 
Sbjct: 217 QKDEGTACRVHGRVRVNKVKGDSVIITAGKGAGIDGLFAH--VDGASNAGNISHRIARLH 274

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTE-HFR 222
           FG    G++ PL G     E+    Y+YF+KVVPT   ++   G +    Q+SVT+ H R
Sbjct: 275 FGPWIGGLLTPLAGTEQISESGIDEYRYFLKVVPTRIFHSGFFGGSTMRYQYSVTKTHKR 334

Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
            S  GR    P +   Y+ + + V   E   S       +C++VGGVF  S I++
Sbjct: 335 PS--GREHMHPAIAIHYEFAALVVEVRETQTSLFQLFVRLCSVVGGVFATSSILN 387


>gi|224126339|ref|XP_002319814.1| predicted protein [Populus trichocarpa]
 gi|222858190|gb|EEE95737.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 64/200 (32%), Positives = 97/200 (48%), Gaps = 28/200 (14%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG-EHFP 171
           GC I G++ V KV GN   +      +SG H     +F     N+SH I+  +FG +  P
Sbjct: 294 GCRIEGYVRVKKVPGNLVISA-----RSGAH-----SFDSAQMNLSHVISHFSFGMKVLP 343

Query: 172 GVV--------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
            V+              + L+G  +      G        +  V T+V      +    +
Sbjct: 344 RVMSDVKRLIPHIGRSHDKLNGRSFINHRDVGANVTIEHYLQVVKTEVVTRRSSAEHKLI 403

Query: 218 TEHFRSSEQGRLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
            E+  ++     QT  +P   F ++LSP++V  TE   SF HF+TNVCAI+GGVFTV+GI
Sbjct: 404 EEYEYTAHSSLAQTVYMPTAKFHFELSPMQVLITENPKSFSHFITNVCAIIGGVFTVAGI 463

Query: 276 IDAFIYHGQRAIKKKIEIGK 295
           +D+ I H    + KK+E+GK
Sbjct: 464 LDS-ILHNTFRMMKKVELGK 482


>gi|403371798|gb|EJY85783.1| hypothetical protein OXYTRI_16231 [Oxytricha trifallax]
          Length = 333

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 67/209 (32%), Positives = 101/209 (48%), Gaps = 40/209 (19%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGK------SFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           EGC+IYG + +N+V GNFH +            Q G H           F+ S+KI+ ++
Sbjct: 142 EGCHIYGNILINRVPGNFHISTHAFNDILMGLMQEGHH-----------FDFSYKIDHIS 190

Query: 166 FGE--HFPGV---------VNPLDG-----VRWTQETPSGMY-QYFIKVVPTVYTDVSGH 208
           FG+  +F  +         ++PLDG      R  +  P  +   +++  VP+ + DVSG 
Sbjct: 191 FGKRNNFDMIRRKFRDHQLISPLDGKSETAPRDNKNFPKSLEGNFYLIAVPSYFKDVSGG 250

Query: 209 TIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
             Q  Q +  +H        +       F Y+LSPI V F+++  S   FL ++CAI+GG
Sbjct: 251 VYQVYQLTANDHTNFGTGNNILK-----FNYELSPITVGFSQDRESIALFLVHICAIIGG 305

Query: 269 VFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           VFT   IIDA I+     + KK  IGK S
Sbjct: 306 VFTAVSIIDAIIHKSFSLLFKK-RIGKLS 333


>gi|297830752|ref|XP_002883258.1| hypothetical protein ARALYDRAFT_479582 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329098|gb|EFH59517.1| hypothetical protein ARALYDRAFT_479582 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 483

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 106/204 (51%), Gaps = 36/204 (17%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
           GC + G++ V KV GN   +       SG H     +F     N+SH ++ L+FG    P
Sbjct: 293 GCRVEGYVRVKKVPGNLVISA-----HSGAH-----SFDSSQMNMSHVVSHLSFGRMISP 342

Query: 172 GVV--------------NPLDGVRWTQETPSG---MYQYFIKVVPT-VYTDVSG--HTIQ 211
            ++              + LDG  +  +   G     ++++++V T V T  SG  H++ 
Sbjct: 343 RLLTDMKRLLPYLGLSHDRLDGKAFINQHEFGANVTIEHYLQIVKTEVITRRSGQEHSLI 402

Query: 212 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
             ++  T H   S   +   LP   F ++LSP+++  TE   SF HF+TN+CAI+GGVFT
Sbjct: 403 -EEYEYTAH---SSVAQTYYLPVAKFHFELSPMQILITENPKSFSHFITNLCAIIGGVFT 458

Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
           V+GI+D+ I+H    + KK+E+GK
Sbjct: 459 VAGILDS-IFHNTVRLIKKVELGK 481


>gi|365759132|gb|EHN00939.1| Erv41p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|401842937|gb|EJT44934.1| ERV41-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 285

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 47/157 (29%), Positives = 87/157 (55%), Gaps = 11/157 (7%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GC+I+G + VN+V+G       K F  +  H   +     +  N +H IN+ +FG+ +P 
Sbjct: 93  GCHIFGSVPVNRVSGVLQIT-AKGFGYADSHRASL-----EDLNFAHVINEFSFGDFYPY 146

Query: 173 VVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ- 230
           + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++   ++   ++ 
Sbjct: 147 IDNPLDNTAQFDQDEPLTTYLYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLNKDSSVKG 205

Query: 231 --TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
              +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 206 NRRVPGIFFKYNFEPLSIVVSDVRISFIQFLVRLVAI 242


>gi|431918151|gb|ELK17379.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Pteropus alecto]
          Length = 313

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 58/166 (34%), Positives = 79/166 (47%), Gaps = 21/166 (12%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            +I    G GC   G   +NKV GNFH           V  H   A Q  + +++H I+K
Sbjct: 135 MKIPLNGGAGCRFEGQFSINKVPGNFH-----------VSTHSATA-QPQNPDMTHVIHK 182

Query: 164 LAFGEHFP-----GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVT 218
           L+FG+        G  N L G       P   + Y +K+VPTVY D SG    S Q++V 
Sbjct: 183 LSFGDTLQVRNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVA 242

Query: 219 --EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
             E+   S  GR+  +P ++F YDLSPI V +TE       F+T V
Sbjct: 243 NKEYVAYSHTGRI--IPAIWFRYDLSPITVKYTERRQPLYRFITTV 286


>gi|444732203|gb|ELW72509.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Tupaia chinensis]
          Length = 250

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 64/171 (37%), Positives = 87/171 (50%), Gaps = 8/171 (4%)

Query: 92  DLIDQCKR-EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAF 150
           DL  Q K  +  LQ I+    E  ++   +  +   G     P    H  G H H     
Sbjct: 63  DLSPQQKEWQRMLQVIQSRLQEEHSLQDVIFKSAFKGTTALPPRAIPHPRG-HAHLAALV 121

Query: 151 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT-VYT-DVSGH 208
             DS+N SH+I+ L+FGE  PG++NPLDG        + M+QYFI VVPT ++T  +S  
Sbjct: 122 NHDSYNFSHRIDHLSFGELVPGIINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISAD 181

Query: 209 TIQSNQFSVTEHFR-SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
           T   +QFSVTE  R  +       + G+F  YDLS + VT TEEH+ F  F
Sbjct: 182 T---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQF 229


>gi|367012766|ref|XP_003680883.1| hypothetical protein TDEL_0D00880 [Torulaspora delbrueckii]
 gi|359748543|emb|CCE91672.1| hypothetical protein TDEL_0D00880 [Torulaspora delbrueckii]
          Length = 348

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 52/167 (31%), Positives = 86/167 (51%), Gaps = 12/167 (7%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           +GC+IYG + VN+VAG             G    D         N SH IN+ ++G+ FP
Sbjct: 156 DGCHIYGSVPVNRVAGELQIT------AKGWGYQDFEKAPVSEINFSHVINEFSYGDFFP 209

Query: 172 GVVNPLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQG 227
            + NPLD           M Y Y   +VPTVY  + G  + +NQ++V+E     +S+++G
Sbjct: 210 YIDNPLDNTAKISIVDRLMGYLYDTSIVPTVYEKL-GAYVDTNQYAVSERQFDQKSTKRG 268

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
              T+PG+FF YD  P+ ++  +  +SF+ F+  + A++  V  ++ 
Sbjct: 269 S-TTVPGIFFRYDFEPLSISIKDRRLSFIQFIIRLVALLSFVVYIAS 314


>gi|325185550|emb|CCA20033.1| thioredoxinlike protein putative [Albugo laibachii Nc14]
          Length = 503

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 96/193 (49%), Gaps = 12/193 (6%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGK---SFHQSGVHVHDI---LAFQRDSFNISHKINKLA 165
           EGC + G L VN+V     F       SF   G++V  +   L+F + +   S K  +L+
Sbjct: 316 EGCEVSGSLNVNRVPSRLVFTARSKDLSFDLRGINVTHVVHHLSFGQVTRKQSTKSTQLS 375

Query: 166 FG-EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
              +HFP     LDG  +  E  +   ++F+ V+   + +     +   + +     RS+
Sbjct: 376 MSFDHFP-----LDGKTFRTENENITVEHFLSVIGVDHMEAKSKHMGLVERTYQIVARSN 430

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
           +      LP   F +D+SP+ +  + +   F  FLT++CAIVGG+ T+ G +DA  YH  
Sbjct: 431 QYNATDMLPAALFTFDISPLVIQMSSDSTPFYRFLTSLCAIVGGMVTIIGFVDAGAYHAM 490

Query: 285 RAIKKKIEIGKFS 297
            +IK+K ++GK +
Sbjct: 491 NSIKRKRQLGKLN 503


>gi|217072996|gb|ACJ84858.1| unknown [Medicago truncatula]
 gi|388501234|gb|AFK38683.1| unknown [Medicago truncatula]
          Length = 243

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 65/205 (31%), Positives = 100/205 (48%), Gaps = 41/205 (20%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GC + G++ V KV G+   +     H          +F     N+SH IN L+FG+    
Sbjct: 56  GCRVEGYVRVKKVPGSLVVSARSDAH----------SFDASQMNMSHVINHLSFGKK--- 102

Query: 173 VVNP---LDGVRW------------------TQETPSGM-YQYFIKVVPTVYTDVSGHTI 210
            V P   +D   W                  T++    +  +++I+VV T      G+ +
Sbjct: 103 -VTPRAMIDVKHWIPYLGINHDRLNGRSFVNTRDLEGNVTIEHYIQVVKTEVITRKGYKL 161

Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
              ++  T H   S       +P   F  +LSP++V  TE   SF HF+TNVCAI+GGVF
Sbjct: 162 -IEEYEYTAH---SSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGVF 217

Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGK 295
           TV+GI+D+ +++  +A+ KKIEIGK
Sbjct: 218 TVAGILDSILHNTIKAM-KKIEIGK 241


>gi|357474735|ref|XP_003607653.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
 gi|355508708|gb|AES89850.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
          Length = 477

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 65/205 (31%), Positives = 100/205 (48%), Gaps = 41/205 (20%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GC + G++ V KV G+   +     H          +F     N+SH IN L+FG+    
Sbjct: 290 GCRVEGYVRVKKVPGSLVVSARSDAH----------SFDASQMNMSHVINHLSFGKK--- 336

Query: 173 VVNP---LDGVRW------------------TQETPSGM-YQYFIKVVPTVYTDVSGHTI 210
            V P   +D   W                  T++    +  +++I+VV T      G+ +
Sbjct: 337 -VTPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIEHYIQVVKTEVITRKGYKL 395

Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
              ++  T H   S       +P   F  +LSP++V  TE   SF HF+TNVCAI+GGVF
Sbjct: 396 I-EEYEYTAH---SSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGVF 451

Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGK 295
           TV+GI+D+ +++  +A+ KKIEIGK
Sbjct: 452 TVAGILDSILHNTIKAM-KKIEIGK 475


>gi|428175103|gb|EKX43995.1| hypothetical protein GUITHDRAFT_159761 [Guillardia theta CCMP2712]
          Length = 475

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 66/206 (32%), Positives = 99/206 (48%), Gaps = 28/206 (13%)

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           +    G GC + G L V +       APG    Q+   V D   F  ++ ++SH +N L+
Sbjct: 282 VDSHNGVGCMVSGLLHVQR-------APGMLKVQA---VSDSHEFNWETMDVSHTVNHLS 331

Query: 166 FGE------------HFPGVVNPLDGVRWT--QETPSGMYQYFIKVVPTVYTDVSGHTI- 210
           FG             H    V  LD   +T  Q  P+  +++++KVV    T  S   + 
Sbjct: 332 FGPFLSETAWMVLPPHIAASVGSLDDRSFTSDQHVPT-THEHYVKVVRHEVTPPSSWKVA 390

Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
           Q   +    H  S+   +   +P V   YD+ PI V F E+  +F HF+TN+CAIVGGVF
Sbjct: 391 QITSYGYVVH--SNNIQKAGEVPTVRINYDILPIIVQFHEKKQAFYHFVTNLCAIVGGVF 448

Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGKF 296
           TV+GII + +      ++KK E+GK 
Sbjct: 449 TVAGIIASLMDKSINLMRKKQELGKL 474


>gi|18402672|ref|NP_566664.1| protein PDI-like 5-3 [Arabidopsis thaliana]
 gi|75273652|sp|Q9LJU2.1|PDI53_ARATH RecName: Full=Protein disulfide-isomerase 5-3; Short=AtPDIL5-3;
           AltName: Full=Protein disulfide-isomerase 12;
           Short=PDI12; AltName: Full=Protein disulfide-isomerase
           8-1; Short=AtPDIL8-1; Flags: Precursor
 gi|11994143|dbj|BAB01164.1| unnamed protein product [Arabidopsis thaliana]
 gi|15215847|gb|AAK91468.1| AT3g20560/K10D20_9 [Arabidopsis thaliana]
 gi|332642877|gb|AEE76398.1| protein PDI-like 5-3 [Arabidopsis thaliana]
          Length = 483

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/200 (31%), Positives = 97/200 (48%), Gaps = 28/200 (14%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-P 171
           GC + G++ V KV GN   +       SG H     +F     N+SH ++  +FG    P
Sbjct: 293 GCRVEGYVRVKKVPGNLVISA-----HSGAH-----SFDSSQMNMSHVVSHFSFGRMISP 342

Query: 172 GVV--------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSV 217
            ++              + LDG  +  +   G        + TV T+V           +
Sbjct: 343 RLLTDMKRLLPYLGLSHDRLDGKAFINQHEFGANVTIEHYLQTVKTEVITRRSGQEHSLI 402

Query: 218 TEHFRSSEQGRLQT--LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
            E+  ++     QT  LP   F ++LSP+++  TE   SF HF+TN+CAI+GGVFTV+GI
Sbjct: 403 EEYEYTAHSSVAQTYYLPVAKFHFELSPMQILITENPKSFSHFITNLCAIIGGVFTVAGI 462

Query: 276 IDAFIYHGQRAIKKKIEIGK 295
           +D+ I+H    + KK+E+GK
Sbjct: 463 LDS-IFHNTVRLVKKVELGK 481


>gi|123408947|ref|XP_001303296.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121884664|gb|EAX90366.1| hypothetical protein TVAG_036780 [Trichomonas vaginalis G3]
          Length = 364

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 75/253 (29%), Positives = 115/253 (45%), Gaps = 34/253 (13%)

Query: 46  HGGRLEHNE-TYCGSCYGAESSDED--CCNNCEEVREAYRKKGWALSNPDLIDQCKREGF 102
           H  R   ++ T CG C   +   +   CCN C++V E            D I QC  +  
Sbjct: 130 HSARFNTSKVTECGFCNATKGLKDKYKCCNTCQQVLEV----AQVFRVVD-IPQCSDK-- 182

Query: 103 LQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKS-FHQSGVHVHDILAFQRD--SFNISH 159
           ++ +K+ + EGC I G  E  K+   FH +PG S   + GVH HD+ +F  D    N+S+
Sbjct: 183 VKELKKMQNEGCRIKGNFETIKIKAEFHISPGYSVIDEDGVHAHDVSSFIDDVSELNLSY 242

Query: 160 KINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYT-DVSGHTIQSNQFSVT 218
           K+N   FG+      + LDG    Q+     Y         VYT DVS    ++N +S T
Sbjct: 243 KLNHCRFGDQNH---SQLDGFSTIQKQIGYFY--------AVYTIDVS----ENNDYS-T 286

Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
            +    + G L  +PG+ F YD   I      +    +H  +N+ ++ GGV  +  I+D 
Sbjct: 287 AYMEQVDNGTL--VPGIVFKYDFGIITAKSFPDRPPLIHLFSNLVSMAGGVAMIFYILDY 344

Query: 279 FIYHG--QRAIKK 289
            ++    QR I K
Sbjct: 345 ALFSSIKQRKIHK 357


>gi|195402035|ref|XP_002059616.1| GJ14724 [Drosophila virilis]
 gi|194147323|gb|EDW63038.1| GJ14724 [Drosophila virilis]
          Length = 434

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 51/156 (32%), Positives = 81/156 (51%), Gaps = 3/156 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++ F+R   N +H+IN+L+FG
Sbjct: 196 ESKYDACRLHGTLGINKVAGVLHLVGGAQPVVGMFEDHWMIEFRRMPANFTHRINRLSFG 255

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G        S   QYF+KVVPT     +  TI + Q++VTE+  S    
Sbjct: 256 QYSRRIVQPLEGDETIIHEESTTVQYFLKVVPTEIQH-TFSTISTFQYAVTENVHSERNS 314

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
                PG++F YD S +K+  + +    L F+  +C
Sbjct: 315 YGS--PGIYFKYDWSALKIVVSHDRDYLLTFVIRLC 348


>gi|194768867|ref|XP_001966532.1| GF22223 [Drosophila ananassae]
 gi|190617296|gb|EDV32820.1| GF22223 [Drosophila ananassae]
          Length = 448

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 97/183 (53%), Gaps = 2/183 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++  +R   N +H+IN+L+FG
Sbjct: 200 ETKYDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 259

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G     +  +   QYF+KVVPT     +  TI + Q+SVTE+ R  +  
Sbjct: 260 QYSRRIVQPLEGDETIIQEEATTVQYFLKVVPTEIRQ-TFSTINTFQYSVTENVRKLDSE 318

Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
           R     PG++F YD S +K+    +      F+  +C+I+ G+  +SG I++ +   QR 
Sbjct: 319 RNSYGSPGIYFKYDWSALKIVVDNDRDHLATFVIRLCSIISGIIVISGAINSLLIAIQRR 378

Query: 287 IKK 289
           + +
Sbjct: 379 LLR 381


>gi|159483443|ref|XP_001699770.1| protein disulfide isomerase [Chlamydomonas reinhardtii]
 gi|158281712|gb|EDP07466.1| protein disulfide isomerase [Chlamydomonas reinhardtii]
          Length = 474

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/191 (31%), Positives = 97/191 (50%), Gaps = 12/191 (6%)

Query: 113 GCNIYGFLEVNKVAGNFHF---APGKSFHQSGVHV-HDILAFQ---RDSFNISHKINKLA 165
           GCN+ GF+ V KV G  HF   + G SF  + +++ H I +F    R S     ++ +L 
Sbjct: 286 GCNLAGFVMVKKVPGTVHFVARSEGHSFDHTWMNMTHMIHSFHVGTRPSPRKYQQLKRLH 345

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVV-PTVYTDVSGHTIQSNQFSVTEHFRSS 224
                    + L    +  E     ++++++VV  T+    S HT   + +  T H  S 
Sbjct: 346 PAGLTADWADKLHDQLFVSEHTQSTHEHYLQVVLTTIEPRHSRHTGNYDAYEYTAHSHSY 405

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
           +     ++P   F YDLSPI++   E    +  FLT  CAI+GGVFTV+GI+DA +Y   
Sbjct: 406 QS---DSIPSARFTYDLSPIQILVHETSKPWYQFLTTSCAIIGGVFTVAGILDALLYQSF 462

Query: 285 RAIKKKIEIGK 295
           + + KK+ +GK
Sbjct: 463 KVV-KKLNLGK 472


>gi|256052432|ref|XP_002569774.1| ptx1 protein [Schistosoma mansoni]
 gi|353229921|emb|CCD76092.1| putative ptx1 protein [Schistosoma mansoni]
          Length = 460

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/166 (31%), Positives = 85/166 (51%), Gaps = 5/166 (3%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSG-VHVHDILAFQRDSF-NISHKINKLA 165
           +   + C I G L V KV GN H   GK  +  G +H+H ++ F   S  N SH+IN  +
Sbjct: 228 DRNSDACRIVGTLFVKKVGGNIHILFGKPLNGFGNLHLH-VVPFSGQSLQNFSHRINHFS 286

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           FG+   G ++PL+ V    +     +QYF+ +VPT   +   H  ++ Q++ T   R+ +
Sbjct: 287 FGDLVNGQIHPLEAVESVTDIAFTSFQYFVTMVPTKVVN-HFHITETYQYAATLQNRTID 345

Query: 226 Q-GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
                  +PG+FF YD+ P+ V  T +      F T + A+ GG+F
Sbjct: 346 HDAGSHGIPGIFFVYDIFPLVVKITYDRELLGTFFTRLAALAGGIF 391


>gi|363748002|ref|XP_003644219.1| hypothetical protein Ecym_1151 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356887851|gb|AET37402.1| hypothetical protein Ecym_1151 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 340

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 54/165 (32%), Positives = 84/165 (50%), Gaps = 10/165 (6%)

Query: 112 EGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF 170
           +GC+IYG + VNKV+G     A G ++  +      +L       N SH IN+L+FG+ F
Sbjct: 152 DGCSIYGSVPVNKVSGELQITAKGWTYMSTRRTPFSVL-------NFSHVINELSFGDFF 204

Query: 171 PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
           P + N LDGV    + P   Y YF  V+PT Y  + G  + +NQ+SV    +SS    L 
Sbjct: 205 PYIDNTLDGVGRIADEPLKAYYYFTSVLPTAYKKM-GAEVHTNQYSVDAIEKSSSSHALG 263

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
              G+   Y+   +KV   +E + F  F+  + AI+  V  ++ +
Sbjct: 264 P-TGITISYNFEALKVIIKDERIGFTQFIVRLVAILSFVVYLASL 307


>gi|194911936|ref|XP_001982403.1| GG12755 [Drosophila erecta]
 gi|190648079|gb|EDV45372.1| GG12755 [Drosophila erecta]
          Length = 441

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 58/183 (31%), Positives = 96/183 (52%), Gaps = 2/183 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++  +R   N +H+IN+L+FG
Sbjct: 194 ESKYDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 253

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G        +   QYF+KVVPT     +  TI + Q++VTE+ R  +  
Sbjct: 254 QYSGRIVQPLEGDEIVIHEEATTIQYFLKVVPTEIHQ-TFTTINAFQYAVTENVRKLDSE 312

Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
           R     PG++F YD S +K+    +    L F   +C+I+ G+  +SG I+A +   QR 
Sbjct: 313 RNSYGSPGIYFKYDWSALKIVVDNDRDHLLTFAIRLCSIISGIIVISGAINALLLGIQRR 372

Query: 287 IKK 289
           + +
Sbjct: 373 LLR 375


>gi|339244785|ref|XP_003378318.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Trichinella spiralis]
 gi|316972786|gb|EFV56437.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Trichinella spiralis]
          Length = 334

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 46/131 (35%), Positives = 73/131 (55%), Gaps = 6/131 (4%)

Query: 169 HFPGVVNPLDGVRWTQETPSG----MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
           + PG  NPL       ++P       Y Y +K+VPTVY +++G+   + Q++        
Sbjct: 130 NLPGNFNPLMNAE-VLDSPVDNFPFSYDYILKIVPTVYENIAGNMKHAYQYTYARKTYIE 188

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQ 284
                QT P ++F YD +PI V + E       FLT++CAI+GG FTV+G+ID+F +   
Sbjct: 189 MSFTGQTNPTLWFRYDFTPITVKYHERRQPLYIFLTSICAIIGGTFTVAGLIDSFFFTAS 248

Query: 285 RAIKKKIEIGK 295
           + + KK+E+GK
Sbjct: 249 Q-LYKKVELGK 258


>gi|254579156|ref|XP_002495564.1| ZYRO0B14344p [Zygosaccharomyces rouxii]
 gi|238938454|emb|CAR26631.1| ZYRO0B14344p [Zygosaccharomyces rouxii]
          Length = 353

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 54/185 (29%), Positives = 96/185 (51%), Gaps = 26/185 (14%)

Query: 92  DLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ 151
           ++ D+ +R+ F           C+I+G ++VN+VAG           Q     H   +F 
Sbjct: 144 NMFDEEERDAF---------NSCHIFGSVQVNRVAGEL---------QITAKGHGYSSFM 185

Query: 152 R---DSFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSG 207
           R   +  + SH IN+L++GE +P + NPLD   ++  + P   + Y   +VPT+Y  + G
Sbjct: 186 RAPPEEIDFSHVINELSYGEFYPYIDNPLDSTAKFVPDAPRTTFVYDTAIVPTIYEKL-G 244

Query: 208 HTIQSNQFSVTEHFRSSE--QGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCA 264
             I +NQ++V+E+  + E  QG+     PG+F  YD  P+ +  ++  +SF+ F+  + A
Sbjct: 245 AKIDTNQYAVSEYHINPEAQQGKGPIRFPGIFLRYDFEPLSIHISDVRLSFIQFVVRLVA 304

Query: 265 IVGGV 269
           I+  V
Sbjct: 305 ILSFV 309


>gi|323303637|gb|EGA57425.1| Erv41p [Saccharomyces cerevisiae FostersB]
          Length = 284

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 46/159 (28%), Positives = 82/159 (51%), Gaps = 10/159 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+I+G + VN+V+G       KS          +     +    +H IN+ +FG+ 
Sbjct: 90  EFNGCHIFGSIPVNRVSGELQIT-AKSLXYVASRKAPL-----EELKFNHVINEFSFGDF 143

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
           +P + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++        
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            +   +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDXRLSFIQFLVRLVAI 241


>gi|123435131|ref|XP_001308935.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121890639|gb|EAX96005.1| hypothetical protein TVAG_369150 [Trichomonas vaginalis G3]
          Length = 353

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 58/224 (25%), Positives = 100/224 (44%), Gaps = 15/224 (6%)

Query: 57  CGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNI 116
           C  C+  +  +  CCN C+ ++E Y+        P+   QC+      R      E C +
Sbjct: 127 CYPCFKVQFHNYTCCNGCDRLKENYKLNNLT-PEPEKWPQCQTNA---RPDINSSEKCLV 182

Query: 117 YGFLEVNKVAGNFHFAPGKSFH-QSGVHVHDIL-AFQRDSFNISHKINKLAFGEHFPGVV 174
            G + VN+V G+FH A G++ +   G H+H++L  F   +F  SH I  + FG       
Sbjct: 183 KGKVSVNRVRGSFHIAAGRNIYLNDGSHIHELLDDFPNLAF--SHAIEHIRFGPRIITAK 240

Query: 175 NPLDG-VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLP 233
            PL   V   +E  +  + Y + V P ++   +    +S +++V  H    +       P
Sbjct: 241 QPLQNLVMRAKENLTVTHDYSLLVTPVIFVADNQFIEKSFEYTVYLHPVQDKD------P 294

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           G++F Y  +P  +  T    SF  FL +      G++ ++ IID
Sbjct: 295 GIYFDYQFTPYTIQITWISRSFRGFLISTAGFTAGLYAIASIID 338


>gi|409048375|gb|EKM57853.1| hypothetical protein PHACADRAFT_116248 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 546

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 48/153 (31%), Positives = 75/153 (49%), Gaps = 10/153 (6%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV-HDILAFQRDSFNISHKINKLAF 166
           +  G  C +YG + V KV  N H       + S  HV H+++       N+SH I + +F
Sbjct: 174 KPSGSACRVYGSVAVKKVTANLHVTTLGHGYASRQHVDHNLM-------NLSHVITEFSF 226

Query: 167 GEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
           G +FP +  PLD      E     YQY++ VVPT Y       + ++Q+SVT + R  + 
Sbjct: 227 GPYFPDITQPLDNSFELTEDSFVSYQYYLHVVPTTYIAPRSRPLHTHQYSVTHYTRVLKH 286

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFL 259
                +PG+FF +D+ P+ +T  +   S L  L
Sbjct: 287 N--NGIPGIFFKFDVDPMSLTIHQRTTSLLQLL 317


>gi|171693749|ref|XP_001911799.1| hypothetical protein [Podospora anserina S mat+]
 gi|170946823|emb|CAP73627.1| unnamed protein product [Podospora anserina S mat+]
          Length = 180

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 48/126 (38%), Positives = 69/126 (54%), Gaps = 8/126 (6%)

Query: 154 SFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGM--YQYFIKVVPTVYT-----DVS 206
           SFN SH IN+L+FG + P ++NPLD    +    S    +QYF+ +VPTVY+       S
Sbjct: 15  SFNFSHIINELSFGPYLPSLINPLDQTVNSAPEHSHFHRFQYFLSIVPTVYSLGHPDSYS 74

Query: 207 GHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
             +I +NQ++VTE      E   +Q +PG+F  YD+ PI +   E+  SF  FL  V  I
Sbjct: 75  SRSIFTNQYAVTEQSAPIPENMEMQMIPGIFVKYDIEPILLNIVEDRDSFFVFLIKVVNI 134

Query: 266 VGGVFT 271
           + G   
Sbjct: 135 LSGAMV 140


>gi|357452761|ref|XP_003596657.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
 gi|355485705|gb|AES66908.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
          Length = 482

 Score = 84.3 bits (207), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 73/257 (28%), Positives = 117/257 (45%), Gaps = 40/257 (15%)

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           S YG   +D       E +  ++  + + L+  D ++  +     +R     G GC I G
Sbjct: 244 SYYGDRDTDS-LVKTMENILASFPSEYYKLALEDKLNVTEDS---KRPAPSSG-GCRIEG 298

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-------- 170
           ++ V KV GN   +     H          +F     N+SH ++ L+FG+          
Sbjct: 299 YVRVKKVPGNLIISARSDAH----------SFDASQMNMSHAVHHLSFGKKLSPKLMSDV 348

Query: 171 ----PGVVNP---LDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH 220
               P V N    LDG+ +      G     ++++++V T      G+ +   ++  T H
Sbjct: 349 QRLIPYVGNSHDRLDGLSFINSHDFGANVTLEHYLQIVKTEVITRQGYQL-VEEYEYTAH 407

Query: 221 FRSSEQGRLQTLPGVFFFYDLSPIKV--TFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
              S       +P   F   LSP++V    TE+H SF HF+TNVCAIVGGVFTV+GI ++
Sbjct: 408 ---SSLAHSLHVPVARFHLQLSPMQVCVLITEDHKSFSHFITNVCAIVGGVFTVAGITES 464

Query: 279 FIYHGQRAIKKKIEIGK 295
            I H    + +K+E+GK
Sbjct: 465 -ILHNTIRLMRKVELGK 480


>gi|145510182|ref|XP_001441024.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408263|emb|CAK73627.1| unnamed protein product [Paramecium tetraurelia]
          Length = 320

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 51/194 (26%), Positives = 90/194 (46%), Gaps = 14/194 (7%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKS---------FHQSGVHVHDILAFQRDSF 155
           R    E +GC + G L++N+V G   F P +S          H    + H  ++F     
Sbjct: 130 RTAVAEKQGCEVVGSLKINRVKGKISFGPHRSHTYIGAVGNLHLPLDYSHKFVSFTFGDE 189

Query: 156 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
           N   K+  +        +       ++   + S  +++FI ++PT YT ++  T     +
Sbjct: 190 NALKKVKSMFKQGQLESLAGSQRIKKYELASQSMQHEHFIHIIPTHYTLLNKQT-----Y 244

Query: 216 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
           SV ++  +  + R      V   YD +P  VT+ +     LHFL  +CA++GG+FTVS +
Sbjct: 245 SVYQYTANHNEVRSHNYANVQLRYDFAPTTVTYWQTKEDILHFLVQICAVIGGIFTVSSM 304

Query: 276 IDAFIYHGQRAIKK 289
           I+A +Y   R++ K
Sbjct: 305 IEASVYKVMRSVLK 318


>gi|167382848|ref|XP_001736294.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165901464|gb|EDR27547.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 315

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 64/200 (32%), Positives = 96/200 (48%), Gaps = 20/200 (10%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGK-SFHQSGV-------------HVHDILAFQRDSFNIS 158
           GC ++G ++V++V+G FH A GK SF Q  +             H+H     +  SFN +
Sbjct: 116 GCRMHGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175

Query: 159 HKINKLAFGEHFPGVV----NPLDGVRWTQET-PSGMYQYFIKVVPTVYTDVSGHTIQSN 213
           H IN L+F       V     PL+G  +T     +    Y+I V+PT++   S +T+++ 
Sbjct: 176 HYINHLSFSNTLGSTVHSGETPLNGKEFTLNGFDNARKTYYINVIPTLFKYPS-YTLRTY 234

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           Q SV+E       G     PGVFF Y+LSP  V       SF H L +V AIVGGV  + 
Sbjct: 235 QLSVSERDIPVTYGASFAQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIVGGVLIII 294

Query: 274 GIIDAFIYHGQRAIKKKIEI 293
           G +       +  +   +E+
Sbjct: 295 GWLSKLFDSNRELVTSVVEM 314


>gi|295663046|ref|XP_002792076.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226279251|gb|EEH34817.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 392

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 59/190 (31%), Positives = 88/190 (46%), Gaps = 39/190 (20%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E  + C IYG LE NKV G+FH  A G  + + G H+                 ++L+FG
Sbjct: 188 EMPDSCRIYGSLEGNKVQGDFHITARGHGYFEYGEHLDH---------------HELSFG 232

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG-------------------- 207
            H+  ++NPLD    T       YQY++ +VPT+YT                        
Sbjct: 233 PHYSTLLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRTGTIDPYSQVLPDPSTISPSQRK 292

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
           +TI +NQ++VT   RS E   +Q  +PG+FF Y + PI +  +EE  S L  L  +  ++
Sbjct: 293 NTIFTNQYAVTS--RSHELPDVQFYVPGIFFKYSIEPILLIISEERGSLLALLVRLVNVM 350

Query: 267 GGVFTVSGII 276
            GV    G +
Sbjct: 351 AGVVVAGGWL 360


>gi|323307814|gb|EGA61076.1| Erv41p [Saccharomyces cerevisiae FostersO]
          Length = 284

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 46/159 (28%), Positives = 82/159 (51%), Gaps = 10/159 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+I+G + VN+V+G       KS          +     +    +H IN+ +FG+ 
Sbjct: 90  EFNGCHIFGSIPVNRVSGELQIT-AKSLXYVASRKAPL-----EELKFNHVINEFSFGDF 143

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
           +P + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++        
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            +   +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDIRLSFIQFLVRLVAI 241


>gi|226294628|gb|EEH50048.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Paracoccidioides brasiliensis Pb18]
          Length = 392

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 59/190 (31%), Positives = 89/190 (46%), Gaps = 39/190 (20%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E  + C IYG LE NKV G+FH  A G  + + G H+                 ++L+FG
Sbjct: 188 EMPDSCRIYGSLEGNKVQGDFHITARGHGYFEFGEHLDH---------------HELSFG 232

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG-------------------- 207
            H+  ++NPLD    T       YQY++ +VPT+YT                        
Sbjct: 233 PHYSTLLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRAGTVDPYSQVLPDPSTISPSQRK 292

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQ-TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
           +TI +NQ++VT   RS E   +Q  +PG+FF Y++ PI +  +EE  S L  L  +  ++
Sbjct: 293 NTIFTNQYAVTS--RSHELPDVQFHVPGIFFKYNIEPILLIISEERGSLLALLVRLVNVM 350

Query: 267 GGVFTVSGII 276
            GV    G +
Sbjct: 351 AGVVVAGGWL 360


>gi|195639434|gb|ACG39185.1| PDIL5-4 - Zea mays protein disulfide isomerase [Zea mays]
          Length = 485

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 79/279 (28%), Positives = 125/279 (44%), Gaps = 51/279 (18%)

Query: 45  RHGGRLEHNETYCG-SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPD----LIDQCKR 99
           R G  ++ N+ +     Y  E   E      E       K+  AL+  D     +D  KR
Sbjct: 228 RKGSDIKENQGHHDHESYYGERDTESLVAAMETYVANIPKEAHALALEDKSNKTVDPAKR 287

Query: 100 EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISH 159
              +         GC I GF+ V +V G+   +      +SG H     +F     N+SH
Sbjct: 288 PAPM-------ASGCRIEGFVRVKRVPGSVVISA-----RSGSH-----SFDPSQINVSH 330

Query: 160 KINKLAFGE---------------HFPGVVNPLDGVRWT----QETPSGMYQYFIKVVPT 200
            + + +FG+               +  G  + L G  +T    +   +   +++++VV T
Sbjct: 331 YVTQFSFGKRLSPRMLHEFIRLTPYLRGYHDRLAGQSYTVKHGEVNANVTIEHYLQVVKT 390

Query: 201 -VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFL 256
            + T  S     S +  V E +  +    L     +P V F ++ SP++V  TE   SF 
Sbjct: 391 ELVTQRS-----SKELKVLEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTEVPKSFS 445

Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           HF+TNVCAI+GGVFTV+GI+D+ I+H    + KKIE+GK
Sbjct: 446 HFITNVCAIIGGVFTVAGILDS-IFHNTLRMVKKIELGK 483


>gi|207342541|gb|EDZ70277.1| YML067Cp-like protein [Saccharomyces cerevisiae AWRI1631]
 gi|323336174|gb|EGA77445.1| Erv41p [Saccharomyces cerevisiae Vin13]
 gi|323347070|gb|EGA81345.1| Erv41p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 284

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+I+G + VN+V+G       KS    G         +   FN  H IN+ +FG+ 
Sbjct: 90  EFNGCHIFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 143

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
           +P + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++        
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            +   +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 241


>gi|151946097|gb|EDN64328.1| ER vesicle protein [Saccharomyces cerevisiae YJM789]
 gi|190408176|gb|EDV11441.1| hypothetical protein SCRG_01831 [Saccharomyces cerevisiae RM11-1a]
 gi|259148509|emb|CAY81754.1| Erv41p [Saccharomyces cerevisiae EC1118]
          Length = 352

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+I+G + VN+V+G       KS    G         +   FN  H IN+ +FG+ 
Sbjct: 158 EFNGCHIFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 211

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
           +P + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++        
Sbjct: 212 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 270

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            +   +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 271 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 309


>gi|407037175|gb|EKE38536.1| hypothetical protein ENU1_163530 [Entamoeba nuttalli P19]
          Length = 315

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 63/200 (31%), Positives = 95/200 (47%), Gaps = 20/200 (10%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGK-SFHQSGV-------------HVHDILAFQRDSFNIS 158
           GC ++G ++V++V+G FH A GK SF Q  +             H+H     +  SFN +
Sbjct: 116 GCRMHGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175

Query: 159 HKINKLAFGEHFPGVV----NPLDGVRWTQET-PSGMYQYFIKVVPTVYTDVSGHTIQSN 213
           H IN L+F       V     PL+G  +T     +    Y+I V+PT++   S +T+++ 
Sbjct: 176 HYINHLSFSNILGSTVHSGETPLNGKEFTLNGFDNARKTYYINVIPTLFKYPS-YTLRTY 234

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           Q SV E       G     PGVFF Y+LSP  V       SF H L +V AI+GGV  + 
Sbjct: 235 QLSVNERDVPVTYGASFAQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIIGGVLIIM 294

Query: 274 GIIDAFIYHGQRAIKKKIEI 293
           G++          +   +E+
Sbjct: 295 GLLSRLFDSKHELVTSVVEM 314


>gi|195469521|ref|XP_002099686.1| GE16580 [Drosophila yakuba]
 gi|194187210|gb|EDX00794.1| GE16580 [Drosophila yakuba]
          Length = 430

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 57/183 (31%), Positives = 96/183 (52%), Gaps = 2/183 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++  +R   N +H+IN+L+FG
Sbjct: 194 ESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 253

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G        +   QYF+KVVPT     +  TI + Q++VTE+ R  +  
Sbjct: 254 QYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTINAFQYAVTENVRKLDSE 312

Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRA 286
           R     PG++F YD S +K+    +    + F   +C+I+ G+  +SG I+A +   QR 
Sbjct: 313 RNSYGSPGIYFKYDWSALKIMVDNDRDHLVTFAIRLCSIISGIIVISGAINALLLGIQRR 372

Query: 287 IKK 289
           + +
Sbjct: 373 LLR 375


>gi|123437985|ref|XP_001309782.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121891523|gb|EAX96852.1| hypothetical protein TVAG_470170 [Trichomonas vaginalis G3]
          Length = 344

 Score = 83.6 bits (205), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 70/242 (28%), Positives = 111/242 (45%), Gaps = 21/242 (8%)

Query: 57  CGSCYGAESSD-EDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCN 115
           CGSCYG E ++   CCN CE+V   + K G  L+N     QC  E +    KE+    C 
Sbjct: 119 CGSCYGTEFAEGSRCCNTCEDVVSHHIKAGRPLTNVTTWQQCINEKYDFTGKEK----CQ 174

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVN 175
           I+G   V+ + G     P  S ++          F +   N++H I+ + FG  F     
Sbjct: 175 IFGNHHVSAIDGGIRILPRFSSNEE--------PFTK-LLNLTHYIDHITFGTSFGP--Q 223

Query: 176 PLDGVRWTQETPSGM-YQYFIKVVPTVYTDVSGHTIQSNQFSV-TEHFRSSEQGRLQTLP 233
           PLD     Q  P    Y+Y +K VPTV  +  G      Q++V +     +++ RL    
Sbjct: 224 PLDDALIVQSEPGQFHYRYDLKAVPTVMHNQDGSITHGFQYAVDSAKIPITDRTRLGE-- 281

Query: 234 GVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEI 293
           G+FF Y  + + V    +  +    ++ +  I GG F ++ +ID+F Y     ++ K+ I
Sbjct: 282 GIFFNYYFATVAVVGKPDRFTIYILISRLFCIFGGGFFLARLIDSFGYR-IHTMEGKMRI 340

Query: 294 GK 295
           GK
Sbjct: 341 GK 342


>gi|162462518|ref|NP_001105762.1| protein disulfide isomerase12 [Zea mays]
 gi|59861281|gb|AAX09970.1| protein disulfide isomerase [Zea mays]
 gi|414590455|tpg|DAA41026.1| TPA: putative thioredoxin superfamily protein [Zea mays]
          Length = 483

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 86/308 (27%), Positives = 137/308 (44%), Gaps = 52/308 (16%)

Query: 15  IFKKRLDSQGNVIESRQDGI-GAPKIDKPLQRHGGRLEHNETYCG-SCYGAESSDEDCCN 72
           I   ++D    V   R++ I G P I   + R G  ++ N+ +     Y  E   E    
Sbjct: 199 ILLGKVDCTEEVELCRRNHIQGYPSIR--VFRKGSDIKENQGHHDHESYYGERDTESLVA 256

Query: 73  NCEEVREAYRKKGWALSNPD--LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFH 130
             E       K+  AL +     +D  KR   +         GC I GF+ V +V G+  
Sbjct: 257 AMETYVANIPKEAHALEDKSNKTVDPAKRPAPM-------ASGCRIEGFVRVKRVPGSVV 309

Query: 131 FAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---------------HFPGVVN 175
            +      +SG H     +F     N+SH + + +FG+               +  G  +
Sbjct: 310 ISA-----RSGSH-----SFDPSQINVSHYVTQFSFGKRLSPRMLHEFIRLTPYLRGYHD 359

Query: 176 PLDGVRWT----QETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
            L G  +T    +   +   +++++VV T + T  S     S +  V E +  +    L 
Sbjct: 360 RLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRS-----SKELKVLEEYEYTAHSSLV 414

Query: 231 ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
               +P V F ++ SP++V  TE   SF HF+TNVCAI+GGVFTV+GI+D+ I+H    +
Sbjct: 415 HSFYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVAGILDS-IFHNTLRM 473

Query: 288 KKKIEIGK 295
            KKIE+GK
Sbjct: 474 VKKIELGK 481


>gi|224030141|gb|ACN34146.1| unknown [Zea mays]
          Length = 483

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 86/308 (27%), Positives = 137/308 (44%), Gaps = 52/308 (16%)

Query: 15  IFKKRLDSQGNVIESRQDGI-GAPKIDKPLQRHGGRLEHNETYCG-SCYGAESSDEDCCN 72
           I   ++D    V   R++ I G P I   + R G  ++ N+ +     Y  E   E    
Sbjct: 199 ILLGKVDCTEEVELCRRNHIQGYPSIR--VFRKGSDIKENQGHHDHESYYGERDTESLVA 256

Query: 73  NCEEVREAYRKKGWALSNPD--LIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFH 130
             E       K+  AL +     +D  KR   +         GC I GF+ V +V G+  
Sbjct: 257 AMETYVANIPKEAHALEDKSNKTVDPAKRPAPM-------ASGCRIEGFVRVKRVPGSVV 309

Query: 131 FAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE---------------HFPGVVN 175
            +      +SG H     +F     N+SH + + +FG+               +  G  +
Sbjct: 310 ISA-----RSGSH-----SFDPSQINVSHYVTQFSFGKRLSPRMLHEFIRLTPYLRGYHD 359

Query: 176 PLDGVRWT----QETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ 230
            L G  +T    +   +   +++++VV T + T  S     S +  V E +  +    L 
Sbjct: 360 RLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRS-----SKELKVLEEYEYTAHSSLV 414

Query: 231 ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAI 287
               +P V F ++ SP++V  TE   SF HF+TNVCAI+GGVFTV+GI+D+ I+H    +
Sbjct: 415 HSFYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVAGILDS-IFHNTLRM 473

Query: 288 KKKIEIGK 295
            KKIE+GK
Sbjct: 474 VKKIELGK 481


>gi|392297516|gb|EIW08616.1| Erv41p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 352

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+I+G + VN+V+G       KS    G         +   FN  H IN+ +FG+ 
Sbjct: 158 EFNGCHIFGSIPVNRVSGELQII-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 211

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
           +P + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++        
Sbjct: 212 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 270

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            +   +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 271 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 309


>gi|323332255|gb|EGA73665.1| Erv41p [Saccharomyces cerevisiae AWRI796]
 gi|323352959|gb|EGA85259.1| Erv41p [Saccharomyces cerevisiae VL3]
 gi|365763687|gb|EHN05213.1| Erv41p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 250

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 50/165 (30%), Positives = 84/165 (50%), Gaps = 10/165 (6%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
            R    E  GC+I+G + VN+V+G       KS    G         +   FN  H IN+
Sbjct: 50  NRAHLPEFNGCHIFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINE 103

Query: 164 LAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH-- 220
            +FG+ +P + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++  
Sbjct: 104 FSFGDFYPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRY 162

Query: 221 FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
                  +   +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 163 LYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 207


>gi|256269733|gb|EEU05000.1| Erv41p [Saccharomyces cerevisiae JAY291]
          Length = 353

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 47/159 (29%), Positives = 82/159 (51%), Gaps = 10/159 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+I+G + VN+V+G          +  G         +   FN  H IN+ +FG+ 
Sbjct: 159 EFNGCHIFGSIPVNRVSGELQITA----NSLGYVASRKAPLEELKFN--HVINEFSFGDF 212

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
           +P + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++        
Sbjct: 213 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 271

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            +   +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 272 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 310


>gi|6323573|ref|NP_013644.1| Erv41p [Saccharomyces cerevisiae S288c]
 gi|2497084|sp|Q04651.1|ERV41_YEAST RecName: Full=ER-derived vesicles protein ERV41
 gi|558408|emb|CAA86254.1| unnamed protein product [Saccharomyces cerevisiae]
 gi|285813935|tpg|DAA09830.1| TPA: Erv41p [Saccharomyces cerevisiae S288c]
          Length = 352

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+++G + VN+V+G       KS    G         +   FN  H IN+ +FG+ 
Sbjct: 158 EFNGCHVFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 211

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
           +P + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++        
Sbjct: 212 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 270

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            +   +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 271 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 309


>gi|558407|emb|CAA86253.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 284

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+++G + VN+V+G       KS    G         +   FN  H IN+ +FG+ 
Sbjct: 90  EFNGCHVFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 143

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
           +P + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++        
Sbjct: 144 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 202

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            +   +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 241


>gi|349580221|dbj|GAA25381.1| K7_Erv41p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 352

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 83/159 (52%), Gaps = 10/159 (6%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC+++G + VN+V+G       KS    G         +   FN  H IN+ +FG+ 
Sbjct: 158 EFNGCHVFGSIPVNRVSGELQIT-AKSL---GYVASRKAPLEELKFN--HVINEFSFGDF 211

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH--FRSSEQ 226
           +P + NPLD   ++ Q+ P   Y Y+  VVPT++  + G  + +NQ+SV ++        
Sbjct: 212 YPYIDNPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKL-GAEVDTNQYSVNDYRYLYKDVA 270

Query: 227 GRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAI 265
            +   +PG+FF Y+  P+ +  ++  +SF+ FL  + AI
Sbjct: 271 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVAI 309


>gi|442614645|ref|NP_001259099.1| CG4293, isoform E [Drosophila melanogaster]
 gi|440216271|gb|AGB94945.1| CG4293, isoform E [Drosophila melanogaster]
          Length = 439

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 81/156 (51%), Gaps = 2/156 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++  +R   N +H+IN+L+FG
Sbjct: 194 ESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 253

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G        +   QYF+KVVPT     +  TI + Q++VTE+ R  E+ 
Sbjct: 254 QYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTIYAFQYAVTENVRKLERN 312

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
              + PG++F YD S +K+    +    + F   +C
Sbjct: 313 SYGS-PGIYFKYDWSALKIIVRNDRDHLVTFAIRLC 347


>gi|115472445|ref|NP_001059821.1| Os07g0524100 [Oryza sativa Japonica Group]
 gi|75118816|sp|Q69SA9.1|PDI54_ORYSJ RecName: Full=Protein disulfide isomerase-like 5-4;
           Short=OsPDIL5-4; AltName: Full=Protein disulfide
           isomerase-like 8-1; Short=OsPDIL8-1; Flags: Precursor
 gi|50508559|dbj|BAD30858.1| thioredoxin family-like protein [Oryza sativa Japonica Group]
 gi|113611357|dbj|BAF21735.1| Os07g0524100 [Oryza sativa Japonica Group]
 gi|215704615|dbj|BAG94243.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218199742|gb|EEC82169.1| hypothetical protein OsI_26259 [Oryza sativa Indica Group]
 gi|222637167|gb|EEE67299.1| hypothetical protein OsJ_24505 [Oryza sativa Japonica Group]
          Length = 485

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 64/224 (28%), Positives = 103/224 (45%), Gaps = 44/224 (19%)

Query: 94  IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 153
           +D  KR   L         GC I GF+ V KV G+   +      +SG H     +F   
Sbjct: 282 VDPAKRPAPLT-------SGCRIEGFVRVKKVPGSVVISA-----RSGSH-----SFDPS 324

Query: 154 SFNISHKINKLAFGEHFPGV-------VNPLDG------------VRWTQETPSGMYQYF 194
             N+SH + + +FG+            + P  G            V+      +   +++
Sbjct: 325 QINVSHYVTQFSFGKRLSAKMFNELKRLTPYVGGHHDRLAGQSYIVKHGDVNANVTIEHY 384

Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEE 251
           +++V T    +      S +  + E +  +    L     +P V F ++ SP++V  TE 
Sbjct: 385 LQIVKTELVTLRS----SKELKLVEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTEL 440

Query: 252 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
             SF HF+TNVCAI+GGVFTV+GI+D+ I+H    + KK+E+GK
Sbjct: 441 PKSFSHFITNVCAIIGGVFTVAGILDS-IFHNTLRLVKKVELGK 483


>gi|393908149|gb|EJD74928.1| hypothetical protein LOAG_17836 [Loa loa]
          Length = 430

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/175 (29%), Positives = 86/175 (49%), Gaps = 6/175 (3%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           ++ EG  C I+G + VNKV G+ F  + GK     G+  H          NISH+I +  
Sbjct: 222 EKNEGTACRIHGRMRVNKVKGDSFIISTGKGLDVDGIFAH--FGGVSSPSNISHRIERFN 279

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS 223
           FG    G+V PL G+    ET    ++YF+K+VPT   ++ + G +  + Q+SVT   + 
Sbjct: 280 FGPRIYGLVTPLAGIEQISETGVDEFRYFLKIVPTRIYHSGLFGGSTLTYQYSVT-FMKK 338

Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
           + +  +     +   Y+ +   +       S L  L  +C+ VGGVF  S ++++
Sbjct: 339 TPKKDVHKHTAIIIHYEFAATVIEVRHVQSSLLQMLVRLCSAVGGVFATSILLNS 393


>gi|402595088|gb|EJW89014.1| hypothetical protein WUBG_00081 [Wuchereria bancrofti]
          Length = 578

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 87/176 (49%), Gaps = 6/176 (3%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           ++ EG  C I+G + VNKV G+ F  + GK     G+  H       +  N+SH+I +  
Sbjct: 372 EKNEGTACRIHGRMRVNKVKGDSFVVSTGKGLGVDGIFAH--FGGLSNPGNVSHRIERFN 429

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS 223
           FG    G+V PL G+    ET    ++YF+KVVPT   ++ + G +  + Q+SVT   + 
Sbjct: 430 FGPTIYGLVTPLAGIEQISETGMDEFRYFLKVVPTRIYHSGLFGGSTLTYQYSVT-FMKK 488

Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
           + +  +     +   Y+ +   +       S L  L  +C+ VGGVF  S ++++ 
Sbjct: 489 TPKKDVHKHAAIIIHYEFAATVIEVRRIQSSLLQMLIRLCSAVGGVFATSVLLNSI 544


>gi|366987569|ref|XP_003673551.1| hypothetical protein NCAS_0A06100 [Naumovozyma castellii CBS 4309]
 gi|342299414|emb|CCC67168.1| hypothetical protein NCAS_0A06100 [Naumovozyma castellii CBS 4309]
          Length = 355

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 48/168 (28%), Positives = 83/168 (49%), Gaps = 11/168 (6%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GC+I+G L VN+VAG             G    D      D    +H IN+ +FG+ +P 
Sbjct: 164 GCHIFGSLPVNRVAGELQIT------AKGYGYADRERTPMDQIKFNHVINEFSFGDFYPY 217

Query: 173 VVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHF---RSSEQGR 228
           + NPLD   ++  ETP   Y Y + V+PT +  + G  + + Q+SV E+    + S   R
Sbjct: 218 IDNPLDKSAKFDLETPKTAYSYDLSVIPTTFRKL-GTEVNTFQYSVAEYHYKGKDSPVPR 276

Query: 229 LQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
              +PG+FF Y+   + +  ++  ++F+ F+  + AI+     ++  I
Sbjct: 277 SGRVPGIFFDYNFESLSIIVSDSRLNFIQFIIRLIAILSFALYIASWI 324


>gi|301089326|ref|XP_002894975.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262104295|gb|EEY62347.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 102

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 44/86 (51%), Positives = 54/86 (62%)

Query: 196 KVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSF 255
           +VVPT YT +S   I +NQFS TEHFR       + LP V F Y  SPI     +  V F
Sbjct: 5   QVVPTEYTFLSASRIITNQFSATEHFRQLTPVSDKGLPMVSFSYTFSPIMFRIEQYRVGF 64

Query: 256 LHFLTNVCAIVGGVFTVSGIIDAFIY 281
           L FLT+VCAIVGGVFT+ GI+D+  +
Sbjct: 65  LQFLTSVCAIVGGVFTILGIMDSLAF 90


>gi|303275141|ref|XP_003056869.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461221|gb|EEH58514.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 604

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 59/209 (28%), Positives = 99/209 (47%), Gaps = 40/209 (19%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GC I G + VN+V G F+     + H  G   H+I     D  N++H +  L+FG+  PG
Sbjct: 402 GCIIEGSVRVNRVPGAFYV----TAHSKG---HNI---NVDVVNMTHVLRHLSFGKTVPG 451

Query: 173 VVN-------------PLD-----GVRWTQET-----PSGMYQYFIKVVPTVYTDVSGHT 209
             +             P D      V   +ET     P  ++++++KVV   +  + G  
Sbjct: 452 RPSYVPRHMRRVWSKIPKDMGGRFAVAGAEETFASAEPYTVHEHYLKVVSHAFEPIDGDA 511

Query: 210 IQ-------SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
           +Q       SN+F +       E       P + F YD+SP++V   EE    L +   +
Sbjct: 512 VQLYEYTFNSNRFKLAPAAYGDEDDAHVDGPMIKFSYDVSPMRVVLREETKPVLDWTLGM 571

Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           CA++GGV+T SG+++AFI +G   +K+++
Sbjct: 572 CALMGGVYTCSGLLEAFISNGVSVVKRRV 600


>gi|154335780|ref|XP_001564126.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134061160|emb|CAM38182.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 309

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 62/224 (27%), Positives = 98/224 (43%), Gaps = 22/224 (9%)

Query: 65  SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNK 124
           ++   CC++C+ V E Y+         +   QC  + +      E   GCN+ G L++ K
Sbjct: 89  AAASKCCDSCDSVFELYKDLEKEFPGIEYFPQCLEQLY------ERARGCNVIGSLDLKK 142

Query: 125 VAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG----EHFP--GVVNPLD 178
           V     F P ++  +  +   D++       + SH I KL  G    E F   GV  PL 
Sbjct: 143 VPVTVIFGPRRTGRRYSLK--DVI-----RLDTSHVIKKLRIGDEAVERFSKHGVAEPLC 195

Query: 179 GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE--QGRLQTLPGVF 236
           G     +T S   +Y +KVVPT Y        +++ +  +    S     G    +P V 
Sbjct: 196 GHERFSKTYSET-RYLVKVVPTTYRKTRTRDAKASTYEYSAQCSSQAIVVGFSGVVPAVL 254

Query: 237 FFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           F ++ + I+V    E     HFL  +C IVGG+F V G ID+ +
Sbjct: 255 FAFEPAAIQVNNVFERQPVSHFLVQLCGIVGGLFVVLGFIDSTV 298


>gi|255714272|ref|XP_002553418.1| KLTH0D16324p [Lachancea thermotolerans]
 gi|238934798|emb|CAR22980.1| KLTH0D16324p [Lachancea thermotolerans CBS 6340]
          Length = 340

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 48/164 (29%), Positives = 84/164 (51%), Gaps = 10/164 (6%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAF 166
           + +E  GC+++G + VN V G+    P          V D      D+ N+SH IN+ +F
Sbjct: 147 ESKEFNGCHVFGTITVNMVKGDLIIIPRSQ------SVRDFGRMPPDAINLSHVINEFSF 200

Query: 167 GEHFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           G+ +P + NPLD   R T E  +  + Y   VVPT++  + G  + +NQ+S++E    + 
Sbjct: 201 GDFYPYIDNPLDRSARITAEHTTS-FHYHTSVVPTIFQKL-GAEVNTNQYSLSETKHETP 258

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
              L+ +P + F Y    + +T  +E +SF  F+  + AI+  +
Sbjct: 259 PSGLR-VPAIIFSYSFEALTITIRDERISFWQFIVRLVAILSFI 301


>gi|195564437|ref|XP_002105825.1| GD16474 [Drosophila simulans]
 gi|194203186|gb|EDX16762.1| GD16474 [Drosophila simulans]
          Length = 441

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 55/174 (31%), Positives = 92/174 (52%), Gaps = 2/174 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++  +R   N +H+IN+L+FG
Sbjct: 194 ESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 253

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G        +   QYF+KVVPT     +  TI + Q++VTE+ R  +  
Sbjct: 254 QYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTINAFQYAVTENVRKLDSE 312

Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           R     PG++F YD S +K+    +    + F   +C+I+ G+  +SG I+A +
Sbjct: 313 RNSYGSPGIYFKYDWSALKIMVRNDRDHLVTFAIRLCSIISGIIVISGAINALL 366


>gi|195165324|ref|XP_002023489.1| GL20164 [Drosophila persimilis]
 gi|194105594|gb|EDW27637.1| GL20164 [Drosophila persimilis]
          Length = 445

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 2/157 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++  +R   N +H+IN+L+FG
Sbjct: 199 ETKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWVIELRRMPANFTHRINRLSFG 258

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G        +   QYF+KVVPT     +  TI + Q++VTE+ R  +  
Sbjct: 259 QYSRRIVQPLEGDESIIHEEATTVQYFLKVVPTEIHQ-TFTTINTFQYAVTENVRKLDSE 317

Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
           R     PG++F YD S +K+  + +    + F   +C
Sbjct: 318 RNSYGSPGIYFKYDWSALKIVVSNDRDHLVTFAIRLC 354


>gi|198468706|ref|XP_001354796.2| GA18088 [Drosophila pseudoobscura pseudoobscura]
 gi|198146533|gb|EAL31851.2| GA18088 [Drosophila pseudoobscura pseudoobscura]
          Length = 445

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 81/157 (51%), Gaps = 2/157 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++  +R   N +H+IN+L+FG
Sbjct: 199 ETKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWVIELRRMPANFTHRINRLSFG 258

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G        +   QYF+KVVPT     +  TI + Q++VTE+ R  +  
Sbjct: 259 QYSRRIVQPLEGDESIIHEEATTVQYFLKVVPTEIHQ-TFTTINTFQYAVTENVRKLDSE 317

Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
           R     PG++F YD S +K+  + +    + F   +C
Sbjct: 318 RNSYGSPGIYFKYDWSALKIVVSNDRDHLVTFAIRLC 354


>gi|308487907|ref|XP_003106148.1| hypothetical protein CRE_15417 [Caenorhabditis remanei]
 gi|308254138|gb|EFO98090.1| hypothetical protein CRE_15417 [Caenorhabditis remanei]
          Length = 427

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 53/175 (30%), Positives = 88/175 (50%), Gaps = 14/175 (8%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQ---RDSFNISHKINKLA 165
           E+G+ C ++G  +V K         GK         + +L F+   +   NISH+I K  
Sbjct: 221 EDGKACRLHGKFKVRK---------GKEEKIVMSISNPLLMFEHQEKQPGNISHRIEKFN 271

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSE 225
           FG   PG+V PL G     E+   +Y+YFIK+VPT       HT+ + Q+SVT   +  +
Sbjct: 272 FGPRIPGLVTPLAGAEHISESGQDIYRYFIKIVPTKIYGYFTHTL-AYQYSVTFLKKQLK 330

Query: 226 QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           +G   +  G+ F Y+ +   +   +  V+   +L  +C+I+GGV+  S II+  +
Sbjct: 331 EGE-HSHGGILFEYEFTANVIEVHKTSVTLFSYLIRICSILGGVYATSTIINNVV 384


>gi|366997520|ref|XP_003678522.1| hypothetical protein NCAS_0J02050 [Naumovozyma castellii CBS 4309]
 gi|342304394|emb|CCC72184.1| hypothetical protein NCAS_0J02050 [Naumovozyma castellii CBS 4309]
          Length = 347

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 50/159 (31%), Positives = 81/159 (50%), Gaps = 14/159 (8%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E   C+I+G + VN+VAG F        HQ   +V D           +H IN+ +FG+ 
Sbjct: 158 EYSACHIFGSIPVNRVAGEFQITTIDR-HQPIENVVDF----------THVINEFSFGDF 206

Query: 170 FPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTE-HFRSSEQG 227
           FP V NPLD   ++  +     YQY + VVPT+Y  + G  I +NQ+S++E H+++    
Sbjct: 207 FPYVDNPLDSTAKYVPDEKLTSYQYHLSVVPTIYNKM-GVLINTNQYSLSEYHYKNITNA 265

Query: 228 RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
             +  PG+F  Y+   + +   +  + F  FL  + AI+
Sbjct: 266 NDKNSPGIFIKYNFESLTIIVNDRRLGFTQFLIRLIAIL 304


>gi|195347402|ref|XP_002040242.1| GM19035 [Drosophila sechellia]
 gi|194121670|gb|EDW43713.1| GM19035 [Drosophila sechellia]
          Length = 437

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 55/174 (31%), Positives = 92/174 (52%), Gaps = 2/174 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++  +R   N +H+IN+L+FG
Sbjct: 190 ESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 249

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G        +   QYF+KVVPT     +  TI + Q++VTE+ R  +  
Sbjct: 250 QYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTINAFQYAVTENVRKLDSE 308

Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           R     PG++F YD S +K+    +    + F   +C+I+ G+  +SG I+A +
Sbjct: 309 RNSYGSPGIYFKYDWSALKIMVRNDRDHLVTFAIRLCSIISGIIVISGAINALL 362


>gi|323448816|gb|EGB04710.1| hypothetical protein AURANDRAFT_55105 [Aureococcus anophagefferens]
          Length = 324

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 97/194 (50%), Gaps = 27/194 (13%)

Query: 104 QRIKE-EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKIN 162
           QR+ E +E  GC + G + VN+V GNFH    +S H      H++ A      N+SH +N
Sbjct: 133 QRMLEIKEHPGCMVSGHVLVNRVPGNFHIE-ARSIH------HNLNAAMT---NLSHVVN 182

Query: 163 KLAFG-----------EHFPGV--VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHT 209
            L+FG             +P    V+PLDG  +       ++ ++ KVV T + +V G  
Sbjct: 183 HLSFGTPLAKDMQRKVSKYPQFQSVHPLDGGIFVSRDYHQVHHHYSKVVSTHF-EVGGMM 241

Query: 210 IQSNQFSVTEHFRSSEQGRLQTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
            +S +    +    S+      +  P   F YDLSP+ V  + +   +  F+T+VCAI+G
Sbjct: 242 TKSREIVGYQMLAQSQIMHYNEMDVPEAKFSYDLSPMAVLVSSKGRRWYDFVTSVCAIIG 301

Query: 268 GVFTVSGIIDAFIY 281
           G FTV GI+DA +Y
Sbjct: 302 GTFTVVGIVDAVLY 315


>gi|343476464|emb|CCD12449.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 224

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 44/139 (31%), Positives = 65/139 (46%), Gaps = 10/139 (7%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D  GE   D+  D  K R+DS         D +      +PL     +   +   C SC
Sbjct: 90  IDAFGEYVEDMGRDTVKMRVDS---------DTLAPLGEARPLVNMNKKATSDTHDCPSC 140

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDL-IDQCKREGFLQRIKEEEGEGCNIYGF 119
           YGAE +  DCC+ C++VR A+ ++ W     D+ I QC +E           EGCN++  
Sbjct: 141 YGAEKNPGDCCHTCDDVRRAFAERQWEFHEDDVSIMQCAKERLQMAASTASREGCNLHSS 200

Query: 120 LEVNKVAGNFHFAPGKSFH 138
             V +V  N HF PG+ F+
Sbjct: 201 FRVPRVTENIHFVPGRMFY 219


>gi|170588701|ref|XP_001899112.1| hypothetical protein [Brugia malayi]
 gi|158593325|gb|EDP31920.1| conserved hypothetical protein [Brugia malayi]
          Length = 430

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 51/175 (29%), Positives = 87/175 (49%), Gaps = 6/175 (3%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           ++ EG  C I+G + VNKV G+ F  + GK     G+  H       +  N+SH+I +  
Sbjct: 223 EKNEGTACRIHGRMRVNKVKGDSFVVSTGKGLGVDGIFAH--FGGVSNPGNLSHRIERFN 280

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVTEHFRS 223
           FG    G+V PL G+    ET    ++YF+KVVPT   ++ + G +  + Q+SVT   + 
Sbjct: 281 FGPTIYGLVTPLAGIEQISETGIDEFRYFLKVVPTRIYHSGLFGGSTLTYQYSVT-FMKK 339

Query: 224 SEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
           + +  +     +   Y+ +   +       S L  L  +C+ VGGVF  S ++++
Sbjct: 340 TPKKDVHKHAAIVIHYEFAATVIEVRRIQSSLLQMLIRLCSAVGGVFATSVLLNS 394


>gi|357122608|ref|XP_003563007.1| PREDICTED: protein disulfide isomerase-like 5-4-like [Brachypodium
           distachyon]
          Length = 485

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 62/224 (27%), Positives = 104/224 (46%), Gaps = 44/224 (19%)

Query: 94  IDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRD 153
           +D  KR   +         GC + GF+ V KV G+   +      +SG H     +F   
Sbjct: 282 VDPAKRPAPMT-------SGCRVEGFVRVKKVPGSVIISA-----RSGSH-----SFDPS 324

Query: 154 SFNISHKINKLAFGEHF-PGVVNPLDG------------------VRWTQETPSGMYQYF 194
             N+SH + + +FG    P + + L                    V+      +   +++
Sbjct: 325 QINVSHYVTQFSFGNRLSPNMFSELKRLIPYVGGHHDRLAGQSYIVKHGDNNANVTIEHY 384

Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEE 251
           +++V T    +      S +  V E +  +    L     +P V F ++ SP++V  TE 
Sbjct: 385 LQIVKTELVTLRS----SKELKVFEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTEL 440

Query: 252 HVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
             SF HF+TNVCAI+GGVFTV+GI+D+ +++  R + KK+E+GK
Sbjct: 441 PKSFSHFITNVCAIIGGVFTVAGILDSILHNTLRLV-KKVELGK 483


>gi|413953324|gb|AFW85973.1| putative DUF1692 domain containing protein [Zea mays]
          Length = 1070

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%)

Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 254
           +KVVPT Y  +S   + +NQ SVTE+F S      +  P V+F YDLSPI  T  EE  +
Sbjct: 515 LKVVPTEYKYLSKKILPTNQGSVTEYFLSIRPTE-RAWPAVYFLYDLSPITFTIKEERRN 573

Query: 255 FLHFLTNVCAIVGGVFTVSG 274
           FLHF+T +CA++GG F ++G
Sbjct: 574 FLHFITRLCAVLGGTFAMTG 593


>gi|18921097|ref|NP_569847.1| CG4293, isoform A [Drosophila melanogaster]
 gi|24638890|ref|NP_726677.1| CG4293, isoform B [Drosophila melanogaster]
 gi|85724768|ref|NP_001033816.1| CG4293, isoform D [Drosophila melanogaster]
 gi|85724770|ref|NP_001033817.1| CG4293, isoform C [Drosophila melanogaster]
 gi|2961397|emb|CAA18090.1| EG:65F1.1 [Drosophila melanogaster]
 gi|7290051|gb|AAF45518.1| CG4293, isoform A [Drosophila melanogaster]
 gi|7290052|gb|AAF45519.1| CG4293, isoform B [Drosophila melanogaster]
 gi|15292011|gb|AAK93274.1| LD35174p [Drosophila melanogaster]
 gi|84798360|gb|ABC67159.1| CG4293, isoform C [Drosophila melanogaster]
 gi|84798361|gb|ABC67160.1| CG4293, isoform D [Drosophila melanogaster]
 gi|220955778|gb|ACL90432.1| CG4293-PA [synthetic construct]
          Length = 441

 Score = 80.5 bits (197), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 49/157 (31%), Positives = 80/157 (50%), Gaps = 2/157 (1%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E + + C ++G L +NKVAG  H   G          H ++  +R   N +H+IN+L+FG
Sbjct: 194 ESKFDACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG 253

Query: 168 EHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
           ++   +V PL+G        +   QYF+KVVPT     +  TI + Q++VTE+ R  +  
Sbjct: 254 QYSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQ-TFTTIYAFQYAVTENVRKLDSE 312

Query: 228 RLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVC 263
           R     PG++F YD S +K+    +    + F   +C
Sbjct: 313 RNSYGSPGIYFKYDWSALKIIVRNDRDHLVTFAIRLC 349


>gi|413949740|gb|AFW82389.1| putative DUF1692 domain containing protein [Zea mays]
          Length = 1061

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%)

Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 254
           +KVVPT Y  +S   + +NQ SVTE+F S      +  P V+F YDLSPI  T  EE  +
Sbjct: 501 LKVVPTEYKYLSKKILPTNQGSVTEYFLSIRPTE-RAWPAVYFLYDLSPITFTIKEERRN 559

Query: 255 FLHFLTNVCAIVGGVFTVSG 274
           FLHF+T +CA++GG F ++G
Sbjct: 560 FLHFITRLCAVLGGTFAMTG 579


>gi|413951106|gb|AFW83755.1| hypothetical protein ZEAMMB73_317062 [Zea mays]
          Length = 1594

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 52/80 (65%), Gaps = 1/80 (1%)

Query: 195 IKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVS 254
           +KVVPT Y  +S   + +NQ SVTE+F S      +  P V+F YDLSPI  T  EE  +
Sbjct: 515 LKVVPTEYKYLSKKILPTNQGSVTEYFLSIRPTE-RAWPAVYFLYDLSPITFTIKEERRN 573

Query: 255 FLHFLTNVCAIVGGVFTVSG 274
           FLHF+T +CA++GG F ++G
Sbjct: 574 FLHFITRLCAVLGGTFAMTG 593


>gi|326503558|dbj|BAJ86285.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 63/205 (30%), Positives = 101/205 (49%), Gaps = 37/205 (18%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
           GC I GF+ V KV G+   +      +SG H     +F     N+SH +   +FG+    
Sbjct: 294 GCRIEGFVRVKKVPGSVVISA-----RSGSH-----SFDPSQINVSHYVTTFSFGKRLSS 343

Query: 170 ---------FP---GVVNPLDG----VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
                    FP   G  + L G    V+      +   ++++++V T    +      S 
Sbjct: 344 KMFNELKRLFPYVGGHHDRLAGQSYVVKHGDVNANVTIEHYLQIVKTELVTLR----YSK 399

Query: 214 QFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
           +  V E +  +    L     +P V F ++ SP++V  TE   SF HF+TNVCAI+GGVF
Sbjct: 400 ELKVLEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVF 459

Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGK 295
           TV+GI+D+ +++  R + KK+E+GK
Sbjct: 460 TVAGILDSILHNTLRLV-KKVELGK 483


>gi|146163751|ref|XP_001012240.2| hypothetical protein TTHERM_00103890 [Tetrahymena thermophila]
 gi|146145943|gb|EAR91995.2| hypothetical protein TTHERM_00103890 [Tetrahymena thermophila
           SB210]
          Length = 331

 Score = 79.7 bits (195), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 57/191 (29%), Positives = 98/191 (51%), Gaps = 32/191 (16%)

Query: 112 EGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF---NISHKINKLAFGE 168
           EGC I G++ + KV GNFH     S+H     ++ I + + D++   N+++KIN L FGE
Sbjct: 138 EGCRINGYINLKKVPGNFHI----SYHAKMDVMNRIASTKPDTYSKINLNYKINHLGFGE 193

Query: 169 H--FPGVVNPLDGVRWTQETPSGMYQY---------------FIKVVPTVYTDVSGH-TI 210
           +      +  + G    QET +  Y +               ++K++P  Y     H ++
Sbjct: 194 NTNHMATIFKIMGRTLFQETNTNDYPHDDTKYINPGKNDYDNYLKILPCRYDSNKLHMSV 253

Query: 211 QSNQFSV--TEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
              ++++  T   +SS +     +P +FF Y++SPI V ++ +  SF HFL  + AIVGG
Sbjct: 254 SRYKYAMYSTHTPKSSTE-----IPTIFFRYEISPINVYYSTKSKSFYHFLVQIFAIVGG 308

Query: 269 VFTVSGIIDAF 279
           +F V GI ++ 
Sbjct: 309 IFAVMGIFNSL 319


>gi|50545267|ref|XP_500171.1| YALI0A17600p [Yarrowia lipolytica]
 gi|49646036|emb|CAG84103.1| YALI0A17600p [Yarrowia lipolytica CLIB122]
          Length = 337

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 91/185 (49%), Gaps = 18/185 (9%)

Query: 114 CNIYGFLEVNKVAGNFHF--APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFP 171
           C I G + +N V G       P   +  + +          D  N++H I++L+FG++FP
Sbjct: 151 CRISGSVPINHVEGALQIFNLPDNQYFINPMKA-------SDGLNLTHAIHELSFGDYFP 203

Query: 172 GVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGH-TIQSNQFSVTEHFRSSEQGRLQ 230
            V+NPLDGV    + P   YQYF+  VP  Y+  SG   I + Q++V +   ++ Q    
Sbjct: 204 KVLNPLDGVSTVTDEPLMSYQYFLSAVPVEYS--SGRKKIHTYQYAVKKQ-TTNLQEHFV 260

Query: 231 TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
           T P +FF Y   P+ +   +   +   F+  + +I+GG F V G   ++I  G     +K
Sbjct: 261 TRPAIFFHYKYEPVTLKIQDSRETLTVFVVKLLSILGG-FVVCG---SWIVRGGEKAYEK 316

Query: 291 IEIGK 295
           I +GK
Sbjct: 317 I-VGK 320


>gi|255074657|ref|XP_002501003.1| predicted protein [Micromonas sp. RCC299]
 gi|226516266|gb|ACO62261.1| predicted protein [Micromonas sp. RCC299]
          Length = 515

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/215 (25%), Positives = 97/215 (45%), Gaps = 42/215 (19%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GC I G   VN+V G F+  P    H              D  N++H +  L+FG+H PG
Sbjct: 313 GCIIDGSFRVNRVPGAFYVTPHSMGHN----------LNPDVINMTHTVKHLSFGKHVPG 362

Query: 173 -----------VVNPL-----------DGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 210
                      V N +           D   +  E P+ ++++++K+V   +  + G  +
Sbjct: 363 RPSYVPRNLRRVWNRVPKDLGGRFAAGDEATFYSEEPNTVHEHYLKIVSRTFEPLEGQAV 422

Query: 211 Q-------SNQFSVTEHFRSS-EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNV 262
           Q       SN+F +     +  +  +    P + F YD+SP+ V   E     L ++  +
Sbjct: 423 QLYEYTFNSNRFRLNPPLAADGDPDQHVDGPMIKFSYDVSPMSVVLKEVKKPLLDWILGM 482

Query: 263 CAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKFS 297
           CA++GGV+T +G+++ F+     A+K++  +GK S
Sbjct: 483 CALLGGVYTCAGLLETFLQSSVCAVKRR--VGKIS 515


>gi|12321801|gb|AAG50943.1|AC079284_18 hypothetical protein [Arabidopsis thaliana]
          Length = 451

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 78/282 (27%), Positives = 123/282 (43%), Gaps = 41/282 (14%)

Query: 35  GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 94
           G P I    +  G R +H      S YG   +D       EE+ +  +K+   L+    +
Sbjct: 188 GYPSIRIFRRGSGLREDHGNHEHESYYGDRDTDS-LVKMVEELLKPIKKEDHKLA----L 242

Query: 95  DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 154
           D           K     GC I G++   KV G    +       SG H     +F    
Sbjct: 243 DGKSDNAASTFKKAPVSGGCRIEGYVRAKKVPGELVISA-----HSGAH-----SFDASQ 292

Query: 155 FNISHKINKLAFGE---------------HFPGVVNPLDGVRWTQET---PSGMYQYFIK 196
            N+SH +  L FG                +     + L+G  +  E     +   +++++
Sbjct: 293 MNMSHIVTHLTFGTMVSERLWTDMKRLLPYLGQSYDRLNGKSFINERQLDANVTIEHYLQ 352

Query: 197 VVPT-VYTDVSG--HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 253
           ++ T V +  SG  H++   ++  T H   S   R    P   F ++LSP++V  +E   
Sbjct: 353 IIKTEVISRRSGQEHSLI-EEYEYTAH---SSVARSYHYPEAKFHFELSPMQVLISENPK 408

Query: 254 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           SF HF+TNVCAI+GGVFTV+GI+D+   +  R + KKIE+GK
Sbjct: 409 SFSHFITNVCAIIGGVFTVAGILDSIFQNTVRMV-KKIELGK 449


>gi|299469370|emb|CBG91903.1| putative PDI-like protein [Triticum aestivum]
 gi|299469398|emb|CBG91917.1| putative PDI-like protein [Triticum aestivum]
          Length = 485

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 62/205 (30%), Positives = 101/205 (49%), Gaps = 37/205 (18%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH--- 169
           GC I GF+ V KV G+   +      +SG H     +F     N+SH +   +FG+    
Sbjct: 294 GCRIEGFVRVKKVPGSVVISA-----RSGSH-----SFDPSQINVSHYVTTFSFGKRLSS 343

Query: 170 ---------FP---GVVNPLDG----VRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSN 213
                    FP   G  + L G    V+      +   ++++++V T    +      + 
Sbjct: 344 KMFNELKRLFPYVGGHHDRLAGQSYIVKHGDVNANVTIEHYLQIVKTELVTLR----YAK 399

Query: 214 QFSVTEHFRSSEQGRLQ---TLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
           +  V E +  +    L     +P V F ++ SP++V  TE   SF HF+TNVCAI+GGVF
Sbjct: 400 ELKVLEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVF 459

Query: 271 TVSGIIDAFIYHGQRAIKKKIEIGK 295
           TV+GI+D+ +++  R + KK+E+GK
Sbjct: 460 TVAGILDSILHNTLRLV-KKVELGK 483


>gi|303279378|ref|XP_003058982.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460142|gb|EEH57437.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 486

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 95/208 (45%), Gaps = 30/208 (14%)

Query: 104 QRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINK 163
           + ++  +G GC++ GF+   KV G+       + H          +F  +  N++H +N 
Sbjct: 291 ESVRAVKGPGCSVTGFVLAKKVPGHVWITANSNSH----------SFHPEEMNMTHTVNH 340

Query: 164 LAFGEHF----------------PGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
           L FG                       + L GV +     +  ++++++ V T     +G
Sbjct: 341 LFFGNQLGRNKLKALERRERGASSNWHDKLAGVTFRSLQTNVTHEHYLQTVLTTLRP-AG 399

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
             +  + +  T+H  +    R   LP   F ++ SP++V  TEE   F HF+T + AIVG
Sbjct: 400 SYVAYHAYEYTQHSHALVTTR--ELPRAKFHFNPSPVQVVVTEEREPFYHFITTLMAIVG 457

Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           GV++V GI D F+ H    + +K E+GK
Sbjct: 458 GVYSVCGIADGFV-HNTLNMMRKFELGK 484


>gi|145350046|ref|XP_001419434.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579665|gb|ABO97727.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 513

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 58/228 (25%), Positives = 109/228 (47%), Gaps = 28/228 (12%)

Query: 79  EAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFA---PGK 135
           EA +++   L  P  +D  KR           G GC I GF+ V KV G+   +   P  
Sbjct: 301 EAAQEENMKLRLPASVDMQKRI---------IGPGCAITGFVLVKKVPGHLWISASSPDH 351

Query: 136 SFHQSGVHVHDILAFQRDSFNISHKIN--------KLAFGEHFPGVVNPLDGVRWTQETP 187
           SFH   +++  ++    + F   H+++        K   GE      + L   R+     
Sbjct: 352 SFHGETMNMTHVV----NHFYFGHQLSDERRRYLEKFHAGEKAGDWHDRLASERFVSNAA 407

Query: 188 SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 247
              ++++++ V T  T    +T+  + +  T+H  +  +     LP   F Y  SP+++ 
Sbjct: 408 HVSHEHYLQTVLTTITPRGRYTLPFSVYEYTQHSHAVHE----PLPKAKFHYQPSPMQIV 463

Query: 248 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
            +EE ++F  F+T++ AI+GGV++V GI D  +++    +++K+E+GK
Sbjct: 464 VSEEKMAFYSFITSLMAIIGGVYSVMGIADGVLFNSLALVRRKLELGK 511


>gi|308807242|ref|XP_003080932.1| Thioredoxin/protein disulfide isomerase (ISS) [Ostreococcus tauri]
 gi|116059393|emb|CAL55100.1| Thioredoxin/protein disulfide isomerase (ISS) [Ostreococcus tauri]
          Length = 533

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 61/228 (26%), Positives = 112/228 (49%), Gaps = 28/228 (12%)

Query: 79  EAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFA---PGK 135
           EA R+  + L  P  +D  +R           G GC I GF+ V KV G+   +   P  
Sbjct: 321 EAAREANFNLQLPASVDVQRRI---------MGPGCAITGFVLVKKVPGHLWISASSPDH 371

Query: 136 SFHQSGVHVHDILAFQRDSFNISHKIN--------KLAFGEHFPGVVNPLDGVRWTQETP 187
           SFH   +++  ++    + F   H+++        K   GE      + L G  +  E+ 
Sbjct: 372 SFHGQNMNMTHVV----NHFYFGHQLSDDRRRYLEKFHAGEKAGDWHDRLAGQTFVSESA 427

Query: 188 SGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVT 247
              ++++++   TV T ++     +  FSV E+ + +     + LP   F Y  SP+++ 
Sbjct: 428 HISHEHYLQ---TVLTSIAPRGRFALPFSVYEYTQHAHAVH-EPLPKAKFHYQPSPMQIA 483

Query: 248 FTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
            +EE ++F  F+T++ AI+GGV++V GI D  +++    ++KK+E+GK
Sbjct: 484 VSEERMAFYSFITSLMAIIGGVYSVMGIADGVLFNSIALVRKKLELGK 531


>gi|32566449|ref|NP_510494.2| Protein C18B12.6 [Caenorhabditis elegans]
 gi|25809204|emb|CAA20929.2| Protein C18B12.6 [Caenorhabditis elegans]
          Length = 428

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 41/130 (31%), Positives = 73/130 (56%), Gaps = 2/130 (1%)

Query: 151 QRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTI 210
           ++ S NISH+I K  FG   PG+V PL G     E+   +Y+YFIK+VPT       +T+
Sbjct: 257 EKQSGNISHRIEKFNFGPRIPGLVTPLAGAEHISESGQDIYRYFIKIVPTKIYGYFSYTM 316

Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
            + Q+SVT   +  ++G   +  G+ F Y+ +   +   +  ++ + +L  +C+I+GGV+
Sbjct: 317 -AYQYSVTFLKKQLKEGE-HSHGGILFEYEFTANVIEVHKTSITLISYLIRICSILGGVY 374

Query: 271 TVSGIIDAFI 280
             S I++  +
Sbjct: 375 ATSTIVNNIL 384


>gi|299115405|emb|CBN74236.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 447

 Score = 78.6 bits (192), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 59/199 (29%), Positives = 92/199 (46%), Gaps = 30/199 (15%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
           R+K++   GC + GF+ VN+V GNFH     + H          +    + NISH +  L
Sbjct: 264 RLKQDY-PGCQLSGFIMVNRVPGNFHIEARSALH----------SIDPTAANISHVVKTL 312

Query: 165 AFGEHFP---------GV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 211
            FG   P         GV    +  L+   ++ ++      ++IKVV T    ++     
Sbjct: 313 KFGTQVPVRGRRVIESGVELEGLPALEDRVYSIDSLHTAPHHYIKVVSTFVGGLAKTDNL 372

Query: 212 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
             Q  V+      EQ ++   P   F YDLSP+ V   +    +  FLT+V AIVGG FT
Sbjct: 373 QYQMMVSSQTMPYEQDQV---PEAKFSYDLSPMSVHIKQRRRKWYDFLTSVLAIVGGTFT 429

Query: 272 VSGIIDAFIYHGQRAIKKK 290
           V G++D  ++   R +K+K
Sbjct: 430 VVGVLDNILF---RVVKQK 445


>gi|302841900|ref|XP_002952494.1| hypothetical protein VOLCADRAFT_75374 [Volvox carteri f.
           nagariensis]
 gi|300262133|gb|EFJ46341.1| hypothetical protein VOLCADRAFT_75374 [Volvox carteri f.
           nagariensis]
          Length = 478

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 59/197 (29%), Positives = 93/197 (47%), Gaps = 23/197 (11%)

Query: 113 GCNIYGFLEVNKVAGNFHF---APGKSFHQSGVH----VHDILAFQRDSFNISHKINKLA 165
           GCN+ GF+ V KV G       + G SF  + ++    VH      R S     ++ +L 
Sbjct: 289 GCNLAGFVMVKKVPGTLTVVARSEGHSFDHTWMNMTHLVHTFHVGTRPSPRKYQQLKRL- 347

Query: 166 FGEHFPGVVNPLDGVRWTQ------ETPSGMYQYFIKVVPT-VYTDVSGHTIQSNQFSVT 218
                P      D   W +      E P   +++++++V T +    S H+   + +  T
Sbjct: 348 ----HPAGEGEGDLFWWREKREKRGEHPQSTHEHYLQIVLTSIEPRRSRHSGNYDAYEYT 403

Query: 219 EHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDA 278
            H   S   +   +P   F YDLSPI++   E    +  FLT  CAI+GGVFTV+GI+DA
Sbjct: 404 AH---SHTYQSDAIPSARFTYDLSPIQILVQETARPWYQFLTTSCAIIGGVFTVAGILDA 460

Query: 279 FIYHGQRAIKKKIEIGK 295
            +Y   + + KK+ +GK
Sbjct: 461 LLYQSFKVV-KKLNLGK 476


>gi|42562656|ref|NP_175508.2| protein Disulfide Isomerase (PDIa) family, redox active TRX
           domain-containing protein [Arabidopsis thaliana]
 gi|332194483|gb|AEE32604.1| protein Disulfide Isomerase (PDIa) family, redox active TRX
           domain-containing protein [Arabidopsis thaliana]
          Length = 484

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 78/282 (27%), Positives = 123/282 (43%), Gaps = 41/282 (14%)

Query: 35  GAPKIDKPLQRHGGRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLI 94
           G P I    +  G R +H      S YG   +D       EE+ +  +K+   L+    +
Sbjct: 221 GYPSIRIFRRGSGLREDHGNHEHESYYGDRDTDS-LVKMVEELLKPIKKEDHKLA----L 275

Query: 95  DQCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS 154
           D           K     GC I G++   KV G    +       SG H     +F    
Sbjct: 276 DGKSDNAASTFKKAPVSGGCRIEGYVRAKKVPGELVISA-----HSGAH-----SFDASQ 325

Query: 155 FNISHKINKLAFGE---------------HFPGVVNPLDGVRWTQET---PSGMYQYFIK 196
            N+SH +  L FG                +     + L+G  +  E     +   +++++
Sbjct: 326 MNMSHIVTHLTFGTMVSERLWTDMKRLLPYLGQSYDRLNGKSFINERQLDANVTIEHYLQ 385

Query: 197 VVPT-VYTDVSG--HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHV 253
           ++ T V +  SG  H++   ++  T H   S   R    P   F ++LSP++V  +E   
Sbjct: 386 IIKTEVISRRSGQEHSLI-EEYEYTAH---SSVARSYHYPEAKFHFELSPMQVLISENPK 441

Query: 254 SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           SF HF+TNVCAI+GGVFTV+GI+D+   +  R + KKIE+GK
Sbjct: 442 SFSHFITNVCAIIGGVFTVAGILDSIFQNTVRMV-KKIELGK 482


>gi|397568633|gb|EJK46248.1| hypothetical protein THAOC_35093 [Thalassiosira oceanica]
          Length = 601

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 97/220 (44%), Gaps = 60/220 (27%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           E+E  GC I GFL V++  GNFH       H    H+           N+SH IN L+FG
Sbjct: 403 EDEHPGCQISGFLLVDRAPGNFHIQAQSKNHDLAAHM----------TNVSHIINHLSFG 452

Query: 168 EHFP------GVVN----------PLDGVRWTQETPSGMYQYFIKVVPTVY--------- 202
           + F       G+ N          P DG  +        + +++KV+ T +         
Sbjct: 453 KPFSKYFIKEGLKNTPAGFLDTTRPFDGNVYVTHNEHEAHHHYLKVITTEFEPQRDTKKQ 512

Query: 203 ------------TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTE 250
                          +   +QS+Q S+          R   +P   F YDLSPI V++++
Sbjct: 513 YGKKKGFYKPPEPQRAYQILQSSQLSLY---------RNDIVPEAKFTYDLSPIAVSYSK 563

Query: 251 EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
           ++ ++  + T++ AI+GG FTV G++++ +Y    A+ KK
Sbjct: 564 KYRAWYDYFTSLMAIIGGTFTVVGMVESSLY----AVSKK 599


>gi|123499008|ref|XP_001327531.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121910461|gb|EAY15308.1| hypothetical protein TVAG_394520 [Trichomonas vaginalis G3]
          Length = 357

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 58/229 (25%), Positives = 103/229 (44%), Gaps = 31/229 (13%)

Query: 59  SCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYG 118
           SCY A ++    C  C++V +A++ +         I QC     +  I+E + EGC +  
Sbjct: 143 SCYAANNTK--VCKTCKDVVQAHKNQELLPPPLSTIAQCASTAAI--IQEMKDEGCKLTS 198

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHD--ILAFQRDSFNISHKINKLAFGE---HFPGV 173
             +  ++A  FH APG ++   G H H+  IL  +    N++H I    F      F   
Sbjct: 199 AFQTVRLASEFHVAPGYNYLYKGWHSHNTTILGSESKDLNLTHIIRSFRFNRVDGKF--- 255

Query: 174 VNPLDGVRWTQETPSGMYQYFIKVVPTVYT-DVSGHTIQSNQFSVTEHFRSSEQGRLQTL 232
             PLD V   Q T  G ++        VY+ D+  +T  +N++ + +  + S        
Sbjct: 256 --PLDNVTSIQ-TGKGSWR-------VVYSADIMDNTYTANKYELMDPPKFSS------- 298

Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
            GV+F Y ++P+      +   FLH  T +  ++G V     ++D+F++
Sbjct: 299 -GVYFRYAINPVSAIDYYDTEPFLHLCTRLLTVIGAVLAAFRLLDSFLF 346


>gi|223995687|ref|XP_002287517.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976633|gb|EED94960.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 457

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 56/193 (29%), Positives = 92/193 (47%), Gaps = 36/193 (18%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHF-- 170
           GC I GFL V++  GNFH       H    H+           N+SH IN L+FG+ F  
Sbjct: 277 GCQISGFLLVDRAPGNFHIQAQSKGHDLAAHMT----------NVSHIINHLSFGKPFSK 326

Query: 171 -----------PG---VVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFS 216
                      PG      P DG  +  +     + +++KV+ T +    G   Q+++++
Sbjct: 327 YFLKDGLKNTPPGFLETTKPFDGNVYITQNEHEAHHHYLKVITTEFEPEKG--AQNSKYN 384

Query: 217 VTEHFRS-----SEQGRL---QTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
             E  R+     S Q  L     +P   F YDLSPI V++ +++  +  + T++ AI+GG
Sbjct: 385 KKEPSRAYQILQSSQLSLYRSDIVPEAKFTYDLSPIAVSYNKKYRHWYDYFTSLMAIIGG 444

Query: 269 VFTVSGIIDAFIY 281
            FTV G++++ I+
Sbjct: 445 TFTVVGMLESGIH 457


>gi|145549492|ref|XP_001460425.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428255|emb|CAK93028.1| unnamed protein product [Paramecium tetraurelia]
          Length = 320

 Score = 77.4 bits (189), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 92/200 (46%), Gaps = 26/200 (13%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
           R    E +GC + G L+VN+V G   F   +S+   G      +       + SHK    
Sbjct: 130 RTAINEKQGCEVIGNLKVNRVRGKISFGAHRSYSYIGA-----VGNLNLPLDYSHKFVSF 184

Query: 165 AFGEHFP----------GVVNPLDGVRWTQE----TPSGMYQYFIKVVPTVYTDVSGHTI 210
           +FG+             G ++   G +  ++    + S  +++FI ++PT YT ++    
Sbjct: 185 SFGDEDALKKVKSLFQQGQLDSFAGTQRIKKPELASQSMQHEHFISIIPTHYTLLNKQVY 244

Query: 211 QSNQFSVTEH-FRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
              Q++   +  RS+  G +Q        YD +P  VT+ +     LHF   +CA++GG+
Sbjct: 245 SVYQYTANHNEVRSNNYGNVQ------LRYDFAPTTVTYWQTKEDILHFYVQICAVIGGI 298

Query: 270 FTVSGIIDAFIYHGQRAIKK 289
           FTVS +I+A +Y   R + K
Sbjct: 299 FTVSSMIEACVYKVMRMLLK 318


>gi|341884627|gb|EGT40562.1| hypothetical protein CAEBREN_07459 [Caenorhabditis brenneri]
          Length = 428

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 87/176 (49%), Gaps = 15/176 (8%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKL 164
           E+G+ C ++G  +V K         GK         + +L F   + N    ISH+I K 
Sbjct: 221 EDGKACRLHGKFKVRK---------GKEEKIVMSISNPLLMFDHQAENQPGNISHRIEKF 271

Query: 165 AFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSS 224
            FG   PG+V PL G     E+   +Y+YFIK+VPT       +T+ + Q+SVT   +  
Sbjct: 272 NFGPRIPGLVTPLAGAEHISESGQDIYRYFIKIVPTKIYGYFTYTM-AYQYSVTFLKKQL 330

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
           ++G   +  G+ F Y+ +   +   +  V+   +L  +C+I+GGV+  S I++  +
Sbjct: 331 KEGE-HSHGGILFEYEFNANVIEVHKTSVTLFSYLIRICSILGGVYATSTIVNNIV 385


>gi|145540599|ref|XP_001455989.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423798|emb|CAK88592.1| unnamed protein product [Paramecium tetraurelia]
          Length = 322

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 97/199 (48%), Gaps = 35/199 (17%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAP-GKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +GE C + GF +VNKV GNFH +     +    +H  D+  F++    + H I +L FGE
Sbjct: 138 QGEQCQLKGFFQVNKVPGNFHVSYHAHHYLLQRIHQRDLSVFRK--MKLDHSIYELRFGE 195

Query: 169 HFPGVVNPLDGVR------------WTQ---ETPSGM---YQYFIKVVPTVYTDVSGHTI 210
                +     +R            W Q     P G    Y+Y+I  +P  + D +    
Sbjct: 196 -----ITTTSKMRKYSKSLQKFQNSWKQIVKSAPEGEKQDYEYYIDALPVRFYDENERNY 250

Query: 211 QS-NQFSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 268
           Q+  ++S+ E    ++  R  T +  ++F Y +SP+ + ++ +  S  HF+  + AI+GG
Sbjct: 251 QTLYKYSINE----AQMPRTFTEIDSIYFKYQISPVNMVYSIQKKSVYHFIVQLLAIIGG 306

Query: 269 VFTVSGIIDAFIYHGQRAI 287
           VF V GI+++ +   Q+AI
Sbjct: 307 VFAVIGILNSIV---QKAI 322


>gi|397641928|gb|EJK74922.1| hypothetical protein THAOC_03372 [Thalassiosira oceanica]
          Length = 583

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 57/203 (28%), Positives = 91/203 (44%), Gaps = 43/203 (21%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC + G L VN+V GNFH    KS +      H++ A      N++H++N ++FGE 
Sbjct: 385 EHPGCQVSGHLMVNRVPGNFHIE-AKSVN------HNLNAAMT---NLTHRVNHISFGEP 434

Query: 170 FPGV--------------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
              +                           NP+D   +        + ++IKVV T   
Sbjct: 435 ITKLPYHMENTPFMRKVKRVLKQVPEEHKQFNPMDDQEYITTQFHQAFHHYIKVVSTHLN 494

Query: 204 DVSGHTIQSNQFSVTEHFRSSEQGRLQ-----TLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
             S  T+  N  +    ++  EQ ++       +P   F YD+SP+ V   +E   +  +
Sbjct: 495 MGSSSTV--NDVNSITVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVVVQKEGRKWYDY 552

Query: 259 LTNVCAIVGGVFTVSGIIDAFIY 281
           LT++CAI+GG FT  G+IDA +Y
Sbjct: 553 LTSLCAIIGGTFTTLGLIDATLY 575


>gi|268581819|ref|XP_002645893.1| Hypothetical protein CBG07646 [Caenorhabditis briggsae]
          Length = 426

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 68/125 (54%), Gaps = 2/125 (1%)

Query: 156 NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQF 215
           NISH+I K  FG   PG+V PL G     E+   +Y+YFIK+VPT       +T+ + Q+
Sbjct: 262 NISHRIEKFNFGPRIPGLVTPLAGAEHISESGQDIYRYFIKIVPTKIYGYFTYTL-AYQY 320

Query: 216 SVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGI 275
           SVT   +  ++G   +  G+ F Y+ +   +   +   +   +L  +C+I+GGV+  S I
Sbjct: 321 SVTFLKKQLKEGE-HSHGGILFEYEFTANVIEVHKTSTTLFSYLIRICSILGGVYATSTI 379

Query: 276 IDAFI 280
           I+  +
Sbjct: 380 INNIV 384


>gi|224013158|ref|XP_002295231.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969193|gb|EED87535.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 492

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 57/205 (27%), Positives = 89/205 (43%), Gaps = 43/205 (20%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           E  GC + G L VN+V GNFH    KS +      H++ A      N++H++N L+FGE 
Sbjct: 290 EHPGCQVSGHLMVNRVPGNFHIE-AKSVN------HNLNAAMT---NLTHRVNHLSFGEP 339

Query: 170 FPGV--------------------------VNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
              +                           NP+D   +        + ++IKVV T   
Sbjct: 340 ITKLPPHMENTPFMRKVKRVLKQVPEEHKQFNPMDDTEYVTAQFHQAFHHYIKVVSTHLN 399

Query: 204 --DVSGHTIQSNQFSVTEHFRSSEQGRLQ-----TLPGVFFFYDLSPIKVTFTEEHVSFL 256
               S      N  +    ++  EQ ++       +P   F YD+SP+ V   +E   + 
Sbjct: 400 MGSSSKSEYSVNDVNAVTVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVVVQKEGRKWY 459

Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIY 281
            +LT++CAI+GG FT  G+IDA +Y
Sbjct: 460 DYLTSLCAIIGGTFTTLGLIDATLY 484


>gi|328700149|ref|XP_003241164.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 2 [Acyrthosiphon pisum]
 gi|328700151|ref|XP_001951220.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 1 [Acyrthosiphon pisum]
 gi|328700153|ref|XP_003241165.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 3 [Acyrthosiphon pisum]
          Length = 289

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 62/199 (31%), Positives = 93/199 (46%), Gaps = 9/199 (4%)

Query: 27  IESRQDGIGAPKIDKPLQRHG--GRLEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKK 84
           + S  D IGA  +D   Q     G L+ ++T+       +   E        +RE Y   
Sbjct: 84  VASTCDSIGADIVDTTGQNMMLFGELKTDDTWWEMTKEQQQHFEKMRKFNAYLREEYHSM 143

Query: 85  GWALSNPDLIDQCKREGFLQRIKEEE-GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 143
              L   D  +  K + F++  K     + C I+G L +NKV GNFH  PGKS    G H
Sbjct: 144 KDILWMFDDYNTLKNKIFVRTDKPNTLPDACRIHGSLILNKVIGNFHITPGKSLIVPGGH 203

Query: 144 VHDILA-FQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVY 202
           VH     F  ++ N SH+IN+ +FG    G++ PL+G  +     +  Y+YFI VV    
Sbjct: 204 VHLTGPFFGSEATNFSHRINQFSFGVPTKGIIYPLEGELYETNENAVSYKYFIDVVA--- 260

Query: 203 TDVSGHT--IQSNQFSVTE 219
           TDV   +  I++ Q+S  +
Sbjct: 261 TDVKSRSNEIKTYQYSAKD 279


>gi|145543941|ref|XP_001457656.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124425473|emb|CAK90259.1| unnamed protein product [Paramecium tetraurelia]
          Length = 322

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 88/187 (47%), Gaps = 22/187 (11%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQ-SGVHVHDILAFQRDSFNISHKINKLAFGE 168
           +GE C   GF  VNKV GNFH +     H    +H  D+  +++    + H I +L FG+
Sbjct: 138 QGEQCQFKGFFSVNKVPGNFHISYHAHHHLIQRIHQRDLSTYRK--LKLDHTIYELRFGD 195

Query: 169 H--------FPGVVNPLDGVRW---TQETPSGM---YQYFIKVVPTVYTDVSGHTIQS-N 213
           +        +P  +       W    +  P G    Y+Y+I  +P  + D      Q+  
Sbjct: 196 NSSSFKMKKYPKSLQKFQS-SWNSIAKTAPEGEKQDYEYYINALPVRFYDDKERNYQTLY 254

Query: 214 QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
           ++S+ E   +        +  ++F Y +SP+ + ++ +  S  HF+  + AIVGGVF V 
Sbjct: 255 KYSINE---AQMTRSFTEIDSIYFKYQISPVNMVYSIQKKSVYHFIVQLLAIVGGVFAVI 311

Query: 274 GIIDAFI 280
           GI+++ I
Sbjct: 312 GIVNSII 318


>gi|154415829|ref|XP_001580938.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121915161|gb|EAY19952.1| hypothetical protein TVAG_402060 [Trichomonas vaginalis G3]
          Length = 359

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 71/276 (25%), Positives = 113/276 (40%), Gaps = 35/276 (12%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRHGGRLEHNETYCGSC 60
           +D  G + LDV +DI  KR+      I+   + +                   +  C  C
Sbjct: 89  LDSIGVEMLDVSNDIKFKRMSVDNRFIDYSNESL-------------------KDICLPC 129

Query: 61  YGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGCNIYGFL 120
           +G +   E CCN C+EV+  +  +G    NP   DQC         K++  E C I G +
Sbjct: 130 HGLKPEGE-CCNTCDEVKAIFEARGEDF-NPLPFDQCMGN---VNFKKDMSESCLIEGTI 184

Query: 121 EVNKVAGNFHFAPGKS--FHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVVNPLD 178
              K  G FH APG++  F ++G H HD       S    H I++   G+ +  V +P+ 
Sbjct: 185 HTFKSPGQFHIAPGRNTKFRRTG-HQHDTGLSPEAS--CPHTIHEFYVGQKYDNVRSPIR 241

Query: 179 G--VRWTQETPS-GMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGV 235
           G   R     P   +Y  FI  V   + D   +T  S ++S     +    G     PG+
Sbjct: 242 GKIFRDRDSLPRIYLYDLFITKVLHTFNDALQYT--SYEYSYNLGAKIFNPGSFYQ-PGI 298

Query: 236 FFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
           +F Y  SP+ +       + + FL     ++ G+F 
Sbjct: 299 YFKYMFSPMTIVERSISKNPMRFLVTSVGVLAGIFA 334


>gi|219125194|ref|XP_002182871.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405665|gb|EEC45607.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 467

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 61/222 (27%), Positives = 94/222 (42%), Gaps = 40/222 (18%)

Query: 84  KGWALSNPDLIDQCKREGFLQRIKEEEGE--GCNIYGFLEVNKVAGNFHFAPGKSFHQSG 141
           K W     D  D  + E   Q  ++   +  GC + G L VN+V GNFH       H   
Sbjct: 254 KEWHSKASDSADPAEVEKKRQLYQQNRPDHPGCQVSGHLMVNRVPGNFHLEAKSKSHNLN 313

Query: 142 VHVHDILAFQRDSFNISHKINKLAFGE--------------HFP---GVVNPLDGVRWTQ 184
             +           N+SH +N L+FGE                P       P+DG  +  
Sbjct: 314 AAMT----------NLSHVVNHLSFGEPIDENNRKSKRILKQVPEEHRQFAPMDGQAFLT 363

Query: 185 ETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQ-----TLPGVFFFY 239
           +     + ++IKVV T         + S+  +    ++  EQ ++       +P   F Y
Sbjct: 364 KAFHQAFHHYIKVVSTHLN------MGSSDANSMLTYQFLEQSQIVFYDDVNVPEARFSY 417

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
           DLSP+ V   +E   +  +LT++CAI+GG FT  G+IDA +Y
Sbjct: 418 DLSPMSVVVEKEGRKWYDYLTSLCAIIGGTFTTLGLIDATLY 459


>gi|223646904|gb|ACN10210.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Salmo salar]
 gi|223672767|gb|ACN12565.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Salmo salar]
          Length = 238

 Score = 75.1 bits (183), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 33/67 (49%), Positives = 42/67 (62%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            C I+G L VNKVAGNFH   GK+      H H       D++N SH+I+ L+FGE  PG
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDTYNFSHRIDHLSFGEEIPG 228

Query: 173 VVNPLDG 179
           ++NPLDG
Sbjct: 229 IINPLDG 235


>gi|428185569|gb|EKX54421.1| hypothetical protein GUITHDRAFT_99900 [Guillardia theta CCMP2712]
          Length = 475

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 69/237 (29%), Positives = 103/237 (43%), Gaps = 55/237 (23%)

Query: 93  LIDQCKREGFLQRI--KEEEGE-------GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVH 143
           L+ Q   +    R+  KE++GE       GC + G L V +       APG    Q+   
Sbjct: 260 LMKQVNLQAPKSRVVDKEQDGEKESHNGVGCMVAGMLHVQR-------APGSIILQA--- 309

Query: 144 VHDILAFQRDSFNISHKINKLAFGEHF---PGVVNP---------LDGVRWTQE--TPSG 189
           V D   F   + ++SH +N L+FG        VV P         LD  ++  E  TP+ 
Sbjct: 310 VSDGHEFNWATMDVSHTVNHLSFGPFLSETAWVVMPPDIAQAVGSLDDKKFLSEERTPT- 368

Query: 190 MYQYFIKVVPTVY----------TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFY 239
           ++++++KVV  V            +  G+ + +N+             R   +P     Y
Sbjct: 369 VWEHYVKVVKNVVELPRSWGIPPVEAHGYVVHTNKVQ-----------RYAEVPTARINY 417

Query: 240 DLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGKF 296
           D+ PI V       S  HFLT +CAIVGGVFTVSGI  + +  G  ++  K  IGK 
Sbjct: 418 DILPIIVHVKTSRESNYHFLTKLCAIVGGVFTVSGIFASMVEGGIASLTHKETIGKL 474


>gi|357474783|ref|XP_003607677.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
 gi|355508732|gb|AES89874.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
          Length = 156

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 56/163 (34%), Positives = 85/163 (52%), Gaps = 31/163 (19%)

Query: 155 FNISHKINKLAFGEHFPGVVNP---LDGVRW------------------TQETPSGM-YQ 192
            N+SH IN L+FG+     V P   +D   W                  T++    +  +
Sbjct: 1   MNMSHVINHLSFGKK----VTPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIE 56

Query: 193 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEH 252
           ++I+VV T      G+ +   ++  T H   S       +P   F  +LSP++V  TE  
Sbjct: 57  HYIQVVKTEVITRKGYKLIE-EYEYTAH---SSVAHSVNIPVARFHLELSPMQVLITENQ 112

Query: 253 VSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
            SF HF+TNVCAI+GGVFTV+GI+D+ +++  +A+ KKIEIGK
Sbjct: 113 KSFSHFITNVCAIIGGVFTVAGILDSILHNTIKAM-KKIEIGK 154


>gi|444316650|ref|XP_004178982.1| hypothetical protein TBLA_0B06400 [Tetrapisispora blattae CBS 6284]
 gi|387512022|emb|CCH59463.1| hypothetical protein TBLA_0B06400 [Tetrapisispora blattae CBS 6284]
          Length = 355

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 48/166 (28%), Positives = 83/166 (50%), Gaps = 14/166 (8%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           E +GC+++G + VN+V G   F A G  +       ++++       N  H IN+ +FG 
Sbjct: 160 ELDGCHVFGQIPVNRVQGELQFTAKGYGYMNWERTPYELI-------NFDHVINEFSFGN 212

Query: 169 HFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQG 227
            FP + NPLD   +   + P   + Y   VVP+ Y  + G  + + Q+SV+++  +    
Sbjct: 213 FFPYIDNPLDNTAKINLDDPVTSWIYDTSVVPSYYRKL-GAEVDTFQYSVSQYSYNGTSL 271

Query: 228 RLQT----LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
           +  T    +PG+FF YD   + +  T+  +SF  FL  + AI+  V
Sbjct: 272 QKMTSSTSVPGIFFKYDFEALSLVLTDHRISFFQFLIRLVAILSFV 317


>gi|297847442|ref|XP_002891602.1| hypothetical protein ARALYDRAFT_474215 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337444|gb|EFH67861.1| hypothetical protein ARALYDRAFT_474215 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 484

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 61/204 (29%), Positives = 98/204 (48%), Gaps = 36/204 (17%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
           GC I G++   KV G    +       SG H     +F     N+SH +  L+FG     
Sbjct: 294 GCRIEGYVRAKKVPGELVISA-----HSGAH-----SFDASQMNMSHIVTHLSFGTMVSE 343

Query: 173 VV---------------NPLDGVRWTQETP---SGMYQYFIKVVPT-VYTDVSG--HTIQ 211
            +               + L+G  +  +     +   ++++++V T V +  SG  H++ 
Sbjct: 344 RLWTDMKRLLPYLGQSHDRLNGKSFINQRKFDVNVTIEHYLQIVKTEVISRRSGKEHSLI 403

Query: 212 SNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 271
             ++  T H   S        P   F ++LSP++V  +E   SF HF+TNVCAI+GGVFT
Sbjct: 404 -EEYEYTAH---SSVAHSYHYPEAKFHFELSPMQVLISENPKSFSHFITNVCAIIGGVFT 459

Query: 272 VSGIIDAFIYHGQRAIKKKIEIGK 295
           V+GI+D+   +  R + KKIE+GK
Sbjct: 460 VAGILDSIFQNTVRMV-KKIELGK 482


>gi|443921357|gb|ELU41041.1| endoplasmic reticulum-derived transport vesicle ERV46 [Rhizoctonia
           solani AG-1 IA]
          Length = 579

 Score = 73.9 bits (180), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 61/195 (31%), Positives = 90/195 (46%), Gaps = 47/195 (24%)

Query: 119 FLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDS--FNISHKINKLAF---------- 166
           FL +NKV GNFHF+PG+SF     H +D++ + +D    +  H I++  F          
Sbjct: 317 FLRINKVTGNFHFSPGRSFLSQRGHAYDLVPYLKDGNHHDFGHYIHEFHFEGDREIEDRW 376

Query: 167 -----GEHFPGVV----NPLDGVRWTQETPSG-MYQYFIKVVPTVYTDVSGHTIQSNQFS 216
                G  +   V     PLDG+    E PS  M QYF+KVV T    + G  ++++Q+S
Sbjct: 377 REGNRGTEWRARVGSDKQPLDGL----EQPSNWMIQYFLKVVSTEVRHLDGDLVRAHQYS 432

Query: 217 VTEHFRSSEQGRLQTLPGVFF--FYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSG 274
           VT + R          PG  F    D + IK T              +CAIVGGV T++ 
Sbjct: 433 VTNYERDIR-------PGHEFDPLRDANGIKTTH------------GLCAIVGGVLTLAS 473

Query: 275 IIDAFIYHGQRAIKK 289
           I D+  +     I++
Sbjct: 474 IADSVAFASLNKIEE 488


>gi|388517493|gb|AFK46808.1| unknown [Lotus japonicus]
          Length = 156

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 82/159 (51%), Gaps = 23/159 (14%)

Query: 155 FNISHKINKLAFGE---------------HFPGVVNPLDG---VRWTQETPSGMYQYFIK 196
            N+SH +N L FG+               H     + L+G   V       +   +++I+
Sbjct: 1   MNMSHVVNHLTFGKKVTPRAISDMQRLIPHIGSSHDRLNGRSFVNTHNLEANVTIEHYIQ 60

Query: 197 VVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFL 256
           +V T     +G+ +  + +  T H   S       +P   F  +LSP++V  TE   SF 
Sbjct: 61  IVKTEVVTRNGYKLIED-YEYTAH---SSVAHSLDIPVAKFHLELSPMQVLITENQKSFS 116

Query: 257 HFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           HF+TNVCAI+GGVFTV+GI+D+ +++  R I KK+E+GK
Sbjct: 117 HFITNVCAIIGGVFTVAGIVDSILHNTIRMI-KKVELGK 154


>gi|384486505|gb|EIE78685.1| hypothetical protein RO3G_03389 [Rhizopus delemar RA 99-880]
          Length = 188

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 43/120 (35%), Positives = 67/120 (55%), Gaps = 13/120 (10%)

Query: 84  KGWALSNPDLIDQCKR----EGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQ 139
           K  A+ +P  I++  R    + +  +I ++ G  C IYG L+VNKVA N H       + 
Sbjct: 72  KYQAIEDPKYINEIIRAANGKSYDHQIAKDMG-ACRIYGSLKVNKVASNLHITSDGHGYA 130

Query: 140 SGVHV-HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVV 198
           S VH  H++L       N +H+I++L+FGE +P ++NPLD      ET   M+QY++ VV
Sbjct: 131 SRVHTSHEVL-------NFTHRIDELSFGEFYPNLINPLDNSMEIAETHFEMFQYYLSVV 183


>gi|71409118|ref|XP_806922.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70870803|gb|EAN85071.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 310

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 91/193 (47%), Gaps = 31/193 (16%)

Query: 1   MDISGEQHLDVKHDIFKKRLDSQGNV--IESRQDGIGAPKIDKPLQRHGGRLEHNETYCG 58
           +D++G  +L+V  ++FK  +D+QGN   I +RQ G+G        +        +  +CG
Sbjct: 109 LDVTGTVNLNVTRNLFKTPVDAQGNFAFIGTRQ-GVGE---YGSFREQSKDDPSSPQFCG 164

Query: 59  SCYGAE------SSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGE 112
            C+  E       +   CCN C +V  AY ++G      + ++QC  E  L RI      
Sbjct: 165 RCFINEHQVSMMENKNRCCNTCNDVLNAYDQQGLPRPQKNEVEQCIYE--LSRI----NP 218

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG-EHFP 171
           GCN  G L V K  G   FAP +     G  + D++      F+ SH INKL+ G EH  
Sbjct: 219 GCNYKGTLIVKKFGGRLVFAPKRV--PGGFLIRDVM-----RFDSSHIINKLSIGDEHVT 271

Query: 172 -----GVVNPLDG 179
                GV +PL+G
Sbjct: 272 RFSRRGVQHPLNG 284


>gi|365986066|ref|XP_003669865.1| hypothetical protein NDAI_0D03080 [Naumovozyma dairenensis CBS 421]
 gi|343768634|emb|CCD24622.1| hypothetical protein NDAI_0D03080 [Naumovozyma dairenensis CBS 421]
          Length = 353

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/180 (27%), Positives = 87/180 (48%), Gaps = 23/180 (12%)

Query: 113 GCNIYGFLEVNKVAGNFHFAPG----KSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           GC+I+G + VN+VAG             +H++ +          +  N +H IN+ +FGE
Sbjct: 162 GCHIFGSVNVNQVAGELQVTAKGHGYADYHRAPL----------EKVNFAHVINEFSFGE 211

Query: 169 HFPGVVNPLD-GVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEH---FRSS 224
            FP + NPLD   ++  + P   Y Y   V+P +Y  + G  + + Q+SV EH    + S
Sbjct: 212 FFPYIDNPLDNSAKFNMDDPLTAYVYDTSVIPMIYRKM-GAEVDTFQYSVAEHQYKSKES 270

Query: 225 EQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG-GVFTVSGII---DAFI 280
                  +PG+FF Y+   + +  ++  + F+ F+  + AI+   V+  S +    D FI
Sbjct: 271 SSSNSFRVPGIFFQYNFENLSIVVSDRRLGFIQFIVRLVAILSFAVYIASWLFILADMFI 330


>gi|219130117|ref|XP_002185219.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403398|gb|EEC43351.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 421

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 59/212 (27%), Positives = 97/212 (45%), Gaps = 24/212 (11%)

Query: 105 RIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQR------------ 152
           + + ++G+GC I G + V  VAG F     K   Q    + +     +            
Sbjct: 210 KFETKKGQGCTIEGHIRVPVVAGKFEITLNKRTWQQAASILNRQMLMQVLGATSEHTSSN 269

Query: 153 ----DSFNISHKINKLAFGEHFP-GVVNPLDGVRWTQETPSG---MYQYFIKVVPT-VYT 203
               D +N +H I+ + FG+ FP  +  PL+  R       G   + +  I++VPT   T
Sbjct: 270 DELGDRYNSTHFIHYIRFGDSFPLNIEKPLEKRRHIFRNKYGAMAVQEMKIELVPTYTST 329

Query: 204 DVSGHTIQSNQFSVTEHFRSSE---QGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
            +   + Q+ Q SV +     E   Q    +LPG+   YD SP+ V  T    + L FL+
Sbjct: 330 WLPTSSRQTYQASVVDSTIEPEHMAQAGASSLPGLAVQYDFSPLTVYHTGGRDNILVFLS 389

Query: 261 NVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIE 292
           ++ +IVGGVF   G++   + H  +A+ KKI+
Sbjct: 390 SLVSIVGGVFVTVGLVSGCLVHSAQAVAKKID 421


>gi|300123978|emb|CBK25249.2| unnamed protein product [Blastocystis hominis]
          Length = 109

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 61/92 (66%), Gaps = 2/92 (2%)

Query: 193 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTE 250
           YF+K++P  +  + G T +S ++SVTE+ +  ++     +T PGV+F Y ++PI++T  E
Sbjct: 10  YFLKLIPVEHISLFGGTSRSYEYSVTEYTQLLDKPSYFSRTSPGVYFKYQITPIRLTKRE 69

Query: 251 EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
             + FL + T +C+IVGGV T+SGII + + H
Sbjct: 70  SRIGFLQYYTTLCSIVGGVITISGIIQSLLTH 101


>gi|219111363|ref|XP_002177433.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411968|gb|EEC51896.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 520

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/201 (27%), Positives = 94/201 (46%), Gaps = 32/201 (15%)

Query: 108 EEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFG 167
           + E  GCNI G L +++V GNFH    +S H      HD++       N+SH ++ L+ G
Sbjct: 333 DAEHPGCNIAGHLLLDRVPGNFHIQ-ARSPH------HDLVPHMT---NVSHVVHHLSIG 382

Query: 168 EH------------FPGVVN----PLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 211
           E              P  V     P++G  +  +     Y +++KV+ T   +V G    
Sbjct: 383 EPVAERLIEQEKVILPEDVKRKLKPMNGNAYVTKELHEAYHHYLKVITT---NVDGLKFG 439

Query: 212 SNQFSVTEHFRSSEQG--RLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
                  +  +SS+    R   +P   F +DLSP+ V++      +  + T++ AI+GG 
Sbjct: 440 KRDLRAYQILQSSQLSFYRNDIIPEAKFVFDLSPVAVSYRTTSRRWYDYFTSILAIIGGT 499

Query: 270 FTVSGIIDAFIYHGQRAIKKK 290
           FTV G++++ I H   A K++
Sbjct: 500 FTVVGLLESTI-HATVARKRR 519


>gi|301101702|ref|XP_002899939.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262102514|gb|EEY60566.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 101

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 52/75 (69%), Gaps = 1/75 (1%)

Query: 223 SSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
           S+ Q   QT P   F +D+SP+ V  T +++ F HF+T++CA++GGVFT+  ++D+ ++H
Sbjct: 28  STTQYEDQT-PSALFTFDISPLVVQITTDNIPFYHFITHLCAVIGGVFTILSLVDSGVFH 86

Query: 283 GQRAIKKKIEIGKFS 297
              +IKKK ++GK S
Sbjct: 87  AMNSIKKKQQLGKLS 101


>gi|327354451|gb|EGE83308.1| hypothetical protein BDDG_06252 [Ajellomyces dermatitidis ATCC
           18188]
          Length = 113

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 41/97 (42%), Positives = 57/97 (58%), Gaps = 13/97 (13%)

Query: 206 SGHTIQSNQFSVTEHFRSSEQG---------RLQT---LPGVFFFYDLSPIKVTFTEEHV 253
           SG +I+++Q+SVT H RS + G         RL +   +PGVF  YD+SP+KV   E   
Sbjct: 13  SGGSIETHQYSVTSHKRSVDGGNDAEEGHKERLHSQGGIPGVFVNYDISPMKVINREART 72

Query: 254 -SFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKK 289
            +F  FLT VCA++GG  TV+  ID  +Y G   +KK
Sbjct: 73  KTFSGFLTGVCAVIGGTLTVAAAIDRALYEGSVRVKK 109


>gi|300122875|emb|CBK23882.2| unnamed protein product [Blastocystis hominis]
          Length = 109

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/92 (39%), Positives = 60/92 (65%), Gaps = 2/92 (2%)

Query: 193 YFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRL--QTLPGVFFFYDLSPIKVTFTE 250
           YF+K++P     + G T +S ++SVTE+ +  ++     +T PGV+F Y ++PI++T  E
Sbjct: 10  YFLKLIPVEQISLFGGTSRSYEYSVTEYTQLLDKPSYFSRTSPGVYFKYQITPIRLTKRE 69

Query: 251 EHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYH 282
             + FL + T +C+IVGGV T+SGII + + H
Sbjct: 70  SRIGFLQYYTTLCSIVGGVITISGIIQSLLTH 101


>gi|403330686|gb|EJY64240.1| hypothetical protein OXYTRI_24846 [Oxytricha trifallax]
          Length = 345

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/187 (29%), Positives = 92/187 (49%), Gaps = 23/187 (12%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
           ++ EGC + G + +NKV GNFH     S H  G  V  I    +   + +H +N L+FG+
Sbjct: 153 DDQEGCMVEGTVIINKVPGNFHL----STHSFGEVVQKIYMNGK-KLDFTHTVNHLSFGD 207

Query: 169 ----------HFPGVVNPLDG--VRWTQETPSG--MYQYFIKVVPTVYTDVSGHTIQSNQ 214
                     +       +DG  V   Q    G  +  Y++ +    Y D +G   +  Q
Sbjct: 208 DKQMKSIQSKYNEKYTFDMDGTYVDQNQHLYQGQLLANYYLDINQVDYLDATGIFYKLLQ 267

Query: 215 FSVTEHFRSSEQGRLQT-LPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVS 273
                 ++SS+    Q  LP +FF Y+LSP+K+ +T  + S+  F   + AI+GG++ V+
Sbjct: 268 ---GFKYKSSKSIMAQMGLPAIFFRYELSPVKLQYTMTYKSWSEFFIEISAIIGGMYVVA 324

Query: 274 GIIDAFI 280
           GII++F+
Sbjct: 325 GIIESFL 331


>gi|118357982|ref|XP_001012239.1| hypothetical protein TTHERM_00103880 [Tetrahymena thermophila]
 gi|89294006|gb|EAR91994.1| hypothetical protein TTHERM_00103880 [Tetrahymena thermophila
           SB210]
          Length = 323

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 101/209 (48%), Gaps = 27/209 (12%)

Query: 96  QCKREGFLQRIKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF 155
           Q K E  L++IK +E   C I+G L +N + G+F F   +     G+    +        
Sbjct: 128 QQKIEEVLEQIKNKEQ--CRIHGQLLLNTIPGSFKF---RILQMKGLDEQLL-----KQL 177

Query: 156 NISHKINKLAFG--------EHFPGV----VNPLDGVRWTQETPSGMYQYFIKVVPTVYT 203
           NI+HKINKL+FG        E   G+        D  R+  E     Y  +IK++P    
Sbjct: 178 NINHKINKLSFGDTIKTKKIEKVLGLDKSDSEAFDESRYNYEYRCS-YDNYIKILPLNAE 236

Query: 204 DVS--GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTN 261
           ++   G+ I++N F  T + +   + +   +  V F Y +SPI + +  ++ SF  F+  
Sbjct: 237 NIKELGY-IRTNSFRFTMYQQVIPKEQTDIIE-VSFNYQVSPINIVYQTKNKSFYSFVVQ 294

Query: 262 VCAIVGGVFTVSGIIDAFIYHGQRAIKKK 290
           VCAI+GG+F V G+I+  + +   +I  K
Sbjct: 295 VCAIIGGIFCVFGVINTLVLNIISSINSK 323


>gi|361132020|gb|EHL03635.1| hypothetical protein M7I_0279 [Glarea lozoyensis 74030]
          Length = 235

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 57/180 (31%), Positives = 76/180 (42%), Gaps = 56/180 (31%)

Query: 116 IYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFN----ISHKINKLAFGEHFP 171
           I G L VNKV GNFH APG+SF    +HVHD+  +           SH I+ L FG   P
Sbjct: 38  IEGALRVNKVIGNFHIAPGRSFSNGNMHVHDLNNYFDTPVEGGHVFSHTIHHLRFGPQLP 97

Query: 172 -------GV---------VNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVS--------- 206
                  G          +NPLD  + T   P+  + YF+KVV T Y  +          
Sbjct: 98  EELTKKLGTKTNLWTNHHLNPLDDTKQTTTEPAYNFMYFVKVVSTSYLPLGWETQAYKSQ 157

Query: 207 -----------GH----TIQSNQFSVTEHFRSSEQGRLQT------------LPGVFFFY 239
                      GH    +++++Q+SVT H RS   G   +            +PGVFF Y
Sbjct: 158 LGSEWVGIGSYGHQHDGSVETHQYSVTSHRRSLNGGDDASEGHKEKVHARGGIPGVFFSY 217


>gi|412994089|emb|CCO14600.1| predicted protein [Bathycoccus prasinos]
          Length = 528

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 51/208 (24%), Positives = 93/208 (44%), Gaps = 29/208 (13%)

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHF-APGKSFHQSGVHVHDILAFQRDSFNISHKINKL 164
           ++     GC+I GF+ V KV G+  F A  K+ H          +F  D  N++H+++  
Sbjct: 330 VQTRASTGCSITGFVLVKKVPGHVFFTADAKNGH----------SFDVDKLNVTHQVHHF 379

Query: 165 AFGEHFPGVV-----------------NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSG 207
            FG+                       + L       + P   ++++++ V T    +  
Sbjct: 380 YFGQQLSASRQKYMARFHRGEKEGDWHDKLANDFVVSKNPRTSHEHYLQTVLTTMQPLGP 439

Query: 208 HTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVG 267
                N +  T+H  S +    +T P   F +  SP+++   E+   F  F+T + AIVG
Sbjct: 440 FAQPFNVYEYTQHTHSVKTPDGET-PRAKFHFTPSPVQILGVEKRREFYQFITTLMAIVG 498

Query: 268 GVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           GV++V GIID  +++     K+K+++GK
Sbjct: 499 GVYSVVGIIDGLMHNTSLMFKRKMQLGK 526


>gi|299116076|emb|CBN74492.1| DEAD box helicase [Ectocarpus siliculosus]
          Length = 865

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 53/214 (24%), Positives = 95/214 (44%), Gaps = 39/214 (18%)

Query: 99  REGFLQR-IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNI 157
           R+GF +  + +++  GC + G + VN+V GNFH       H           F   + N+
Sbjct: 669 RKGFPEVGLHDDKWPGCMVTGHIMVNRVPGNFHIEAASKSH----------TFHGATTNL 718

Query: 158 SHKINKLAFGEHFPGVVN--------------PLDGVRWTQETPSGMYQYFIKVVPTVY- 202
           SH ++ ++FG   P                  PLDG  +          ++++VV ++Y 
Sbjct: 719 SHIVHHMSFGNDPPRRTQTKINRLTEDLRQNAPLDGNVYVANAYHQAPHHYLRVVGSMYH 778

Query: 203 -----TDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLH 257
                T   G+ I +N    ++     E+     +P   F Y++SP+ V    E   +  
Sbjct: 779 LSPMKTPWHGYQIVAN----SQMMLYDEE----EVPEARFSYNISPMSVLVRSEKRPWYD 830

Query: 258 FLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
           F+T V AIVGG F++ G++DA ++   R   +++
Sbjct: 831 FVTKVLAIVGGTFSMVGLVDAAVFRASRKAGRQL 864


>gi|312374049|gb|EFR21698.1| hypothetical protein AND_16520 [Anopheles darlingi]
          Length = 252

 Score = 70.5 bits (171), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 31/74 (41%), Positives = 46/74 (62%)

Query: 106 IKEEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           I +   + C I+G L +NKVAGNFH   GK+ H +  H+H    F     N SH+IN+ +
Sbjct: 163 IPQRPHDACRIHGVLTLNKVAGNFHITVGKTIHFARGHIHLNSIFANTQTNFSHRINRFS 222

Query: 166 FGEHFPGVVNPLDG 179
           FG+H  G+++PL+G
Sbjct: 223 FGDHTAGIIHPLEG 236


>gi|313227239|emb|CBY22386.1| unnamed protein product [Oikopleura dioica]
          Length = 380

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 63/247 (25%), Positives = 104/247 (42%), Gaps = 60/247 (24%)

Query: 78  REAYRKKGWALSNPDLIDQCKREG--------------------FLQRIKE-----EEGE 112
           R    K   ++++P++ DQ  REG                     + ++ +      E +
Sbjct: 138 RAKLLKMKESMTDPNMRDQLLREGHDVKHLEFSRKKNKKMMEQGMMHKVVQINLDPNEPQ 197

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----------------- 155
           GC ++G +E+ K+AG       ++    G+     L+   D+                  
Sbjct: 198 GCRVWGSVELQKIAGTIKI---QAGGFGGMGGIPGLSGGLDAIMGMFMMPMMGMGAQIQD 254

Query: 156 ----NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 211
               N SH+I+  +FG+   G+V  LDG    QE  +    Y +KVVP   TD+     Q
Sbjct: 255 GKKANFSHRIDHFSFGDPSSGLVYGLDGDIQIQEKENDDTTYVVKVVP---TDLKTFKFQ 311

Query: 212 SN--QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
               Q++VT+H   S++      P V   YD S + V+ TE   SF+  LT +  I+GG+
Sbjct: 312 QKAYQYAVTQHVGKSDK------PAVTIKYDFSGLGVSITEYRESFVGLLTRLAGILGGI 365

Query: 270 FTVSGII 276
              SGI+
Sbjct: 366 AASSGIL 372


>gi|313241668|emb|CBY33893.1| unnamed protein product [Oikopleura dioica]
          Length = 380

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 63/247 (25%), Positives = 104/247 (42%), Gaps = 60/247 (24%)

Query: 78  REAYRKKGWALSNPDLIDQCKREG--------------------FLQRIKE-----EEGE 112
           R    K   ++++P++ DQ  REG                     + ++ +      E +
Sbjct: 138 RAKLLKMKESMTDPNMRDQLLREGHDVKHLEFSRKKNKKMMEQGMMHKVVQINLDPNEPQ 197

Query: 113 GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF----------------- 155
           GC ++G +E+ K+AG       ++    G+     L+   D+                  
Sbjct: 198 GCRVWGSVELQKIAGTIKI---QAGGFGGMGGIPGLSGGLDAIMGMFMMPMMGMGAQIQD 254

Query: 156 ----NISHKINKLAFGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 211
               N SH+I+  +FG+   G+V  LDG    QE  +    Y +KVVP   TD+     Q
Sbjct: 255 GKKANFSHRIDHFSFGDPSSGLVYGLDGDIQIQEKENDDTTYVVKVVP---TDLKTFKFQ 311

Query: 212 SN--QFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGV 269
               Q++VT+H   S++      P V   YD S + V+ TE   SF+  LT +  I+GG+
Sbjct: 312 QKAYQYAVTQHVGKSDK------PAVTIKYDFSGLGVSITEYRESFVGLLTRLAGILGGI 365

Query: 270 FTVSGII 276
              SGI+
Sbjct: 366 AASSGIL 372


>gi|303278158|ref|XP_003058372.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459532|gb|EEH56827.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 399

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 40/101 (39%), Positives = 56/101 (55%), Gaps = 18/101 (17%)

Query: 111 GEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSF------NISHKINKL 164
           GEGC ++G L+V +VAGNFH +         VH  D     R +F      N+SH +++L
Sbjct: 158 GEGCRVHGRLKVQRVAGNFHVS---------VHGEDARTL-RATFEHPRNVNMSHAVHRL 207

Query: 165 AFGEHFPGVVNPLDGVRWTQE--TPSGMYQYFIKVVPTVYT 203
           +FG+ FP   +PL G   T      +G Y+YF+KVVP  YT
Sbjct: 208 SFGKSFPRKEDPLSGFTRTTRHANETGTYKYFLKVVPVTYT 248



 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 31/72 (43%), Positives = 46/72 (63%)

Query: 211 QSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVF 270
           ++N +SVTE +  ++     +LP V+F YDLSPI VT ++   SF HFL    A VGG +
Sbjct: 318 RTNLYSVTETYIPTKNWNGGSLPAVYFIYDLSPIAVTISDARKSFGHFLARTVAGVGGAY 377

Query: 271 TVSGIIDAFIYH 282
            ++G+ID  I+H
Sbjct: 378 AIAGLIDRMIHH 389


>gi|298714834|emb|CBJ25733.1| similar to Endoplasmic reticulum-Golgi intermediate compartment
           protein 1 (ER-Golgi intermediate compartment 32 kDa
           protein) (ERGIC-32) [Ectocarpus siliculosus]
          Length = 320

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 46/129 (35%), Positives = 66/129 (51%), Gaps = 9/129 (6%)

Query: 156 NISHKINKLAFGEHFPGVV----NPLDGVRWTQETPSGMYQYFIKVVPTVY-----TDVS 206
           N++HKI+   FG    G V    N L    +  E  SG+ +Y +KVVP  +      +V+
Sbjct: 178 NMTHKIHDFGFGPPVKGPVGVGRNSLARSTFVSEEGSGLVKYSLKVVPISHRRMHGAEVN 237

Query: 207 GHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIV 266
            HT  SN   V E     +      L GV F YD + + V +T+   S    +T+VCAIV
Sbjct: 238 THTYSSNVAFVPEAAVLQDLSSSSLLLGVEFSYDFTSVMVKYTDARRSMFELITSVCAIV 297

Query: 267 GGVFTVSGI 275
           GG++TVSG+
Sbjct: 298 GGIYTVSGL 306


>gi|414879928|tpg|DAA57059.1| TPA: hypothetical protein ZEAMMB73_408305, partial [Zea mays]
          Length = 75

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 26/49 (53%), Positives = 38/49 (77%)

Query: 233 PGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIY 281
           P V+F YDLSPI VT  EE  +FLHF+T +CA++GG F ++G++D ++Y
Sbjct: 11  PAVYFLYDLSPITVTIKEERRNFLHFITRLCAVLGGTFAMTGMLDRWMY 59


>gi|443920575|gb|ELU40475.1| endoplasmic reticulum-derived transport vesicle ERV46 [Rhizoctonia
           solani AG-1 IA]
          Length = 506

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 60/129 (46%), Gaps = 8/129 (6%)

Query: 109 EEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGE 168
            +   C ++G + V KV  N H       ++S  H    L       N++H IN+ +FG 
Sbjct: 168 PDASACRVFGTVAVKKVTANLHITTLGHGYRSAEHTDHTL------MNLTHVINEFSFGP 221

Query: 169 HFPGVVNPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGR 228
             P +  PLD            +QYFI VVPT Y       + +NQ+SVT + R+ E GR
Sbjct: 222 FIPDLSQPLDYSFEVTHEHFTAFQYFITVVPTTYQVPGQDPLHTNQYSVTHYTRNIEHGR 281

Query: 229 LQTLPGVFF 237
               PG+FF
Sbjct: 282 --GTPGIFF 288


>gi|123407515|ref|XP_001303026.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121884369|gb|EAX90096.1| hypothetical protein TVAG_396530 [Trichomonas vaginalis G3]
          Length = 234

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 54/223 (24%), Positives = 98/223 (43%), Gaps = 15/223 (6%)

Query: 55  TYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQCKREGFLQRIKEEEGEGC 114
           T CGSCYGA +    CCN+C+EV +A++K   +     +I QC+             + C
Sbjct: 13  TECGSCYGASNG---CCNSCKEVLDAFQKIEKSHPPTAMIQQCRNT--FSDADSLINDSC 67

Query: 115 NIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPGVV 174
            +   L V    G+F    G++   +     D L   +++ N +H  +  + G  +    
Sbjct: 68  TLGITLTVPHTHGSFFITIGQNTTNTSA---DYLGVPKENLNFTHSFDFFSMGGGYHPAQ 124

Query: 175 NPLDGVRWTQETPSGMYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPG 234
              + ++  +E       Y+I+      T +          SVT + R  ++     LPG
Sbjct: 125 ILQNYMKVQKEYGRYKAMYYIRA-----TRILNDYDTQYSLSVTSYDRYRDESS-DKLPG 178

Query: 235 VFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIID 277
           VF  YD+SP+ + +  +   +   + ++ AI+GG+F    +ID
Sbjct: 179 VFINYDISPLILQYVLDRPIY-QIIIDMMAIIGGIFAFGLLID 220


>gi|365991164|ref|XP_003672411.1| hypothetical protein NDAI_0J02760 [Naumovozyma dairenensis CBS 421]
 gi|343771186|emb|CCD27168.1| hypothetical protein NDAI_0J02760 [Naumovozyma dairenensis CBS 421]
          Length = 341

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 95/200 (47%), Gaps = 23/200 (11%)

Query: 92  DLIDQCKREGFLQRIKEEEGE-------GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHV 144
           +++ Q     F  RI E   E        C+++G ++VN++ G    +       S  ++
Sbjct: 128 EVLTQAIPYEFGMRIDERPPEDDMPNINACHLFGSVDVNRLPGILEISTN-----STGNI 182

Query: 145 HDILAFQRDSFNISHKINKLAFGEHFPGVVNPLDGV-RWTQETPSGMYQYFIKVVPTVYT 203
           +D      +  + +H IN+L+FGE FP + NPLD   +   + P   Y Y++ V+PT+Y 
Sbjct: 183 ND------NGKSFAHVINELSFGEFFPFIDNPLDNTAKVLPDQPLTTYSYYLTVIPTIYE 236

Query: 204 DVSGHTIQSNQFSVTEH-FRSSEQGRLQTL--PGVFFFYDLSPIKVTFTEEHVSFLHFLT 260
            + G  + +NQ+S+ E  F+     + QT     +   YD   + +   +  + F+ FL 
Sbjct: 237 KL-GKRVNTNQYSLNEFIFKHIYNVKSQTQYDEAIRIHYDFDALSIFMHDTRLDFIQFLV 295

Query: 261 NVCAIVGGVFTVSGIIDAFI 280
            + AI+  V  ++  +  FI
Sbjct: 296 RLVAILSFVVYIASWVFRFI 315


>gi|410046954|ref|XP_003952285.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 2 [Pan troglodytes]
          Length = 333

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 40/90 (44%), Positives = 56/90 (62%), Gaps = 6/90 (6%)

Query: 190 MYQYFIKVVPT-VYT-DVSGHTIQSNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPIKV 246
           M+QYFI VVPT ++T  +S  T   +QFSVTE  R  +       + G+F  YDLS + V
Sbjct: 202 MFQYFITVVPTKLHTYKISADT---HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMV 258

Query: 247 TFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
           T TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 259 TVTEEHMPFWQFFVRLCGIVGGIFSTTGML 288


>gi|118386954|ref|XP_001026594.1| hypothetical protein TTHERM_01146090 [Tetrahymena thermophila]
 gi|89308361|gb|EAS06349.1| hypothetical protein TTHERM_01146090 [Tetrahymena thermophila
           SB210]
          Length = 712

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 60/181 (33%), Positives = 79/181 (43%), Gaps = 25/181 (13%)

Query: 110 EGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEH 169
           + E C IYG   V KV GNFH     SFH  G+    +L      FN+ H I+ L F   
Sbjct: 545 QREKCQIYGHFYVKKVPGNFHV----SFHNEGL----LLMNSNLIFNLRHTIHTLEFTTE 596

Query: 170 --------FPGVVNPLDGVRWTQETPS-GM-YQYFIKVVPTVYTDVSGHTIQSNQFSVTE 219
                   +    NPLD    T   P  GM   Y++KVV TV+ ++      +N +S T 
Sbjct: 597 DGSLTLGKYTKSSNPLDK---TIHNPGHGMDTDYYLKVVNTVFENMLSE--HNNIYSFTS 651

Query: 220 HFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAF 279
              S    R   LP V F Y+  PI V    +  S   F+  +CAIVGG   +S  I   
Sbjct: 652 LETSG--VRDFRLPSVNFRYEFDPITVLHYRKSRSLTQFIVTLCAIVGGSIAISKYIYTL 709

Query: 280 I 280
           +
Sbjct: 710 L 710



 Score = 38.5 bits (88), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 44/108 (40%), Gaps = 16/108 (14%)

Query: 2   DISGEQHLDVKHDIFKKRLDSQGNVIESRQDGIGAPKIDKPLQRH------------GGR 49
           D+SG    D+   + K RLD  G  I        A  I K  Q+               +
Sbjct: 89  DVSGAHLEDMHWTVHKIRLDQFGKFINYD----SANDIKKQEQKFYPGNPFFEAVKTNNQ 144

Query: 50  LEHNETYCGSCYGAESSDEDCCNNCEEVREAYRKKGWALSNPDLIDQC 97
           +++  +   SCYGAE  +   C  C +V  A+ ++GW     + I QC
Sbjct: 145 VQNQFSNSVSCYGAELYEGQICLTCSDVLIAFAQRGWPQPMKEQISQC 192


>gi|393908150|gb|EJD74929.1| hypothetical protein, variant [Loa loa]
          Length = 368

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 40/115 (34%), Positives = 61/115 (53%), Gaps = 5/115 (4%)

Query: 107 KEEEGEGCNIYGFLEVNKVAGN-FHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLA 165
           ++ EG  C I+G + VNKV G+ F  + GK     G+  H          NISH+I +  
Sbjct: 222 EKNEGTACRIHGRMRVNKVKGDSFIISTGKGLDVDGIFAH--FGGVSSPSNISHRIERFN 279

Query: 166 FGEHFPGVVNPLDGVRWTQETPSGMYQYFIKVVPT--VYTDVSGHTIQSNQFSVT 218
           FG    G+V PL G+    ET    ++YF+K+VPT   ++ + G +  + Q+SVT
Sbjct: 280 FGPRIYGLVTPLAGIEQISETGVDEFRYFLKIVPTRIYHSGLFGGSTLTYQYSVT 334


>gi|307110923|gb|EFN59158.1| hypothetical protein CHLNCDRAFT_138016 [Chlorella variabilis]
          Length = 360

 Score = 65.9 bits (159), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/97 (34%), Positives = 55/97 (56%), Gaps = 13/97 (13%)

Query: 199 PTVYTDVSGHTIQSNQFSVTEHFRSSEQGRLQTLPGVFFFYDLSPIKVTFTEEHVSFLHF 258
           P +  D   +T+QS++++  +H  +             F Y +SPI++  TE+      F
Sbjct: 275 PELQFDAYEYTVQSHKYNAEDHASAK------------FTYKMSPIQIVVTEQPKQLYKF 322

Query: 259 LTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKIEIGK 295
           LT +CA++GGVFTV+GI+D  + H    I KK+++GK
Sbjct: 323 LTAICAVIGGVFTVAGILDGMV-HQVNKIAKKVDLGK 358


>gi|308804553|ref|XP_003079589.1| acyl-CoA thioester hydrolase-like (ISS) [Ostreococcus tauri]
 gi|116058044|emb|CAL54247.1| acyl-CoA thioester hydrolase-like (ISS) [Ostreococcus tauri]
          Length = 1155

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 53/230 (23%), Positives = 92/230 (40%), Gaps = 63/230 (27%)

Query: 113  GCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDILAFQRDSFNISHKINKLAFGEHFPG 172
            GC+I G   +N+V G F+F P    H  G              +++H +  L+FG H PG
Sbjct: 934  GCSINGQFSINRVPGAFYFHPRSRSHTIG------------DVDMTHVVKHLSFGTHAPG 981

Query: 173  --------------VVNPLD-GVRWTQETPSGM-----------YQYFIKVVPTVYTDVS 206
                           + P D G R+  +    M           + +++ V+P  Y  V 
Sbjct: 982  GPRRFVPRHLRKAWKLIPKDAGGRFAGKLSKPMQFDADTSGRTVFDHYVHVIPRTYHPVG 1041

Query: 207  GHTIQSNQFSVTEH----------------FRS---------SEQGRLQTLPGVFFFYDL 241
               I   +++ + H                +R+         ++  R    P + F YD+
Sbjct: 1042 DEPIHIYEYTFSSHAFKLRDDAAERELSRNYRTGGEIDREFGTDDFRRPDGPSIRFSYDI 1101

Query: 242  SPIKVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFIYHGQRAIKKKI 291
            S + V   E H + L ++    AI+GG+ T S  ++ F+Y   RA+K++I
Sbjct: 1102 SAMGVVTREVHKNLLEWILGCSAILGGLVTCSVGLERFVYASSRAVKRRI 1151


>gi|260826492|ref|XP_002608199.1| hypothetical protein BRAFLDRAFT_90361 [Branchiostoma floridae]
 gi|229293550|gb|EEN64209.1| hypothetical protein BRAFLDRAFT_90361 [Branchiostoma floridae]
          Length = 336

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 38/96 (39%), Positives = 52/96 (54%), Gaps = 10/96 (10%)

Query: 190 MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFR-----SSEQGRLQTLPGVFFFYDLSPI 244
           M+QYFI++VPT   +       + QF+VTE  R     S   G    + G+FF YDL+ I
Sbjct: 188 MFQYFIQIVPT-RVNTRQAQADTGQFAVTERERVINHDSGSHG----VAGIFFKYDLTSI 242

Query: 245 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGIIDAFI 280
            V  TEE   F   L  +C IVGG+F  SG++  F+
Sbjct: 243 MVKVTEERQPFSQLLIRLCGIVGGIFATSGMLHGFV 278


>gi|30268567|emb|CAD89902.1| hypothetical protein [Homo sapiens]
          Length = 132

 Score = 65.5 bits (158), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 54/92 (58%), Gaps = 10/92 (10%)

Query: 190 MYQYFIKVVPTVYTDVSGHTIQ----SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPI 244
           M+QYFI VVPT       HT +    ++QFSVTE  R  +       + G+F  YDLS +
Sbjct: 1   MFQYFITVVPT-----KLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSL 55

Query: 245 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
            VT TEEH+ F  F   +C IVGG+F+ +G++
Sbjct: 56  MVTVTEEHMPFWQFFVRLCGIVGGIFSTTGML 87


>gi|354507876|ref|XP_003515980.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Cricetulus griseus]
 gi|344235439|gb|EGV91542.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Cricetulus griseus]
          Length = 132

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 38/92 (41%), Positives = 54/92 (58%), Gaps = 10/92 (10%)

Query: 190 MYQYFIKVVPTVYTDVSGHTIQ----SNQFSVTEHFRS-SEQGRLQTLPGVFFFYDLSPI 244
           M+QYFI VVPT       HT +    ++QFSVTE  R  +       + G+F  YDLS +
Sbjct: 1   MFQYFITVVPT-----KLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSL 55

Query: 245 KVTFTEEHVSFLHFLTNVCAIVGGVFTVSGII 276
            VT TEEH+ F  F   +C I+GG+F+ +G++
Sbjct: 56  MVTVTEEHMPFWQFFVRLCGIIGGIFSTTGML 87


>gi|11907610|gb|AAG41243.1|AF210626_1 Fun9 [Eremothecium gossypii]
          Length = 138

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 63/135 (46%), Gaps = 16/135 (11%)

Query: 170 FPGVVNPLDGVRWTQETPSG---MYQYFIKVVPTVYTDVSGHTIQSNQFSVTEHFRSSEQ 226
            P    PL+G     + P+G    + YF KVVP  Y  ++G   +S +FSVT H R    
Sbjct: 3   LPANPGPLNG--RAMKVPNGHSHFFSYFAKVVPIRYETLAGTITESAEFSVTAHDRPVHG 60

Query: 227 GRLQTLPGVFFF----------YDLSPIKVTFTEEHVS-FLHFLTNVCAIVGGVFTVSGI 275
           GR    P    F          +++SP+KV   E++ S +  F+ N    +GGV  V  +
Sbjct: 61  GRDADHPNTVHFRGGMAGMTINFEMSPLKVIQREQYASTWTAFVLNAITSIGGVLAVGTV 120

Query: 276 IDAFIYHGQRAIKKK 290
           +D   YH QR +  K
Sbjct: 121 LDRVTYHTQRTLMGK 135


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.139    0.428 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,987,611,959
Number of Sequences: 23463169
Number of extensions: 219448077
Number of successful extensions: 433412
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1023
Number of HSP's successfully gapped in prelim test: 68
Number of HSP's that attempted gapping in prelim test: 429642
Number of HSP's gapped (non-prelim): 1189
length of query: 297
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 156
effective length of database: 9,050,888,538
effective search space: 1411938611928
effective search space used: 1411938611928
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)