BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy13967
         (379 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|270007946|gb|EFA04394.1| hypothetical protein TcasGA2_TC014693 [Tribolium castaneum]
          Length = 385

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 207/386 (53%), Positives = 261/386 (67%), Gaps = 27/386 (6%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           E+L+  DA+ K  ED   KT  G  VTI+    ++ L  V++ DY   + +EELFVD+SR
Sbjct: 6   EKLRRFDAYPKTLEDVRIKTYGGAVVTIISLTIMTLLFWVELVDYLTPNVSEELFVDTSR 65

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
              + I+LDI+VPTISCD+LALDA+DSSGEQHL ++HNIYKRRLDL G+PI+EP+KE + 
Sbjct: 66  SPSIQINLDIIVPTISCDFLALDAMDSSGEQHLQIDHNIYKRRLDLQGQPIEEPKKEDIT 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE-LD 183
              K+K +TE  T      +  +CGSCYGA  + ++CCNTC +V+EAYR ++WA PE  +
Sbjct: 126 I--KRKNSTEVATV-----NKTECGSCYGASFDPKRCCNTCEDVREAYRERRWAFPENPE 178

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
            I QCK E  +EKLK  F +GCQIYG L VNRVSGSFHIAPG S+SINHVHVHD+QP++S
Sbjct: 179 NITQCKEERFSEKLKTAFAQGCQIYGSLTVNRVSGSFHIAPGKSFSINHVHVHDVQPFSS 238

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS- 302
             FNTTH IRHLSFG  +  D +   PL  TV  AEEGASMF Y+IKI+PT Y +LDG  
Sbjct: 239 TEFNTTHKIRHLSFGASI--DSDTHNPLKDTVGLAEEGASMFQYHIKIVPTAYVKLDGQF 296

Query: 303 ---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
                           L  G+ GMPGIFF YELSPLMVK TE+S+S GH  T +   I G
Sbjct: 297 ISANQFSVTKHRRVISLMSGESGMPGIFFQYELSPLMVKYTEQSRSFGHFATNVCAIIGG 356

Query: 348 TYITFMLVDALLHSCVKKIS-KVEIG 372
            Y    L+D +L+  VK I  K+E+G
Sbjct: 357 VYTVAGLIDTMLYHSVKLIQKKIELG 382


>gi|189237821|ref|XP_974331.2| PREDICTED: similar to AGAP012144-PA [Tribolium castaneum]
          Length = 395

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 206/385 (53%), Positives = 260/385 (67%), Gaps = 25/385 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+  DA+ K  ED   KT  G  VTI+    ++ L  V++ DY   + +EELFVD+SR 
Sbjct: 15  KLRRFDAYPKTLEDVRIKTYGGAVVTIISLTIMTLLFWVELVDYLTPNVSEELFVDTSRS 74

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             + I+LDI+VPTISCD+LALDA+DSSGEQHL ++HNIYKRRLDL G+PI+EP+KE +  
Sbjct: 75  PSIQINLDIIVPTISCDFLALDAMDSSGEQHLQIDHNIYKRRLDLQGQPIEEPKKEDITI 134

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE-LDT 184
             K+K +TE    T    +  +CGSCYGA  + ++CCNTC +V+EAYR ++WA PE  + 
Sbjct: 135 --KRKNSTEVSVATV---NKTECGSCYGASFDPKRCCNTCEDVREAYRERRWAFPENPEN 189

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QCK E  +EKLK  F +GCQIYG L VNRVSGSFHIAPG S+SINHVHVHD+QP++S 
Sbjct: 190 ITQCKEERFSEKLKTAFAQGCQIYGSLTVNRVSGSFHIAPGKSFSINHVHVHDVQPFSST 249

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
            FNTTH IRHLSFG  +  D +   PL  TV  AEEGASMF Y+IKI+PT Y +LDG   
Sbjct: 250 EFNTTHKIRHLSFGASI--DSDTHNPLKDTVGLAEEGASMFQYHIKIVPTAYVKLDGQFI 307

Query: 303 --------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                          L  G+ GMPGIFF YELSPLMVK TE+S+S GH  T +   I G 
Sbjct: 308 SANQFSVTKHRRVISLMSGESGMPGIFFQYELSPLMVKYTEQSRSFGHFATNVCAIIGGV 367

Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
           Y    L+D +L+  VK I  K+E+G
Sbjct: 368 YTVAGLIDTMLYHSVKLIQKKIELG 392


>gi|307179776|gb|EFN67966.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Camponotus floridanus]
          Length = 385

 Score =  400 bits (1028), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 202/385 (52%), Positives = 256/385 (66%), Gaps = 25/385 (6%)

Query: 7   LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           L+ LD   K  E  D   +T  G  VT++  + +  L+  ++  Y   S +EELFVD+SR
Sbjct: 4   LRQLDVHPKVREEADILVRTFSGAIVTVISTIIMGILLMSEINYYLTPSMSEELFVDTSR 63

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           GSKL I+LDI+VP ISCD L++DA+D++GEQHLH+EHNI+KRRLDL+GKPI++PQ+  + 
Sbjct: 64  GSKLRINLDIIVPVISCDLLSIDAMDTTGEQHLHIEHNIFKRRLDLNGKPIEDPQRTNIT 123

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             K    T E      E+     CG CYGA TET +CCNTC EV+EAY+ KKWA P+   
Sbjct: 124 DSKAVNKTAEKAL---EIGSTESCGDCYGAATETLRCCNTCEEVREAYKLKKWAPPDPAN 180

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QCK++ S EK+K+ FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD+QPYTS 
Sbjct: 181 IKQCKDDKSMEKIKHAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTST 240

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
            FN TH IRHLSFG+ +     +  P+D T   A EGA MF +YIKI+PT Y R DGS  
Sbjct: 241 HFNMTHKIRHLSFGLNIPG---KTNPMDDTTVIATEGAMMFYHYIKIVPTTYVRTDGSTL 297

Query: 303 --------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                          L  G+ GMPGIFFSYELSPLMVK TEK+KS GH  T     I G 
Sbjct: 298 FTNQFSVTRHAKQVSLFTGESGMPGIFFSYELSPLMVKYTEKAKSFGHFATNTCAIIGGV 357

Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
           +    L+D+LL+  V+ I  K+E+G
Sbjct: 358 FTVAGLIDSLLYHSVRAIQKKIELG 382


>gi|380016121|ref|XP_003692037.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Apis florea]
          Length = 385

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 202/386 (52%), Positives = 255/386 (66%), Gaps = 27/386 (6%)

Query: 7   LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           L+ LD   K  E  D   +T  G  VTI+  + +  L   +V  Y   + +EELFVD+SR
Sbjct: 4   LRQLDVHPKVREEADILVRTFSGAVVTIISTIIMGILFLSEVNYYLTPTLSEELFVDTSR 63

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           GSKL I+LDI+VPTISCD L++DA+D++GEQHL +EHNI+KRRLDL+GKPI++PQ+  + 
Sbjct: 64  GSKLRINLDIIVPTISCDLLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDIT 123

Query: 125 AVKK-KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
             K   K T +   +TTE      CG CYGA +E  KCCNTC +V+EAYR K WA P L 
Sbjct: 124 DTKALSKTTAKTLESTTE----KICGDCYGAASEIIKCCNTCEDVREAYRLKNWAPPVLG 179

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
            I QC+N+ S EK+K  FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD+QPYTS
Sbjct: 180 NIKQCQNDKSVEKMKTAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTS 239

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS- 302
             FN TH IRHLSFG+ +     +  P+D T   A EGA MF +YIKI+PT Y R DGS 
Sbjct: 240 TQFNMTHKIRHLSFGLNIPG---KTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGST 296

Query: 303 ---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
                           L  G+ GMPGIFF+YELSPLMVK TEK+KS GH  T     I G
Sbjct: 297 LLTNQFSVTRHARQVSLFSGESGMPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGG 356

Query: 348 TYITFMLVDALLHSCVKKIS-KVEIG 372
            +    L+D+LL+  ++ I  K+E+G
Sbjct: 357 VFTVAGLIDSLLYHSLRAIQKKIELG 382


>gi|193627365|ref|XP_001948436.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Acyrthosiphon pisum]
          Length = 404

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 198/385 (51%), Positives = 262/385 (68%), Gaps = 28/385 (7%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF KP E+   KTV+GG V++VC+L I +L+  ++ +Y   + TEELFVD+SR  
Sbjct: 13  LKQFDAFAKPLEEVQIKTVWGGIVSLVCFLTIVFLMVSNLVEYLDNTPTEELFVDTSRNK 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I+ DIVVP ISCD+L LDAVD+SGE HL V+HNIYKRRL+L+G+PI +P+K   +  
Sbjct: 73  KLQINFDIVVPKISCDFLVLDAVDNSGETHLQVDHNIYKRRLNLEGQPISDPEKS-DDVG 131

Query: 127 KKKKVTTENGTTTTELEDPNK----CGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
            KK +   +   + E +D N     CGSCYGAE+ T  CCNTC++VK AY+ K W     
Sbjct: 132 SKKTLNPPSMLKSNETDDANNTEDICGSCYGAESSTIPCCNTCDDVKRAYKMKNWDF-RP 190

Query: 183 DTIVQCKNEYSTEKLKN-TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            +I QCKN+ S  ++ +  F EGCQ+YG L VNRVSGSFHIAPG+S+S NH+HVHD+ P+
Sbjct: 191 SSIEQCKNQSSQNEMYDKAFKEGCQLYGTLLVNRVSGSFHIAPGMSFSFNHMHVHDVHPF 250

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
           +S++FNTTH IRHLSFG KL+  +      PLD T + A EGA+MF YYIKI+PT+Y+R 
Sbjct: 251 SSSSFNTTHTIRHLSFGQKLESINTSHGGNPLDSTESIAGEGATMFQYYIKIVPTLYQRR 310

Query: 300 DGS---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           D S                   G  G PGIFFSYE SP+M+K+TEK + LGHL+T+ +CN
Sbjct: 311 DLSIFSTNQFSVTKHKVQAFDKGPSGAPGIFFSYEFSPIMIKLTEKPRLLGHLFTQFLCN 370

Query: 345 ISGTYITFMLVDALLHSCVKKISKV 369
           ISG +I F ++D  ++    K+SKV
Sbjct: 371 ISGVFICFWIIDIFMY----KVSKV 391


>gi|383864675|ref|XP_003707803.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Megachile rotundata]
          Length = 385

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 200/385 (51%), Positives = 253/385 (65%), Gaps = 25/385 (6%)

Query: 7   LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           L+ LD   K  E  D   +T  G  VTI+  + ++ L   ++  Y   + +EELFVD+SR
Sbjct: 4   LRQLDVHPKVREEADILVRTFSGAVVTIISTIIMAILFLTELNYYLTPTLSEELFVDTSR 63

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           GSKL I+LDIVVPTISCD L++DA+D++GEQHL +EHNIYKRRLDL GKPI++PQK  + 
Sbjct: 64  GSKLRINLDIVVPTISCDLLSIDAMDTTGEQHLQIEHNIYKRRLDLQGKPIEDPQKTDIT 123

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             K    TT     +T +E    CG CYGA +E  KCCNTC +V++AY  K WA P+  +
Sbjct: 124 DTKALSKTTAKSVESTTVE---TCGDCYGAASEKIKCCNTCEDVRKAYSDKNWAPPDPGS 180

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+N+ S EK+K  FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD+QPY S 
Sbjct: 181 IKQCQNDKSVEKMKTAFTQGCQIYGYMEVNRVGGSFHIAPGNSFSVNHVHVHDVQPYMST 240

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
            FN TH IRHLSFG+ +     +  P+D T   A EGA MF +YIKI+PT Y R DGS  
Sbjct: 241 QFNMTHKIRHLSFGLNIPG---KTNPIDDTTMVAMEGAMMFYHYIKIVPTTYVRADGSTL 297

Query: 303 --------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                          L  G+ GMPGIFFSYELSPLMVK TEK+KS GH  T +   I G 
Sbjct: 298 LTNQFSVTRHARQVSLLSGESGMPGIFFSYELSPLMVKYTEKAKSFGHFATNMCAIIGGV 357

Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
           +    L+D+ L+  V+ I  K+E+G
Sbjct: 358 FTVAGLIDSFLYHSVRAIQKKIELG 382


>gi|332024433|gb|EGI64631.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Acromyrmex echinatior]
          Length = 386

 Score =  393 bits (1010), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 200/390 (51%), Positives = 257/390 (65%), Gaps = 30/390 (7%)

Query: 5   ERLKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           + L+ LD   K  E  D   +T  G  VTI+  + +  L   ++  Y   + +EELFVD+
Sbjct: 2   QMLRQLDVHPKVREEADILVRTFSGAIVTIISTIIMGILFLSEINYYLTPTMSEELFVDT 61

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           SRGSKL I+LDI+VP+ISCD L+LDA+D++GEQHLH+EHNI+KRRLDL+G PI++PQ+  
Sbjct: 62  SRGSKLRINLDIIVPSISCDLLSLDAMDTTGEQHLHIEHNIFKRRLDLNGNPIEDPQRTN 121

Query: 123 VNAVKKKKVTTENGT---TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           +   K    TTE      +TTEL     CG CYGA T+T KCCNTC +V EAYR KKWA 
Sbjct: 122 ITDAKAMSKTTEKAVEIGSTTEL-----CGDCYGATTDTMKCCNTCEDVWEAYRRKKWAP 176

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
           P+   + QC+N+ S +KLK+ FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD+Q
Sbjct: 177 PDPADVKQCQNDKSMDKLKHAFTQGCQIYGYMEVNRVGGSFHIAPGASFSVNHVHVHDVQ 236

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
           PYTS+ FN TH IRHLSFG+ +     +  P+DG      + A MF +YIKI+PT Y R 
Sbjct: 237 PYTSSHFNMTHKIRHLSFGLNIPG---KTNPMDGMTVVDMDAAMMFYHYIKIVPTTYVRA 293

Query: 300 DGS----------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DGS                 L  G+ GMPGIFF+YELSPLMVK TEK+ S GH  T    
Sbjct: 294 DGSTLLTNQFSVTRHSKKVSLLTGESGMPGIFFNYELSPLMVKYTEKANSFGHFATNTCA 353

Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            I G +    L+D+LL+  V+ I  K+E+G
Sbjct: 354 IIGGVFTVAGLIDSLLYHSVRAIQRKIELG 383


>gi|307193219|gb|EFN76110.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Harpegnathos saltator]
          Length = 386

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 199/385 (51%), Positives = 252/385 (65%), Gaps = 24/385 (6%)

Query: 7   LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           L+ LD   K  E  D   +T  G  VTI+  + +  L   ++  Y   + +EELFVD+SR
Sbjct: 4   LRQLDVHPKVREEADILVRTFSGAIVTIISTIIMGILFMSEINYYLTPTMSEELFVDTSR 63

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           GSKL I+LD++VPTISCD L++DA+D++G Q+L +EHNI++RRLDL+GKPI++PQ+   N
Sbjct: 64  GSKLRINLDVIVPTISCDLLSVDAMDTTGVQYLQIEHNIFQRRLDLNGKPIEDPQR--TN 121

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             K K V       T        CG CYGA TET +CCNTC++V+ AYR KKWA+P+L  
Sbjct: 122 ITKTKAVVKPTDEETQISSTTKVCGDCYGAATETLECCNTCDDVQMAYRLKKWAMPDLAK 181

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+N+ S +K K+ FT+GCQIYGY+EVNRV GSFHIAPG SYS+NHVHVHD+QPY S 
Sbjct: 182 IKQCQNDKSADKYKHAFTQGCQIYGYMEVNRVGGSFHIAPGDSYSVNHVHVHDVQPYNSN 241

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
            FN TH IRHLSFG+ +     +  P+D T   A EGA MF YYIKI+PT Y R DGS L
Sbjct: 242 HFNMTHKIRHLSFGLNIPG---KTNPMDDTTTVATEGAMMFYYYIKIVPTTYVRADGSTL 298

Query: 305 GGG----------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                              D GMPGIFFSYELSPLMVK TEK+KS GH  T     I G 
Sbjct: 299 LTNQFSVTRHSKRMPLYMSDSGMPGIFFSYELSPLMVKYTEKAKSFGHFATNTCAIIGGV 358

Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
           +    L+D+LL+  V+ I  K+E+G
Sbjct: 359 FTVAGLIDSLLYHSVRAIQKKIELG 383


>gi|350404831|ref|XP_003487234.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Bombus impatiens]
          Length = 385

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 200/385 (51%), Positives = 253/385 (65%), Gaps = 25/385 (6%)

Query: 7   LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           L+ LD   K  E  D   +T  G  VTI+  + +  L   +V  Y   + +EELFVD+SR
Sbjct: 4   LRQLDVHPKVREEADILVRTFSGAVVTIISTIIMGILFLSEVNYYLTPTLSEELFVDTSR 63

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           GSKL I+LDI+VPTISCD L++DA+D++GEQHL +EHNI+KRRLDL+GKPI++PQ+  + 
Sbjct: 64  GSKLRINLDIIVPTISCDVLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDIT 123

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             K +  TT     +T  +    CG CYGA  +  KCCNTC +V+EAYR K WALP L  
Sbjct: 124 DTKARSKTTTKTVESTTEK---ACGDCYGAAGDIIKCCNTCEDVREAYRLKNWALPALGM 180

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QCKN+ S EK+K  F +GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD++PYTS 
Sbjct: 181 IKQCKNDKSVEKMKTAFIQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVKPYTST 240

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
            FN TH IRHLSFG+ +     +  P+D T   A EGA MF +YIKI+PT Y R DGS  
Sbjct: 241 QFNMTHKIRHLSFGLNIPG---KTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTL 297

Query: 303 --------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                          L  G+ GMPGIFF+YELSPLMVK TEK+KS GH  T     I G 
Sbjct: 298 LTNQFSVTRHARQVSLFSGESGMPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGGV 357

Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
           +    L+D+LL+  V+ I  K+E+G
Sbjct: 358 FTVAGLIDSLLYHSVRAIQKKIELG 382


>gi|328786822|ref|XP_393819.4| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Apis mellifera]
          Length = 383

 Score =  389 bits (1000), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 200/386 (51%), Positives = 255/386 (66%), Gaps = 29/386 (7%)

Query: 7   LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           L+ LD   K  E  D   +T  G  VTI+  + +  L   ++  Y   + +EELFVD+SR
Sbjct: 4   LRQLDVHPKVREEADILVRTFSGAVVTIISTIIMGILFLSEMNYYLTPTLSEELFVDTSR 63

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           GSKL I+LDI+VPTISCD L++DA+D++GEQHL +EHNI+KRRLDL+GKPI++PQ+  + 
Sbjct: 64  GSKLRINLDIIVPTISCDLLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDIT 123

Query: 125 AVKK-KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
             K   K T +   +TTE      CG CYGA +E  KCCNTC +V+EAYR K WA+  L 
Sbjct: 124 DTKALSKTTAKTLESTTE----KICGDCYGAASEIIKCCNTCEDVREAYRLKNWAV--LG 177

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
            I QC+N+ S EK+K  FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD+QPYTS
Sbjct: 178 NIKQCQNDKSVEKMKTAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVQPYTS 237

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS- 302
             FN TH IRHLSFG+ +     +  P+D T   A EGA MF +YIKI+PT Y R DGS 
Sbjct: 238 TQFNMTHKIRHLSFGLNIPG---KTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGST 294

Query: 303 ---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
                           L  G+ GMPGIFF+YELSPLMVK TEK+KS GH  T     I G
Sbjct: 295 LLTNQFSVTRHARQVSLFSGESGMPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGG 354

Query: 348 TYITFMLVDALLHSCVKKIS-KVEIG 372
            +    L+D+LL+  ++ I  K+E+G
Sbjct: 355 VFTVAGLIDSLLYHSLRAIQKKIELG 380


>gi|340721521|ref|XP_003399168.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Bombus terrestris]
          Length = 385

 Score =  386 bits (992), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 202/385 (52%), Positives = 253/385 (65%), Gaps = 25/385 (6%)

Query: 7   LKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           L+ LD   K  E  D   +T  G  VTI+  + +S L   +V  Y   + +EELFVD+SR
Sbjct: 4   LRQLDVHPKVREEADILVRTFSGAVVTIISTIIMSILFLSEVNYYLTPTLSEELFVDTSR 63

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
            SKL I+LDI+VPTISCD L++DA+D++GEQHL +EHNI+KRRLDL+GKPI++PQ+  + 
Sbjct: 64  DSKLRINLDIIVPTISCDVLSIDAMDTTGEQHLQIEHNIFKRRLDLNGKPIEDPQRTDIT 123

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             K +  TTE    T E      CG CYGA  +  KCCNTC +V+EAYR K WA P L  
Sbjct: 124 DTKARSKTTEK---TVESTTEKACGDCYGAAGDIIKCCNTCEDVREAYRLKNWAPPALGM 180

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QCKN+ S EK+K  FT+GCQIYGY+EVNRV GSFHIAPG S+S+NHVHVHD++PYTS 
Sbjct: 181 IKQCKNDKSVEKIKTAFTQGCQIYGYMEVNRVGGSFHIAPGDSFSVNHVHVHDVKPYTST 240

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
            FN TH IRHLSFG+ +     +  P+D T   A EGA MF +YIKI+PT Y R DGS  
Sbjct: 241 QFNMTHKIRHLSFGLNIPG---KTNPMDDTTVVAMEGAMMFYHYIKIVPTTYVRADGSTL 297

Query: 303 --------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                          L  G+ GMPGIFF+YELSPLMVK TEK+KS GH  T     I G 
Sbjct: 298 LTNQFSVTRHARQVSLFSGESGMPGIFFNYELSPLMVKYTEKAKSFGHFATNACAIIGGV 357

Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
           +    L+D+LL+  V+ I  K+E+G
Sbjct: 358 FTVAGLIDSLLYHSVRAIQKKIELG 382


>gi|357612408|gb|EHJ67977.1| hypothetical protein KGM_08440 [Danaus plexippus]
          Length = 385

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 190/386 (49%), Positives = 246/386 (63%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           + K LDA+ K  EDF  KT  G  +T+     +  LI +++  Y   + +EELFVD+SRG
Sbjct: 8   KFKQLDAYAKTLEDFRVKTATGAIITVTGAFVMILLIVLELHTYMSPNISEELFVDTSRG 67

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I+ DIVVP ISCDYL LDA+DSSGEQHL ++HN++KRRLDLDG PI+EP KE ++ 
Sbjct: 68  HKLRINFDIVVPRISCDYLVLDAMDSSGEQHLQMDHNVHKRRLDLDGVPIKEPIKEDISL 127

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
               K  +      T       CGSCYGA     +CCNTC +VKEAYR ++WALP+L T+
Sbjct: 128 SSTVKQNSSEIAIVT-------CGSCYGAAFNDSQCCNTCEDVKEAYRLRRWALPDLATV 180

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK++ S E+      EGCQIYGY+EVNRV GSFHIAPG S++INHVHVHD+QP++S+ 
Sbjct: 181 EQCKDDDSLERTNLALKEGCQIYGYMEVNRVGGSFHIAPGKSFTINHVHVHDVQPFSSSV 240

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FNTTH IRHLSFG  ++  +    PLDG    A+EGA MF YY+KI+PT+Y +LDG+ L 
Sbjct: 241 FNTTHIIRHLSFGSDIESANT--APLDGITGLAKEGAVMFQYYLKIVPTMYVKLDGTILH 298

Query: 306 GG----------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                             + GMPG FFSYELSPLMVK T K +S+GH  T +   + G +
Sbjct: 299 TNQFSVTRHQKSVSNINVESGMPGAFFSYELSPLMVKYTAKGRSIGHFATNVCAIVGGVF 358

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               + D LL+  +       + GK 
Sbjct: 359 TVAGIFDTLLYHSLNAFQNKVVLGKA 384


>gi|158300475|ref|XP_320382.3| AGAP012144-PA [Anopheles gambiae str. PEST]
 gi|157013177|gb|EAA00591.3| AGAP012144-PA [Anopheles gambiae str. PEST]
          Length = 386

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 190/389 (48%), Positives = 247/389 (63%), Gaps = 23/389 (5%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F + L+ LDA+ K   +F  +TV G A+T++  + I  L+  ++  Y   + +EELFV
Sbjct: 1   MRFLDSLRRLDAYPKIDNEFSIRTVSGAALTLISSIVIVTLVIGEINAYLSPNVSEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D++RG KL I+LD  +P ISCDY++LDA DS+GEQHLH+EHNIYKRRLDL G  I+EP+K
Sbjct: 61  DTTRGHKLKINLDFTIPRISCDYVSLDAQDSTGEQHLHIEHNIYKRRLDLQGNQIEEPKK 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           E + A  K+  +TE   TTT       CGSCYGA     +CCNTC EV +AYR +KW  P
Sbjct: 121 EDIQASTKRISSTEAPATTTV---KPACGSCYGAAKNASQCCNTCQEVIDAYRERKWN-P 176

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            ++   QCKN          F+EGC IYG +EVNRV G FHIAPG S+SINH+HVHD+QP
Sbjct: 177 NVEDFEQCKNGNGGSVEGKAFSEGCHIYGTMEVNRVEGRFHIAPGKSFSINHIHVHDVQP 236

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           Y+S+ FNTTH I  LSFG +      R  PLDG + +A EGA MF YYIKI+PT++  L+
Sbjct: 237 YSSSRFNTTHRINTLSFGEQFGFGTTR--PLDGLMVEATEGAMMFQYYIKIVPTMFVPLN 294

Query: 301 GSKL----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G  L                  G+ GMPGIF +YELSPLMVK TEK  SLGH  T +   
Sbjct: 295 GPTLYTNQFSVTKHQKSVTAMSGETGMPGIFVNYELSPLMVKFTEKRNSLGHFATNVCAI 354

Query: 345 ISGTYITFMLVDALLHSCVKKIS-KVEIG 372
           I G +    ++D+LL + +  I  K+E+G
Sbjct: 355 IGGIFTVAGIIDSLLFTSIHVIKRKIELG 383


>gi|170031960|ref|XP_001843851.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Culex quinquefasciatus]
 gi|167871431|gb|EDS34814.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Culex quinquefasciatus]
          Length = 391

 Score =  364 bits (934), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 190/396 (47%), Positives = 248/396 (62%), Gaps = 32/396 (8%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   +  + LDA+ K  ++F  KT+ G A+T +    I +LI  +   +   +  ++LFV
Sbjct: 1   MRLIDSFRRLDAYPKIDKEFSIKTIGGAALTTISGTIIVFLIYSEFVAFLTPTIEDQLFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D++RG KL I+LD VVP +SCDY++LDA D++GEQHLH++HNI+KRRLDL G PI+ P+K
Sbjct: 61  DATRGQKLRINLDFVVPRVSCDYVSLDAQDATGEQHLHIDHNIFKRRLDLKGNPIEAPKK 120

Query: 121 EVVNAVKKKKVTTE----NGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKK 176
           E + A K +K  TE    N +TT      N CGSCYGA+  +  CCNTC +V +AYR K+
Sbjct: 121 EDIQAPKPRKDATEAPVVNSSTTA-----NPCGSCYGAQKNSSHCCNTCQDVIDAYREKQ 175

Query: 177 WALPELDTIVQCKNEYSTEKLK---NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV 233
           W  P L+   QCK E +  KL      F EGCQIYGY+EVNRV GSFHIAPG S+SI+H+
Sbjct: 176 WN-PTLEEFEQCKTEVAIGKLSLEAKAFNEGCQIYGYMEVNRVGGSFHIAPGKSFSISHI 234

Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
           HVHD+QP++S+ FN THHI  LSFG +      +  PLDGT   AEEGA MF YYIKI+P
Sbjct: 235 HVHDVQPFSSSRFNMTHHINTLSFGEEFGFG--QTSPLDGTDVIAEEGAMMFQYYIKIVP 292

Query: 294 TIYERLDGSKLG----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
           T +  L G KL                  GD GMPGIF +YELSPLMVK TEK  S  H 
Sbjct: 293 TEFVPLSGPKLHTNQFSVTTHRKSVSLMSGDSGMPGIFVNYELSPLMVKFTEKRSSFSHF 352

Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            T +   I G +    +VD LL + +  +  K+E+G
Sbjct: 353 ATNLCAIIGGIFTVSGIVDTLLFTSIHALKRKIELG 388


>gi|156552683|ref|XP_001599365.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Nasonia vitripennis]
          Length = 328

 Score =  352 bits (902), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 179/337 (53%), Positives = 229/337 (67%), Gaps = 34/337 (10%)

Query: 56  EELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
           EELFVD+SRGSKL I+LDIV+ +I+CD L++DA+D++GE HL ++HNI+KRRLDLDGKPI
Sbjct: 3   EELFVDTSRGSKLKINLDIVISSIACDMLSIDAMDTTGETHLEIQHNIFKRRLDLDGKPI 62

Query: 116 QEPQKE-VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR--KCCNTCNEVKEAY 172
           ++P+K  + +  K  +   EN T         KCG CYGA +E    KCCNTC EVKEAY
Sbjct: 63  EDPKKTGIADPKKTTEKPAENATA--------KCGDCYGAASEELGIKCCNTCEEVKEAY 114

Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH 232
           R +KWA+ +     QCKN+ S E    TF EGCQIYG++EVNRV GSFHIAPG S +I+H
Sbjct: 115 RKRKWAVHDTSRFAQCKNDKSREM---TFKEGCQIYGFMEVNRVGGSFHIAPGDSITIDH 171

Query: 233 VHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
           +HVHD+QPY+S+ FN TH IRHLSFG  +     +  P+D T   A EGA+MF++YIKI+
Sbjct: 172 LHVHDVQPYSSSQFNLTHRIRHLSFGTNIPG---KTNPIDNTTVIASEGATMFHHYIKIV 228

Query: 293 PTIYERLDGSKLG----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
           PT + RLDGS L                  G+ GMPG+FFSYELSPLMVK T+  KSLGH
Sbjct: 229 PTTFMRLDGSILHTNQFSLTKHSRSIKQYSGESGMPGLFFSYELSPLMVKYTQTVKSLGH 288

Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
           L T     I GT+    ++DA L+  V+ I  K+E+G
Sbjct: 289 LMTNTCAIIGGTFTVASIIDAFLYHSVRAIQKKMELG 325


>gi|242007856|ref|XP_002424735.1| Endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Pediculus humanus corporis]
 gi|212508228|gb|EEB11997.1| Endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Pediculus humanus corporis]
          Length = 376

 Score =  350 bits (899), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 190/388 (48%), Positives = 241/388 (62%), Gaps = 46/388 (11%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  D + K  +D+  +T+ GGAVT+V ++ ++ L   ++  Y     +EELFVD++R  
Sbjct: 10  LKDFDGYPKTLDDYRIRTLGGGAVTVVSYIIMTLLFISELNTYLTPDISEELFVDTTREP 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV---- 122
           KL I+L+I VP ISC YL+LDA+DSSGEQHL +EHNIYK  LD +G PI+EP+KE     
Sbjct: 70  KLQINLNITVPEISCKYLSLDAMDSSGEQHLQIEHNIYKVSLDKNGIPIKEPEKETFVKP 129

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKWALP 180
           VN  K+K                 KCGSCYGAE+ET    CCNTC +VK+AY  + W L 
Sbjct: 130 VNETKEK-----------------KCGSCYGAESETLNITCCNTCADVKDAYMKRGWGLN 172

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            L+ I QCKN        N F EGC IYG +EVNRV GSFHIAPG S+SINHVHVHD+QP
Sbjct: 173 NLELIEQCKN----LSQNNIFNEGCFIYGTMEVNRVGGSFHIAPGQSFSINHVHVHDVQP 228

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           ++S AFNT+H I HLSFG  +     +  PLDG VA   EGA+MF YYIKI+PTIY   D
Sbjct: 229 FSSKAFNTSHKIDHLSFGYNIPG---KTNPLDGIVALTHEGATMFQYYIKIVPTIYYYYD 285

Query: 301 GS--------------KLGGGDGGM-PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
            S              K G    G+ PGIFF+YEL+P+MVK TE+ +S GH  T +   I
Sbjct: 286 KSGTILTNQFSVTRHQKSGSETIGVPPGIFFNYELAPIMVKYTERKRSFGHFATNVCAII 345

Query: 346 SGTYITFMLVDALLHSCVKKI-SKVEIG 372
            G +    L+DA L+  V+    K+EIG
Sbjct: 346 GGVFTVASLIDAFLYRSVQAFKKKIEIG 373


>gi|225708964|gb|ACO10328.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Caligus rogercresseyi]
          Length = 385

 Score =  340 bits (873), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 176/381 (46%), Positives = 232/381 (60%), Gaps = 30/381 (7%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           +SE L+ LDA+ K  EDF  +T+ GGA+T++  + + +L   ++ +Y      EELFVD+
Sbjct: 4   WSEALRRLDAYPKTLEDFRIQTLSGGAITLLSGVLMVFLFASEIREYLTPRVQEELFVDT 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           S+G KL I+LD+V  ++SCD+L LDA+D SGE H+ + HNIYKRRL L+G P++EP++E 
Sbjct: 64  SKGGKLKINLDVVFNSVSCDFLVLDAMDVSGESHVDIVHNIYKRRLSLEGSPMEEPRRET 123

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
               KK   TT   +   E   P  CGSCYGAET    CCN+C EVKEAYR K W     
Sbjct: 124 EVGQKK---TTHAPSPKNETSTP-PCGSCYGAETPGSPCCNSCGEVKEAYRRKGW----- 174

Query: 183 DTIVQCKN---EYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            TIV  K    E  TE ++  + EGCQIYG L VNRV GSFHI PG S+++NH+H+HD+Q
Sbjct: 175 -TIVAAKFEQCEMDTEGIERVYKEGCQIYGSLLVNRVGGSFHIVPGKSFTLNHLHIHDLQ 233

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
           P++S  FNT+H IRHLSFG K   D      LD   A + +G  M+ YY+KI+PT Y R 
Sbjct: 234 PFSSGEFNTSHRIRHLSFGSKTALDPGGNA-LDAVSALSPKGGLMYQYYLKIVPTTYSRS 292

Query: 300 DGSKLGGGD----------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG    G                  GGMPG+FF+YEL+PLMVK +EK KS GH  T +  
Sbjct: 293 DGGTFTGNQYSVTRLEKDVSSSLDSGGMPGVFFNYELAPLMVKYSEKEKSFGHFATGLCA 352

Query: 344 NISGTYITFMLVDALLHSCVK 364
            I G +      D  ++S  K
Sbjct: 353 IIGGVFTLASAFDKFIYSSSK 373


>gi|321463520|gb|EFX74535.1| hypothetical protein DAPPUDRAFT_226626 [Daphnia pulex]
          Length = 381

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 173/387 (44%), Positives = 241/387 (62%), Gaps = 32/387 (8%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
            K +DA+ K  EDF  +T  G  VT+   + +++L  ++  D+  ++ +E+L+VD++R  
Sbjct: 9   FKTIDAYPKTLEDFTIRTATGAMVTVFSSIIMAFLFVIEFRDFLSINVSEQLYVDTTRIP 68

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I+ D+  PTISC YL++DAVDSSGEQ   VEHNI+K+RL+L G+P+Q  + E +N  
Sbjct: 69  NMKINFDVTFPTISCSYLSVDAVDSSGEQQFGVEHNIFKQRLNLLGEPLQAAELEEINKT 128

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL-PELDTI 185
             K   TE   T+TE      C SCYGA+     CC TC EV+EAYR K WA  PE    
Sbjct: 129 HNK---TE---TSTEESASKPCNSCYGAK---EGCCETCAEVREAYRQKNWAFRPE--EF 177

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+NE +  +  + F EGC++YGYLEVNRVSGSFHIAPG SY+INHVHVHD+QPY+S  
Sbjct: 178 EQCRNEKNLTRDYSAFKEGCKLYGYLEVNRVSGSFHIAPGKSYAINHVHVHDVQPYSSED 237

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN THHI  LSFG  L     +  PLDG +  A++GA MF YYIK++PT Y +LDG +  
Sbjct: 238 FNVTHHINSLSFGTSLIG---KENPLDGFLTTADKGAMMFQYYIKVVPTWYVKLDGEEFH 294

Query: 306 ----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                           GG+ G+PG+FF+YE+SPL +   E  +S+GH  T +   I G +
Sbjct: 295 TNQYSVTRHQKVVSSYGGESGVPGVFFTYEMSPLQISYKESKRSIGHFATDVCTIIGGVF 354

Query: 350 ITFMLVDALLHSCVKKI-SKVEIGGKT 375
               ++D+LL+   K +  K+++G  T
Sbjct: 355 TVAGIIDSLLYRSSKLLQQKLQLGKAT 381


>gi|157118753|ref|XP_001653244.1| ptx1 protein [Aedes aegypti]
 gi|108875623|gb|EAT39848.1| AAEL008391-PA [Aedes aegypti]
          Length = 384

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 179/392 (45%), Positives = 238/392 (60%), Gaps = 31/392 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + L+  DA+ K  ++F  +TV G  +T +    I  LI  ++  Y     T+ELFV
Sbjct: 1   MTLLDSLRRFDAYPKIDKEFSIRTVGGATLTFISGTIIVVLIYSELIAYLTPVVTDELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           DS+RG KL I+LD  +P ISCDY++LDA D++GEQHLH+EH IYKRR+DL G PI+E +K
Sbjct: 61  DSTRGQKLKINLDFYIPRISCDYVSLDAQDATGEQHLHIEHTIYKRRMDLQGNPIEEAKK 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           E ++A K +    E        E+  KC SCYGAE  +  CC TC +V +AYR K+W  P
Sbjct: 121 EDISAPKPRLEKKE--------ENVKKCRSCYGAEKNSTHCCETCQDVIDAYREKQWN-P 171

Query: 181 ELDTIVQCKNEYSTEKL---KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
            LD   QC+NE    K       F+EGCQIYG ++VNRV GSFHIAPG S+SI+H+HVHD
Sbjct: 172 NLDDFEQCQNEVLLGKKSLESKAFSEGCQIYGSMQVNRVGGSFHIAPGKSFSISHIHVHD 231

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
           +QP++S+ FNT+H I  LSFG +      R  PLD T   A EGA MF YYIKI+PT + 
Sbjct: 232 VQPFSSSRFNTSHRINTLSFGEEFGYGQTR--PLDFTEKTAHEGAIMFQYYIKIVPTEFV 289

Query: 298 RLDGSKLG----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
            L+G  L                  G+ GMPGIF +YELSPLMV+ TEK  S  H  T +
Sbjct: 290 PLNGPTLHTNQFSVTKHQKSVSVMSGESGMPGIFVNYELSPLMVRFTEKRNSFSHFATNL 349

Query: 342 MCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
              I G +    ++D+LL + +  +  K+E+G
Sbjct: 350 CAIIGGIFTVAGIIDSLLFTSIHALKRKIELG 381


>gi|260815243|ref|XP_002602383.1| hypothetical protein BRAFLDRAFT_63528 [Branchiostoma floridae]
 gi|229287692|gb|EEN58395.1| hypothetical protein BRAFLDRAFT_63528 [Branchiostoma floridae]
          Length = 397

 Score =  333 bits (854), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 170/403 (42%), Positives = 243/403 (60%), Gaps = 47/403 (11%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           ++ +L+  DA+ K  +DF  KT  G AVTI+   F+  L   ++  Y  +  TEELFVD+
Sbjct: 6   WAAKLRRFDAYPKTLDDFRVKTFGGAAVTIISGFFMILLFVSELQYYLTLEVTEELFVDT 65

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE- 121
           SRG K+ I++DI+   + C YL++DA+D +GEQ + V+HN++KRR+DL G  + EP+KE 
Sbjct: 66  SRGEKMRINIDILFHKVPCAYLSIDAMDIAGEQQIDVDHNLFKRRMDLQGNILDEPEKED 125

Query: 122 -------VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRY 174
                   + A+KK     EN T          C SCYGAETE  KCCNTC +V+EAYR 
Sbjct: 126 LGDPSDEFMQAIKK----LENKTADV-------CESCYGAETEDLKCCNTCEDVREAYRR 174

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
           K WA    DTI QCK E  +EKLK    EGCQ+YGYLEVN+V+G+FH APG S+  +HVH
Sbjct: 175 KGWAFNNPDTIEQCKREGWSEKLKQQKNEGCQVYGYLEVNKVAGNFHFAPGKSFQQHHVH 234

Query: 235 --------VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFN 286
                   VHD+QP+    FN +HH+ HLSFG    D   R  PLDG +  A++G+ M+ 
Sbjct: 235 VSCFYHPIVHDLQPFGGEKFNLSHHVNHLSFGT---DIPGRVNPLDGHMVAAKQGSMMYQ 291

Query: 287 YYIKIIPTIYERLDGSKL----------------GGGDGGMPGIFFSYELSPLMVKITEK 330
           Y++KI+PTIY+++ G ++                  G+ G+PG+F  YELSP+MV+ TEK
Sbjct: 292 YFVKIVPTIYKKISGQEVRTNQFSVTKHQKQVTASSGEQGLPGVFVLYELSPMMVQFTEK 351

Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
            +S  H  T +   + G +    L+D+L++   + I  K+++G
Sbjct: 352 QRSFMHFLTGVCAIVGGVFTVAGLIDSLIYHSARAIQQKIDLG 394


>gi|148222292|ref|NP_001091124.1| ERGIC and golgi 3 [Xenopus laevis]
 gi|120538715|gb|AAI29573.1| LOC100036873 protein [Xenopus laevis]
          Length = 384

 Score =  325 bits (834), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 168/388 (43%), Positives = 234/388 (60%), Gaps = 27/388 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RL+  DA+ K  EDF  KT  G  VT++  L +  L   ++  Y       ELFVD SRG
Sbjct: 6   RLRQFDAYPKTLEDFRVKTCGGAVVTVISGLIMLILFFSELQYYLTKEIYPELFVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVN 124
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD D KP+  E  K  + 
Sbjct: 66  DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDKKPVTSEADKHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            ++      E+     +  DPN+C SCYGAETE   CCN+C++V+EAYR K WA    D+
Sbjct: 126 KLE------EHVVLDPKTLDPNRCESCYGAETEDFSCCNSCDDVREAYRRKGWAFKTPDS 179

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QCK E  ++K++    EGCQIYG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 180 IEQCKREGFSQKMQEQKNEGCQIYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 239

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH I+HLSFG   +D      PLDGT   A + + MF Y++KI+PT+Y ++DG  L
Sbjct: 240 NINMTHEIKHLSFG---RDYPGLVNPLDGTSIVAMQSSMMFQYFVKIVPTVYVKVDGEVL 296

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G 
Sbjct: 297 RTNQFSVTRHEKMTNGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGV 356

Query: 349 YITFMLVDALLHSCVKKIS-KVEIGGKT 375
           +    L+DAL++   + I  K+E+G  T
Sbjct: 357 FTVASLIDALIYHSTRAIQKKIELGKAT 384


>gi|327271489|ref|XP_003220520.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Anolis carolinensis]
          Length = 383

 Score =  324 bits (830), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 167/387 (43%), Positives = 239/387 (61%), Gaps = 26/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RLK  DAF K  EDF  KT  G  VT++  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   RLKRFDAFPKTLEDFRVKTCGGALVTVISGLIMFLLFFSELQYYLTKEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D+V P + C YL++DA+D +GEQ L VEHN++K+RLD DGK +  P+ E    
Sbjct: 66  DKLRINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGKHVT-PEAERHEL 124

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K+++   +  +      DP++C SCYGAE++  KCCNTC++V+EAYR + WA    DTI
Sbjct: 125 GKEEETIFDPNSL-----DPDRCESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E  ++K++    EGC++YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCKREGFSQKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
            N TH I+HLSFG   +D      PLDGTV  A++ + MF Y++K++PTIY ++DG    
Sbjct: 240 INMTHIIKHLSFG---RDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPTIYMKVDGEVVR 296

Query: 302 ---------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                     K+     GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVF 356

Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
               L+D+L++   + I  K+E+G  T
Sbjct: 357 TVAGLIDSLIYHSARVIQKKIELGKTT 383


>gi|405966014|gb|EKC31342.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Crassostrea gigas]
          Length = 397

 Score =  323 bits (828), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 174/398 (43%), Positives = 233/398 (58%), Gaps = 31/398 (7%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F ERL+  DA+ K  EDF  KT  G  VT++  L +  L   ++  Y       ELFVD+
Sbjct: 6   FYERLRQFDAYPKTLEDFRVKTFGGALVTVISSLLMVILFISELNYYLTKDVQPELFVDT 65

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ--EPQK 120
           +RG KL I++DI  P + C YL++DA+D SGEQ L V+H+++K+RL+ DG+ I+  EP+K
Sbjct: 66  TRGQKLRINIDIDFPKVPCAYLSIDAMDVSGEQQLDVDHHLFKQRLNADGEKIKDTEPEK 125

Query: 121 E------VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRY 174
           E      +     K K   E     T+  DP++C SCYGAET   KCCNTC +V+EAYR 
Sbjct: 126 EGTMYEPIFELGDKSKDAVE---AVTKKLDPDRCESCYGAETGDLKCCNTCEDVREAYRK 182

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
           K WA    + I QC  E  T K+K    EGCQ+YGYLEVN+V G+FH APG S+  +HVH
Sbjct: 183 KGWAFNSPEGIEQCNREGWTAKMKAQQKEGCQVYGYLEVNKVQGNFHFAPGKSFQQHHVH 242

Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           VHD+Q +    FN +H IRHLSFG   QD      PLD T   +E+  +MF YY+K++PT
Sbjct: 243 VHDLQAFGGQKFNLSHAIRHLSFG---QDYPGIINPLDQTSQISEDEQTMFQYYVKVVPT 299

Query: 295 IYERLDGSKL----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
            Y  + G  L                G GD G+PG+FF YELSP+MVK TEK +S  H  
Sbjct: 300 TYVDVKGKTLYTNQYSVNKHSKTVGNGMGDSGLPGVFFIYELSPMMVKYTEKQRSFMHFL 359

Query: 339 TKIMCNISGTYITFMLVDALL-HSCVKKISKVEIGGKT 375
           T +   I G +    L+D+++ HS      K+E+G  T
Sbjct: 360 TGVCAIIGGIFTVAGLIDSMIYHSSRALQKKIELGKAT 397


>gi|41055991|ref|NP_957309.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform 2 [Danio rerio]
 gi|82210123|sp|Q803I2.1|ERGI3_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|28278376|gb|AAH44474.1| ERGIC and golgi 3 [Danio rerio]
 gi|182890166|gb|AAI64701.1| Ergic3 protein [Danio rerio]
          Length = 383

 Score =  322 bits (825), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 166/384 (43%), Positives = 230/384 (59%), Gaps = 26/384 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  L +  L   ++  Y       ELFVD+SRG
Sbjct: 6   KLKQFDAYPKTLEDFRIKTCGGATVTIISGLIMLILFFSELQYYLTKEVHPELFVDTSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG+P+         A
Sbjct: 66  DKLRINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGQPV------TTEA 119

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K      E G       DP++C SCYGAET+  KCCNTC++V+EAYR + WA    DTI
Sbjct: 120 EKHDLGKEEEGVFDPSTLDPDRCESCYGAETDDLKCCNTCDDVREAYRRRGWAFKTPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
            N TH I+HLSFG   +D      PLD T   A + + M+ Y++KI+PTIY + DG    
Sbjct: 240 INMTHFIKHLSFG---KDYPGIVNPLDDTNVAAPQASMMYQYFVKIVPTIYVKGDGEVVK 296

Query: 302 ---------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                     K+     GD G+PG+F  YELSP+MVK TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCAIIGGVF 356

Query: 350 ITFMLVDALLHSCVKKIS-KVEIG 372
               L+D+L++   + I  K+E+G
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIELG 380


>gi|47575764|ref|NP_001001226.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Xenopus (Silurana) tropicalis]
 gi|82185697|sp|Q6NVS2.1|ERGI3_XENTR RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|45708932|gb|AAH67932.1| ERGIC and golgi 3 [Xenopus (Silurana) tropicalis]
          Length = 384

 Score =  321 bits (822), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 163/387 (42%), Positives = 231/387 (59%), Gaps = 25/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RL+  DA+ K  EDF  KT  G  VT++  L +  L   ++  Y       ELFVD SRG
Sbjct: 6   RLRQFDAYPKTLEDFRVKTCGGALVTVISGLIMLILFFSELQYYLTKEIYPELFVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD D KP+          
Sbjct: 66  DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDKKPVTSEADRHELG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             ++ V  +  +      DPN+C SCYGAET+   CCNTC++V+EAYR + WA    D+I
Sbjct: 126 KSEEHVVFDPKSL-----DPNRCESCYGAETDDFSCCNTCDDVREAYRRRGWAFKTPDSI 180

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 181 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 240

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH IRHLSFG   +D      PLDG+   A + + MF Y++KI+PT+Y ++DG  L 
Sbjct: 241 INMTHEIRHLSFG---RDYPGLVNPLDGSSVAAMQSSMMFQYFVKIVPTVYVKVDGEVLR 297

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 298 TNQFSVTRHEKMTNGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVF 357

Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
               L+D+L++   + I  K+E+G  T
Sbjct: 358 TVAGLIDSLVYYSTRAIQKKIELGKAT 384


>gi|348521804|ref|XP_003448416.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Oreochromis niloticus]
          Length = 384

 Score =  320 bits (821), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 166/385 (43%), Positives = 233/385 (60%), Gaps = 27/385 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  + +  L   ++  Y       EL+VD+SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYYLTKEVHPELYVDTSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVN 124
            KL I++DI+ P + C YL++DA+D +GEQ L VEHN++K+RLD + KP+ QE +K  + 
Sbjct: 66  DKLKINIDIIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKEFKPVTQEAEKHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
                +V   +        DP++C SCYGAETE  KCCNTC++V+EAYR + WA    DT
Sbjct: 126 KADDGEVFDPSTL------DPDRCESCYGAETEDLKCCNTCDDVREAYRRRGWAFKSADT 179

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QCK E  T+K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 180 IEQCKREGFTQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 239

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH I+HLSFG   +D      PLDGT   A + + M+ Y++KI+PTIY + DG  +
Sbjct: 240 NINMTHLIKHLSFG---KDYPGLVNPLDGTDVTAPQASMMYQYFVKIVPTIYMKTDGEVV 296

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK TEK +S  H  T +   I G 
Sbjct: 297 KTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGV 356

Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
           +    L+D+L++   + I  K+E+G
Sbjct: 357 FTVAGLIDSLIYHSARVIQKKIELG 381


>gi|148225661|ref|NP_001087591.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Xenopus laevis]
 gi|82181499|sp|Q66KH2.1|ERGI3_XENLA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|51513379|gb|AAH80394.1| MGC83277 protein [Xenopus laevis]
          Length = 389

 Score =  318 bits (814), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 165/392 (42%), Positives = 232/392 (59%), Gaps = 30/392 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RL+  DA+ K  EDF  KT  G  VT++  L +  L   ++  Y       ELFVD SRG
Sbjct: 6   RLRQFDAYPKTLEDFRVKTCGGAVVTVISGLIMLILFFSELQYYLTKEVYPELFVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLDLD KP+          
Sbjct: 66  DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDLDKKPVTSEADRHELG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             +++V  +  T      DPN+C SCYGAET+   CCN+C++V+EAYR K WA    D+I
Sbjct: 126 KSEEQVVFDPKTL-----DPNRCESCYGAETDDFSCCNSCDDVREAYRRKGWAFKTPDSI 180

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QCK E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 181 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 240

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH I+HLSFG   +D      PLDGT   A + + MF Y++KI+PT+Y ++D
Sbjct: 241 FGLDNINMTHEIKHLSFG---KDYPGLVNPLDGTSIVAMQSSMMFQYFVKIVPTVYVKVD 297

Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G  L                  GD G+PG+F  YELSP+MVK TEK +S  H  T +   
Sbjct: 298 GEVLRTNQFSVTRHEKMTNGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAI 357

Query: 345 ISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
           I G +    L+D+L++   + I  K+E+G  T
Sbjct: 358 IGGVFTVAGLIDSLIYYSTRAIQKKIELGKAT 389


>gi|327271491|ref|XP_003220521.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Anolis carolinensis]
          Length = 388

 Score =  318 bits (814), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 167/392 (42%), Positives = 239/392 (60%), Gaps = 31/392 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RLK  DAF K  EDF  KT  G  VT++  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   RLKRFDAFPKTLEDFRVKTCGGALVTVISGLIMFLLFFSELQYYLTKEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D+V P + C YL++DA+D +GEQ L VEHN++K+RLD DGK +  P+ E    
Sbjct: 66  DKLRINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGKHVT-PEAERHEL 124

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K+++   +  +      DP++C SCYGAE++  KCCNTC++V+EAYR + WA    DTI
Sbjct: 125 GKEEETIFDPNSL-----DPDRCESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QCK E  ++K++    EGC++YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 180 EQCKREGFSQKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH I+HLSFG   +D      PLDGTV  A++ + MF Y++K++PTIY ++D
Sbjct: 240 FGLDNINMTHIIKHLSFG---RDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPTIYMKVD 296

Query: 301 G-------------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G              K+     GD G+PG+F  YELSP+MVK+TEK +S  H  T +   
Sbjct: 297 GEVVRTNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356

Query: 345 ISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
           I G +    L+D+L++   + I  K+E+G  T
Sbjct: 357 IGGVFTVAGLIDSLIYHSARVIQKKIELGKTT 388


>gi|390359988|ref|XP_792057.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Strongylocentrotus purpuratus]
          Length = 400

 Score =  317 bits (812), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 169/392 (43%), Positives = 234/392 (59%), Gaps = 28/392 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RL+  DA+ K  EDF  KT  G AVTI+  + +  L   ++  Y       EL+VD++RG
Sbjct: 9   RLREFDAYPKTLEDFRVKTFGGAAVTIISSIIMITLFISELNFYLTKEVIPELYVDATRG 68

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I+++IV P + C YL++DA+D SGEQ L V+HNIYKRR+D  G PI EP+KE +  
Sbjct: 69  EKLKINMEIVFPKMPCAYLSIDAMDISGEQQLDVDHNIYKRRIDKTGTPISEPEKEELGK 128

Query: 126 VKKKKVTTENGTTTT------ELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
            + ++   E  +         E+ DPN+C SCYGAET   KCCN C  V+EAYR K WA 
Sbjct: 129 KEDQEKKEEEDSEQEDEKKKMEVLDPNRCESCYGAETPGLKCCNDCEGVQEAYRRKGWAF 188

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            +  +I QCK E  +EK+++   EGC++YGYLEVN+V+G+FH APG S+  +HVHVHD+Q
Sbjct: 189 SDPTSIEQCKREGFSEKMQSQKEEGCELYGYLEVNKVAGNFHFAPGKSFQQHHVHVHDLQ 248

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
               A FN THH++ LSFG++         PLD       +G+SMF Y++KI+PT Y +L
Sbjct: 249 AIAGAKFNMTHHVKTLSFGMEYPG---MENPLDNMKTIDVKGSSMFQYFVKIVPTTYTKL 305

Query: 300 DGS------------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
           D S                      G+ G+PG+F  YELSPLMVK TEK +S  H  T +
Sbjct: 306 DKSITRTNQYSVTKHEKQVTTSFSTGEHGLPGVFVLYELSPLMVKFTEKHRSFMHFLTGV 365

Query: 342 MCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
              I G +    L+D+L++   K I  K+++G
Sbjct: 366 CAIIGGVFTVAGLIDSLIYHSAKAIQKKIDLG 397


>gi|259155256|ref|NP_001158869.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Salmo salar]
 gi|223647782|gb|ACN10649.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Salmo salar]
          Length = 388

 Score =  316 bits (810), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 167/390 (42%), Positives = 230/390 (58%), Gaps = 33/390 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  L +  L   ++  Y       ELFVD+SRG
Sbjct: 6   KLKQFDAYPKTLEDFRIKTCGGATVTIISGLIMLILFFSELQYYLTKEVHPELFVDTSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P       V   
Sbjct: 66  DKLKININVIFPNMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGNP-------VTTE 118

Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            +K  +  E G      + DP +C SCYGAETE  KCCNTC++V+EAYR + WA    DT
Sbjct: 119 AEKHDLGQEEGEIFDPSKLDPERCESCYGAETEDLKCCNTCDDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QCK E  ++K++    EGCQIYG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 179 IEQCKREGFSQKMQEQKNEGCQIYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH I+HLSFG   +D      PLDGT   A + + M+ Y++KI+PTIY + 
Sbjct: 239 SFGLDNINMTHLIKHLSFG---RDYPGIVNPLDGTDVAAPQASMMYQYFVKIVPTIYVKW 295

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  +                  GD G+PG+F  YELSP+MVK TEK +S  H  T +  
Sbjct: 296 DGEVVKTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCA 355

Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            + G +    L+D+L++   K I  K+E+G
Sbjct: 356 IVGGVFTVAGLIDSLIYHSAKAIQKKIELG 385


>gi|387015776|gb|AFJ50007.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3-like
           [Crotalus adamanteus]
          Length = 372

 Score =  316 bits (810), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 164/387 (42%), Positives = 228/387 (58%), Gaps = 37/387 (9%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RLK  DAF K  EDF  KT  G  VT++  L + +L   ++  Y       EL+VD SRG
Sbjct: 6   RLKRFDAFPKTLEDFRVKTCGGAFVTVISGLIMFFLFFSELQYYLTKEIHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++DI  P + C YL++DA+D +GEQ L VEHN++K+RLD D    +E      N+
Sbjct: 66  DKLRINIDIAFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDELGKEEELFFNPNS 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +                 DP +C SCYGAE+E  KCCN C++V+EAYR + WA    DTI
Sbjct: 126 L-----------------DPERCESCYGAESEDIKCCNNCDDVREAYRRRGWAFKNPDTI 168

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E  +EK++    EGC++YG+LEVN+V+G+FH APG S+  +HVHVHD+Q Y    
Sbjct: 169 EQCKREGFSEKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSYGLDN 228

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH IRHLSFG   +D      PLDGT+  A + + MF Y++K++PT+Y ++DG  + 
Sbjct: 229 INITHFIRHLSFG---KDYPGLVNPLDGTIVTAHQASMMFQYFVKVVPTVYMKVDGEMVR 285

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 286 TNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGVF 345

Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
               L+D+L++   + I  K+E+G  T
Sbjct: 346 TVAGLIDSLIYHSARAIQKKIELGKTT 372


>gi|417399979|gb|JAA46966.1| Putative copii vesicle protein [Desmodus rotundus]
          Length = 383

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 167/387 (43%), Positives = 228/387 (58%), Gaps = 25/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +     
Sbjct: 66  DKLKINIDVFFPRMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             + KV   N        DP +C SCYGAETE  KCCNTC +V+EAYR + WA    DTI
Sbjct: 126 KAEMKVFDPNSL------DPERCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y +LDG  L 
Sbjct: 240 INMTHYIRHLSFG---EDYPGIVNPLDHTNVTALQASMMFQYFVKVVPTVYMKLDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKTV 376
               L+D+L++   + I K    GKTV
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKTV 383


>gi|74315943|ref|NP_001028277.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform 1 [Danio rerio]
 gi|72679324|gb|AAI00126.1| ERGIC and golgi 3 [Danio rerio]
          Length = 388

 Score =  315 bits (808), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 166/389 (42%), Positives = 230/389 (59%), Gaps = 31/389 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  L +  L   ++  Y       ELFVD+SRG
Sbjct: 6   KLKQFDAYPKTLEDFRIKTCGGATVTIISGLIMLILFFSELQYYLTKEVHPELFVDTSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG+P+         A
Sbjct: 66  DKLRINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGQPV------TTEA 119

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K      E G       DP++C SCYGAET+  KCCNTC++V+EAYR + WA    DTI
Sbjct: 120 EKHDLGKEEEGVFDPSTLDPDRCESCYGAETDDLKCCNTCDDVREAYRRRGWAFKTPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QCK E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 180 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH I+HLSFG   +D      PLD T   A + + M+ Y++KI+PTIY + D
Sbjct: 240 FGLDNINMTHFIKHLSFG---KDYPGIVNPLDDTNVAAPQASMMYQYFVKIVPTIYVKGD 296

Query: 301 G-------------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G              K+     GD G+PG+F  YELSP+MVK TEK +S  H  T +   
Sbjct: 297 GEVVKTNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKFTEKQRSFTHFLTGVCAI 356

Query: 345 ISGTYITFMLVDALLHSCVKKIS-KVEIG 372
           I G +    L+D+L++   + I  K+E+G
Sbjct: 357 IGGVFTVAGLIDSLIYHSARAIQKKIELG 385


>gi|224077228|ref|XP_002191084.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Taeniopygia guttata]
          Length = 383

 Score =  315 bits (807), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 160/387 (41%), Positives = 232/387 (59%), Gaps = 26/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RLK  DAF K  EDF  KT  G  VT V  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   RLKRFDAFPKTLEDFRVKTCGGALVTAVSGLIMVLLFFSELQYYLTKEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I+LD++ P + C YL++DA+D +G+Q L VEHN++K+RLD  G  +    +     
Sbjct: 66  DKLKINLDVIFPHMPCAYLSIDAMDVAGDQQLDVEHNLFKQRLDKAGNRVTPEAERHELG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            +++KV   N        D ++C SCYGAE+E  +CCNTC++V+EAYR + WA    D+I
Sbjct: 126 KEEEKVFDPNSL------DADRCESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDSI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
            N TH+I+HLSFG   +D      PLDGT   A++ + MF Y++K++PT+Y ++DG    
Sbjct: 240 INMTHYIKHLSFG---RDYPGIVNPLDGTAVTAQQASMMFQYFVKVVPTVYRKVDGEVVR 296

Query: 302 ---------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                     K+     GD G+PG+F  YELSP+MVK+TEK +S  H  T +   + G +
Sbjct: 297 TNQFSVTQHEKIANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFVTGVCAIVGGIF 356

Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
                +D+L++   + I  K+E+G  T
Sbjct: 357 TVAGFIDSLIYHSARAIQKKIELGKTT 383


>gi|431894341|gb|ELK04141.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Pteropus alecto]
          Length = 383

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 167/387 (43%), Positives = 232/387 (59%), Gaps = 27/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  T+K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 179 IEQCRREGFTQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y +LDG  L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKLDGEVL 295

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP++VK+TEK +S  H  T +   I G 
Sbjct: 296 RTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMVVKLTEKHRSFTHFLTGVCAIIGGM 355

Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
           +    L+D+L++   + I K    GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|344279905|ref|XP_003411726.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Loxodonta africana]
          Length = 386

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 166/387 (42%), Positives = 233/387 (60%), Gaps = 27/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 9   KLKQFDAYPKTLEDFRIKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 68

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 69  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 128

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 129 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 181

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 182 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 241

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L
Sbjct: 242 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 298

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G 
Sbjct: 299 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 358

Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
           +    L+D+L++   + I K    GKT
Sbjct: 359 FTVAGLIDSLIYHSARAIQKKIDLGKT 385


>gi|301762088|ref|XP_002916455.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Ailuropoda melanoleuca]
          Length = 383

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 166/388 (42%), Positives = 233/388 (60%), Gaps = 27/388 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G 
Sbjct: 296 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355

Query: 349 YITFMLVDALLHSCVKKISKVEIGGKTV 376
           +    L+D+L++   + I K    GKT+
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKTM 383


>gi|363741418|ref|XP_003642491.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Gallus gallus]
 gi|363741445|ref|XP_003642499.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Gallus gallus]
          Length = 383

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 161/387 (41%), Positives = 231/387 (59%), Gaps = 25/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RLK  DAF K  EDF  KT  G  VT+V  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   RLKRFDAFPKTLEDFRVKTCGGALVTVVSGLIMVLLFFSELQYYLTKEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D+V P + C YL++DA+D +GEQ L VEHN++K+RLD  G  +    +     
Sbjct: 66  DKLKINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            +++KV   N        D ++C SCYGAE+E  +CCNTC++V+EAYR + WA    DTI
Sbjct: 126 KEEEKVFDPNSL------DADRCESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
            N TH+I+HLSFG   +D      PLDGT   A++ + MF Y++K++PT+Y ++DG    
Sbjct: 240 INMTHYIKHLSFG---RDYPGIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVR 296

Query: 302 ---------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                     K+     GD G+PG+F  YELSP+MVK+TEK +   H  T +   + G +
Sbjct: 297 TNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKTV 376
                +D+L++   + I K    GKT+
Sbjct: 357 TVAGFIDSLIYHSARAIQKKIELGKTI 383


>gi|348521802|ref|XP_003448415.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Oreochromis niloticus]
          Length = 389

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 166/390 (42%), Positives = 233/390 (59%), Gaps = 32/390 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  + +  L   ++  Y       EL+VD+SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYYLTKEVHPELYVDTSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVN 124
            KL I++DI+ P + C YL++DA+D +GEQ L VEHN++K+RLD + KP+ QE +K  + 
Sbjct: 66  DKLKINIDIIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKEFKPVTQEAEKHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
                +V   +        DP++C SCYGAETE  KCCNTC++V+EAYR + WA    DT
Sbjct: 126 KADDGEVFDPSTL------DPDRCESCYGAETEDLKCCNTCDDVREAYRRRGWAFKSADT 179

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH-----DIQ 239
           I QCK E  T+K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVH     D+Q
Sbjct: 180 IEQCKREGFTQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 239

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH I+HLSFG   +D      PLDGT   A + + M+ Y++KI+PTIY + 
Sbjct: 240 SFGLDNINMTHLIKHLSFG---KDYPGLVNPLDGTDVTAPQASMMYQYFVKIVPTIYMKT 296

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  +                  GD G+PG+F  YELSP+MVK TEK +S  H  T +  
Sbjct: 297 DGEVVKTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCA 356

Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            I G +    L+D+L++   + I  K+E+G
Sbjct: 357 IIGGVFTVAGLIDSLIYHSARVIQKKIELG 386


>gi|410953936|ref|XP_003983624.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Felis catus]
          Length = 383

 Score =  314 bits (805), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 166/387 (42%), Positives = 232/387 (59%), Gaps = 27/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G 
Sbjct: 296 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355

Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
           +    L+D+L++   + I K    GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|327271493|ref|XP_003220522.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 3 [Anolis carolinensis]
          Length = 394

 Score =  314 bits (805), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 166/398 (41%), Positives = 240/398 (60%), Gaps = 37/398 (9%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RLK  DAF K  EDF  KT  G  VT++  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   RLKRFDAFPKTLEDFRVKTCGGALVTVISGLIMFLLFFSELQYYLTKEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D+V P + C YL++DA+D +GEQ L VEHN++K+RLD DGK +  P+ E    
Sbjct: 66  DKLRINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGKHVT-PEAERHEL 124

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K+++   +  +      DP++C SCYGAE++  KCCNTC++V+EAYR + WA    DTI
Sbjct: 125 GKEEETIFDPNSL-----DPDRCESCYGAESDDIKCCNTCDDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E  ++K++    EGC++YG+LEVN+V+G+FH APG S+  +HVHVH ++ +   +
Sbjct: 180 EQCKREGFSQKMQEQKNEGCKVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 246 F-----------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           F           N TH I+HLSFG   +D      PLDGTV  A++ + MF Y++K++PT
Sbjct: 240 FGLDNVSILGKINMTHIIKHLSFG---RDYPGIVNPLDGTVVSAQQASMMFQYFVKVVPT 296

Query: 295 IYERLDG-------------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           IY ++DG              K+     GD G+PG+F  YELSP+MVK+TEK +S  H  
Sbjct: 297 IYMKVDGEVVRTNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFL 356

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
           T +   I G +    L+D+L++   + I  K+E+G  T
Sbjct: 357 TGVCAIIGGVFTVAGLIDSLIYHSARVIQKKIELGKTT 394


>gi|440797665|gb|ELR18746.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
          Length = 383

 Score =  314 bits (804), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 154/388 (39%), Positives = 236/388 (60%), Gaps = 30/388 (7%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F ++LK  DA+ K  EDF  +TV G AV+I+  L I++L   ++  Y       ELFVD+
Sbjct: 5   FFKKLKSFDAYPKTLEDFRVRTVSGAAVSIISGLIITWLFFSELSFYLSTDVQPELFVDT 64

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           SRG KL I++D+  P + C YL++DA+D SGE  L VEHNI+K+RL  DG+P+       
Sbjct: 65  SRGEKLRINMDVTFPDLPCGYLSVDAMDVSGEHQLDVEHNIFKKRLAADGRPL------- 117

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
              ++K ++      +  +  +P +CGSCYG+E E  +CCNTC EV+E+YR K WA    
Sbjct: 118 --GIEKGELEAAATPSPGQELEPIECGSCYGSEQEPGQCCNTCAEVRESYRKKGWAFAHP 175

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
           ++I QC  E  +E L+    EGCQ+YG++ VN+V+G+FH APG S+  +H+HVHD+QP+ 
Sbjct: 176 ESIEQCAREGFSENLEKQKGEGCQVYGHILVNKVAGNFHFAPGKSFQAHHMHVHDLQPFR 235

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA--SMFNYYIKIIPTIYERLD 300
            +++N +H I  +SFG +         PLDG     + GA  +M+ Y++KI+PTIYE LD
Sbjct: 236 MSSWNISHRINRISFGKEFPG---VINPLDGVEKTTDPGAGSAMYQYFVKIVPTIYESLD 292

Query: 301 GSKLG---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
           G+ +                G   G+PG+F  Y+LSP+MVK TE++KS  H  T +   I
Sbjct: 293 GNVINTNQFSVTEHTRMLPPGDKSGLPGLFVMYDLSPIMVKFTERTKSFAHFLTGVCAII 352

Query: 346 SGTYITFMLVDALLHSCVKKIS-KVEIG 372
            G +    ++D+L+++ ++ +  K+E+G
Sbjct: 353 GGVFTVAGIIDSLIYNSLRTLGKKMELG 380


>gi|284004911|ref|NP_001164802.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Oryctolagus cuniculus]
 gi|217038333|gb|ACJ76626.1| serologically defined breast cancer antigen 84 isoform b
           (predicted) [Oryctolagus cuniculus]
          Length = 383

 Score =  313 bits (803), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 165/386 (42%), Positives = 233/386 (60%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  N  +     DP++C SCYGAE+E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFNPDSL----DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|348564091|ref|XP_003467839.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cavia porcellus]
          Length = 383

 Score =  313 bits (802), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 164/387 (42%), Positives = 235/387 (60%), Gaps = 26/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE+E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAESEDLKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
               L+D+L++   + I  K+E+G  T
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIELGKTT 383


>gi|13384938|ref|NP_079792.1| endoplasmic reticulum-Golgi intermediate compartment protein 3 [Mus
           musculus]
 gi|37999778|sp|Q9CQE7.1|ERGI3_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3; AltName: Full=Serologically defined breast
           cancer antigen NY-BR-84 homolog
 gi|12844094|dbj|BAB26233.1| unnamed protein product [Mus musculus]
 gi|12851518|dbj|BAB29073.1| unnamed protein product [Mus musculus]
 gi|26341008|dbj|BAC34166.1| unnamed protein product [Mus musculus]
 gi|27882157|gb|AAH43720.1| ERGIC and golgi 3 [Mus musculus]
 gi|148674217|gb|EDL06164.1| ERGIC and golgi 3, isoform CRA_d [Mus musculus]
          Length = 383

 Score =  313 bits (802), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 164/386 (42%), Positives = 233/386 (60%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DPN+C SCYGAE+E  KCCN+C +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPNSL----DPNRCESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|395830112|ref|XP_003788179.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Otolemur garnettii]
          Length = 383

 Score =  313 bits (802), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 164/386 (42%), Positives = 233/386 (60%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  N  +     DP++C SCYGAE+E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFNPDSL----DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|157820783|ref|NP_001100003.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Rattus norvegicus]
 gi|149030853|gb|EDL85880.1| ERGIC and golgi 3 (predicted) [Rattus norvegicus]
          Length = 383

 Score =  313 bits (801), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 164/386 (42%), Positives = 233/386 (60%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DPN+C SCYGAE+E  KCCN+C +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPNRCESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|126291179|ref|XP_001371602.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Monodelphis domestica]
          Length = 383

 Score =  312 bits (800), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 163/387 (42%), Positives = 228/387 (58%), Gaps = 26/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTAEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++DI+ P + C YL++DA+D +GEQ L VEHN+YK+RLD DG+P+         A
Sbjct: 66  DKLKINIDILFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGRPV------TTEA 119

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            + +    E         DP +C SCYGAE+E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 120 ERHELGKEEEKAFDPSSLDPERCESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+IR LSFG   +D      PLD T   A + + MF Y++K++PT+Y ++ G  L 
Sbjct: 240 INMTHYIRRLSFG---EDYPGIVNPLDDTNITAPQASMMFQYFVKVVPTVYMKVSGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 SNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
               L+D+L++   + I  K+E+G  T
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIELGKTT 383


>gi|359322740|ref|XP_864582.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 3 [Canis lupus familiaris]
          Length = 383

 Score =  312 bits (800), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 164/387 (42%), Positives = 232/387 (59%), Gaps = 27/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        +P++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LNPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK+TEK +S  H  T +   + G 
Sbjct: 296 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTSVCAIVGGM 355

Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
           +    L+D+L++   + I K    GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|426241390|ref|XP_004014574.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Ovis aries]
          Length = 383

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 164/387 (42%), Positives = 233/387 (60%), Gaps = 27/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCN+C +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G 
Sbjct: 296 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355

Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
           +    L+D+L++   + I K    GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|194044515|ref|XP_001929457.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Sus scrofa]
 gi|350594868|ref|XP_003483992.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Sus scrofa]
          Length = 383

 Score =  311 bits (798), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 164/387 (42%), Positives = 233/387 (60%), Gaps = 27/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCN+C +V+EAYR + WA    DT
Sbjct: 126 KVEIKVFDPDS-------LDPDRCESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L
Sbjct: 239 NINMTHYIQHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G 
Sbjct: 296 RTNQFSVTRHEKVASGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355

Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
           +    L+D+L++   + I K    GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|296199725|ref|XP_002747286.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Callithrix jacchus]
 gi|403281165|ref|XP_003932068.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Saimiri boliviensis boliviensis]
          Length = 383

 Score =  311 bits (798), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 164/386 (42%), Positives = 233/386 (60%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE+E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|156389237|ref|XP_001634898.1| predicted protein [Nematostella vectensis]
 gi|156221986|gb|EDO42835.1| predicted protein [Nematostella vectensis]
          Length = 386

 Score =  311 bits (798), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 163/383 (42%), Positives = 222/383 (57%), Gaps = 26/383 (6%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DA+ K  EDF  KT  G AVT +    +  L   ++  Y       ELFVD++R  
Sbjct: 10  LRRFDAYPKTLEDFRIKTFGGAAVTFISGFLMFILFVSELNYYLTTEVNPELFVDTTRAQ 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I+++IV P + C YL++DA+D SGEQ + V  NI KRR+DLDGK I E      NA 
Sbjct: 70  KLRINVEIVFPKLPCVYLSIDAMDVSGEQQIDVSSNILKRRVDLDGKIIDE------NAE 123

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           K       +        DPN+C SCYGAET  +KCCNTC++V+EAYR K WAL  +D + 
Sbjct: 124 KGDLGDKSHEAKELLDLDPNRCESCYGAETPDKKCCNTCDDVREAYRRKGWALSNVDDVK 183

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC  E   +KL+    EGC++ GYLEVN+V+G+FH APG S+  +HVHVHD+QP+ S  F
Sbjct: 184 QCMREGWKDKLQEQKNEGCEVTGYLEVNKVAGNFHFAPGKSFQQHHVHVHDLQPFGSTQF 243

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG- 305
           N TH+I+HLSFG    D   +  PLD T   A E  SM+ Y++KI+PT Y +L G  L  
Sbjct: 244 NLTHNIKHLSFG---HDYPGKTYPLDNTFVPAMEAGSMYQYFVKIVPTTYRKLSGEILHT 300

Query: 306 ---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                           G+ G+PG+F  YE SP+MV+ TE  +S  H  T +   + G + 
Sbjct: 301 HQFSVTKHKRVIRQMSGEHGLPGVFVLYEFSPMMVQYTESRRSFMHFLTGVCAIVGGIFT 360

Query: 351 TFMLVDALL-HSCVKKISKVEIG 372
              LVD+++ HS      K+++G
Sbjct: 361 VAGLVDSMIYHSSRALQKKIDLG 383


>gi|354477966|ref|XP_003501188.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Cricetulus griseus]
 gi|344246673|gb|EGW02777.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Cricetulus griseus]
          Length = 383

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 162/386 (41%), Positives = 232/386 (60%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +V   +  +     DPN+C SCYGAE++  KCCN+C +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVAVFDPNSL----DPNRCESCYGAESDDIKCCNSCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|410262554|gb|JAA19243.1| ERGIC and golgi 3 [Pan troglodytes]
          Length = 383

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 163/387 (42%), Positives = 234/387 (60%), Gaps = 26/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKIS-KVEIGGKT 375
               L+D+L++   + I  K+++G  T
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKAT 383


>gi|7706278|ref|NP_057050.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform b [Homo sapiens]
 gi|332858219|ref|XP_003316930.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Pan troglodytes]
 gi|397523795|ref|XP_003831904.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Pan paniscus]
 gi|37999823|sp|Q9Y282.1|ERGI3_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3; AltName: Full=Serologically defined breast
           cancer antigen NY-BR-84
 gi|4689108|gb|AAD27763.1|AF077030_1 hypothetical 43.2 kDa protein [Homo sapiens]
 gi|4929577|gb|AAD34049.1|AF151812_1 CGI-54 protein [Homo sapiens]
 gi|7671663|emb|CAB89412.1| ERGIC and golgi 3 [Homo sapiens]
 gi|14602515|gb|AAH09765.1| ERGIC and golgi 3 [Homo sapiens]
 gi|15559308|gb|AAH14014.1| ERGIC and golgi 3 [Homo sapiens]
 gi|119596605|gb|EAW76199.1| ERGIC and golgi 3, isoform CRA_a [Homo sapiens]
 gi|124249802|gb|ABM92879.1| endoplasmic reticulum-localized protein ERp43 [Homo sapiens]
 gi|312152490|gb|ADQ32757.1| ERGIC and golgi 3 [synthetic construct]
 gi|380785591|gb|AFE64671.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform b [Macaca mulatta]
 gi|383419067|gb|AFH32747.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform b [Macaca mulatta]
 gi|384947602|gb|AFI37406.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform b [Macaca mulatta]
 gi|410342895|gb|JAA40394.1| ERGIC and golgi 3 [Pan troglodytes]
          Length = 383

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 164/386 (42%), Positives = 232/386 (60%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|109092202|ref|XP_001098982.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 3 [Macaca mulatta]
          Length = 383

 Score =  311 bits (797), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 164/386 (42%), Positives = 232/386 (60%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLK 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|410926566|ref|XP_003976749.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Takifugu rubripes]
          Length = 384

 Score =  311 bits (796), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 165/385 (42%), Positives = 231/385 (60%), Gaps = 27/385 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  + +  L   ++  Y       EL+VD+SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTWGGATVTIISGVLMLILFVSELQYYLTKEVHPELYVDTSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I+++IV P + C YL++DA+D +GEQ L VEHN++K+RLD + +P+  E +K  + 
Sbjct: 66  DKLKININIVFPHMPCVYLSIDAMDVAGEQQLDVEHNLFKQRLDKNLQPVSTEAEKHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
              +  V   + +T     DP +C SCYGAET+  KCCN+C++V+EAYR + WA    DT
Sbjct: 126 G--EDDVPVFDPSTL----DPERCESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADT 179

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QCK E  T+K++    EGCQ+YG LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 180 IEQCKREGFTQKMQEQKNEGCQVYGVLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 239

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH IRHLSFG   QD      PLD T   A + + M+ Y++KI+PTIY + DG  L
Sbjct: 240 NINMTHLIRHLSFG---QDYPGLINPLDDTNITAPQASMMYQYFVKIVPTIYVKTDGEVL 296

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK TEK +S  H  T +   I G 
Sbjct: 297 KTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCAIIGGV 356

Query: 349 YITFMLVDALLHSCVKKIS-KVEIG 372
           +    L+D+L++   + I  K+E+G
Sbjct: 357 FTVAGLIDSLIYHSARVIQKKIELG 381


>gi|164448602|ref|NP_001029525.2| endoplasmic reticulum-Golgi intermediate compartment protein 3 [Bos
           taurus]
 gi|75057944|sp|Q5EAE0.1|ERGI3_BOVIN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|59857621|gb|AAX08645.1| serologically defined breast cancer antigen 84 isoform b [Bos
           taurus]
 gi|59857623|gb|AAX08646.1| serologically defined breast cancer antigen 84 isoform b [Bos
           taurus]
 gi|59857741|gb|AAX08705.1| serologically defined breast cancer antigen 84 isoform b [Bos
           taurus]
 gi|110665562|gb|ABG81427.1| serologically defined breast cancer antigen 84 [Bos taurus]
          Length = 383

 Score =  310 bits (793), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 163/387 (42%), Positives = 232/387 (59%), Gaps = 27/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAE E  KCCN+C +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G 
Sbjct: 296 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 355

Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
           +    L+D+L++   + I K    GKT
Sbjct: 356 FTVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|190402265|gb|ACE77675.1| ERGIC and golgi 3 (predicted) [Sorex araneus]
          Length = 388

 Score =  310 bits (793), Expect = 9e-82,   Method: Compositional matrix adjust.
 Identities = 166/392 (42%), Positives = 233/392 (59%), Gaps = 32/392 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGVPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            ++ K    ++        DPN+C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KIEVKVFDPDS-------LDPNRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 179 IEQCQREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +  
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            I G +    L+D+L++   + I K    GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|95767625|gb|ABF57320.1| serologically defined breast cancer antigen 84 [Bos taurus]
          Length = 380

 Score =  310 bits (793), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 163/387 (42%), Positives = 232/387 (59%), Gaps = 27/387 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 3   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 62

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 63  DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 122

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAE E  KCCN+C +V+EAYR + WA    DT
Sbjct: 123 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 175

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 176 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 235

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L
Sbjct: 236 NINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 292

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G 
Sbjct: 293 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 352

Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
           +    L+D+L++   + I K    GKT
Sbjct: 353 FTVAGLIDSLIYHSARAIQKKIDLGKT 379


>gi|410218732|gb|JAA06585.1| ERGIC and golgi 3 [Pan troglodytes]
          Length = 383

 Score =  310 bits (793), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 163/386 (42%), Positives = 231/386 (59%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++ +RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFNQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|197100234|ref|NP_001126130.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Pongo abelii]
 gi|75041559|sp|Q5R8G3.1|ERGI3_PONAB RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|55730450|emb|CAH91947.1| hypothetical protein [Pongo abelii]
          Length = 383

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 163/386 (42%), Positives = 231/386 (59%), Gaps = 25/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+E YR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVRETYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 296

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 297 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 356

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 357 TVAGLIDSLIYHSARAIQKKIDLGKT 382


>gi|184185558|gb|ACC68956.1| serologically defined breast cancer antigen 84 isoform a
           (predicted) [Rhinolophus ferrumequinum]
          Length = 388

 Score =  309 bits (792), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 167/392 (42%), Positives = 232/392 (59%), Gaps = 32/392 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y +L
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKL 295

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +  
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            I G +    L+D+L++   + I K    GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|344279907|ref|XP_003411727.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Loxodonta africana]
          Length = 391

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 166/392 (42%), Positives = 233/392 (59%), Gaps = 32/392 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 9   KLKQFDAYPKTLEDFRIKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 68

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 69  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 128

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 129 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 181

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 182 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 241

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++
Sbjct: 242 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 298

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +  
Sbjct: 299 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 358

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            I G +    L+D+L++   + I K    GKT
Sbjct: 359 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 390


>gi|432101449|gb|ELK29631.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Myotis davidii]
          Length = 391

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 167/395 (42%), Positives = 232/395 (58%), Gaps = 35/395 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEMKVFDPDS-------LDPHRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY--- 241
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238

Query: 242 -----TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
                     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y
Sbjct: 239 NVCTRCCLQINMTHYIRHLSFG---EDYPGIVNPLDRTNVTALQASMMFQYFVKVVPTVY 295

Query: 297 ERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
            +LDG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T 
Sbjct: 296 MKLDGQVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTG 355

Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           +   I G +    L+D+L++   + I K    GKT
Sbjct: 356 VCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 390


>gi|194751543|ref|XP_001958085.1| GF10736 [Drosophila ananassae]
 gi|190625367|gb|EDV40891.1| GF10736 [Drosophila ananassae]
          Length = 372

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 172/390 (44%), Positives = 236/390 (60%), Gaps = 39/390 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS LI ++  +Y Q +  EELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLNYMQPTMNEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++RG KL I+LD+ +  + C+Y++LDA+DSSG+ HL V+H+I+K RLDL G+P++E P 
Sbjct: 61  DTTRGHKLRINLDVTLHNLGCNYVSLDAMDSSGDTHLRVDHDIFKHRLDLKGEPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K  T              CGSCYGAE  +  CCNTC EV +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNSTHCCNTCEEVLDAYRLRKWNV 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            ++D I QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIPTIYER 298
               +    +H I HLSFG K++    +  PLDG   + EE  S MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGMHVEVEEKKSEMFNYYLKIVPTLYMR 279

Query: 299 LDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
               K               L   + GMPGIFFSYELSPLMVK  EK  S GH  T    
Sbjct: 280 DSDGKPIYTNQFSVTRHRKDLSDRERGMPGIFFSYELSPLMVKYAEKHSSFGHFATNCCS 339

Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            I G +    ++  LL++ ++ I  K+E+G
Sbjct: 340 IIGGVFTVAGILAVLLNNSLEAIQRKLEVG 369


>gi|363741420|ref|XP_003642492.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Gallus gallus]
 gi|363741447|ref|XP_003642500.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Gallus gallus]
          Length = 388

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 161/392 (41%), Positives = 231/392 (58%), Gaps = 30/392 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RLK  DAF K  EDF  KT  G  VT+V  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   RLKRFDAFPKTLEDFRVKTCGGALVTVVSGLIMVLLFFSELQYYLTKEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D+V P + C YL++DA+D +GEQ L VEHN++K+RLD  G  +    +     
Sbjct: 66  DKLKINIDVVFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKAGNRVTPEAERHELG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            +++KV   N        D ++C SCYGAE+E  +CCNTC++V+EAYR + WA    DTI
Sbjct: 126 KEEEKVFDPNSL------DADRCESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QCK E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 180 EQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH+I+HLSFG   +D      PLDGT   A++ + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIKHLSFG---RDYPGIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVD 296

Query: 301 G-------------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G              K+     GD G+PG+F  YELSP+MVK+TEK +   H  T +   
Sbjct: 297 GEVVRTNQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAI 356

Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
           + G +     +D+L++   + I K    GKT+
Sbjct: 357 VGGIFTVAGFIDSLIYHSARAIQKKIELGKTI 388


>gi|229368723|gb|ACQ63006.1| serologically defined breast cancer antigen 84 isoform a
           (predicted) [Dasypus novemcinctus]
          Length = 388

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 166/392 (42%), Positives = 233/392 (59%), Gaps = 32/392 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +  
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            I G +    L+D+L++   + I K    GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|301762086|ref|XP_002916454.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 1 [Ailuropoda melanoleuca]
          Length = 388

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 166/393 (42%), Positives = 233/393 (59%), Gaps = 32/393 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +  
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
            I G +    L+D+L++   + I K    GKT+
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKTM 388


>gi|410953938|ref|XP_003983625.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Felis catus]
          Length = 388

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 166/392 (42%), Positives = 232/392 (59%), Gaps = 32/392 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +  
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            I G +    L+D+L++   + I K    GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|281346059|gb|EFB21643.1| hypothetical protein PANDA_004535 [Ailuropoda melanoleuca]
          Length = 387

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 166/392 (42%), Positives = 232/392 (59%), Gaps = 32/392 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +  
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            I G +    L+D+L++   + I K    GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|95767501|gb|ABF57305.1| serologically defined breast cancer antigen 84 [Bos taurus]
          Length = 376

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 162/385 (42%), Positives = 230/385 (59%), Gaps = 27/385 (7%)

Query: 8   KGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSK 67
           K  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG K
Sbjct: 1   KQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGDK 60

Query: 68  LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVNAV 126
           L I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  +  V
Sbjct: 61  LKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELGKV 120

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + K    ++        DP++C SCYGAE E  KCCN+C +V+EAYR + WA    DTI 
Sbjct: 121 EVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDTIE 173

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +     
Sbjct: 174 QCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNI 233

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
           N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L  
Sbjct: 234 NMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRT 290

Query: 307 ----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                           GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G + 
Sbjct: 291 NQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFT 350

Query: 351 TFMLVDALLHSCVKKISKVEIGGKT 375
              L+D+L++   + I K    GKT
Sbjct: 351 VAGLIDSLIYHSARAIQKKIDLGKT 375


>gi|194872681|ref|XP_001973062.1| GG13555 [Drosophila erecta]
 gi|190654845|gb|EDV52088.1| GG13555 [Drosophila erecta]
          Length = 373

 Score =  307 bits (786), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 172/391 (43%), Positives = 237/391 (60%), Gaps = 40/391 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS LI ++V +Y Q +  EELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVLNYMQPTLNEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++RG KL I+LD+ +  ++C+Y++LDA+DSSG+ HL V+H+++K RLDL+G+P++E P 
Sbjct: 61  DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGEPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K  T              CGSCYGAE     CCNTC EV +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEEVLDAYRLRKWNV 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
             +D I QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 A-VDKIEQCKGKYKRSD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYER 298
               +    +H I HLSFG K++    +  PLDG  V  AE  + MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVEVAETKSEMFNYYLKIVPTLYMR 279

Query: 299 --LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
              DG                L   + GMPGIFFSYELSPLMVK  EK  S GH  T   
Sbjct: 280 GNSDGEPIYTNQFSVTRYRKDLSDRERGMPGIFFSYELSPLMVKYAEKRSSFGHFATNCC 339

Query: 343 CNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
             I G +    ++  LL++  + +  K+E+G
Sbjct: 340 SIIGGVFTVAGILAVLLNNSWEALQRKLEVG 370


>gi|395830114|ref|XP_003788180.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Otolemur garnettii]
 gi|197215642|gb|ACH53034.1| ERGIC and golgi 3 (predicted) [Otolemur garnettii]
          Length = 388

 Score =  307 bits (786), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 164/391 (41%), Positives = 233/391 (59%), Gaps = 30/391 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  N  +     DP++C SCYGAE+E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFNPDSL----DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296

Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +   
Sbjct: 297 GEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356

Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           I G +    L+D+L++   + I K    GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|126291176|ref|XP_001371575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Monodelphis domestica]
          Length = 388

 Score =  306 bits (785), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 163/392 (41%), Positives = 228/392 (58%), Gaps = 31/392 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTAEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++DI+ P + C YL++DA+D +GEQ L VEHN+YK+RLD DG+P+         A
Sbjct: 66  DKLKINIDILFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGRPV------TTEA 119

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            + +    E         DP +C SCYGAE+E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 120 ERHELGKEEEKAFDPSSLDPERCESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH+IR LSFG   +D      PLD T   A + + MF Y++K++PT+Y ++ 
Sbjct: 240 FGLDNINMTHYIRRLSFG---EDYPGIVNPLDDTNITAPQASMMFQYFVKVVPTVYMKVS 296

Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +   
Sbjct: 297 GEVLRSNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356

Query: 345 ISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
           I G +    L+D+L++   + I  K+E+G  T
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIELGKTT 388


>gi|195327731|ref|XP_002030571.1| GM24497 [Drosophila sechellia]
 gi|195590409|ref|XP_002084938.1| GD12569 [Drosophila simulans]
 gi|194119514|gb|EDW41557.1| GM24497 [Drosophila sechellia]
 gi|194196947|gb|EDX10523.1| GD12569 [Drosophila simulans]
          Length = 373

 Score =  306 bits (785), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 172/391 (43%), Positives = 237/391 (60%), Gaps = 40/391 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS LI ++V +Y Q +  EELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVLNYMQPTLNEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++RG KL I+LD+ +  ++C+Y++LDA+DSSG+ HL V+H+++K RLDL+G+P++E P 
Sbjct: 61  DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGEPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K  T              CGSCYGAE     CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEDVLDAYRLRKWTV 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
             +D I QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 A-VDKIEQCKGKYKRSD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYER 298
               +    +H I HLSFG K++    +  PLDG  V  AE  + MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMR 279

Query: 299 --LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
              DG                L   + GMPGIFFSYELSPLMVK  EK  S GH  T   
Sbjct: 280 GNSDGEPIYTNQFSVTRYRKDLSDRERGMPGIFFSYELSPLMVKYAEKHSSFGHFATNCC 339

Query: 343 CNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
             I G +    ++  LL++  + I  K+E+G
Sbjct: 340 SIIGGVFTVAGILAVLLNNSWEAIQRKLEVG 370


>gi|359322742|ref|XP_851879.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Canis lupus familiaris]
          Length = 388

 Score =  306 bits (784), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 164/392 (41%), Positives = 232/392 (59%), Gaps = 32/392 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        +P++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LNPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +  
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTSVCA 355

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            + G +    L+D+L++   + I K    GKT
Sbjct: 356 IVGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|426241392|ref|XP_004014575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Ovis aries]
          Length = 388

 Score =  306 bits (783), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 164/392 (41%), Positives = 233/392 (59%), Gaps = 32/392 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCN+C +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKV 295

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +  
Sbjct: 296 DGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            I G +    L+D+L++   + I K    GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|194044517|ref|XP_001929458.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Sus scrofa]
 gi|350594870|ref|XP_003483993.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Sus scrofa]
          Length = 388

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 164/392 (41%), Positives = 233/392 (59%), Gaps = 32/392 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCN+C +V+EAYR + WA    DT
Sbjct: 126 KVEIKVFDPDS-------LDPDRCESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++
Sbjct: 239 SFGLDNINMTHYIQHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKV 295

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +  
Sbjct: 296 DGEVLRTNQFSVTRHEKVASGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCA 355

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            I G +    L+D+L++   + I K    GKT
Sbjct: 356 IIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|296199723|ref|XP_002747285.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Callithrix jacchus]
 gi|403281167|ref|XP_003932069.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Saimiri boliviensis boliviensis]
 gi|166831592|gb|ABY90117.1| serologically defined breast cancer antigen 84 isoform a
           (predicted) [Callithrix jacchus]
          Length = 388

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 164/391 (41%), Positives = 233/391 (59%), Gaps = 30/391 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE+E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296

Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +   
Sbjct: 297 GEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356

Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           I G +    L+D+L++   + I K    GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|195495133|ref|XP_002095138.1| GE19855 [Drosophila yakuba]
 gi|194181239|gb|EDW94850.1| GE19855 [Drosophila yakuba]
          Length = 373

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 171/391 (43%), Positives = 237/391 (60%), Gaps = 40/391 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS LI ++V +Y Q +  EELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVINYMQPTLNEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++RG KL I+LD+ +  ++C+Y++LDA+DSSG+ HL V+H+++K RLDL+G+P++E P 
Sbjct: 61  DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGEPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K  T              CGSCYGAE     CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEDVLDAYRLRKWNV 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
             +D I QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 A-VDKIEQCKGKYKRSD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYER 298
               +    +H I HLSFG K++    +  PLDG  V  AE  + MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMR 279

Query: 299 --LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
              DG                L   + GMPGIFFSYELSPLMVK  EK  S GH  T   
Sbjct: 280 GNSDGEPIYTNQFSVTRYRKDLSDRERGMPGIFFSYELSPLMVKYAEKHSSFGHFATNCC 339

Query: 343 CNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
             I G +    ++  LL++  + +  K+E+G
Sbjct: 340 SIIGGVFTVAGILAVLLNNSWEALQRKLEVG 370


>gi|109092200|ref|XP_001098885.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Macaca mulatta]
          Length = 388

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 164/391 (41%), Positives = 232/391 (59%), Gaps = 30/391 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296

Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +   
Sbjct: 297 GEVLKTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356

Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           I G +    L+D+L++   + I K    GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|38327615|ref|NP_938408.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform a [Homo sapiens]
 gi|281182526|ref|NP_001162565.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Papio anubis]
 gi|397523797|ref|XP_003831905.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Pan paniscus]
 gi|410055053|ref|XP_003953764.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Pan troglodytes]
 gi|57208593|emb|CAI42842.1| ERGIC and golgi 3 [Homo sapiens]
 gi|164623746|gb|ABY64672.1| ERGIC and golgi 3, isoform 1 (predicted) [Papio anubis]
 gi|380785589|gb|AFE64670.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           isoform a [Macaca mulatta]
          Length = 388

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 164/391 (41%), Positives = 232/391 (59%), Gaps = 30/391 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296

Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +   
Sbjct: 297 GEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356

Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           I G +    L+D+L++   + I K    GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|410926568|ref|XP_003976750.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Takifugu rubripes]
          Length = 389

 Score =  305 bits (781), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 165/390 (42%), Positives = 231/390 (59%), Gaps = 32/390 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  + +  L   ++  Y       EL+VD+SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTWGGATVTIISGVLMLILFVSELQYYLTKEVHPELYVDTSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I+++IV P + C YL++DA+D +GEQ L VEHN++K+RLD + +P+  E +K  + 
Sbjct: 66  DKLKININIVFPHMPCVYLSIDAMDVAGEQQLDVEHNLFKQRLDKNLQPVSTEAEKHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
              +  V   + +T     DP +C SCYGAET+  KCCN+C++V+EAYR + WA    DT
Sbjct: 126 G--EDDVPVFDPSTL----DPERCESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADT 179

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQ 239
           I QCK E  T+K++    EGCQ+YG LEVN+V+G+FH APG S+  +HVHV     HD+Q
Sbjct: 180 IEQCKREGFTQKMQEQKNEGCQVYGVLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 239

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH IRHLSFG   QD      PLD T   A + + M+ Y++KI+PTIY + 
Sbjct: 240 SFGLDNINMTHLIRHLSFG---QDYPGLINPLDDTNITAPQASMMYQYFVKIVPTIYVKT 296

Query: 300 DGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           DG  L                  GD G+PG+F  YELSP+MVK TEK +S  H  T +  
Sbjct: 297 DGEVLKTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKFTEKHRSFTHFLTGVCA 356

Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            I G +    L+D+L++   + I  K+E+G
Sbjct: 357 IIGGVFTVAGLIDSLIYHSARVIQKKIELG 386


>gi|22760064|dbj|BAC11054.1| unnamed protein product [Homo sapiens]
          Length = 388

 Score =  305 bits (781), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 164/391 (41%), Positives = 232/391 (59%), Gaps = 30/391 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDDINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296

Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +   
Sbjct: 297 GEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356

Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           I G +    L+D+L++   + I K    GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|354477968|ref|XP_003501189.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Cricetulus griseus]
          Length = 388

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 162/391 (41%), Positives = 232/391 (59%), Gaps = 30/391 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +V   +  +     DPN+C SCYGAE++  KCCN+C +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVAVFDPNSL----DPNRCESCYGAESDDIKCCNSCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQP 240
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q 
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++D
Sbjct: 240 FGLDNINMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVD 296

Query: 301 GSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +   
Sbjct: 297 GEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAI 356

Query: 345 ISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           I G +    L+D+L++   + I K    GKT
Sbjct: 357 IGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 387


>gi|195378906|ref|XP_002048222.1| GJ11466 [Drosophila virilis]
 gi|194155380|gb|EDW70564.1| GJ11466 [Drosophila virilis]
          Length = 372

 Score =  304 bits (778), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 167/390 (42%), Positives = 241/390 (61%), Gaps = 39/390 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS L+ ++  +Y +   +EELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLVFLEFLNYMKPMLSEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++RG KL I+LD+ +  ++C+Y++LDA+DSSG+ HL V+H+++K RLDL+G+P++E P 
Sbjct: 61  DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLEGQPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K +T              CGSCYGAE     CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNST--------------CGSCYGAEHNATHCCNTCEDVLDAYRVRKWNM 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            ++D I QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIPTIYER 298
            +T+     +H I HLSFG K++    +  PLDG   + +E  S MFNYY+KI+PT+YER
Sbjct: 225 -FTNVKL--SHTINHLSFGEKIE--FAKTHPLDGLRVEVQESKSEMFNYYLKIVPTLYER 279

Query: 299 -LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
             DG                L   + GMPGIFFSYELSPLMVK  E+  S GH  T    
Sbjct: 280 HSDGQPIYTNQFSVTRHRKDLTDRERGMPGIFFSYELSPLMVKYAERHVSFGHFATNCCS 339

Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            + G +    ++  LL++  + +  K+E+G
Sbjct: 340 IVGGVFTVAGILAVLLNNSWEALQRKLEVG 369


>gi|34849462|gb|AAH57130.1| Ergic3 protein [Mus musculus]
          Length = 394

 Score =  304 bits (778), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 163/397 (41%), Positives = 234/397 (58%), Gaps = 36/397 (9%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DPN+C SCYGAE+E  KCCN+C +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPNSL----DPNRCESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVH ++ +   +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 246 F-----------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           F           N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT
Sbjct: 240 FGLDNPSDCLQINMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPT 296

Query: 295 IYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           +Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  
Sbjct: 297 VYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFL 356

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           T +   I G +    L+D+L++   + I K    GKT
Sbjct: 357 TGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 393


>gi|75077200|sp|Q4R8X1.1|ERGI3_MACFA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 3
 gi|67967936|dbj|BAE00450.1| unnamed protein product [Macaca fascicularis]
          Length = 382

 Score =  303 bits (777), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 163/386 (42%), Positives = 231/386 (59%), Gaps = 26/386 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGTPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT     +     DP++C SCYGAE E  KCCNTC +V+EAYR ++ A    DTI
Sbjct: 124 LGKVEVTVFGPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYR-RRGAFKNPDTI 178

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 179 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 238

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 239 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 295

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 296 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 355

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 356 TVAGLIDSLIYHSARAIQKKIDLGKT 381


>gi|195441336|ref|XP_002068468.1| GK20487 [Drosophila willistoni]
 gi|194164553|gb|EDW79454.1| GK20487 [Drosophila willistoni]
          Length = 372

 Score =  303 bits (777), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 169/390 (43%), Positives = 236/390 (60%), Gaps = 39/390 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS LI ++  +Y + +  EELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLNYMRPTLNEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++R  KL I+LD+ +  ++C+Y++LDA+DSSG+ HL V+H+++K RLDL G+P++E P 
Sbjct: 61  DTTRNHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLKGEPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K +T              CGSCYGAE     CCNTC +V +AY  KKW++
Sbjct: 121 KEIVAVSPANKNST--------------CGSCYGAEHNATHCCNTCEDVLDAYHLKKWSV 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            ++D + QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 -QVDKLEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIPTIYER 298
               +    +H I HLSFG K++    +  PLDG     EE  S MFNYYIKI+PT+YER
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVNVEESKSEMFNYYIKIVPTLYER 279

Query: 299 -LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
             DG                L   + GMPGIFFSYELSPLMVK  E+  S GH  T    
Sbjct: 280 NSDGQPIYTNQFSVTRYRKDLTDRERGMPGIFFSYELSPLMVKYAERHNSFGHFATNCCS 339

Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            I G +    ++  LL++  + I  K+E+G
Sbjct: 340 IIGGVFTVAGILAVLLNNSWEAIQRKLEVG 369


>gi|21357439|ref|NP_648758.1| CG7011 [Drosophila melanogaster]
 gi|7294304|gb|AAF49653.1| CG7011 [Drosophila melanogaster]
 gi|16768234|gb|AAL28336.1| GH25868p [Drosophila melanogaster]
 gi|220946650|gb|ACL85868.1| CG7011-PA [synthetic construct]
          Length = 373

 Score =  303 bits (776), Expect = 9e-80,   Method: Compositional matrix adjust.
 Identities = 170/391 (43%), Positives = 236/391 (60%), Gaps = 40/391 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS LI ++V +Y Q +  EELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEVLNYMQPTLNEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++R  KL I+LD+ +  ++C+Y++LDA+DSSG+ HL V+H+++K RLDL+G+P++E P 
Sbjct: 61  DTTRDHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLNGEPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K  T              CGSCYGAE     CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEDVLDAYRLRKWTV 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
             +D I QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 A-VDKIEQCKGKYKRSD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYER 298
               +    +H I HLSFG K++    +  PLDG  V  AE  + MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMR 279

Query: 299 --LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
              DG                L   + GMPGIFFSYELSPLMVK  E+  S GH  T   
Sbjct: 280 GNSDGEPIYTNQFSVTRYRKDLSDRERGMPGIFFSYELSPLMVKYAERHSSFGHFATNCC 339

Query: 343 CNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
             I G +    ++  LL++  + I  K+E+G
Sbjct: 340 SIIGGVFTVAGILAVLLNNSWEAIQRKLEVG 370


>gi|410953940|ref|XP_003983626.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 3 [Felis catus]
          Length = 399

 Score =  301 bits (772), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 165/403 (40%), Positives = 233/403 (57%), Gaps = 43/403 (10%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVH ++ +   
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 245 AF----------------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
           +F                N TH+IRHLSFG   +D      PLD T   A + + MF Y+
Sbjct: 239 SFGLDNRSRLRCWYCLQINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYF 295

Query: 289 IKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSK 332
           +K++PT+Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK +
Sbjct: 296 VKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHR 355

Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           S  H  T +   I G +    L+D+L++   + I K    GKT
Sbjct: 356 SFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 398


>gi|334310895|ref|XP_003339551.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Monodelphis domestica]
          Length = 396

 Score =  301 bits (772), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 162/400 (40%), Positives = 229/400 (57%), Gaps = 39/400 (9%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTAEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++DI+ P + C YL++DA+D +GEQ L VEHN+YK+RLD DG+P+         A
Sbjct: 66  DKLKINIDILFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGRPV------TTEA 119

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            + +    E         DP +C SCYGAE+E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 120 ERHELGKEEEKAFDPSSLDPERCESCYGAESEDSKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVH ++ +   +
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 239

Query: 246 F-------------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
           F             N TH+IR LSFG   +D      PLD T   A + + MF Y++K++
Sbjct: 240 FGLDNVVLCWYLQINMTHYIRRLSFG---EDYPGIVNPLDDTNITAPQASMMFQYFVKVV 296

Query: 293 PTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
           PT+Y ++ G  L                  GD G+PG+F  YELSP+MVK+TEK +S  H
Sbjct: 297 PTVYMKVSGEVLRSNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFTH 356

Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
             T +   I G +    L+D+L++   + I  K+E+G  T
Sbjct: 357 FLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIELGKTT 396


>gi|125978263|ref|XP_001353164.1| GA20029 [Drosophila pseudoobscura pseudoobscura]
 gi|54641917|gb|EAL30666.1| GA20029 [Drosophila pseudoobscura pseudoobscura]
          Length = 372

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 171/390 (43%), Positives = 234/390 (60%), Gaps = 39/390 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS LI ++   Y Q +  EELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLSYMQPALNEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++RG KL I+LD+ +  ++C+Y++LDA+DSSG+ HL V+H+I+K RLDL G+P++E P 
Sbjct: 61  DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDIFKHRLDLKGEPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K  T              CGSCYGAE     CCNTC +V +AYR  KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEDVLDAYRLHKWNV 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            ++D I QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYER 298
               +    +H I HLSFG K++    +  PLDG  V  AE  + MFNYY+KI+PT+Y R
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVDVAETKSEMFNYYLKIVPTLYMR 279

Query: 299 L-DGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
             DG                L   + GMPGIFFSYELSPLMVK  EK  S GH  T    
Sbjct: 280 QSDGQPIYTNQFSVTRYRKDLTDRERGMPGIFFSYELSPLMVKYAEKHNSFGHFATNCCS 339

Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            I G +    ++  LL++  + I  K+++G
Sbjct: 340 IIGGVFTVAGILAVLLNNSWEAIQRKLDVG 369


>gi|57208594|emb|CAI42843.1| ERGIC and golgi 3 [Homo sapiens]
          Length = 396

 Score =  301 bits (770), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 164/401 (40%), Positives = 232/401 (57%), Gaps = 40/401 (9%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 4   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 63

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 64  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 121

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 122 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 177

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH----------- 234
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVH           
Sbjct: 178 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHGCVCRLKMIAR 237

Query: 235 ----VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIK 290
               VHD+Q +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K
Sbjct: 238 SLACVHDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVK 294

Query: 291 IIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSL 334
           ++PT+Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK +S 
Sbjct: 295 VVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSF 354

Query: 335 GHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            H  T +   I G +    L+D+L++   + I K    GKT
Sbjct: 355 THFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 395


>gi|335304738|ref|XP_003360010.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Sus scrofa]
 gi|350594872|ref|XP_003134465.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like isoform 2 [Sus scrofa]
          Length = 398

 Score =  300 bits (767), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 163/402 (40%), Positives = 234/402 (58%), Gaps = 42/402 (10%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  KCCN+C +V+EAYR + WA    DT
Sbjct: 126 KVEIKVFDPDS-------LDPDRCESCYGAETEDIKCCNSCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVH ++ +   
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQ 238

Query: 245 AF---------------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
           +F               N TH+I+HLSFG   +D      PLD T   A + + MF Y++
Sbjct: 239 SFGLDNVSTGHRCCLQINMTHYIQHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFV 295

Query: 290 KIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKS 333
           K++PT+Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK +S
Sbjct: 296 KVVPTVYMKVDGEVLRTNQFSVTRHEKVASGLMGDQGLPGVFVLYELSPMMVKLTEKHRS 355

Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
             H  T +   I G +    L+D+L++   + I K    GKT
Sbjct: 356 FTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 397


>gi|195126511|ref|XP_002007714.1| GI12235 [Drosophila mojavensis]
 gi|193919323|gb|EDW18190.1| GI12235 [Drosophila mojavensis]
          Length = 372

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 171/390 (43%), Positives = 241/390 (61%), Gaps = 39/390 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS LI ++  +Y + + TEELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISSSIISLLIFLECLNYMRPTLTEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++RG KL I+LD+ +  ++C+Y++LDA+DSSG+ HL V+H+++K RLDLDG P++E P 
Sbjct: 61  DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLDGNPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K +T              CGSCYGAE  +  CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNST--------------CGSCYGAEHNSTHCCNTCEDVLDAYRIRKWNM 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            ++D I QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIPTIYER 298
            +T+     +H I HLSFG K++    +  PLDG     EE  S MFNYY+KI+PT+YER
Sbjct: 225 -FTNVKL--SHTINHLSFGEKIE--FAKTHPLDGLRVDVEESKSEMFNYYLKIVPTLYER 279

Query: 299 LDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
               K               L   + GMPGIFFSYELSPLMVK  E+  S GH  T    
Sbjct: 280 HSDGKPIYTNQFSVTRHRKDLTDRERGMPGIFFSYELSPLMVKYAERHVSFGHFATNCCS 339

Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            I G +    ++  +L++ ++ I  K+E+G
Sbjct: 340 IIGGVFTVAGILAVVLNNSLEAIQRKLEVG 369


>gi|389612123|dbj|BAM19583.1| ptx1 protein [Papilio xuthus]
          Length = 285

 Score =  298 bits (762), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 151/293 (51%), Positives = 187/293 (63%), Gaps = 25/293 (8%)

Query: 99  VEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET 158
           ++HNI+KRRLDLDG PI+EP+KE +      K  T    T T       CGSCYGA    
Sbjct: 1   MDHNIHKRRLDLDGNPIEEPKKEEIAISSTVKQNTSELATVT-------CGSCYGAAFND 53

Query: 159 RKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSG 218
            +CCNTC +VKEAYR ++WALP+L TIVQCK++ S EK      EGCQIYGY+EVNRV G
Sbjct: 54  SQCCNTCEDVKEAYRIRRWALPDLATIVQCKDDESLEKANLALKEGCQIYGYMEVNRVGG 113

Query: 219 SFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA 278
           SFHIAPG S++INHVHVHD+QPY+S+AFNTTH I+HLSFG  ++  +    PLDG    A
Sbjct: 114 SFHIAPGKSFTINHVHVHDVQPYSSSAFNTTHXIQHLSFGSDIKSAN--TAPLDGVKGIA 171

Query: 279 EEGASMFNYYIKIIPTIYERLDGSKLG----------------GGDGGMPGIFFSYELSP 322
           +EGA MF YYIKI PT+Y +LD + L                   + GMPG FFSYELSP
Sbjct: 172 QEGAVMFQYYIKIGPTMYVKLDKTVLHTNQFSVTRHQKSVSNINSESGMPGAFFSYELSP 231

Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           LMVK TEK +S+GH  T I   I G +    ++D LL+  +       + GK 
Sbjct: 232 LMVKYTEKERSIGHFATNICAIIGGVFTVAGILDTLLYHSLNAFHNKIVLGKA 284


>gi|335774962|gb|AEH58414.1| endoplasmic reticulum-golgi intermediat compartment protein 3-like
           protein [Equus caballus]
          Length = 354

 Score =  296 bits (759), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 149/344 (43%), Positives = 210/344 (61%), Gaps = 27/344 (7%)

Query: 49  YFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
           Y       EL+VD SRG KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RL
Sbjct: 20  YLTTEVHPELYVDKSRGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRL 79

Query: 109 DLDGKPIQ-EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNE 167
           D DG P+  E ++  +  V+ K    ++        DP++C SCYGAETE  KCCNTC +
Sbjct: 80  DKDGIPVSSEAERHELGKVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCED 132

Query: 168 VKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS 227
           V+EAYR + WA    DTI QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S
Sbjct: 133 VREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKS 192

Query: 228 YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
           +  +HVHVHD+Q +     N TH+IRHLSFG   +D      PLD T   A + + MF Y
Sbjct: 193 FQQSHVHVHDLQSFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQY 249

Query: 288 YIKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKS 331
           ++K++PT+Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK 
Sbjct: 250 FVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKH 309

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           +S  H  T +   I G +    L+D+L++   + I K    GKT
Sbjct: 310 RSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 353


>gi|302790744|ref|XP_002977139.1| hypothetical protein SELMODRAFT_271242 [Selaginella moellendorffii]
 gi|302820940|ref|XP_002992135.1| hypothetical protein SELMODRAFT_162191 [Selaginella moellendorffii]
 gi|300140061|gb|EFJ06790.1| hypothetical protein SELMODRAFT_162191 [Selaginella moellendorffii]
 gi|300155115|gb|EFJ21748.1| hypothetical protein SELMODRAFT_271242 [Selaginella moellendorffii]
          Length = 386

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 158/390 (40%), Positives = 223/390 (57%), Gaps = 25/390 (6%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   ++L+ LDA+ K  EDFH +T+ GG +T+V  +F++ L   ++  +    TT EL V
Sbjct: 1   MQMLKKLQQLDAYPKINEDFHSRTLSGGVITVVSSIFMAILFITELKLFLLPGTTSELLV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+SRG  L I+ DI  P ++C  ++LDA+D SGEQHL V+HNI+K+RLD  GK +Q P +
Sbjct: 61  DTSRGETLQINFDITFPALACSVISLDAMDVSGEQHLDVKHNIFKKRLDPSGKVVQPPVQ 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           E +   K  K   ++G      E    CGSC+GAE    +CCN+C EV+EAYR + WA+ 
Sbjct: 121 EDIGGPKIDKPLQKHGGRLEHNE--TYCGSCFGAEQSDDECCNSCEEVREAYRKRGWAIH 178

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
             D I QCK E    K+K    EGC IYG LEVN+V+G+FH APG S+S  HVHVHD+Q 
Sbjct: 179 NADLIDQCKREGWLTKIKEEEGEGCNIYGSLEVNKVAGNFHFAPGKSFSQQHVHVHDVQS 238

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
                FN +H+I  LSFG +         PLD      +  ++M+ Y+IK++PT Y  + 
Sbjct: 239 LHKEKFNVSHYINELSFGARFPG---VVNPLDKEKRIQKFPSAMYQYFIKVVPTAYTDMT 295

Query: 301 GSKL---------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
           G K+               G     +PG+FF YELSP+ V  TE+  S  H  T +   I
Sbjct: 296 GHKIVTNQFSVTDHFKAVEGLNGRSLPGVFFFYELSPIKVLFTERKTSFLHFLTNVCAII 355

Query: 346 SGTYITFMLVDALL---HSCVKKISKVEIG 372
            G +    ++D+ +   H  +KK  K+EIG
Sbjct: 356 GGVFTVSGIIDSFIYHGHRAIKK--KMEIG 383


>gi|195021391|ref|XP_001985385.1| GH17030 [Drosophila grimshawi]
 gi|193898867|gb|EDV97733.1| GH17030 [Drosophila grimshawi]
          Length = 372

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 170/390 (43%), Positives = 241/390 (61%), Gaps = 39/390 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS L+ ++  +Y + + TEELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISSSIISLLVLLEFLNYMKPTMTEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++RG KL I+LD+ +  ++C+Y++LDA+DSSG+ HL V+H+++K RLDL G+P++E P 
Sbjct: 61  DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDVFKHRLDLQGEPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K +T              CGSCYGAE  +  CCNTC +V +AYR +KW +
Sbjct: 121 KEIVAVSPPNKNST--------------CGSCYGAEHNSTHCCNTCEDVLDAYRIRKWNM 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            ++D I QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIPTIYER 298
            +T+     +H I HLSFG K++    +  PLDG     EE  S MFNYY+KI+PT+YER
Sbjct: 225 -FTNVKL--SHTINHLSFGEKIE--FAKTHPLDGIRVDVEESKSEMFNYYLKIVPTLYER 279

Query: 299 -LDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
             DG                L   + GMPGIFFSYELSPLMVK  E+  S GH  T    
Sbjct: 280 HSDGEPIYTNQFSVTRHRKDLTDRERGMPGIFFSYELSPLMVKYAERHNSFGHFATNCCS 339

Query: 344 NISGTYITFMLVDALLHSCVKKIS-KVEIG 372
            + G +    ++  LL++  + I  K+E+G
Sbjct: 340 IVGGVFTVAGILAVLLNNSWEAIQRKLEVG 369


>gi|351702542|gb|EHB05461.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Heterocephalus glaber]
          Length = 378

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 161/386 (41%), Positives = 228/386 (59%), Gaps = 30/386 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +     E  DP++C SCYGAE+E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFD----PESLDPDRCESCYGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVH      +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVH-----GWCCLQ 234

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L 
Sbjct: 235 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLR 291

Query: 306 G----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                            GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +
Sbjct: 292 TNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMF 351

Query: 350 ITFMLVDALLHSCVKKISKVEIGGKT 375
               L+D+L++   + I K    GKT
Sbjct: 352 TVAGLIDSLIYHSARAIQKKIDLGKT 377


>gi|326434226|gb|EGD79796.1| intermediate compartment protein 3 [Salpingoeca sp. ATCC 50818]
          Length = 396

 Score =  296 bits (757), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 155/389 (39%), Positives = 226/389 (58%), Gaps = 23/389 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF  KT  G A++IV  L I  L   ++  Y       ELFVD+SR 
Sbjct: 9   KLRNLDAYPKTLEDFRVKTFSGAAISIVAILLIVVLFTSELVYYLSTEVEPELFVDTSRD 68

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            K+ I++D+    ++C +L LD +D SGE  L VEH+I+K+RL   G PI E  +EV + 
Sbjct: 69  EKMRINVDVTFHKMACAFLHLDIMDVSGENELDVEHDIFKQRLTETGTPIYEEPEEVDDL 128

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             +            E  DPN+C SCYGAE+E  KCCNTC  V+EAYR K WAL ++  I
Sbjct: 129 GDESDSAVGALKMMKEGLDPNRCESCYGAESEQNKCCNTCEAVREAYRRKGWALTDIQGI 188

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  TEKLK    EGC+IYG+LEVN+V+G+FHIAPG S+  + +H HD+  +   A
Sbjct: 189 EQCEREGWTEKLKAQAKEGCRIYGHLEVNKVAGNFHIAPGKSFQQHSIHFHDLNSFGREA 248

Query: 246 ---FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE-GASMFNYYIKIIPTIYERLDG 301
              FN +H I HLSFGI+         PLDG    A++ GA+M+ YY+KI+PT Y +  G
Sbjct: 249 LGKFNMSHTINHLSFGIEYPG---VVNPLDGHSETADKLGATMYQYYVKIVPTRYRKARG 305

Query: 302 SKLG----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
            +L                  G  G+PG+F  +E+SP++V+++E++ S  H  T ++  I
Sbjct: 306 QELNTNQYSVTMHQRHIDHKAGQTGLPGMFVMFEISPILVQLSERTHSFFHFLTGVLAII 365

Query: 346 SGTYITFMLVDALLHSCVKKISKVEIGGK 374
            G +    ++D+ ++  ++ + K +  GK
Sbjct: 366 GGIFSVAGMIDSFVYHGLRSLKKKQELGK 394


>gi|291231388|ref|XP_002735646.1| PREDICTED: serologically defined breast cancer antigen 84-like,
           partial [Saccoglossus kowalevskii]
          Length = 358

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 147/366 (40%), Positives = 216/366 (59%), Gaps = 29/366 (7%)

Query: 31  TIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVD 90
           TI+  + +  L   ++  Y     T EL+VD++RG K+ I+LDI  PT+ C YL++DA+D
Sbjct: 1   TIISGILMFILFISELNYYLTKEVTPELYVDTTRGEKMRINLDITFPTLPCGYLSIDAMD 60

Query: 91  SSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGS 150
            +GEQ L V+HNI K R+D +GKP+  P+KE +        + E         DP++C S
Sbjct: 61  VAGEQQLDVDHNIMKSRIDKNGKPVATPEKEDIG-----DKSEEAKDFDVNKLDPDRCES 115

Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGY 210
           CYGAE++  KCCNTC +V+EAYR K WA    D I QC  E  ++KLK+   EGCQ+YG+
Sbjct: 116 CYGAESKDLKCCNTCEDVREAYRRKGWAFNNADGIAQCSREGWSDKLKSQSGEGCQVYGH 175

Query: 211 LEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKP 270
           LEVN+V+G+FH APG S+  +HVHVHD+Q ++   FN +H I HLSFG K         P
Sbjct: 176 LEVNKVAGNFHFAPGKSFQQHHVHVHDLQAFSGEKFNLSHRINHLSFGHKYPG---MENP 232

Query: 271 LDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL--------------------GGGDGG 310
           LD +   +++ + M+ Y++KI+PT Y +L+G+                        G+ G
Sbjct: 233 LDNSKVTSQKASIMYQYFVKIVPTTYTKLNGATTRSNQYSVTKHEKVVSTSLASAAGEHG 292

Query: 311 MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKV 369
           +PG+F  YE +PLMVK TEK +S  H  T +   I G +    L+D++++   K I  K+
Sbjct: 293 LPGVFILYEFAPLMVKYTEKHRSFMHFMTGVCAIIGGVFTVAGLIDSMIYHSSKAIKKKI 352

Query: 370 EIGGKT 375
           ++G  T
Sbjct: 353 DLGKAT 358


>gi|168004517|ref|XP_001754958.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694062|gb|EDQ80412.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 158/390 (40%), Positives = 225/390 (57%), Gaps = 26/390 (6%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +LK LDA  K  EDF+ +T+ GG +T+V  +F+  L   +   Y    T  +L V
Sbjct: 1   MAIFNKLKQLDAHPKISEDFYSRTLSGGVITLVSSIFMFLLFVTEFRIYLSAQTQNQLVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+SRG  L I+LDI  P ++C  ++LDA+D SGE HL V HNIYK+RLD+ GK +  P+ 
Sbjct: 61  DTSRGETLQINLDITFPALACSVVSLDAMDISGELHLDVRHNIYKKRLDVHGKAVDAPKP 120

Query: 121 EVVNAVKKKKVTTENGTTTTELED-PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           + +NA K +K   ++G     LED    CGSC+GAE+   +CCN+C EV+EAYR K WAL
Sbjct: 121 DAINAPKVQKPLQKHGG---RLEDHETYCGSCFGAESSDDQCCNSCEEVREAYRKKGWAL 177

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
              D I QC  E   E++K    EGC IYG LEVN+V+G+F IAPG S+  + +H+ D+ 
Sbjct: 178 TNTDLIDQCHREGFIERIKEEAGEGCNIYGKLEVNKVAGNFQIAPGKSFQQSAMHLLDLM 237

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            + + +FN +H I  LSFG           PLD   +  ++   MF Y+IK++PT+Y  +
Sbjct: 238 GFVTDSFNVSHTINELSFGAYFPG---AVNPLDKVTSIQKDQNGMFQYFIKVVPTVYTDI 294

Query: 300 DGSKLG-----------GGDGG---MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
            G K+             GD G   +PG+FF Y+L+P+ VK TE+  S  H  T +   I
Sbjct: 295 KGRKISTNQFSVMEHYTAGDHGPRVIPGVFFFYDLTPIKVKFTEERPSFLHFLTNVCAII 354

Query: 346 SGTYITFMLVDALL---HSCVKKISKVEIG 372
            G Y    +VD+ +   H  +KK  K+E+G
Sbjct: 355 GGIYTIAGIVDSFIYHGHRAIKK--KMELG 382


>gi|426391505|ref|XP_004062113.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Gorilla gorilla gorilla]
 gi|7959731|gb|AAF71038.1|AF116721_14 PRO0989 [Homo sapiens]
          Length = 346

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 147/343 (42%), Positives = 210/343 (61%), Gaps = 25/343 (7%)

Query: 49  YFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
           Y       EL+VD SRG KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RL
Sbjct: 12  YLTTEVHPELYVDKSRGDKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRL 71

Query: 109 DLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEV 168
           D DG P+    +   + + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V
Sbjct: 72  DKDGIPVSSEAER--HELGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDV 125

Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
           +EAYR + WA    DTI QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+
Sbjct: 126 REAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSF 185

Query: 229 SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
             +HVHVHD+Q +     N TH+I+HLSFG   +D      PLD T   A + + MF Y+
Sbjct: 186 QQSHVHVHDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYF 242

Query: 289 IKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSK 332
           +K++PT+Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK +
Sbjct: 243 VKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHR 302

Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           S  H  T +   I G +    L+D+L++   + I K    GKT
Sbjct: 303 SFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 345


>gi|168024878|ref|XP_001764962.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683771|gb|EDQ70178.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  293 bits (751), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 153/389 (39%), Positives = 225/389 (57%), Gaps = 24/389 (6%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +LK LDA+ K  EDF+ +T+ GG +T+V  +F+  L   ++  Y    T  +L V
Sbjct: 1   MAVFNKLKQLDAYPKISEDFYSRTLSGGVITLVSTVFMFVLFVTEISLYLSAQTQNQLVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+SRG  L I+LDI  P ++C  ++LDA+D SGEQHL+V HNI+K+RLD+ GK +  P+ 
Sbjct: 61  DTSRGETLQINLDITFPALACSMVSLDAMDISGEQHLNVRHNIFKKRLDVHGKVVNAPKP 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           + +NA K +K   ++G      E    CGSC+GAE+   +CCN C EV+EAYR K WAL 
Sbjct: 121 DAINAPKVQKPLQKHGGRLEHNE--TYCGSCFGAESSDDECCNNCEEVREAYRKKGWALT 178

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
             D I QC  E   E++K    EGC IYG LEVN+V+G+FH APG S+  + +H+ D+  
Sbjct: 179 NADLIDQCHREGFIERVKEEAGEGCNIYGKLEVNKVAGNFHFAPGKSFQQSAMHLLDLMG 238

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           + + +FN +H I  LSFG           PLD      ++   M+ Y+IK++PT+Y  + 
Sbjct: 239 FITDSFNVSHTINELSFGAHFPG---AVNPLDKVTNIQKDLNGMYQYFIKVVPTVYTDIK 295

Query: 301 GSKLG-----------GGDGG---MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
           G K+             GD G   +PG+FF Y+LSP+ VK +E+  S  H  T +   + 
Sbjct: 296 GRKISTNQFSVTEHYTAGDHGPRFVPGVFFFYDLSPIKVKFSEERPSFLHFLTNVCAIVG 355

Query: 347 GTYITFMLVDALL---HSCVKKISKVEIG 372
           G Y    ++D+ +   H  +KK  K+E+G
Sbjct: 356 GVYSIAGIIDSFVYHGHRAIKK--KMELG 382


>gi|355563183|gb|EHH19745.1| Serologically defined breast cancer antigen NY-BR-84 [Macaca
           mulatta]
 gi|355784539|gb|EHH65390.1| Serologically defined breast cancer antigen NY-BR-84 [Macaca
           fascicularis]
          Length = 401

 Score =  293 bits (751), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 162/404 (40%), Positives = 230/404 (56%), Gaps = 43/404 (10%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH------------- 232
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +H             
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHGTYLTGCVCRLKM 239

Query: 233 -----VHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
                  VHD+Q +     N TH+I+HLSFG   +D      PLD T   A + + MF Y
Sbjct: 240 IARSLACVHDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQY 296

Query: 288 YIKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKS 331
           ++K++PT+Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK 
Sbjct: 297 FVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKH 356

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           +S  H  T +   I G +    L+D+L++   + I K    GKT
Sbjct: 357 RSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 400


>gi|61555014|gb|AAX46646.1| serologically defined breast cancer antigen 84 isoform b [Bos
           taurus]
          Length = 346

 Score =  292 bits (748), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 146/344 (42%), Positives = 210/344 (61%), Gaps = 27/344 (7%)

Query: 49  YFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
           Y       EL+VD SRG KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RL
Sbjct: 12  YLTTEVHPELYVDKSRGDKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRL 71

Query: 109 DLDGKPIQ-EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNE 167
           D DG P+  E ++  +  V+ K    ++        DP++C SCYGAE E  KCCN+C +
Sbjct: 72  DKDGFPVSSEAERHELGKVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCED 124

Query: 168 VKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS 227
           V+EAYR + WA    DTI QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S
Sbjct: 125 VREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKS 184

Query: 228 YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
           +  +HVHVHD+Q +     N TH+IRHLSFG   +D      PLD T   A + + MF Y
Sbjct: 185 FQQSHVHVHDLQSFGLDNINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQY 241

Query: 288 YIKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKS 331
           ++K++PT+Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK 
Sbjct: 242 FVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKH 301

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           +S  H  T +   I G +    L+D+L++   + I K    GKT
Sbjct: 302 RSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 345


>gi|440902508|gb|ELR53293.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3 [Bos
           grunniens mutus]
          Length = 395

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 160/399 (40%), Positives = 229/399 (57%), Gaps = 39/399 (9%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAE E  KCCN+C +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH---------- 234
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVH          
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHGCREEVRVTG 238

Query: 235 --VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
               + Q +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++
Sbjct: 239 ARCSEAQGWCCLQINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVV 295

Query: 293 PTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
           PT+Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H
Sbjct: 296 PTVYMKVDGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTH 355

Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
             T +   I G +    L+D+L++   + I K    GKT
Sbjct: 356 FLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 394


>gi|449265747|gb|EMC76893.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3,
           partial [Columba livia]
          Length = 330

 Score =  290 bits (742), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 146/341 (42%), Positives = 209/341 (61%), Gaps = 35/341 (10%)

Query: 57  ELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ 116
           EL+VD SRG KL I+LD++ P + C YL++DA+D +GEQ L VEHN++K+RLD  G    
Sbjct: 4   ELYVDKSRGDKLKINLDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKAG---- 59

Query: 117 EPQKEVVNAVKKKKVTTENGTTTTELEDPN-----KCGSCYGAETETRKCCNTCNEVKEA 171
                  N V  +    E G    ++ DPN     +C SCYGAE+E  +CCNTC++V+EA
Sbjct: 60  -------NRVTPEAERHELGKEEEKVFDPNSLDADRCESCYGAESEDIRCCNTCDDVREA 112

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
           YR + WA    DTI QCK E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +
Sbjct: 113 YRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQS 172

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
           HVHVHD+Q +     N TH+I+HLSFG   +D      PLDGT   A++ + MF Y++K+
Sbjct: 173 HVHVHDLQSFGLDNINMTHYIKHLSFG---RDYPGIVNPLDGTDVTAQQASMMFQYFVKV 229

Query: 292 IPTIYERLDG-------------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLG 335
           +PT+Y ++DG              K+     GD G+PG+F  YELSP+MVK+TEK +S  
Sbjct: 230 VPTVYMKVDGEVVRTNQFSVTRHEKIANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFT 289

Query: 336 HLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
           H  T +   + G +     +D+L++   + I K    GKT+
Sbjct: 290 HFLTGVCAIVGGIFTVAGFIDSLIYHSARAIQKKIELGKTI 330


>gi|168019656|ref|XP_001762360.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686438|gb|EDQ72827.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 380

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 158/389 (40%), Positives = 228/389 (58%), Gaps = 29/389 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F  +LK LDA+ K  EDF+ +T+ GG +T+V  +F++ L   +   Y    T  +L V
Sbjct: 1   MSFFNKLKHLDAYPKISEDFYSRTLSGGLITLVSSVFMTLLFITEFRIYLSAQTQNQLVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+SRG  L I+LDI    ++C  ++LDA+D SGEQHL+V HNI+K+RLD+ GK I  P+ 
Sbjct: 61  DTSRGETLQINLDITFSALACSVVSLDAMDISGEQHLNVRHNIFKKRLDVHGKAIDAPKP 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           + +NA K ++   ++G      E    CGSC+GA +   +CCN+C EV+EAYR K WAL 
Sbjct: 121 DAINAPKVQRPLQKHGGRLEHNE--TYCGSCFGAASSDDECCNSCEEVREAYRKKGWALI 178

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            +D I QC  E   E++K    EGC IYG LEVN+V+G+FHIAPG  +  + +H+ D+  
Sbjct: 179 NIDIIDQCHREGFIERVKEEAGEGCNIYGKLEVNKVAGNFHIAPGKLFQQSAMHLLDLLG 238

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
             S +FN +H +  LSFG        R  PLD   +  ++   M+ Y+IK++PT+Y  + 
Sbjct: 239 IRSDSFNVSHIVNELSFGAHFPG---RVNPLDKITSIQKDQNGMYQYFIKVVPTVYTDIR 295

Query: 301 GSKLG-----------GGDGG---MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
           GS++             GD G   +PG+FF Y+LSP+ VK TEK  S  H  T + C I 
Sbjct: 296 GSEIATNQFSVTEHYTAGDHGPRVVPGVFFFYDLSPIKVKFTEKRPSFLHFLTTV-CAIV 354

Query: 347 GTYITFMLVDALL---HSCVKKISKVEIG 372
           G  I    +D+ +   H  VKK  K+E+G
Sbjct: 355 GASI----IDSFIYHGHRAVKK--KMELG 377


>gi|196008679|ref|XP_002114205.1| hypothetical protein TRIADDRAFT_37998 [Trichoplax adhaerens]
 gi|190583224|gb|EDV23295.1| hypothetical protein TRIADDRAFT_37998 [Trichoplax adhaerens]
          Length = 369

 Score =  289 bits (739), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 149/383 (38%), Positives = 222/383 (57%), Gaps = 43/383 (11%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  EDF  +T  G  +TIV  + +  L   ++  Y  V  T ELFVD+SRG 
Sbjct: 10  LRRYDAFPKTLEDFRIRTFGGATITIVSAVIMLLLFVSEMNYYLSVEVTSELFVDTSRGE 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+ I++++  P ++C  L++D +D +G Q L ++ N+ KRR+D +GKP         +AV
Sbjct: 70  KIKIYMNVTFPKMACAILSVDTMDVAGMQQLDIKQNLMKRRIDENGKPTG-------DAV 122

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +K K                KCGSCYGAE    KCCN+C +V+EAYR K WAL   + I 
Sbjct: 123 QKNK---------------TKCGSCYGAENAEMKCCNSCEDVREAYRKKGWALTSPEGIE 167

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNR-VSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
           QC+ E   + LK    EGC ++GYLEVN+ V+G+FH APG S+  + VHVHD+Q + S  
Sbjct: 168 QCQEEGWAQMLKEQEKEGCNVFGYLEVNKVVAGNFHFAPGKSFQQHRVHVHDLQSFGSRK 227

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS--- 302
           FNT+H I  LSFG   ++      PLDG    +++ ++M+ Y+IK++PT+Y++L G    
Sbjct: 228 FNTSHTIHKLSFG---EEFPGIINPLDGHRMSSDQDSAMYQYFIKVVPTVYKKLKGEEVK 284

Query: 303 -------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                        KL  G+ G+PG+F SYELSP++++  E+ KS  H  T +   I G +
Sbjct: 285 SNQYSVTKHLKYIKLSMGEQGLPGVFISYELSPMIIRYAERRKSFAHFLTGVCAIIGGVF 344

Query: 350 ITFMLVDALLHSCVKKISKVEIG 372
               L+DA+++   K + K+E+G
Sbjct: 345 TVASLIDAMVYHSAKML-KIELG 366


>gi|168014180|ref|XP_001759631.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689170|gb|EDQ75543.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 382

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 147/379 (38%), Positives = 216/379 (56%), Gaps = 20/379 (5%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           ++LK LDA+ K  EDF+ +T+ GG +TI+   F+  L   ++  Y       +L VD+ R
Sbjct: 3   QKLKSLDAYPKINEDFYSRTLSGGIITIISATFMVLLFFSELKLYLAAQVANDLVVDTER 62

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           G  + I+LD+  P ++C  ++LDA+D SGE HL V+HNI+K+RLD++GK I+  ++E +N
Sbjct: 63  GGTIQINLDVTFPALACSVVSLDAMDISGEAHLDVKHNIFKKRLDVNGKVIEPARQESIN 122

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             K  K   ++G      E    CGSC+GAETE   CCN C EV+EAYR K WAL   D 
Sbjct: 123 QPKLDKPLQKHGGRLEHNE--TYCGSCFGAETEEDHCCNNCEEVREAYRKKGWALNNPDL 180

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QCK E   +K+K+   EGC +YG LE N+V+G+FH APG S+   ++HVHD+  +   
Sbjct: 181 IDQCKREGFLQKIKDEDGEGCNVYGTLEANKVAGNFHFAPGKSFQQANMHVHDLMAFGKD 240

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
           +FN +H I  +SFG++         PLD           M+ Y+IK++PT+Y    G K+
Sbjct: 241 SFNVSHKINEISFGVRYPG---AVNPLDKLERIQTTTHGMYQYFIKVVPTVYTDTRGRKI 297

Query: 305 G---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                           G D  +PG+FF Y+LSP+ VK TEK  S  H  T +   + G +
Sbjct: 298 STNQFAVTDHFKGVGPGEDHALPGVFFFYDLSPIKVKFTEKRMSFFHFLTNVCAIVGGVF 357

Query: 350 ITFMLVDALLHSCVKKISK 368
               ++DA ++   K+I K
Sbjct: 358 SVSGIIDAFVYHGQKQIKK 376


>gi|330790779|ref|XP_003283473.1| hypothetical protein DICPUDRAFT_52316 [Dictyostelium purpureum]
 gi|325086583|gb|EGC39970.1| hypothetical protein DICPUDRAFT_52316 [Dictyostelium purpureum]
          Length = 383

 Score =  288 bits (736), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 156/389 (40%), Positives = 215/389 (55%), Gaps = 35/389 (8%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+   +LK  DA+ K  +DF  KT  G  V+IV  +FI +L    V  YF      ELFV
Sbjct: 1   MLMVSQLKKFDAYPKTVDDFRVKTFTGAIVSIVGGIFILWLFFSQVTLYFSTDIHHELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI--QEP 118
           D++RG KL I++DI    + C YL+LDA+D SGE    V HNI+K+RL   G+PI  Q P
Sbjct: 61  DTTRGEKLKINMDITFHHLPCAYLSLDAMDVSGEHQFDVAHNIFKKRLSSTGQPIIEQPP 120

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR--KCCNTCNEVKEAYRYKK 176
            +E    + KK V  EN        D   CGSCYGAE   R   CCNTC EV+ AY  K 
Sbjct: 121 IRE--EEINKKIVKNEN--------DVQGCGSCYGAEDPARGIPCCNTCEEVRNAYSKKG 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W L +  T+ QC  E  T+ +     EGCQ+YG++ VN+V+G+FH APG S+  +H+HVH
Sbjct: 171 WGL-DPSTVSQCLREGFTKNIVEQNGEGCQVYGFILVNKVAGNFHFAPGKSFQQHHMHVH 229

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
           D+QP+    FN +H I  L+ G +       + PLD        G  MF Y+IKI+PTIY
Sbjct: 230 DLQPFKDGQFNMSHTINKLAVGNEFPG---IKNPLDEVTKTEVAGVGMFQYFIKIVPTIY 286

Query: 297 ERLDGSKL-----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
           E L+G+++                 G    G+PG+FF Y+LSP+M+K++EK KS     T
Sbjct: 287 EGLNGNRIATNQYSVTEHYRLLAKKGEEPTGLPGLFFMYDLSPIMMKVSEKGKSFASFLT 346

Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISK 368
            +   I G +  F + D+ ++   K + K
Sbjct: 347 NVCAIIGGVFTVFGIFDSFIYYSTKNLKK 375


>gi|395510083|ref|XP_003759313.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3, partial [Sarcophilus harrisii]
          Length = 335

 Score =  286 bits (733), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 146/341 (42%), Positives = 207/341 (60%), Gaps = 31/341 (9%)

Query: 57  ELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ 116
           EL+VD SRG KL I++DI  P + C YL++DA+D +GEQ L VEHN+YK+RLD DG P+ 
Sbjct: 4   ELYVDKSRGDKLKINIDIFFPHMPCAYLSIDAMDVAGEQQLDVEHNLYKQRLDKDGHPVT 63

Query: 117 EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKK 176
              +      +++KV   +        DP +C SCYGAE+E  KCCNTC +V+EAYR + 
Sbjct: 64  TEAERHELGKEEEKVFDPSSL------DPERCESCYGAESEDSKCCNTCEDVREAYRRRG 117

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV- 235
           WA    DTI QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV 
Sbjct: 118 WAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVH 177

Query: 236 ----HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
               HD+Q +     N TH+IR LSFG   +D      PLD T   A + + MF Y++K+
Sbjct: 178 AVEIHDLQSFGLDNINMTHYIRRLSFG---EDYPGIVNPLDDTNITAPQASMMFQYFVKV 234

Query: 292 IPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLG 335
           +PT+Y +++G  L                  GD G+PG+F  YELSP+MVK+TEK +S  
Sbjct: 235 VPTVYMKVNGEVLRSNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMMVKLTEKHRSFT 294

Query: 336 HLWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
           H  T +   I G +    L+D+L++   + I  K+E+G  T
Sbjct: 295 HFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIELGKTT 335


>gi|355686517|gb|AER98082.1| ERGIC and golgi 3 [Mustela putorius furo]
          Length = 304

 Score =  284 bits (727), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 140/310 (45%), Positives = 196/310 (63%), Gaps = 27/310 (8%)

Query: 57  ELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ 116
           EL+VD SRG KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+ 
Sbjct: 4   ELYVDKSRGDKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVS 63

Query: 117 -EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
            E ++  +  V+ K    ++        DP++C SCYGAETE  KCCNTC +V+EAYR +
Sbjct: 64  SEAERHELGKVEVKVFDPDS-------LDPDRCESCYGAETEDIKCCNTCEDVREAYRRR 116

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            WA    DTI QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV
Sbjct: 117 GWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHV 176

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
           HD+Q +     N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+
Sbjct: 177 HDLQSFGLDNINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTV 233

Query: 296 YERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
           Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T
Sbjct: 234 YMKVDGEVLRTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLT 293

Query: 340 KIMCNISGTY 349
            +   I G +
Sbjct: 294 GVCAIIGGMF 303


>gi|66801671|ref|XP_629760.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Dictyostelium discoideum AX4]
 gi|74851212|sp|Q54DW2.1|ERGI3_DICDI RecName: Full=Probable endoplasmic reticulum-Golgi intermediate
           compartment protein 3
 gi|60463164|gb|EAL61357.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Dictyostelium discoideum AX4]
          Length = 383

 Score =  284 bits (726), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 153/389 (39%), Positives = 215/389 (55%), Gaps = 30/389 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  +DF  KT  G  V+I+  +FI +L    V  YF      ELFVD++RG
Sbjct: 5   QLKKFDAYPKTVDDFRVKTYTGAIVSIIGGVFILWLFFSQVTLYFSTDIHHELFVDTTRG 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++DI    + C YL+LDA+D SGE    V HNI+K+RL   G+PI E        
Sbjct: 65  EKLKINMDITFHHLPCAYLSLDAMDVSGEHQFDVAHNIFKKRLSPTGQPIIEAPPIREEE 124

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETR--KCCNTCNEVKEAYRYKKWALPELD 183
           + KK+   +N        D   CGSCYGAE  ++   CCNTC EV+ AY  K W L +  
Sbjct: 125 INKKESVKDN-------NDVVGCGSCYGAEDPSKGIGCCNTCEEVRVAYSKKGWGL-DPS 176

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
            I QC  E  T+ L     EGCQ+YG++ VN+V+G+FH APG S+  +H+HVHD+QP+  
Sbjct: 177 GIPQCIREGFTKNLVEQNGEGCQVYGFILVNKVAGNFHFAPGKSFQQHHMHVHDLQPFKD 236

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
            +FN +H I  LSFG    D    + PLD        G  MF Y++K++PTIYE L+G++
Sbjct: 237 GSFNVSHTINRLSFG---NDFPGIKNPLDDVTKTEMVGVGMFQYFVKVVPTIYEGLNGNR 293

Query: 304 L-----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
           +                 G    G+PG+FF Y+LSP+M+K++E+ KS     T +   I 
Sbjct: 294 IATNQYSVTEHYRLLAKKGEEPSGLPGLFFMYDLSPIMMKVSERGKSFASFLTNVCAIIG 353

Query: 347 GTYITFMLVDALLHSCVKKISKVEIGGKT 375
           G +  F + D+ ++   K + K    GKT
Sbjct: 354 GVFTVFGIFDSFIYYSTKNLQKKIDLGKT 382


>gi|302834369|ref|XP_002948747.1| hypothetical protein VOLCADRAFT_80399 [Volvox carteri f.
           nagariensis]
 gi|300265938|gb|EFJ50127.1| hypothetical protein VOLCADRAFT_80399 [Volvox carteri f.
           nagariensis]
          Length = 392

 Score =  280 bits (716), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 153/389 (39%), Positives = 216/389 (55%), Gaps = 26/389 (6%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F  +LK LDA+ K  EDF  KT+ GG +TIV  + +  L   ++  Y    +  EL VD 
Sbjct: 8   FLSKLKALDAYPKINEDFFTKTMSGGIITIVASVVMVLLFLSELRLYMTTQSVHELSVDV 67

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
            RG K+ IH D+  P + C +L+LDA+D SGE HL ++H++YK+RL  +G P++E +K  
Sbjct: 68  GRGEKIQIHFDLTFPKVPCSWLSLDAMDISGELHLDLDHDVYKQRLSANGSPVKEVEKHN 127

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           V A KK  V   NGT  +       CGSCYGAE     CCNTC+EV+ AYR K WAL  +
Sbjct: 128 VEATKK--VVPVNGTENSTATP--VCGSCYGAEDRQGDCCNTCDEVRAAYRRKGWALANV 183

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
           D I QC ++  TE +K    EGC ++G LEVN+V+G+FH APG SY    +HVHDI P+ 
Sbjct: 184 DHIEQCAHDLYTESIKEQTGEGCHMWGMLEVNKVAGNFHFAPGRSYQQGSMHVHDIAPFG 243

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA--KAEEGASMFNYYIKIIPTIYERLD 300
            A  +  H +  LSFG         + PLD   A  K+     M+ Y++K++PT Y  +D
Sbjct: 244 DAVIDFRHTVNKLSFGAPYPG---MKNPLDNAKAGYKSAAATGMYQYFLKVVPTSYTGID 300

Query: 301 GSKL----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
              L                GG    +PG+FF Y+LSP+ V+I E S S     T +   
Sbjct: 301 NKTLATNQFSVTENFRESSQGGAGKTLPGVFFFYDLSPIKVRIVEHSSSFLSFLTSVCAI 360

Query: 345 ISGTYITFMLVDALLHSCVKKI-SKVEIG 372
           + G +    +VDA +++  + I  K+E+G
Sbjct: 361 VGGVFTVSGIVDAFIYTSTRLIRKKMELG 389


>gi|340373749|ref|XP_003385402.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Amphimedon queenslandica]
          Length = 386

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 154/388 (39%), Positives = 221/388 (56%), Gaps = 32/388 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RLK LDA++K  EDF  KT  G  +T+V  + I  L   ++  +      +EL+VD+SRG
Sbjct: 7   RLKNLDAYSKTLEDFKIKTFSGATITLVSSIIILLLFLSELLYFLSTDVKQELYVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++DI+     C YL++D +D SGE  L VEH +YK+RL LDG        EV+N 
Sbjct: 67  EKLQINVDIIFHRAPCLYLSIDVMDVSGEHQLDVEHTMYKQRLTLDG--------EVINE 118

Query: 126 VKKKKVTTENGTTTTELEDPNK-CGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
              K V   + T   +    NK CGSCYGAET    CCNTC +V+EAYR K WA  +  +
Sbjct: 119 SPTKSVLARDETQDGKAGAANKTCGSCYGAETPELSCCNTCEQVREAYRKKGWAFSDPSS 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  T ++K    EGC++YG ++V++V+G+FH APG S+  + VHVHD+QP+   
Sbjct: 179 IEQCEKEGWTTQIKEQMNEGCRVYGLIDVSKVAGNFHFAPGKSFQQHSVHVHDLQPFGVK 238

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA---KAEEGASMFNYYIKIIPTIYERLDG 301
            FN +H +  LSFG   Q+      PLDG  A   +   G  M+ Y+IK++PT+Y RL+ 
Sbjct: 239 HFNMSHTVLKLSFG---QEYPGIINPLDGHKAFDVETTHGGIMYQYFIKVVPTLYRRLNN 295

Query: 302 SKLG----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
             +G                 G+ G+PG+FF Y++SP++V +TE   SL H  T +   +
Sbjct: 296 ETMGTNQFAVTKHQRPVRSASGEHGLPGVFFIYDISPILVYLTEYRHSLTHFLTSVCAIV 355

Query: 346 SGTYITFMLVDALL-HSCVKKISKVEIG 372
            G +    ++D LL HS      K+E+G
Sbjct: 356 GGVFTVAGMIDKLLYHSGRVLKKKMELG 383


>gi|320167013|gb|EFW43912.1| Ergic3 protein [Capsaspora owczarzaki ATCC 30864]
          Length = 392

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 147/392 (37%), Positives = 224/392 (57%), Gaps = 30/392 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RLK LDA+ K  ED   KT  G  V+IVC L ++ L   ++  +    T  EL VD++R 
Sbjct: 9   RLKQLDAYAKTTEDVRIKTYGGAIVSIVCALIMAALFVSELNYFLTTETHHELLVDTTRA 68

Query: 66  --SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
              KL I++++  P + C Y+++D +D +GE  L V H + K RL   G+ ++EP    V
Sbjct: 69  GEQKLRININVTFPRLPCAYMSIDVMDVAGEHQLDVLHTLVKTRLSASGEVVREPTP--V 126

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
            A+ ++  +  +     +L D +KCG CYGA+TE R CCN+C EV+ AYR K W + + D
Sbjct: 127 EALGQQPPS--DAAERRDL-DNSKCGDCYGAQTEKRPCCNSCEEVQAAYREKGWGMMDPD 183

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
           +I QC+ E  +E++++   EGC++ G++ VN+V+G+FH APG S    HVHVHD+Q + +
Sbjct: 184 SIEQCRQEGFSERMRSIANEGCKVQGFMYVNKVAGNFHFAPGKSSQHQHVHVHDLQQFKT 243

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE---GASMFNYYIKIIPTIYERLD 300
             F+ TH I  LSFG +      +  PLD       E   G++MF Y+IK++PT Y +L+
Sbjct: 244 TTFDMTHTIHLLSFGTEYPG---QVNPLDAVSKVPPENTPGSAMFQYFIKVVPTEYVKLN 300

Query: 301 GS----------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           G                     G+ G+PG+FF YE SP++VKITE+ KS  H  T +   
Sbjct: 301 GETEQTSQFSATSHVKMINHAAGENGLPGVFFMYEPSPMLVKITERRKSFMHFLTGVCAI 360

Query: 345 ISGTYITFMLVDALLHSCVKKI-SKVEIGGKT 375
           + G +    LVDA ++   + I  K+E+G +T
Sbjct: 361 VGGVFTVAGLVDATIYHSYRSIKKKMELGKQT 392


>gi|56753075|gb|AAW24747.1| SJCHGC09363 protein [Schistosoma japonicum]
 gi|226486460|emb|CAX74359.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Schistosoma japonicum]
 gi|226486464|emb|CAX74361.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Schistosoma japonicum]
          Length = 379

 Score =  275 bits (703), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 148/379 (39%), Positives = 206/379 (54%), Gaps = 33/379 (8%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF KP +DF  KT+ G  V+I+    I  L   +   + +    +E+ VD +RG 
Sbjct: 8   LRNFDAFAKPLKDFRIKTMSGAMVSIISSFIIGILFTSEFISFMRTQNKQEIIVDINRGE 67

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+ I+LDI +  I C +L LD +D++G Q L+V H +YK  + + G P+    +  VN  
Sbjct: 68  KMSIYLDITINFIPCAFLRLDTMDTTGAQQLNVMHEVYKTSVSISGNPLSNSVRHTVN-- 125

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                  ++  TTT   DPN CGSCYGA++ TRKCCNTC EV+ AY   +W         
Sbjct: 126 ------DDSALTTTR--DPNYCGSCYGADSPTRKCCNTCEEVQMAYHEMQWVFGNASEFE 177

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC+NE      +N   EGC+I+G L VNRV G FHIAPG SY+ NH HVH I+      F
Sbjct: 178 QCRNENWDGMKRNIGNEGCRIHGSLTVNRVGGGFHIAPGHSYTENHAHVHSIRSLGHVQF 237

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD------ 300
           N +H I  L FG        +   LDGT    ++ + MFNYY+K++PT+Y  +       
Sbjct: 238 NVSHSITELRFGDAYPG---QINSLDGTKMTVDKPSQMFNYYLKLVPTMYTSVSNNESTL 294

Query: 301 ------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                       GS L G   G+PG+FF+YE++PL+VKITE+ KS  H  T     I G 
Sbjct: 295 ITNQYSATWHSRGSPLSGDGQGLPGVFFNYEIAPLLVKITEERKSFVHFLTNTCAIIGGV 354

Query: 349 YITFMLVDALLH--SCVKK 365
           +    L+DA ++  SCV +
Sbjct: 355 FTVASLLDAFIYQSSCVLR 373


>gi|226486462|emb|CAX74360.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Schistosoma japonicum]
          Length = 379

 Score =  275 bits (703), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 148/379 (39%), Positives = 206/379 (54%), Gaps = 33/379 (8%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF KP +DF  KT+ G  V+I+    I  L   +   + +    +E+ VD +RG 
Sbjct: 8   LRNFDAFAKPLKDFRIKTMSGAMVSIISSFIIGILFTSEFISFMRTQNKQEIIVDINRGE 67

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+ I+LDI +  I C +L LD +D++G Q L+V H +YK  + + G P+    +  VN  
Sbjct: 68  KMSIYLDITINFIPCAFLRLDTMDTTGAQQLNVMHEVYKTSVSISGNPLSNSVRHTVN-- 125

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                  ++  TTT   DPN CGSCYGA++ TRKCCNTC EV+ AY   +W         
Sbjct: 126 ------DDSALTTTR--DPNYCGSCYGADSPTRKCCNTCEEVQMAYHEMQWVFGNASEFE 177

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC+NE      +N   EGC+I+G L VNRV G FHIAPG SY+ NH HVH I+      F
Sbjct: 178 QCRNENWDGMKRNIGNEGCRIHGSLTVNRVGGGFHIAPGHSYTENHAHVHSIRSLGHVQF 237

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD------ 300
           N +H I  L FG        +   LDGT    ++ + MFNYY+K++PT+Y  +       
Sbjct: 238 NVSHSITELRFGDAYPG---QINSLDGTKMTVDKPSQMFNYYLKLVPTMYTSVSNNESTL 294

Query: 301 ------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                       GS L G   G+PG+FF+YE++PL+VKITE+ KS  H  T     I G 
Sbjct: 295 ITNQYSATWHSRGSPLSGDGQGLPGVFFNYEIAPLLVKITEERKSFVHFLTNTCAIIGGV 354

Query: 349 YITFMLVDALLH--SCVKK 365
           +    L+DA ++  SCV +
Sbjct: 355 FTVASLLDAFIYQSSCVLR 373


>gi|357489473|ref|XP_003615024.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
 gi|355516359|gb|AES97982.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
          Length = 386

 Score =  270 bits (690), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 142/383 (37%), Positives = 215/383 (56%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +TIV  + +  L   ++  Y   +T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGLITIVSSILMLLLFFSELRLYLHAATETKLVVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P ++C  ++LDA+D SGEQHL V H+I K+R+D  G  I+  Q  + + 
Sbjct: 67  ETLRINFDVTFPALACSIVSLDAMDISGEQHLDVRHDIIKKRIDSHGNVIETRQDGIGSP 126

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             +K +    G       +   CGSCYGAE    +CCN+C EV+EAYR K WAL   D+I
Sbjct: 127 NIEKPLQRHGGRLE---HNETYCGSCYGAEASDEECCNSCEEVREAYRKKGWALSSPDSI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   E++K    EGC +YG+LEVN+V+G+FH APG S+  + VHVHD+  +   +
Sbjct: 184 DQCKREGFLERIKEEEGEGCNVYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKES 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN +HHI  ++FG           PLD      E  + M+ Y+IK++PT+Y  + G+ + 
Sbjct: 244 FNLSHHINRIAFGDYFPG---VVNPLDRVHWTQETPSGMYQYFIKVVPTMYTDVSGNTIQ 300

Query: 306 ---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                          G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHFRTADVGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGIFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D+ ++   K I  K+E+G
Sbjct: 361 VSGILDSFIYHGQKAIKKKMELG 383


>gi|224082148|ref|XP_002306582.1| predicted protein [Populus trichocarpa]
 gi|222856031|gb|EEE93578.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 148/383 (38%), Positives = 215/383 (56%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGVITLASSVVMFLLFFSELRLYLHAVTETKLVVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+LDA+D SGEQHL V+H+I K+RLD  G  I E +++ + A
Sbjct: 67  ETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDFHGNVI-EARQDGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +K    +G      E    CGSCYGAE     CCN+C +V+EAYR K WA+   D +
Sbjct: 126 PKIEKPLQRHGGRLEHNE--TYCGSCYGAEASDEDCCNSCEDVREAYRKKGWAVTNPDLM 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +K+K+   EGC IYG+LEVN+V+G+FH APG S+  + VHVHD+  +   +
Sbjct: 184 DQCKREGFLQKIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN TH I  L+FG           PLDG     E  + M+ Y+IK++PT+Y  + G  + 
Sbjct: 244 FNITHKINRLTFGEYFPG---VVNPLDGVQWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300

Query: 306 -----------GGDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                      G D G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHFRGTDIGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D  ++   K I  K+EIG
Sbjct: 361 VSGILDTFIYHGQKAIKKKMEIG 383


>gi|225459342|ref|XP_002285801.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Vitis vinifera]
 gi|302141938|emb|CBI19141.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  270 bits (689), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 143/383 (37%), Positives = 214/383 (55%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+   +F+  L   ++  Y    T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGVITLASSIFMLLLFISELRLYLHAVTETKLVVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+LDA+D SGEQHL V H+I K+R+D  G  I E +++ + +
Sbjct: 67  ETLRINFDVTFPALPCSILSLDAMDISGEQHLDVRHDIIKKRIDAHGSVI-EARQDGIGS 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +K   ++G      E    CGSCYGAE     CCN C EV+EAYR K WA+   D I
Sbjct: 126 PKIEKPLQKHGGRLEHNE--TYCGSCYGAEASDDDCCNNCEEVREAYRKKGWAMSNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +++K+   EGC IYG+LEVN+V+G+FH APG S+  +++HVHD+  +   +
Sbjct: 184 DQCKREGFLQRIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSNIHVHDLLAFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN +H I  L+FG           PLDG        + M+ Y+IK++PT+Y  + G  + 
Sbjct: 244 FNISHKINRLAFGDYFPG---VVNPLDGVQWIQATPSGMYQYFIKVVPTVYTHVSGHTIS 300

Query: 306 ---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                          G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 TNQFSVTEHFRNAELGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D+ ++   K I  K+EIG
Sbjct: 361 VSGILDSFIYHSQKAIKKKIEIG 383


>gi|255545672|ref|XP_002513896.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
 gi|223546982|gb|EEF48479.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
          Length = 386

 Score =  269 bits (687), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 146/383 (38%), Positives = 214/383 (55%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGVITLASSILMLLLFISELRLYIHAVTETKLAVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+LDA+D SGEQHL V+H+I K+RLD  G  I E +++ + A
Sbjct: 67  ETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDSHGNVI-EARQDGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +     +G      E    CGSCYGAE     CCN+C +V+EAYR K WAL   D I
Sbjct: 126 PKIENPLQRHGGRLEHNE--TYCGSCYGAEASDEDCCNSCEDVREAYRKKGWALSNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +++K+   EGC IYG+LEVN+V+G+FH APG S+  ++VHVHD+  +   +
Sbjct: 184 DQCKREGFLQRIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSNVHVHDLLAFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
           FN +H I  L+FG           PLDG     E  + M+ Y+IK++PT+Y  + G    
Sbjct: 244 FNISHKINRLAFGDYFPG---VVNPLDGVHWTQETPSGMYQYFIKVVPTVYTDVSGYTIQ 300

Query: 302 -----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                      S   G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHFRSAEAGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D+ ++   K I  K+EIG
Sbjct: 361 VSGILDSFIYHGQKAIKKKMEIG 383


>gi|356552872|ref|XP_003544786.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Glycine max]
          Length = 386

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 145/383 (37%), Positives = 216/383 (56%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T  +L VD+SR 
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGVITLASSILMLLLFFSELRLYLHAVTETKLVVDTSRA 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+LDA+D SGEQHL V+H+I K+RLD  G  I E ++E + A
Sbjct: 67  ETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDSHGNVI-ETRQEGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +K    +G      E    CGSCYGAE     CCN+C +V+EAYR K WAL   D I
Sbjct: 126 PKIEKPLQRHGGRLEHNE--TYCGSCYGAEESDDDCCNSCEDVREAYRKKGWALSNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +++K+   EGC +YG+LEVN+V+G+FH APG S+  + VHVHD+  +   +
Sbjct: 184 DQCKREGFLQRIKDEEGEGCNVYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN +HHI  L+FG           PLD      E  + M+ Y+IK++PT+Y  + G  + 
Sbjct: 244 FNLSHHINRLAFGEYFPG---VVNPLDNVHWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300

Query: 306 G-----------GDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                       GD G    +PG+FF Y+LSP+ V  TE++ S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHFRTGDVGRLQSLPGVFFFYDLSPIKVTFTEENVSFLHFLTNVCAIVGGIFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D+ ++   + I  K+E+G
Sbjct: 361 VSGILDSFIYHGQRAIKKKMELG 383


>gi|167535515|ref|XP_001749431.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163772059|gb|EDQ85716.1| predicted protein [Monosiga brevicollis MX1]
          Length = 394

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 147/396 (37%), Positives = 221/396 (55%), Gaps = 26/396 (6%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + LK  DA+ K  +DF  KT  G AV+I+  + +  L   ++  +      EELFV
Sbjct: 1   MAIFDNLKRFDAYPKTLDDFRVKTFSGAAVSIIAIIIMVILFSSELVYFLSTDVHEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D++R  KL I+LDI  P + C YL+LD +D SGE   +++H+++++RLD  G  I   Q+
Sbjct: 61  DTARNEKLRINLDITFPKMPCVYLSLDVMDISGENEQNIDHDVFRQRLDASGNKIYNGQE 120

Query: 121 EV--VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
           E+  +       V  +      +L DPN+C SCYGAE    +CCNTC +V+EAYR K WA
Sbjct: 121 EIDELGESHADNVADKALDGLKDL-DPNRCESCYGAEDTEGQCCNTCAQVQEAYRKKGWA 179

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
                 I QC+ E     ++    EGCQ+YG+LEVN+V+G+FHIAPG S+  +++H+HD+
Sbjct: 180 FRSGQGIAQCEREGYDAMMEAQEREGCQLYGHLEVNKVAGNFHIAPGRSFEQHNMHIHDM 239

Query: 239 QPYTS---AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE-GASMFNYYIKIIPT 294
           Q +     A FN TH I HLSFGI   D   R   LDG V    E GA M+ Y++K++PT
Sbjct: 240 QSFGREKLAKFNLTHVINHLSFGIDYPD---RVNSLDGHVEVPNEYGAIMYQYFLKVVPT 296

Query: 295 IYERLDGSKL----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
            Y  L  +++                  G  G+PG+FF Y++SP+ +++T+ S+S  H  
Sbjct: 297 RYRFLSQTEIDTNQYSVTMHQREIRPDQGTSGLPGLFFMYDISPMKIQLTQSSRSFFHFL 356

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
           T +   I G Y    ++D  L+  ++ +   +  GK
Sbjct: 357 TGLCAIIGGVYTVAGMIDGFLYHGIRTLKAKQNMGK 392


>gi|281211641|gb|EFA85803.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Polysphondylium pallidum PN500]
          Length = 388

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 156/398 (39%), Positives = 218/398 (54%), Gaps = 45/398 (11%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           ++LK  DA+ K  +DF  KT  G  V+IV  +FI +L    +  Y    T  ELFVD++R
Sbjct: 3   QKLKSFDAYPKTVDDFRVKTYAGAIVSIVSSIFIIWLFLSQISIYMTTETHHELFVDTNR 62

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
             KL I++D+V   + C YL+LDA+D SGE    V HNI+KRRL   G+ I +  K   N
Sbjct: 63  AEKLKINIDVVFHHLPCAYLSLDAMDVSGEHQFDVAHNIFKRRLSPTGEFIPDAPKREDN 122

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR--KCCNTCNEVKEAYRYKKWALPEL 182
              K KV  EN        D  +CGSC GAE  ++   CCNTC EV+ AY+   W     
Sbjct: 123 VNIKPKV-NEN--------DRPECGSCMGAENPSKGINCCNTCEEVRVAYQKMGWGFDPS 173

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
           DT  QC  E  T+ +     EGCQ+YG+L VN+V+G+FH APG S+  +H+HVHD+Q + 
Sbjct: 174 DT-PQCVREGFTKNVVEQNGEGCQVYGFLLVNKVAGNFHFAPGKSFQQHHMHVHDLQSF- 231

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE----------GASMFNYYIKII 292
              FN +H I  LSFG    D    + PLDG V+K E           G+ MF YY+KI+
Sbjct: 232 KGQFNLSHTISRLSFG---NDFPGIKNPLDG-VSKTEANQYQYHNLVVGSGMFQYYVKIV 287

Query: 293 PTIYERLDG-----------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
           PTIYE L+G                 +K G    G+PG+FF Y+LSP+M+K+ E+SKS  
Sbjct: 288 PTIYEGLNGNLINTNQYSVTEHYRLLAKKGEEMTGLPGLFFMYDLSPIMMKVVERSKSFA 347

Query: 336 HLWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
              T +   + G +    + D+ ++   K +  K+++G
Sbjct: 348 SFITSVCAIVGGVFTVAGIFDSFIYQTTKSLKRKIDLG 385


>gi|332248939|ref|XP_003273622.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 1 [Nomascus leucogenys]
          Length = 380

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 156/394 (39%), Positives = 219/394 (55%), Gaps = 44/394 (11%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEG---CQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HD 237
            QC        L+ T  E    C +       +V+G+FH APG S+  +HVHV     HD
Sbjct: 180 EQC----PARGLQRTQPENERECSL-------QVAGNFHFAPGKSFQQSHVHVHAVEIHD 228

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
           +Q +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y 
Sbjct: 229 LQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYM 285

Query: 298 RLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
           ++DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T +
Sbjct: 286 KVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGV 345

Query: 342 MCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
              I G +    L+D+L++   + I K    GKT
Sbjct: 346 CAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 379


>gi|297846654|ref|XP_002891208.1| hypothetical protein ARALYDRAFT_891247 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337050|gb|EFH67467.1| hypothetical protein ARALYDRAFT_891247 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 386

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 144/383 (37%), Positives = 216/383 (56%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T++  + +  L   ++  Y    T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRLYLHTVTETKLIVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ DI  P ++C  L++DA+D SGE HL V+H+I KRRLD +G  I E +++ + A
Sbjct: 67  ETLRINFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTI-EARQDGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +K   ++G      E    CGSCYGAE E   CCN+C +V+EAYR K W +   D I
Sbjct: 126 TKIEKPLQKHGGRLEHNE--TYCGSCYGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +++K+   EGC IYG+LEVN+V+G+FH APG S+  + VHVHD+  +   +
Sbjct: 184 DQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
           FN +H I  L++G           PLD      +   +M+ Y+IK++PT+Y  + G    
Sbjct: 244 FNISHKINRLTYGDYFPG---VVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQ 300

Query: 302 -----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                      S   G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++DA ++   K I  K+EIG
Sbjct: 361 VSGIIDAFIYHGQKAIKKKMEIG 383


>gi|356548103|ref|XP_003542443.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Glycine max]
          Length = 386

 Score =  264 bits (674), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 143/383 (37%), Positives = 215/383 (56%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T  +L VD+SR 
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGVITLASSILMLLLFYSELRLYLHAVTETKLVVDTSRA 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+LDA+D SGEQ L V+H+I K+RLD  G  I E ++E + A
Sbjct: 67  ETLRINFDVTFPALPCSILSLDAMDISGEQRLDVKHDIIKKRLDSRGNVI-ETRQEGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +K    +G      E    CGSCYG+E     CCN+C +V+EAYR K WAL   D I
Sbjct: 126 PKIEKPLQRHGGRLEHNE--TYCGSCYGSEVSDDDCCNSCEDVREAYRKKGWALSNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +++K+   EGC +YG+LEVN+V+G+FH APG S+  + VHVHD+  +   +
Sbjct: 184 DQCKREGFLQRIKDEEGEGCNVYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN +HHI  L+FG           PLD      E  + M+ Y+IK++PT+Y  + G  + 
Sbjct: 244 FNLSHHINRLTFGEYFPG---VVNPLDNVHWTQETPSGMYQYFIKVVPTVYTDVSGHTIQ 300

Query: 306 G-----------GDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                       GD G    +PG+FF Y+LSP+ V  TE++ S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHFRTGDMGRLQSLPGVFFFYDLSPIKVTFTEENVSFLHFLTNVCAIVGGIFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D+ ++   + I  K+E+G
Sbjct: 361 VSGILDSFIYHGQRAIKKKMELG 383


>gi|38347102|emb|CAE02574.2| OSJNBa0006M15.17 [Oryza sativa Japonica Group]
 gi|116309990|emb|CAH67017.1| H0523F07.5 [Oryza sativa Indica Group]
 gi|218194960|gb|EEC77387.1| hypothetical protein OsI_16129 [Oryza sativa Indica Group]
          Length = 386

 Score =  263 bits (673), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 139/383 (36%), Positives = 212/383 (55%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T   L VD+SRG
Sbjct: 7   KLRSLDAYPKVNEDFYSRTLSGGIITLASSVVMLLLFVSELRLYLHAVTETTLRVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  ++LDA+D SG++HL V+H+I+K+R+D+ G  I   Q + V  
Sbjct: 67  ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDIFKQRIDVHGNVIATKQ-DAVGG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +K ++    +G      E    CGSCYGAE    +CCN+C +V+EAYR K W +   D I
Sbjct: 126 MKVEQPLQRHGGRLEHNE--TYCGSCYGAEESDEQCCNSCEDVREAYRKKGWGVSNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   + +K+   EGC IYG+LEVN+V+G+FH APG S+   +VHVHD+ P+   +
Sbjct: 184 DQCKREGFLQSIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD----- 300
           FN +H I  LSFG +         PLDG          M+ Y+IK++PT+Y  ++     
Sbjct: 244 FNVSHKINKLSFGQRFPG---VVNPLDGAQWMQHSSYGMYQYFIKVVPTVYTDINEHIIL 300

Query: 301 ----------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                      S   G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHFRSSESGRIQAVPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D+ ++   + I  K+EIG
Sbjct: 361 VSGIIDSFVYHGQRAIKKKMEIG 383


>gi|238478737|ref|NP_001154394.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
 gi|12324714|gb|AAG52317.1|AC021666_6 unknown protein; 24499-21911 [Arabidopsis thaliana]
 gi|27808598|gb|AAO24579.1| At1g36050 [Arabidopsis thaliana]
 gi|110736190|dbj|BAF00066.1| hypothetical protein [Arabidopsis thaliana]
 gi|332193720|gb|AEE31841.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
          Length = 386

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 143/383 (37%), Positives = 215/383 (56%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T++  + +  L   ++  Y    T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRLYLHTVTETKLIVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ DI  P ++C  L++DA+D SGE HL V+H+I KRRLD +G  I E +++ + A
Sbjct: 67  ETLRINFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTI-EARQDGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +    ++G      E    CGSCYGAE E   CCN+C +V+EAYR K W +   D I
Sbjct: 126 TKIENPLQKHGGRLGHNE--TYCGSCYGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +++K+   EGC IYG+LEVN+V+G+FH APG S+  + VHVHD+  +   +
Sbjct: 184 DQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
           FN +H I  L++G           PLD      +   +M+ Y+IK++PT+Y  + G    
Sbjct: 244 FNISHKINRLTYGDYFPG---VVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQ 300

Query: 302 -----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                      S   G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++DA ++   K I  K+EIG
Sbjct: 361 VSGIIDAFIYHGQKAIKKKMEIG 383


>gi|296481082|tpg|DAA23197.1| TPA: endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Bos taurus]
          Length = 306

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 133/300 (44%), Positives = 190/300 (63%), Gaps = 11/300 (3%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAE E  KCCN+C +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +   
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLD 238

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L
Sbjct: 239 NINMTHYIRHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 295


>gi|194374867|dbj|BAG62548.1| unnamed protein product [Homo sapiens]
          Length = 321

 Score =  261 bits (668), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 133/296 (44%), Positives = 189/296 (63%), Gaps = 9/296 (3%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSGAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +    
Sbjct: 180 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDN 239

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
            N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG
Sbjct: 240 INMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDG 292


>gi|240254210|ref|NP_564467.5| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
 gi|332193719|gb|AEE31840.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
          Length = 489

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 140/378 (37%), Positives = 211/378 (55%), Gaps = 21/378 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T++  + +  L   ++  Y    T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRLYLHTVTETKLIVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ DI  P ++C  L++DA+D SGE HL V+H+I KRRLD +G  I E +++ + A
Sbjct: 67  ETLRINFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTI-EARQDGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +    ++G      E    CGSCYGAE E   CCN+C +V+EAYR K W +   D I
Sbjct: 126 TKIENPLQKHGGRLGHNE--TYCGSCYGAEAEEHDCCNSCEDVREAYRKKGWGVTNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +++K+   EGC IYG+LEVN+V+G+FH APG S+  + VHVHD+  +   +
Sbjct: 184 DQCKREGFLQRVKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFHQSGVHVHDLLAFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
           FN +H I  L++G           PLD      +   +M+ Y+IK++PT+Y  + G    
Sbjct: 244 FNISHKINRLTYGDYFPG---VVNPLDKVEWSQDTPNAMYQYFIKVVPTVYTDIRGHTIQ 300

Query: 302 -----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                      S   G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHVKSSEAGQLQSLPGVFFFYDLSPIKVTFTEEHISFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKISK 368
              ++DA ++   K I K
Sbjct: 361 VSGIIDAFIYHGQKAIKK 378


>gi|226494692|ref|NP_001148795.1| LOC100282412 [Zea mays]
 gi|194696974|gb|ACF82571.1| unknown [Zea mays]
 gi|195622210|gb|ACG32935.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
 gi|414586929|tpg|DAA37500.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
           protein [Zea mays]
          Length = 386

 Score =  261 bits (666), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 138/383 (36%), Positives = 212/383 (55%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+V    +  L   ++  Y    T   L VD+SRG
Sbjct: 7   KLRSLDAYPKVNEDFYSRTLSGGIITLVSSAVMLLLFVSELRLYLHAVTETTLRVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  ++LDA+D SG++HL V+H+++K+R+D  G  I   Q +VV  
Sbjct: 67  ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ-DVVGG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +K +     +G      E    CGSCYGA+    +CCNTC +V+EAYR K W +   D +
Sbjct: 126 MKMEAPLQHHGGRLEHNE--TYCGSCYGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLL 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   + +K+   EGC IYG++EVN+V+G+FH APG S+  ++VHVHD+ P+   +
Sbjct: 184 DQCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD----- 300
           FN +H I  LSFG           PLDG          M+ Y+IK++PT+Y  ++     
Sbjct: 244 FNVSHKINRLSFGEYFPG---VVNPLDGANWVQHSSYGMYQYFIKVVPTVYTDINEHIIL 300

Query: 301 ------GSKLGGGDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                       G+ G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHFRSGESGRMQALPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D+ ++   + I  K+EIG
Sbjct: 361 VSGIIDSFVYHSQRAIKKKMEIG 383


>gi|449465886|ref|XP_004150658.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
 gi|449518819|ref|XP_004166433.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 386

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 144/383 (37%), Positives = 214/383 (55%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGVITLSSSILMLLLFISELRLYLHAVTETKLVVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+LDA+D SGEQHL V+H+I K+RLD  G  I E + + + A
Sbjct: 67  ETLRINFDVTFPALPCSLLSLDAMDISGEQHLDVKHDIIKKRLDSHGNAI-EARPDGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +K    +G      E    CGSC+GAE+    CCN+C EV+EAYR K WAL   D I
Sbjct: 126 PKIEKPLQRHGGRLEHNE--TYCGSCFGAESADDDCCNSCEEVREAYRKKGWALSNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +++K+   EGC IYG+LEVN+V+G+FH APG S+  ++VHVHD+  +   +
Sbjct: 184 DQCKREGFLQRIKDEDGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSNVHVHDLLAFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN +H I  L+FG           PLD    K E  ++ + Y+IK++PT+Y  + G  + 
Sbjct: 244 FNISHKINRLAFGEYFPG---VVNPLDSVQWKQETPSATYQYFIKVVPTVYNSVSGYTIQ 300

Query: 306 ---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                          G    +P +FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHVRTAEVGRLQSLPAVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D+ ++   K I  K+EIG
Sbjct: 361 VSGILDSFIYHGQKVIKKKMEIG 383


>gi|441638772|ref|XP_004090166.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 isoform 2 [Nomascus leucogenys]
          Length = 393

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 152/404 (37%), Positives = 219/404 (54%), Gaps = 51/404 (12%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 124 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC      ++ +      C +       +V+G+FH APG S+  +HVHVH ++ +   +
Sbjct: 180 EQCPAR-GLQRTQPENERECSL-------QVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 231

Query: 246 F------------------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
           F                  N TH+I+HLSFG   +D      PLD T   A + + MF Y
Sbjct: 232 FGLDNVQLWMSSGWCCLQINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQY 288

Query: 288 YIKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKS 331
           ++K++PT+Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK 
Sbjct: 289 FVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKH 348

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           +S  H  T +   I G +    L+D+L++   + I K    GKT
Sbjct: 349 RSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 392


>gi|198425065|ref|XP_002127888.1| PREDICTED: similar to ERGIC and golgi 3 [Ciona intestinalis]
          Length = 385

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 143/386 (37%), Positives = 208/386 (53%), Gaps = 28/386 (7%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           FS ++K  DA+ K  EDF  KT+ G  VT++    +  L   ++  Y       ELFVD 
Sbjct: 9   FSSKVKDFDAYPKTLEDFRIKTISGATVTLISGTIMLLLFLSELKYYLTTEVNSELFVDM 68

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           SRG+KL I++++  P + C++L+LD +D SG++ + V+H + K+ L+ DG  + E   E 
Sbjct: 69  SRGNKLSINMNVTFPLVPCEFLSLDMIDVSGQRDIDVQHTLVKQPLNSDGSWVAE-AAEK 127

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           V+ V  K V        TE    + CGSC+GAET+   CCNTC+++KEAYR K WA P  
Sbjct: 128 VDLVGTKPV-----LNATEPPPADYCGSCFGAETKDMTCCNTCSDIKEAYRRKGWAFPRD 182

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
            +I  C  E   +  K     GC ++G+LEVNRV+G+FHI+PG SY + H+HVHD+    
Sbjct: 183 GSITPCIGE---DDDKEPVGSGCYLHGHLEVNRVAGNFHISPGKSYEVGHMHVHDMARMG 239

Query: 243 S-AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
                N +H   HLSFG        +  PLD     A E +  F YY+KI+PT YE+L G
Sbjct: 240 KYKESNVSHVFNHLSFGSTYPG---QVHPLDNLEVIASESSVAFQYYVKIVPTTYEKLSG 296

Query: 302 SKLGGGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
                                  +PG+F SYELSP+MV+  E+ +S  H  T +   I G
Sbjct: 297 DTFHTNQFSVTRHQKRNKDSRESLPGMFVSYELSPMMVRYVERRRSFVHFLTSVCAIIGG 356

Query: 348 TYITFMLVDALLHSCVKKIS-KVEIG 372
            +    L D+ ++   K +  K+E+G
Sbjct: 357 IFTVAGLFDSFIYHGSKALQKKIELG 382


>gi|242076030|ref|XP_002447951.1| hypothetical protein SORBIDRAFT_06g018670 [Sorghum bicolor]
 gi|241939134|gb|EES12279.1| hypothetical protein SORBIDRAFT_06g018670 [Sorghum bicolor]
          Length = 386

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 135/383 (35%), Positives = 211/383 (55%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T   L VD+SRG
Sbjct: 7   KLRSLDAYPKVNEDFYSRTLSGGVITLASSVIMLLLFVSELRLYLHAVTETTLRVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  ++LDA+D SG++HL V+H+++K+R+D  G  I   Q + V  
Sbjct: 67  ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ-DAVGG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +K +     +G      E    CGSCYGA+    +CCN+C +V+EAYR K W +   D +
Sbjct: 126 MKMEAPLQHHGGRLEHNE--TYCGSCYGAQESDGQCCNSCEDVREAYRKKGWGVSNPDLL 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   + +K+   EGC IYG++EVN+V+G+FH APG S+  ++VHVHD+ P+   +
Sbjct: 184 DQCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD----- 300
           FN +H I  LSFG           PLDG          M+ Y+IK++PT+Y  ++     
Sbjct: 244 FNVSHKINRLSFGEYFPG---VVNPLDGASWVQHSSYGMYQYFIKVVPTVYTDINEHIIL 300

Query: 301 ------GSKLGGGDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                       G+ G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHFRSGESGRMQALPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D+ ++   + I  K+EIG
Sbjct: 361 VSGIIDSFVYHSQRAIKKKMEIG 383


>gi|357163897|ref|XP_003579883.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Brachypodium distachyon]
          Length = 386

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 134/383 (34%), Positives = 211/383 (55%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+     +  L   ++  Y    T   L VD+SRG
Sbjct: 7   KLRNLDAYPKVNEDFYSRTLSGGVITLASSFVMLLLFVSELRLYLHAVTETTLRVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I+ DI  P + C  +++D +D SG++HL V+H+++K+R+D +G  I   Q + V  
Sbjct: 67  EKLRINFDITFPALQCSIISIDVMDISGQEHLDVKHDVFKQRIDANGNVIATKQ-DAVGG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +K +K    +G      E    CGSCYGAE    +CCN+C +V+EAYR K W +   D+I
Sbjct: 126 MKVEKPLQMHGGRLEHNE--TYCGSCYGAEEPGEQCCNSCEDVREAYRKKGWGVSNPDSI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   + +K+   EGC IYG++E+N+V+G+FH APG S+  ++VHVHD+ P+   +
Sbjct: 184 DQCKREGFLQTIKDEEGEGCNIYGFVEINKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDS 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN +H I  LSFG           PLDG          M+ Y++K++PT+Y  ++   + 
Sbjct: 244 FNVSHKINKLSFGEPFPG---VVNPLDGAHWFQHSPYGMYQYFVKVVPTVYSHINEQIIL 300

Query: 306 GGD---------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                               +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 301 SNQFSVTEHARSSESVRMQALPGVFFFYDLSPIKVTFTERHVSFLHFLTNVCAIVGGVFT 360

Query: 351 TFMLVDALLHSCVKKIS-KVEIG 372
              ++D+ ++   + I+ K EIG
Sbjct: 361 VSGIIDSFVYHGQRAITKKREIG 383


>gi|443732120|gb|ELU16969.1| hypothetical protein CAPTEDRAFT_192533 [Capitella teleta]
          Length = 304

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 135/303 (44%), Positives = 186/303 (61%), Gaps = 29/303 (9%)

Query: 49  YFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
           Y       ELFVD++RG KL I++D+  PT+ C +L LDA+D SGEQ + V H+I+K+RL
Sbjct: 12  YLTTEVHPELFVDTARGQKLKINVDMTFPTVGCSFLTLDAMDVSGEQQIDVLHDIFKQRL 71

Query: 109 DLDGKPIQ-EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNE 167
           DLDG  ++ EP KE     +  +    N   ++ L       SCYGAE+E  KCCNTCNE
Sbjct: 72  DLDGIEVKAEPSKEG----QSSESCALNHALSSFLFSRF---SCYGAESEAHKCCNTCNE 124

Query: 168 VKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS 227
           V+EAYR K WA  +   I QC  E    +L+    EGC+IYG+LEVN+V+G+FH+APG S
Sbjct: 125 VREAYRQKGWAFVDAQNIEQCMREGYVSQLEEGKNEGCRIYGFLEVNKVAGNFHVAPGRS 184

Query: 228 YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA-SMFN 286
           +S +H H+HD+Q      FN +H I+HLSFG    D   +  PLD +    E+    MF+
Sbjct: 185 FSQHHAHIHDMQALQGMKFNMSHRIQHLSFG---DDYPGQVNPLDASEQVTEQADFVMFS 241

Query: 287 YYIKIIPTIYERLDGS--------------KLGG---GDGGMPGIFFSYELSPLMVKITE 329
           YY+K++PT Y R +G               K+GG   G+ G+PG+F +YELSP+MVK TE
Sbjct: 242 YYVKVVPTSYLRANGEFVSSNQYSVTKHHKKVGGGILGEQGLPGVFVTYELSPMMVKYTE 301

Query: 330 KSK 332
           K++
Sbjct: 302 KNR 304


>gi|224032113|gb|ACN35132.1| unknown [Zea mays]
 gi|414586931|tpg|DAA37502.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
           protein [Zea mays]
          Length = 391

 Score =  254 bits (650), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 138/388 (35%), Positives = 212/388 (54%), Gaps = 27/388 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+V    +  L   ++  Y    T   L VD+SRG
Sbjct: 7   KLRSLDAYPKVNEDFYSRTLSGGIITLVSSAVMLLLFVSELRLYLHAVTETTLRVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  ++LDA+D SG++HL V+H+++K+R+D  G  I   Q +VV  
Sbjct: 67  ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ-DVVGG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +K +     +G      E    CGSCYGA+    +CCNTC +V+EAYR K W +   D +
Sbjct: 126 MKMEAPLQHHGGRLEHNE--TYCGSCYGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLL 183

Query: 186 VQ-----CKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            Q     CK E   + +K+   EGC IYG++EVN+V+G+FH APG S+  ++VHVHD+ P
Sbjct: 184 DQVEPSDCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLP 243

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +   +FN +H I  LSFG           PLDG          M+ Y+IK++PT+Y  ++
Sbjct: 244 FQKDSFNVSHKINRLSFGEYFPG---VVNPLDGANWVQHSSYGMYQYFIKVVPTVYTDIN 300

Query: 301 -----------GSKLGGGDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
                            G+ G    +PG+FF Y+LSP+ V  TE+  S  H  T +   +
Sbjct: 301 EHIILSNQFSVTEHFRSGESGRMQALPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIV 360

Query: 346 SGTYITFMLVDALLHSCVKKI-SKVEIG 372
            G +    ++D+ ++   + I  K+EIG
Sbjct: 361 GGVFTVSGIIDSFVYHSQRAIKKKMEIG 388


>gi|328868763|gb|EGG17141.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Dictyostelium fasciculatum]
          Length = 335

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 133/345 (38%), Positives = 200/345 (57%), Gaps = 37/345 (10%)

Query: 54  TTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
           T  ELFVD++RG KL I++D+V   + C +L+LDA+D SG+    V HNI+K+RL   G 
Sbjct: 5   THHELFVDTTRGEKLRINMDVVFHHLPCAFLSLDAMDVSGDHQFDVAHNIFKKRLSPTGM 64

Query: 114 PIQE--PQKE-VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR--KCCNTCNEV 168
           PI +  PQ+E  +N  K+     EN        D   CGSCYGAE  +R   CC+TC EV
Sbjct: 65  PIADASPQREDTIN--KRVPAGNEN--------DKVDCGSCYGAEDPSRGISCCSTCEEV 114

Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
           + AY+ K W++ E   I QC  E  T+ +     EGCQ+YG++ VN+V+G+FH APG S+
Sbjct: 115 RTAYQKKGWSIQEYSGIAQCVREGFTKNIVEQNGEGCQVYGFINVNKVAGNFHFAPGKSF 174

Query: 229 SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
             +H+HVHD+Q +   +FN +H I  LSFG    D    + PLDG       G+ MF YY
Sbjct: 175 QQHHMHVHDLQAF-KGSFNLSHSINRLSFG---NDFPGIKNPLDGVTKTEMVGSGMFQYY 230

Query: 289 IKIIPTIYERLDGSKLGGGD-----------------GGMPGIFFSYELSPLMVKITEKS 331
           IK++PT+YE L+G+++                      G+PG+FF Y+LSP+M+K++E+ 
Sbjct: 231 IKVVPTLYEGLNGNRISTNQFSVTEHYRLLAKKDEEPSGLPGLFFMYDLSPIMMKVSEQG 290

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIGGKT 375
           KS     T +   + G +    ++D++++   K +  K+++G  T
Sbjct: 291 KSFASFLTSVCAIVGGVFTVAGILDSMIYKTTKNLKKKIDLGKNT 335


>gi|326510689|dbj|BAJ87561.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514988|dbj|BAJ99855.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326533080|dbj|BAJ93512.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 383

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 140/382 (36%), Positives = 213/382 (55%), Gaps = 23/382 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LKGLDA+ K  EDF+++T+ GG VT++    +  L   +   YF  +T  +L VD+SRG
Sbjct: 7   KLKGLDAYPKVNEDFYKRTLSGGVVTLLSAFVMLLLFVSETKSYFYSATETKLVVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            +L ++ DI  P+I C  L++D  D SGEQH  + H+I K+RLD  G  I E +KE +  
Sbjct: 67  ERLRVNFDITFPSIPCTLLSVDTRDISGEQHQDIRHDIEKKRLDSHGNVI-ESRKEGIGG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +K   ++G    + E+   CG+CYGAE    +CCN+C EV+EAY+ K WAL   D I
Sbjct: 126 TKIEKPLQKHGGRLGKGEE--YCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E   E++K    EGC ++G+L+V++V+G+FH APG  Y  ++V + ++       
Sbjct: 184 DQCAREDFVERVKTQHGEGCSVHGFLDVSKVAGNFHFAPGKGYYESNVDMPELS--AEGG 241

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN TH I  LSFG +         PLDG           + Y+IK++PTIY  + G K+ 
Sbjct: 242 FNITHKINKLSFGTEFPG---AVNPLDGAQWTQPASDGTYQYFIKVVPTIYNDIRGRKID 298

Query: 306 GG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                      DG +     PG+FF Y+ SP+ V  TE+++S  H  T +   + G +  
Sbjct: 299 SNQFSVTEHFRDGNVQPRPQPGVFFFYDFSPIKVIFTEENRSFLHYLTNLCAIVGGIFTV 358

Query: 352 FMLVDALLHSCVKKI-SKVEIG 372
             ++D+ ++   K +  K+EIG
Sbjct: 359 AGIIDSFIYHGQKALKKKMEIG 380


>gi|449684240|ref|XP_002157414.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like, partial [Hydra magnipapillata]
          Length = 311

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 133/317 (41%), Positives = 183/317 (57%), Gaps = 23/317 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M  S RLK  DA+ K  EDF  KT  G  +T +  + +  L   +   Y       ELFV
Sbjct: 1   MDISTRLKQFDAYPKTLEDFRVKTYGGALITGISSIIMFALFLSEFNYYLTTEVHPELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++R  KL I++D+  P I C YL++DA+D SGEQ   +EHNI+K+R D  G PI    +
Sbjct: 61  DTTRHQKLRINIDVYFPNIGCAYLSIDAMDVSGEQQTDLEHNIFKKRYDEKGNPIDTVEK 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE +    ++ V   N T    L+D  KC SCYGAET    CCNTC +V+ AYR K W  
Sbjct: 121 KEELGDKSEEAVKVLNST----LDDKPKCESCYGAETTDHPCCNTCEDVRVAYRKKGWGF 176

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV---- 235
            + D+I QCK E+  +  +    EGCQIYGY+EV++V+G+FHIAPG S+   H+HV    
Sbjct: 177 HDPDSIEQCKREHWKDTFQQQSNEGCQIYGYIEVSKVAGNFHIAPGKSFQQQHIHVQTIR 236

Query: 236 -----------HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASM 284
                      HD+QP+ +  FN +H+I  LSFG  +        PLDGT   AE G+ M
Sbjct: 237 FGKDGTISLNMHDLQPFGAKQFNVSHNIWSLSFGEPIPG---VENPLDGTNVSAEAGSLM 293

Query: 285 FNYYIKIIPTIYERLDG 301
           + Y++KI+PT+Y++L G
Sbjct: 294 YQYFVKIVPTVYKKLSG 310


>gi|444729170|gb|ELW69597.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Tupaia chinensis]
          Length = 393

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 148/420 (35%), Positives = 218/420 (51%), Gaps = 83/420 (19%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIISGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVV- 123
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSTEAERHELG 125

Query: 124 ---------NAVKKKKVTTENGTTTTELE-----------------------DPNKCGSC 151
                    N++   +  +  G  + +++                       DP++C SC
Sbjct: 126 KIEVKVFDPNSLDPDRCESCYGAESEDIKPCLEAADLELGKIEVKVFDPNSLDPDRCESC 185

Query: 152 YGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYL 211
           YGAE+E  KCCNTC +V+EAYR + WA    DTI QC+ E  ++K++    EGCQ+YG+L
Sbjct: 186 YGAESEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGFL 245

Query: 212 EVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPL 271
           EVN++                              N TH+I+HLSFG   +D      PL
Sbjct: 246 EVNKI------------------------------NMTHYIQHLSFG---EDYPGIVNPL 272

Query: 272 DGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------------GDGGMPGIF 315
           D T   A + + MF Y++K++PT+Y ++DG  L                  GD G+PG+F
Sbjct: 273 DHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVF 332

Query: 316 FSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
             YELSP+MVK+TEK +S  H  T +   I G +    L+D+L++   + I K    GKT
Sbjct: 333 VLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 392


>gi|242088319|ref|XP_002439992.1| hypothetical protein SORBIDRAFT_09g023960 [Sorghum bicolor]
 gi|241945277|gb|EES18422.1| hypothetical protein SORBIDRAFT_09g023960 [Sorghum bicolor]
          Length = 384

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 140/386 (36%), Positives = 215/386 (55%), Gaps = 22/386 (5%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
            F +RLK LDA+ K  EDF+++T+ GG VT+V  + +  L   +   YF  +T  +L VD
Sbjct: 3   AFLQRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSATETKLVVD 62

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +SRG +L ++ DI  P+I C  L++D +D SGEQH  + H+I KRRLD  G  I E +KE
Sbjct: 63  TSRGERLRVNFDITFPSIPCTLLSVDTMDISGEQHHDIRHDIEKRRLDSHGNVI-EARKE 121

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
            +   K ++   ++G    + E    CG+CYGAE    +CCN+C EV+EAY+ K WAL  
Sbjct: 122 GIGGAKIERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTN 179

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            D I QC  E   E++K    EGC ++G+L+V++V+G+FH APG  +  +++ V ++   
Sbjct: 180 PDLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNFHFAPGKGFYESNIDVPELS-V 238

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
               FN TH I  LSFG +         PLDG           + Y+IK++PTIY  + G
Sbjct: 239 LEGGFNITHKINKLSFGTEFPG---VVNPLDGAQWIQPASDGTYQYFIKVVPTIYTDIRG 295

Query: 302 SKLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
             +            DG +     PG+FF Y+ SP+ V  TE+++SL H  T +   + G
Sbjct: 296 HNIHSNQFSVTEHFRDGNILPKPQPGVFFFYDFSPIKVIFTEENRSLLHYLTNLCAIVGG 355

Query: 348 TYITFMLVDALLHSCVKKI-SKVEIG 372
            +    ++D+ ++   K +  K+E+G
Sbjct: 356 VFTVSGIIDSFIYHGQKALKKKMELG 381


>gi|256078219|ref|XP_002575394.1| serologically defined breast cancer antigen ny-br-84-related
           [Schistosoma mansoni]
 gi|353230384|emb|CCD76555.1| serologically defined breast cancer antigen ny-br-84-related
           [Schistosoma mansoni]
          Length = 338

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 133/344 (38%), Positives = 190/344 (55%), Gaps = 31/344 (9%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF KP +DF  KT+ G  V+I+  L I  L   ++  +      +E+ VD +RG 
Sbjct: 8   LQNFDAFAKPLKDFRIKTLSGALVSIISSLIIGILFTSELLSFTHTQNKQEIIVDVNRGE 67

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+ I++DI +  I C +L+LD +D++G Q L+V H +YK  + +DG P+ +  +  VN  
Sbjct: 68  KMSIYMDITLNFIPCRFLSLDTMDTTGAQQLNVMHEVYKTSVSVDGTPVSDSVRHAVN-- 125

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                   + +  T   DPN CGSCYGAE+ +RKCCNTC EV+ AY   +W    +    
Sbjct: 126 --------DASALTTTRDPNYCGSCYGAESPSRKCCNTCEEVQMAYNEMRWIFVNISAFE 177

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC+ E   E  +    EGC+I+G L VNRV G+FHIAPG SY+ NH H H  Q      F
Sbjct: 178 QCRKENWNEIKQKIGNEGCRIHGNLTVNRVGGAFHIAPGHSYTENHAHFHSFQSLGPVQF 237

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL------- 299
           N +H I  L FG   +    +  PLDGT    +  + M  YY+K++PT+Y  L       
Sbjct: 238 NVSHSIGELRFG---ESYPGQVNPLDGTKLAVQTHSQMVIYYLKLVPTMYISLRRNESTV 294

Query: 300 -----------DGSKLGGGDGGMPGIFFSYELSPLMVKITEKSK 332
                       G+ L G   G+PG+FF+YE++PL+VKITE+ K
Sbjct: 295 ITNQYSATWHSKGTPLTGDGQGLPGVFFNYEIAPLLVKITEEKK 338


>gi|150036309|emb|CAO03349.1| ERGIC and golgi 3 [Homo sapiens]
          Length = 325

 Score =  251 bits (640), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 136/332 (40%), Positives = 194/332 (58%), Gaps = 37/332 (11%)

Query: 12  AFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIH 71
           A+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG KL I+
Sbjct: 1   AYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGDKLKIN 60

Query: 72  LDIVVPTISCDY------------LALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
           +D++ P + C +            L++DA+D +GEQ L VEHN++K+RLD DG P+    
Sbjct: 61  IDVLFPHMPCAWSQYLSLIFLLPDLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEA 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           +   + + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA 
Sbjct: 121 ER--HELGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAF 174

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
              DTI QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q
Sbjct: 175 KNPDTIEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQ 234

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++
Sbjct: 235 SFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKV 291

Query: 300 DGSKLGG----------------GDGGMPGIF 315
           DG  L                  GD G+PG+F
Sbjct: 292 DGEVLRTNQFSVTRHEKVANGLLGDQGLPGVF 323


>gi|326931697|ref|XP_003211962.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Meleagris gallopavo]
          Length = 411

 Score =  250 bits (639), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 131/326 (40%), Positives = 190/326 (58%), Gaps = 40/326 (12%)

Query: 77  PTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENG 136
           P +    L++DA+D +GEQ L VEHN++K+RLD  G           N V  +    E G
Sbjct: 100 PHLLVSDLSIDAMDVAGEQQLDVEHNLFKQRLDKAG-----------NRVTPEAERHELG 148

Query: 137 TTTTELEDPN-----KCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNE 191
               ++ DPN     +C SCYGAE+E  +CCNTC++V+EAYR + WA    DTI QCK E
Sbjct: 149 KEEEKVFDPNSLDADRCESCYGAESEDIRCCNTCDDVREAYRRRGWAFKNPDTIEQCKRE 208

Query: 192 YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-----HDIQPYTSAAF 246
             ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHV     HD+Q +     
Sbjct: 209 GFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQSFGLDNI 268

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG----- 301
           N TH+I+HLSFG   +D      PLDGT   A++ + MF Y++K++PT+Y ++DG     
Sbjct: 269 NMTHYIKHLSFG---RDYPGIVNPLDGTDVTAQQASMMFQYFVKVVPTVYMKVDGEVVRT 325

Query: 302 --------SKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                    K+     GD G+PG+F  YELSP+MVK+TEK +   H  T +   + G + 
Sbjct: 326 NQFSVTRHEKIANGLIGDQGLPGVFVLYELSPMMVKLTEKHRPFTHFLTGVCAIVGGIFT 385

Query: 351 TFMLVDALLHSCVKKISKVEIGGKTV 376
               +D+L++   + I K    GKT+
Sbjct: 386 VAGFIDSLIYHSARAIQKKIELGKTI 411


>gi|225448309|ref|XP_002264644.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Vitis vinifera]
 gi|296085664|emb|CBI29463.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 137/387 (35%), Positives = 208/387 (53%), Gaps = 28/387 (7%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           +RL+ LDA+ K  EDF+ +T  GG +T++  + + +L   ++  Y    T  +L VD+SR
Sbjct: 6   QRLRNLDAYPKINEDFYSRTFSGGLITLISSIVMLFLFFSELRLYLHTVTETKLVVDTSR 65

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           G  L I+ D+  P + C  L LDA+D SGEQH  ++H+I K+R+D  G  +   Q  +  
Sbjct: 66  GGTLRINFDVTFPAVPCSVLTLDAMDISGEQHHDIKHDIVKKRIDAHGNVVAVRQDGIGG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNK-CGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
              +K +    G     LE   K CGSCYGAE     CCN+C+EV+EAYR K W +   D
Sbjct: 126 PQIEKPLQRHGG----RLEHNEKYCGSCYGAEVTDDDCCNSCDEVREAYRKKGWGMTNPD 181

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
            I QCK E   +K+K    EGC +YG+LEVN+V+G+FH +PG  +  +++HV+D+   + 
Sbjct: 182 LIDQCKREGFVQKVKEEEGEGCNVYGFLEVNKVAGNFHFSPGKGFYQSNIHVNDLLAISK 241

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG-- 301
             +N +H I  L+FG           PLDG     +    M+ Y+IK++PTIY  + G  
Sbjct: 242 DGYNISHRINKLAFGDHFPG---VVNPLDGAQWFQDAPDGMYQYFIKVVPTIYTDIRGHT 298

Query: 302 -------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                        S   G    +PG++F Y+LSP+ V   E+  S  H  T I   + G 
Sbjct: 299 IQSNQFSVTEHFRSAEPGRPHSLPGVYFFYDLSPIKVTSKEEHSSFLHFMTNICAIVGGI 358

Query: 349 YITFMLVDALL---HSCVKKISKVEIG 372
           +    ++D+ +   H  +KK  K+E+G
Sbjct: 359 FTVSGIIDSFVYHGHRAIKK--KMELG 383


>gi|357133202|ref|XP_003568216.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Brachypodium distachyon]
          Length = 384

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 139/385 (36%), Positives = 213/385 (55%), Gaps = 22/385 (5%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F ++LKGLDA+ K  EDF+++T+ GG VT+V  + +  L   +   Y   +T  +L VD+
Sbjct: 4   FLQKLKGLDAYPKVNEDFYKRTLSGGVVTLVSAVVMLLLFISETSSYLNSATETKLVVDT 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           SRG +L ++ DI  P+I C  L++D  D SGEQH  + H+I K+RL+  G  I E +KE 
Sbjct: 64  SRGERLRVNFDITFPSIPCTLLSVDTRDISGEQHQDIRHDIEKKRLNSHGNVI-ESRKEG 122

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +   K ++   ++G    + E    CG+CYGAE    +CCN+C+EV+EAY+ K WAL   
Sbjct: 123 IGGAKIERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCDEVREAYKKKGWALTNP 180

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
           D I QC  E   E++K    EGC ++G+L+V++V+G+FH APG  +  ++V V ++    
Sbjct: 181 DLIDQCAREDFVERVKTQHGEGCSVHGFLDVSKVAGNFHFAPGRGFYESNVDVPELSSL- 239

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
              FN TH I  LSFG +         PLDG           + Y+IK++PT Y    G 
Sbjct: 240 EGGFNITHKINKLSFGTEFPG---VVNPLDGAQWTQPASDGTYQYFIKVVPTNYTDTRGR 296

Query: 303 KLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
           K+            DG +     PG+FF Y+ SP+ V  TE++KS  H  T +   + G 
Sbjct: 297 KIDSNQFSVTEHFRDGNVHPRPQPGVFFFYDFSPIKVIFTEENKSFLHYLTNLCAIVGGI 356

Query: 349 YITFMLVDALLHSCVKKI-SKVEIG 372
           +    ++D+ ++   K +  K+EIG
Sbjct: 357 FTVSGIIDSFIYHGQKALKKKMEIG 381


>gi|449449715|ref|XP_004142610.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 385

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 138/385 (35%), Positives = 215/385 (55%), Gaps = 24/385 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +++ LDA+ K  EDF+ +T+ GG +TI   + +  L   ++  Y   +T  +L VD+SRG
Sbjct: 7   KIRKLDAYPKISEDFYNRTLSGGFITIASSIIMFLLFFSELRLYVHTATETKLIVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+L A+D SGEQHL V+H+I K+R+D  G  I + + + + +
Sbjct: 67  EHLRINFDVTFPALPCSVLSLHAMDISGEQHLDVKHDIVKKRIDYQGNVI-DSRPDGIGS 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            + ++   ++G    + E    CGSCYGA  E   CCN+C +V+EAY  K WAL   D I
Sbjct: 126 TEIERPLQKHGGRLKQNE--TYCGSCYGASGE--DCCNSCQDVREAYHRKGWALSHPDLI 181

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD-IQPYTSA 244
            QCK E   +++KN   EGC IYG+LEVN+V+G+FH APG  + +++  +H+ +  +   
Sbjct: 182 DQCKREGFFQRVKNEEGEGCNIYGFLEVNKVAGNFHFAPGRGFQLSYFQIHNPLASFQWD 241

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--- 301
           AFN +H I  L+FG    D      PLDG        + MF Y+IK++PT+Y+ ++G   
Sbjct: 242 AFNISHRINRLTFG---DDFPGVVNPLDGVQWNQGTLSGMFQYFIKVVPTVYKAVNGKAI 298

Query: 302 --------SKLGGGDG----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                     L G DG     + G+FF Y+LSP+ V  TE+  S  H  T +   + G +
Sbjct: 299 KSNQFSVTQHLRGIDGESFQALHGVFFFYDLSPIKVTFTEEHISFFHFLTNVCAIVGGVF 358

Query: 350 ITFMLVDALLHSCVKKISKVEIGGK 374
               ++D++++   K I K    GK
Sbjct: 359 TISGILDSIIYHGQKAIKKKMALGK 383


>gi|357112459|ref|XP_003558026.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Brachypodium distachyon]
          Length = 387

 Score =  248 bits (633), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 139/392 (35%), Positives = 216/392 (55%), Gaps = 28/392 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +L+ LDA+ K  EDF+ +T+ GG +TI   L I  L   ++  Y   +T  +L V
Sbjct: 1   MDLWNKLRSLDAYPKVNEDFYSRTLSGGLITIASSLAILLLFFSEIRLYLYSATESKLTV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+SRG +L I+ D+  P + C  +A+D +D SGEQH  + H+I+K+R+D  G  I E +K
Sbjct: 61  DTSRGERLHINFDVTFPALPCSLVAIDTMDVSGEQHYDIRHDIFKKRIDHLGNVI-ESRK 119

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           + V + K ++    +G      E    CGSCYG+E    +CCN+C EV++AYR K WAL 
Sbjct: 120 DGVGSPKIERPLQNHGGRLDHNE--AYCGSCYGSEESDDQCCNSCEEVRDAYRKKGWALT 177

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            +++I QCK E   ++LK+   EGC I+G+++VN+V+G+FH APG     +   + D+  
Sbjct: 178 NVESIDQCKREGFVQRLKDEQGEGCNIHGFVDVNKVAGNFHFAPGKHLDQSFNFLQDMLN 237

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEG---ASMFNYYIKIIPTIYE 297
           +    +N +H I  LSFG +         PLDG   K E+      M+ Y++K++PTIY 
Sbjct: 238 FQPENYNISHKINKLSFGKEFPG---VVNPLDGVEWKQEQATGLTGMYQYFVKVVPTIYT 294

Query: 298 RLDGSKLGGGDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            + G K+      +              PG++F YE SP+ V  TE++ SL H  T I  
Sbjct: 295 DIRGRKIHSNQFSVTEHFREAIGFPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICA 354

Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
            + G +    ++D+ +   H  +KK  K+EIG
Sbjct: 355 IVGGIFTVAGIIDSFVYHGHRAIKK--KMEIG 384


>gi|428183328|gb|EKX52186.1| hypothetical protein GUITHDRAFT_65491 [Guillardia theta CCMP2712]
          Length = 425

 Score =  248 bits (632), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 139/386 (36%), Positives = 210/386 (54%), Gaps = 39/386 (10%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RL+  D + K  +DF  +T+ G  V+I+ +L +  LI  ++  Y  + T  EL VD+SRG
Sbjct: 39  RLREFDIYPKTIQDFQVRTLAGAVVSILGFLIMFVLILGEINLYLTIQTDHELSVDTSRG 98

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I+ +I    + C  ++LD +D SGEQH+ V H +YK+RLD+DG  I    +  +N 
Sbjct: 99  EKLQINFNITFHAMPCTIISLDTMDISGEQHIDVHHEVYKQRLDVDGNVILLLSRACLN- 157

Query: 126 VKKKKVTTENGTTTT-----ELEDP---NKCGSCYGAETETRKCCNTCNEVKEAYRYKKW 177
                VT  +G  TT       + P    +CGSCYGAE    +CCNTC+ V+EAYR + W
Sbjct: 158 -----VTNGSGDFTTLRAHAGFDAPLTGGECGSCYGAEESPDECCNTCDSVREAYRRRGW 212

Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYL-------EVNRVSGSFHIAPGLSYSI 230
           A    D IVQCK E    K++    EGC++ G L       +VN+V+G+FH +PG S+S 
Sbjct: 213 AFVNSDGIVQCKTEGFLLKMQEERHEGCRVVGTLQARLTREQVNKVAGNFHFSPGKSFSQ 272

Query: 231 N-HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
              VH  D+       +N +H I HLSFG K      R  PLDG V   E  ++M+ Y++
Sbjct: 273 QVGVHFQDLLVLRKTDYNVSHAINHLSFGRKYPG---RVNPLDGVVRICEFRSAMYQYFV 329

Query: 290 KIIPTIYERLDGS--------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
           K++PT Y+  +G+              +L G   G+PG+FF Y+LSP+   + E++ S  
Sbjct: 330 KVVPTQYQYRNGTILSTNQFSTTENTRQLEGFTRGLPGVFFFYDLSPIKATLAERNNSFL 389

Query: 336 HLWTKIMCNISGTYITFMLVDALLHS 361
           H  T +   I G +    ++D+ +++
Sbjct: 390 HFLTGLCAIIGGVFTVMGIIDSTIYT 415


>gi|326506194|dbj|BAJ86415.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 363

 Score =  248 bits (632), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 135/365 (36%), Positives = 203/365 (55%), Gaps = 22/365 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LKGLDA+ K  EDF+++T+ GG VT++    +  L   +   YF  +T  +L VD+SRG
Sbjct: 7   KLKGLDAYPKVNEDFYKRTLSGGVVTLLSAFVMLLLFVSETKSYFYSATETKLVVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            +L ++ DI  P+I C  L++D  D SGEQH  + H+I K+RLD  G  I E +KE +  
Sbjct: 67  ERLRVNFDITFPSIPCTLLSVDTRDISGEQHQDIRHDIEKKRLDSHGNVI-ESRKEGIGG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +K   ++G    + E+   CG+CYGAE    +CCN+C EV+EAY+ K WAL   D I
Sbjct: 126 TKIEKPLQKHGGRLGKGEE--YCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E   E++K    EGC ++G+L+V++V+G+FH APG  Y  ++V + ++       
Sbjct: 184 DQCAREDFVERVKTQHGEGCSVHGFLDVSKVAGNFHFAPGKGYYESNVDMPELS--AEGG 241

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN TH I  LSFG +         PLDG           + Y+IK++PTIY  + G K+ 
Sbjct: 242 FNITHKINKLSFGTEFPG---AVNPLDGAQWTQPASDGTYQYFIKVVPTIYNDIRGRKID 298

Query: 306 GG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                      DG +     PG+FF Y+ SP+ V  TE+++S  H  T +   + G +  
Sbjct: 299 SNQFSVTEHFRDGNVQPRPQPGVFFFYDFSPIKVIFTEENRSFLHYLTNLCAIVGGIFTV 358

Query: 352 FMLVD 356
             ++D
Sbjct: 359 AGIID 363


>gi|449510462|ref|XP_004163672.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 3-like [Cucumis
           sativus]
          Length = 385

 Score =  246 bits (629), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 138/385 (35%), Positives = 214/385 (55%), Gaps = 24/385 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +++ LDA+ K  EDF+ +T+ GG +TI   + +  L   ++  Y   +T  +L VD+SRG
Sbjct: 7   KIRKLDAYPKISEDFYNRTLSGGFITIASSIIMFLLFFSELRLYVHTATETKLIVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+L A+D SGEQHL V+H+I K+R+D  G  I + + + + +
Sbjct: 67  EHLRINFDVTFPALPCSVLSLHAMDISGEQHLDVKHDIVKKRIDYQGNVI-DSRPDGIGS 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            + ++   ++G    + E    CGSCYGA  E   CCN+C +V+EAY  K WAL   D I
Sbjct: 126 TEIERPLQKHGGRLKQNE--TYCGSCYGASGE--DCCNSCQDVREAYHRKGWALSHPDLI 181

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD-IQPYTSA 244
            QCK E   +++KN   EGC IYG+LEVN+V+G+FH APG  + +++  +H+ +  +   
Sbjct: 182 DQCKREGFFQRVKNEEGEGCNIYGFLEVNKVAGNFHFAPGRGFQLSYFQIHNPLASFQWD 241

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--- 301
           AFN +H I  L+FG    D      PLDG        + MF Y+IK++PT+Y+ ++G   
Sbjct: 242 AFNISHRINRLTFG---DDFPGVVNPLDGVQWNQGTLSGMFQYFIKVVPTVYKAVNGKAI 298

Query: 302 --------SKLGGGDG----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                     L G DG     + G FF Y+LSP+ V  TE+  S  H  T +   + G +
Sbjct: 299 KSNQFSVTQHLRGIDGESFQALHGXFFFYDLSPIKVTFTEEHISFFHFLTNVCAIVGGVF 358

Query: 350 ITFMLVDALLHSCVKKISKVEIGGK 374
               ++D++++   K I K    GK
Sbjct: 359 TISGILDSIIYHGQKAIKKKMALGK 383


>gi|356512071|ref|XP_003524744.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Glycine max]
          Length = 431

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 137/384 (35%), Positives = 208/384 (54%), Gaps = 26/384 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG VT+V    + +L   ++  Y    T  +L VD+SRG
Sbjct: 54  KLRNLDAYPKVNEDFYNRTLAGGVVTVVSAAVMLFLFFSELSLYLYTVTESKLLVDTSRG 113

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+LDA+D SGEQHL + HNI K+R+D +G  I+E +K+ + A
Sbjct: 114 DTLHINFDVTFPAVRCSILSLDAMDISGEQHLDIRHNIVKKRIDANGNVIEE-RKDGIGA 172

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K ++   ++G       D   CGSC+GAE     CCN+C EV+EAYR K WA+  +D I
Sbjct: 173 PKIERPLQKHGGRLGH--DEKYCGSCFGAEESDEHCCNSCEEVREAYRKKGWAMTNMDLI 230

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E   +++K+   EGC + G LEVN+V+G+FH A G S+  + + + D+       
Sbjct: 231 DQCQREGYVQRVKDEEGEGCNLQGSLEVNKVAGNFHFATGKSFLQSAIFLADLLALQDNH 290

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY--------- 296
           +N +H I  LSFG           PLDG          M+ Y+IK++PTIY         
Sbjct: 291 YNISHRINKLSFGHHFPG---LVNPLDGVKWVQGPAHGMYQYFIKVVPTIYTDIRGRVIH 347

Query: 297 -------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                  E    S+LG     +PG+FF Y++SP+ V   E+     H  T I   I G +
Sbjct: 348 SNQYSVTEHFKSSELG---VAVPGVFFFYDISPIKVNFKEEHIPFLHFLTNICAIIGGVF 404

Query: 350 ITFMLVDALLHSCVKKIS-KVEIG 372
               ++D+ ++   + I  K+E+G
Sbjct: 405 TVAGIIDSSIYYGQRTIKRKMELG 428


>gi|212721670|ref|NP_001132255.1| uncharacterized protein LOC100193691 [Zea mays]
 gi|194693892|gb|ACF81030.1| unknown [Zea mays]
 gi|223949235|gb|ACN28701.1| unknown [Zea mays]
 gi|413949703|gb|AFW82352.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 384

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 138/386 (35%), Positives = 212/386 (54%), Gaps = 22/386 (5%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
            F  RLK LDA+ K  EDF+++T+ GG VT+V  + +  L   +   YF  ST  +L VD
Sbjct: 3   AFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLVVD 62

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +SRG +L ++ DI  P+I C  L++D  D SGEQH  + H+I KRRL+  G  I E +KE
Sbjct: 63  TSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVI-EARKE 121

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
            +   K ++   ++G    + E    CG+CYGAE    +CCN+C EV+EAY+ K WAL  
Sbjct: 122 GIGGAKVERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTN 179

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            D I QC  E   +++K    EGC + G+L+V++V+G+FH APG  +  +++ V ++   
Sbjct: 180 PDLIDQCAREDFIDRVKTQQDEGCNVLGFLDVSKVAGNFHFAPGKGFYESNIDVPELS-L 238

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
               FN +H I  LSFG +         PLDG           + Y+IK++PTIY  + G
Sbjct: 239 LEGGFNISHKINKLSFGTEFPG---VVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDIRG 295

Query: 302 SKLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
             +            DG +     PG+FF Y+ SP+ V  TE+++SL H  T +   + G
Sbjct: 296 RGIHSNQFSVTEHFRDGNVRPKSQPGVFFFYDFSPIKVIFTEENRSLLHYLTNLCAIVGG 355

Query: 348 TYITFMLVDALLHSCVKKI-SKVEIG 372
            +    ++D+ ++   K +  K+E+G
Sbjct: 356 VFTVSGIIDSFIYHGQKALKKKMELG 381


>gi|195162746|ref|XP_002022215.1| GL25735 [Drosophila persimilis]
 gi|194104176|gb|EDW26219.1| GL25735 [Drosophila persimilis]
          Length = 313

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 133/293 (45%), Positives = 184/293 (62%), Gaps = 23/293 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F++ L+ LDA+ +  +DF  +TV G AVTI+    IS LI ++   Y Q +  EELFV
Sbjct: 1   MKFADVLRRLDAYPRTLDDFSVRTVGGAAVTIISTSIISLLIFLEFLSYMQPALNEELFV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-PQ 119
           D++RG KL I+LD+ +  ++C+Y++LDA+DSSG+ HL V+H+I+K RLDL G+P++E P 
Sbjct: 61  DTTRGHKLRINLDVTLHNLACNYISLDAMDSSGDTHLRVDHDIFKHRLDLKGEPLKETPI 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           KE+V      K  T              CGSCYGAE     CCNTC +V +AYR  KW +
Sbjct: 121 KEIVAVSPPNKNVT--------------CGSCYGAEHNATHCCNTCEDVLDAYRLHKWNV 166

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            ++D I QCK +Y     ++ F EGC+I G+LEVNR++GSFH APG S+SI   H+HD Q
Sbjct: 167 -QVDKIEQCKGKYKRTD-EDAFKEGCRIQGHLEVNRMAGSFHFAPGKSFSIRQFHIHDFQ 224

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKI 291
               +    +H I HLSFG K++    +  PLDG  V  AE    MFN+Y+KI
Sbjct: 225 ---FSNVKLSHTINHLSFGEKIE--FAKTHPLDGLRVDVAETKTEMFNHYLKI 272


>gi|108707873|gb|ABF95668.1| Serologically defined breast cancer antigen NY-BR-84, putative,
           expressed [Oryza sativa Japonica Group]
          Length = 387

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 139/392 (35%), Positives = 217/392 (55%), Gaps = 28/392 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +L+ LDA+ K  EDF+ +T+ GG +TI   L I  L   ++  Y   +T  +L V
Sbjct: 1   MDLWNKLRSLDAYPKVNEDFYSRTLSGGLITIASSLAILLLFLSEIRLYLYSATDSKLTV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+SRG +L I+ D+  P + C  +A+D +D SGEQH  + H+I K+R+D  G  I E +K
Sbjct: 61  DTSRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDIIKKRIDNLGNVI-ESRK 119

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           + V A K ++   ++G      E    CGSCYG+E    +CCN+C +V++AYR K WAL 
Sbjct: 120 DGVGAPKIERPLQKHGGRLDHNE--VYCGSCYGSEESDDQCCNSCEDVRDAYRKKGWALT 177

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            ++ I QCK E   ++LK+   EGC I+G++ VN+V+G+FH APG S   +   + D+  
Sbjct: 178 NIEEIDQCKREGFVQRLKDEQGEGCSIHGFVNVNKVAGNFHFAPGKSLDQSFNFLQDLLN 237

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT--VAKAEEGAS-MFNYYIKIIPTIYE 297
           +    +N +H I  LSFG++         PLDG   + +   G + M+ Y++K++PTIY 
Sbjct: 238 FQQENYNISHKINKLSFGVEFPG---VVNPLDGVEWIQEHTNGLTGMYQYFVKVVPTIYT 294

Query: 298 RLDGSKLGGGDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            + G K+      +              PG++F YE SP+ V  TE++ SL H  T I  
Sbjct: 295 DIRGRKINSNQFSVTEHFREAIGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICA 354

Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
            + G +    ++D+ +   H  +KK  K+EIG
Sbjct: 355 IVGGIFTVAGIIDSFVYHGHRAIKK--KMEIG 384


>gi|291000812|ref|XP_002682973.1| predicted protein [Naegleria gruberi]
 gi|284096601|gb|EFC50229.1| predicted protein [Naegleria gruberi]
          Length = 416

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 147/408 (36%), Positives = 216/408 (52%), Gaps = 49/408 (12%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  D + K  +DF  KT+ GG ++I+  L I  L+  +   Y QV   ++L+VD+ +  
Sbjct: 3   LKSFDFYPKTQDDFRVKTLGGGLISIISLLVILILVLGEFYLYLQVERFDQLYVDTQQER 62

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE-HNIYKRRLDLDGKPIQEPQKEVVN- 124
           K+PI+++I  P +SCD L LD +D SGE H+H++ H +YK RL LDGKPI E Q E V+ 
Sbjct: 63  KIPIYINITFPAVSCDALNLDVMDVSGEHHVHLDYHTVYKMRLTLDGKPIIEQQAEQVSD 122

Query: 125 -------------AVKKKKVTTE-----NGTTTTELEDPNKCGSCYGAETETRKCCNTCN 166
                        AVK   V              +++DP  CGSCYG+  +  +CCNTC+
Sbjct: 123 DKPTLDILKPPPGAVKHDLVNNAELDKIRAERAKKVKDPKYCGSCYGSNRDANQCCNTCD 182

Query: 167 EVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGL 226
           +V+E+YR   WA    + I QC  E    K+K +  EGC ++GY  VN+V+G+FH APG 
Sbjct: 183 DVRESYRRVGWAFSPNEDIEQCYEEILERKMKYSKQEGCNLHGYFLVNKVAGNFHFAPGK 242

Query: 227 SYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV----------A 276
           S+     H+HD   Y    FNT+H I +L FG K+        PLDGT            
Sbjct: 243 SFVRAQQHMHDYTNYEVDHFNTSHIINYLGFGEKIPG---LINPLDGTSKIIGYNAETGQ 299

Query: 277 KAEEGASMFNYYIKIIPTIYERLDGS----------------KLGGGDGGMPGIFFSYEL 320
           + E  +++F Y++K++PTIYE+   S                K       +PG+FF Y+L
Sbjct: 300 RVEGESALFQYFVKVVPTIYEKYGSSNSIITNQYSVTQHSRPKNRLHPNVVPGVFFIYDL 359

Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           SP+MV ITE  KS     T +   I G +    L+D +++   KK+++
Sbjct: 360 SPIMVHITENKKSFVQFLTSLCAIIGGVFTVSALLDRVIYGVEKKMNR 407


>gi|313231322|emb|CBY08437.1| unnamed protein product [Oikopleura dioica]
          Length = 386

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 142/390 (36%), Positives = 214/390 (54%), Gaps = 25/390 (6%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F  +++  DA+TKP EDF E+TV G  +TI C L    L   ++  Y       EL V
Sbjct: 1   MGFLSQIRRFDAYTKPVEDFRERTVTGAVITICCSLLCMLLFFSELNYYLTTEVVSELRV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D++RG KL ++LD+ V  + C+Y ++DA+D +G++    EH ++K R+  DG+ +   +K
Sbjct: 61  DNTRGGKLVMNLDLTVAGLPCNYFSIDAMDLTGDR-ADAEHQLFKVRMK-DGQEVALSEK 118

Query: 121 -EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
            E +NA K      E   T   ++D  +C SCYGAETE + CCN+C EV++AYR K WA 
Sbjct: 119 VEEINAEKLHDEKQEEEETGLAVKD--ECQSCYGAETEEQPCCNSCEEVQQAYRNKGWAF 176

Query: 180 P-ELDTIVQCKNEY--STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
                   QC NE+    E+L+ T  E C+++G+LEVNRVSGS  I+PG +  ++   VH
Sbjct: 177 DHSAQQFSQCVNEHFDLNEELQKTEGESCRVHGHLEVNRVSGSLQISPGKTLVLDGSVVH 236

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
           DI+     +F+T+H I HLSFG      +    PLD T  +AE     ++Y  K+IPT +
Sbjct: 237 DIRGMKHMSFDTSHTIHHLSFGEVFPGQE---NPLDNTEHEAESMNMAWHYNFKVIPTEF 293

Query: 297 ERLDGSK--------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
            +LDGS+              L      +PGI F +E++P+ V   E  +S  H  T + 
Sbjct: 294 RKLDGSRTATNQFSVTRHEKALSQMSSRLPGINFHFEIAPIAVIKMETRRSAVHFATSVC 353

Query: 343 CNISGTYITFMLVDALLHSCVKKISKVEIG 372
             I G +    ++D+ +H   K + K E+G
Sbjct: 354 AIIGGVWTISSILDSFIHKTNKLLIKTELG 383


>gi|224066933|ref|XP_002302286.1| predicted protein [Populus trichocarpa]
 gi|222844012|gb|EEE81559.1| predicted protein [Populus trichocarpa]
          Length = 377

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 138/386 (35%), Positives = 207/386 (53%), Gaps = 37/386 (9%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+  DA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T  +L VD+SRG
Sbjct: 7   KLRNFDAYPKINEDFYSRTLSGGVITLASSIVMFLLFFSELRLYLHAVTETKLVVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+LDA+D SGEQHL V+H+I K+RLD  G  I+  Q      
Sbjct: 67  ETLRINFDVTFPALPCSILSLDAMDISGEQHLDVKHDIIKKRLDSHGNVIESRQ------ 120

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETET---RKCCNTCNEVKEAYRYKKWALPEL 182
                    +G    ++E P +         ET     CCN+C EV+EAY+ K WA+   
Sbjct: 121 ---------DGIGAPKIEKPLQRHGGRLEHNETYCDEDCCNSCEEVREAYQKKGWAVTNP 171

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
           D + QCK E   +++K+   EGC IYG+LEVN+V+G+FH APG S+  + VHVHD+  + 
Sbjct: 172 DLMDQCKREGFLQRIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQQSGVHVHDLLAFQ 231

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
             +FNT+H I  L+FG           PLDG     E  + M+ Y+IK++PT+Y  + G 
Sbjct: 232 KDSFNTSHKINRLAFGEYFPG---VVNPLDGVQWTQETPSGMYQYFIKVVPTVYTDVSGH 288

Query: 303 KLG-----------GGDGG----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
            +            G D G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G
Sbjct: 289 TIQSNQFSVTEHFRGADIGRLQSLPGVFFFYDLSPIKVTFTEEHVSFLHFLTNVCAIVGG 348

Query: 348 TYITFMLVDALLHSCVKKI-SKVEIG 372
            +    ++D+ ++   K I  K+EIG
Sbjct: 349 VFTVSGILDSFIYHGQKAIKKKMEIG 374


>gi|443734710|gb|ELU18591.1| hypothetical protein CAPTEDRAFT_139954 [Capitella teleta]
          Length = 285

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 127/276 (46%), Positives = 172/276 (62%), Gaps = 11/276 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DA+ K +EDF  KT  G AVTIV  + +  L   +   Y       ELFVD++RG 
Sbjct: 10  LRQFDAYPKTFEDFRVKTYGGAAVTIVSGILMFVLFVSEFNYYLITEVHPELFVDTARGQ 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVNA 125
           KL I++D+  PT+ C +L LDA+D SGEQ + V H+I+K+RLDLDG  ++ EP KE +  
Sbjct: 70  KLKINVDMTFPTVGCSFLTLDAMDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKEDLGD 129

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K K    +N      L+D ++C SCYGAE+E  KCCNTCNEV+EAYR K WA  +   I
Sbjct: 130 -KSKDFAVKN-----PLKD-DRCESCYGAESEAHKCCNTCNEVREAYRQKGWAFVDAQNI 182

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E    +L+    EGC+IYG+LEVN+V+G+FH+APG S+S +H H+HD+Q      
Sbjct: 183 EQCMREGYVSQLEEGKNEGCRIYGFLEVNKVAGNFHVAPGRSFSQHHAHIHDMQALQGMK 242

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEG 281
           FN +H I+HLSFG    D   +  PLD +    E+G
Sbjct: 243 FNMSHRIQHLSFG---DDYPGQVNPLDASEQVTEQG 275


>gi|115464597|ref|NP_001055898.1| Os05g0490200 [Oryza sativa Japonica Group]
 gi|50080302|gb|AAT69636.1| unknown protein [Oryza sativa Japonica Group]
 gi|113579449|dbj|BAF17812.1| Os05g0490200 [Oryza sativa Japonica Group]
 gi|218197014|gb|EEC79441.1| hypothetical protein OsI_20422 [Oryza sativa Indica Group]
 gi|222632053|gb|EEE64185.1| hypothetical protein OsJ_19017 [Oryza sativa Japonica Group]
          Length = 384

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 139/385 (36%), Positives = 217/385 (56%), Gaps = 22/385 (5%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F ++LKGLDA+ K  EDF+++T+ GG VT+V  + +  L   +   YF  +T  +L VD+
Sbjct: 4   FLQKLKGLDAYPKVNEDFYKRTLSGGVVTVVASVVMLLLFVSETRSYFYSATETKLVVDT 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           SRG +L ++ D+  P++ C  L++D +D SGEQH  + H+I KRRLD  G  I E +KE 
Sbjct: 64  SRGERLRVNFDVTFPSVPCTLLSVDTMDISGEQHHDIRHDIEKRRLDAHGNVI-EARKEG 122

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +   K +    ++G   ++ E+   CG+CYGAE    +CCN+C EV+EAY+ K WAL   
Sbjct: 123 IGGAKIESPLQKHGGRLSKGEE--YCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTNP 180

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
           D I QC  E   E++K    EGC ++G+L+V++V+G+ H APG  +  ++++V ++    
Sbjct: 181 DLIDQCTREDFVERVKTQQGEGCNVHGFLDVSKVAGNLHFAPGKGFYESNINVPELSA-L 239

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
              FN TH I  LSFG +         PLDG           + Y+IK++PTIY  L G 
Sbjct: 240 EHGFNITHKINKLSFGTEFPG---VVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDLRGR 296

Query: 303 KLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
           K+            DG +     PG+FF Y+ SP+ V  TE++ SL H  T +   + G 
Sbjct: 297 KIHSNQFSVTEHFRDGNIRPKPQPGVFFFYDFSPIKVIFTEENSSLLHYLTNLCAIVGGV 356

Query: 349 YITFMLVDALLHSCVKKI-SKVEIG 372
           +    ++D+ ++   K +  K+E+G
Sbjct: 357 FTVSGIIDSFIYHGQKALKKKMELG 381


>gi|226494401|ref|NP_001141198.1| uncharacterized protein LOC100273285 [Zea mays]
 gi|194703210|gb|ACF85689.1| unknown [Zea mays]
 gi|238011828|gb|ACR36949.1| unknown [Zea mays]
 gi|413945823|gb|AFW78472.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 384

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 138/386 (35%), Positives = 212/386 (54%), Gaps = 22/386 (5%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
            F +RLK LDA+ K  EDF+++T+ GG VT+V  + +  L   +   YF  +T  +L VD
Sbjct: 3   AFLQRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSATETKLVVD 62

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +SRG +L ++ DI   +I C  L++D +D SGEQH  + H+I K RLD  G  I E +K 
Sbjct: 63  TSRGERLRVNFDITFLSIPCTLLSVDTMDISGEQHQDIRHDIEKIRLDAHGNVI-EARKV 121

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
            +   K ++   ++G    + E    CG+CYGAE    +CCN+C EV+EAY+ K WAL  
Sbjct: 122 SIGGAKIERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTN 179

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            D I QC  E   E++K    EGC ++G+L+V++V+G+FH APG  +  +++ V ++   
Sbjct: 180 PDLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNFHFAPGKGFYESNIDVPELS-L 238

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
               FN TH I  LSFG +         PLDG           + Y+IK++PTIY  + G
Sbjct: 239 LEGGFNITHKINKLSFGTEFPG---VVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDIRG 295

Query: 302 SKLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
             +            DG +     PG+FF Y+ SP+ V  TE+S+SL H  T +   + G
Sbjct: 296 HNIHSNQFSVTEHFRDGNVRPKPQPGVFFFYDFSPIKVIFTEESRSLLHYLTNLCAIVGG 355

Query: 348 TYITFMLVDALLHSCVKKI-SKVEIG 372
            +    ++D+ ++   K +  K+E+G
Sbjct: 356 VFTVSGIIDSFIYHGQKALKKKMELG 381


>gi|443734706|gb|ELU18587.1| hypothetical protein CAPTEDRAFT_139951 [Capitella teleta]
          Length = 285

 Score =  243 bits (621), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 127/276 (46%), Positives = 171/276 (61%), Gaps = 11/276 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DA+ K  EDF  KT  G AVTIV  + +  L   +   Y       ELFVD++RG 
Sbjct: 10  LRQFDAYPKTLEDFRVKTYGGAAVTIVSGILMFVLFVSEFNYYLTTEVHPELFVDTARGQ 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVNA 125
           KL I++D+  PT+ C +L LDA+D SGEQ + V H+I+K+RLDLDG  ++ EP KE +  
Sbjct: 70  KLKINVDMTFPTVGCSFLTLDAMDVSGEQQIDVLHDIFKQRLDLDGIEVKAEPSKEDLGD 129

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K K    +N      L+D ++C SCYGAE+E  KCCNTCNEV+EAYR K WA  +   I
Sbjct: 130 -KSKDFAVKN-----PLKD-DRCESCYGAESEAHKCCNTCNEVREAYRQKGWAFVDAQNI 182

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E    +L+    EGC+IYG+LEVN+V+G+FH+APG S+S +H H+HD+Q      
Sbjct: 183 EQCMREGYVSQLEEGKNEGCRIYGFLEVNKVAGNFHVAPGRSFSQHHAHIHDMQALQGMK 242

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEG 281
           FN +H I+HLSFG    D   +  PLD +    E+G
Sbjct: 243 FNMSHRIQHLSFG---DDYPGQVNPLDASEQVTEQG 275


>gi|297850670|ref|XP_002893216.1| hypothetical protein ARALYDRAFT_472456 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297339058|gb|EFH69475.1| hypothetical protein ARALYDRAFT_472456 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 386

 Score =  243 bits (621), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 131/383 (34%), Positives = 205/383 (53%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RL+ LDA+ K  EDF+ +T+ GG +T+V    +  L   ++  Y    T  +L VD+SRG
Sbjct: 7   RLRNLDAYPKINEDFYRRTLSGGVITLVSSFVMLILFFSELQLYIHPVTETQLRVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I+ D+  P + C  ++LD++D SGE+HL V H+I KRRLD  G  I+  Q  + + 
Sbjct: 67  EKLRINFDVTFPALQCSIISLDSMDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHT 126

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             +K +    G       +   CGSC+GAE     CCN+C EV+EAYR K WAL + ++I
Sbjct: 127 KIEKPLQKHGGRLE---HNETYCGSCFGAEASDDACCNSCEEVREAYRKKGWALSDPESI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +K+K+   EGC ++G+LEVN+V+G+FH  PG S+  +    HD+  +    
Sbjct: 184 DQCKREGFVQKVKDEEGEGCNVHGFLEVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGN 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL- 304
           +N +H +  L+FG           PLDG      + + ++ Y+IK++P+IY  +  + + 
Sbjct: 244 YNISHTVNRLAFGDFFPG---VVNPLDGVQWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQ 300

Query: 305 --------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                          G     PG+FF Y+LSP+ V   E+     H  T +   + G + 
Sbjct: 301 SNQFSVTEHFQNMEAGRMQSPPGVFFYYDLSPIKVIFEEQHVEFLHFLTNVCAIVGGIFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              +VD+ ++   + I  K+EIG
Sbjct: 361 VSGIVDSFIYHGQRAIKKKMEIG 383


>gi|18395087|ref|NP_564162.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
 gi|9454530|gb|AAF87853.1|AC073942_7 Contains similarity to a PR00989 protein from Homo sapiens
           gi|7959731. EST gb|AI995648 comes from this gene
           [Arabidopsis thaliana]
 gi|13878151|gb|AAK44153.1|AF370338_1 unknown protein [Arabidopsis thaliana]
 gi|21281042|gb|AAM44956.1| unknown protein [Arabidopsis thaliana]
 gi|21553754|gb|AAM62847.1| unknown [Arabidopsis thaliana]
 gi|332192089|gb|AEE30210.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
          Length = 386

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 130/383 (33%), Positives = 205/383 (53%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RL+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T  +L VD+SRG
Sbjct: 7   RLRNLDAYPKINEDFYRRTLSGGVITLASSIVMLILFFSELQLYIHPVTETQLRVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I+ D+  P + C  ++LD++D SGE+HL V H+I KRRLD  G  I+  Q  + + 
Sbjct: 67  EKLRINFDVTFPALQCSIISLDSMDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHT 126

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             +K +    G       +   CGSC+GAE     CCN+C EV+EAYR K WAL + ++I
Sbjct: 127 KIEKPLQKHGGRLE---HNETYCGSCFGAEASDDACCNSCEEVREAYRKKGWALSDPESI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +K+K+   EGC ++G+LEVN+V+G+FH  PG S+  +    HD+  +    
Sbjct: 184 DQCKREGFVQKVKDEEGEGCNVHGFLEVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGN 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL- 304
           +N +H +  L+FG           PLDG      + + ++ Y+IK++P+IY  +  + + 
Sbjct: 244 YNISHKVNRLAFGDFFPG---VVNPLDGVQWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQ 300

Query: 305 --------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                          G     PG+FF Y+LSP+ V   E+     H  T +   + G + 
Sbjct: 301 SNQFSVTEHFQNMEAGRMQSPPGVFFYYDLSPIKVIFEEQHVEFLHFLTNVCAIVGGIFT 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              +VD+ ++   + I  K+EIG
Sbjct: 361 VSGIVDSFIYHGQRAIKKKMEIG 383


>gi|324511490|gb|ADY44781.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Ascaris suum]
          Length = 382

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 138/390 (35%), Positives = 204/390 (52%), Gaps = 31/390 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    RL+ LDA+TKP +DF  KT  GGAVT++  L I  L   +   +      E+LFV
Sbjct: 1   MSLLARLRDLDAYTKPLDDFRVKTFTGGAVTLLSTLVIVVLFVSETISFLSTDVVEQLFV 60

Query: 61  DS-SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
           DS S   +L ++ D+    + C  + +D +D SG+    V+ ++YK+RLD  G  I    
Sbjct: 61  DSTSADQRLDVNFDVTFTKLPCAMVTVDVMDVSGDNQDDVQDDVYKQRLDQQGNNITG-- 118

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
                A  +  V     T  ++L    KCGSCYGA   + +CCNTC +VKEAY  + W +
Sbjct: 119 ----QAAVRLGVNVNTSTPASQLTTEPKCGSCYGA---SDRCCNTCEDVKEAYSARGWQM 171

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            +++++ QCK++     + +   EGC++YG ++V +V+G+FHIAPG        H HD+ 
Sbjct: 172 LDIESVEQCKSDAWVRTINDFKGEGCRVYGKVQVAKVAGNFHIAPGDPLRSLRSHFHDLH 231

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS--MFNYYIKIIPTIYE 297
               A F+T H I HLSFG        +  PLDG      + +S  MF YY+K++PT+YE
Sbjct: 232 SIAPAKFDTAHIINHLSFGTPFPG---KNYPLDGKSFGTNKDSSGIMFQYYMKVVPTMYE 288

Query: 298 RLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
            LD S                +G G  G+PG F  YE SPLMVK  E+ + L      + 
Sbjct: 289 FLDSSNNIFSHQFSVTTHQKDIGMGASGLPGFFVQYEFSPLMVKYEERRQPLSTFLVSLC 348

Query: 343 CNISGTYITFMLVDALLHSCVKKIS-KVEI 371
             I G +    L+D+L++   + I  KVE+
Sbjct: 349 AIIGGVFTVASLIDSLIYHSSRAIQHKVEM 378


>gi|217071774|gb|ACJ84247.1| unknown [Medicago truncatula]
          Length = 384

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 141/384 (36%), Positives = 205/384 (53%), Gaps = 26/384 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG VT+V    + +L   ++  Y    T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKVNEDFYNRTLAGGVVTVVSAAVMLFLFISELRLYLYTVTESKLLVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+LD +D SGE+H  + HNI K+R+D +GK I E +KE + A
Sbjct: 67  ETLNINFDVTFPAVRCSILSLDTMDISGERHHDILHNIMKQRIDANGKVI-EARKEGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K ++   ++G       D   CGSC+GAE     CCN C EV+EAYR K WAL  +D I
Sbjct: 126 PKIERPLQKHGGRLEH--DEKYCGSCFGAEESDDHCCNNCEEVREAYRKKGWALTNIDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E   +K+K+   EGC I+G LEVN+V+G+FH A G S+  + + + D+       
Sbjct: 184 DQCQREGFVQKVKDEEGEGCNIHGSLEVNKVAGNFHFATGQSFLQSAIFLTDLLALQDNH 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY--------- 296
           +N +H I  LSFG           PLDG          M  Y+IK++PT+Y         
Sbjct: 244 YNISHQINKLSFG---HHYPGLVNPLDGIKWVQGNDHGMCQYFIKVVPTVYTDIRGRVIH 300

Query: 297 -------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                  E    S+LG     +PG+FF Y++SP+ V   E+     H  T I   I G +
Sbjct: 301 SNQYSVTEHFKSSELG---AAVPGVFFFYDISPIKVNFKEEHIPFLHFLTNICAIIGGIF 357

Query: 350 ITFMLVDALLHSCVKKI-SKVEIG 372
               +VD+ ++   K I  K+EIG
Sbjct: 358 TIAGIVDSSIYYGQKTIKKKMEIG 381


>gi|363806898|ref|NP_001242045.1| uncharacterized protein LOC100781612 [Glycine max]
 gi|255644390|gb|ACU22700.1| unknown [Glycine max]
          Length = 384

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 137/384 (35%), Positives = 206/384 (53%), Gaps = 26/384 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG VT+V    + +L   ++       T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKVNEDFYNRTLAGGVVTVVSAAVMLFLFFSELSLCLYTVTESKLLVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  L+LDA+D SGEQHL + HNI K+R+D +G  I+E +K+ + A
Sbjct: 67  DTLHINFDVTFPAVRCSILSLDAMDISGEQHLDIRHNIVKKRIDANGNVIEE-RKDGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +K   ++G       D   CGSC+GAE     CCN+C EV+EAYR K WA+  +D I
Sbjct: 126 PKIEKPLQKHGGRLGH--DEKYCGSCFGAEESDEHCCNSCEEVREAYRKKGWAMTNMDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E   +++K+   EGC + G LEVN+V+G+FH A G S+  + + + D+       
Sbjct: 184 DQCQREGYVQRVKDEEGEGCNLQGSLEVNKVAGNFHFATGKSFLQSAIFLADVLALQDNH 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY--------- 296
           +N +H I  LSFG           PLDG          M+ Y+IK++PTIY         
Sbjct: 244 YNISHRINKLSFGHHFPG---LVNPLDGVRWVQGPTHGMYQYFIKVVPTIYTDIRGRVIH 300

Query: 297 -------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                  E    S+LG     +PG+FF Y++SP+ V   E+     H  T I   I G  
Sbjct: 301 SNQYSVTEHFKSSELG---VAVPGVFFFYDISPIKVNFKEEHTPFLHFLTNICAIIGGVL 357

Query: 350 ITFMLVDALLHSCVKKIS-KVEIG 372
               ++D+ ++   + I  K+E+G
Sbjct: 358 AVAGIIDSSIYYGQRTIKRKMELG 381


>gi|159470839|ref|XP_001693564.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283067|gb|EDP08818.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 388

 Score =  242 bits (617), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 145/393 (36%), Positives = 202/393 (51%), Gaps = 36/393 (9%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F  +LK LDA+ K  EDF  KT+ GG +TIV  + +  L   ++  +   S+  EL VD 
Sbjct: 6   FLGKLKALDAYPKINEDFFTKTMSGGIITIVSSVVMVLLFLSELRLFLTTSSAHELSVDV 65

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK--RRLDLDGKPIQEPQK 120
            RG K+ IH D+  P + C +L+LDA+D SGE HL +   +Y   RR       + E + 
Sbjct: 66  GRGEKIKIHFDVTFPKVPCAWLSLDAMDISGELHLDLVVELYTLWRR---GAAGLTEGKG 122

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
             +  +      + N T        N CGSCYGAE +   CCNTC+EV+ AYR K WAL 
Sbjct: 123 GGIGVLSVSVSRSRNATALA-----NGCGSCYGAEDKQGDCCNTCDEVRAAYRRKGWALS 177

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            +D I QC ++  TE +K    EGC I   +EVN+V+G+FH APG SY    +HVHDI P
Sbjct: 178 NVDHIEQCAHDLYTEAIKEQAGEGCHI--GVEVNKVAGNFHFAPGRSYQQGSMHVHDIAP 235

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-----TVAKAEEGASMFNYYIKIIPTI 295
           +  A  +  H I  LSFG   +     + PLDG       A A     MF Y++K++PT 
Sbjct: 236 FGDAVIDFRHVIHKLSFG---EPYPGMKNPLDGAKAGQAAAAAAAATGMFQYFLKVVPTS 292

Query: 296 YERLDGSKL---------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           Y  L    L               GG    +PG+FF Y+LSP+ VKI E   S     T 
Sbjct: 293 YTDLSNKTLSTNQFSVTENFREAQGGAGRTLPGVFFFYDLSPIKVKIVEHGSSFLSFLTS 352

Query: 341 IMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
           +   + G +    +VDA +++  + I  K+E+G
Sbjct: 353 VCAIVGGVFTVSGIVDAFVYTGTRMIKKKMELG 385


>gi|449438787|ref|XP_004137169.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 386

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 142/383 (37%), Positives = 208/383 (54%), Gaps = 22/383 (5%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T  GG +T+    F+ +L   ++  Y    T  +L VD+SRG
Sbjct: 7   KLRNLDAYPKINEDFYRRTFSGGLITLASSFFMLFLFFSELRMYLHAKTETQLVVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            +L I+ D+  P I C  L+LDA+D SGEQHL + HNI K+R+D  G  I E + + + A
Sbjct: 67  GELHINFDLSFPAIPCSILSLDAIDISGEQHLDIRHNIIKKRIDHLGTVI-EARPDGIGA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K +K   ++G      E    CGSC+GAE     CCN+C EV+EAYR K WA+   D I
Sbjct: 126 PKIEKPLQKHGGRLEHNE--TYCGSCFGAEASDDDCCNSCEEVREAYRKKGWAITNQDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E   +K+K+   EGC I G LEVN+V+GSFH  PG S+  +  +   +    ++ 
Sbjct: 184 DQCQREDFIQKVKDEEGEGCNIEGSLEVNKVAGSFHFVPGKSFYQSSFNFLGLLALQTSD 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           +N +H I  L+FG      D    PLDG   +  E   M  Y++K++PTIY+ + G  + 
Sbjct: 244 YNVSHRINRLAFG---NHYDGLVNPLDGVHWEYNEQNVMHQYFVKVVPTIYKNIRGRTVH 300

Query: 306 ---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                          G    +PG+FF Y+LSP+ V  TE+     H  T I   I G + 
Sbjct: 301 SNQYSVTEHFKSVEFGSSQSIPGVFFYYDLSPVKVTYTEEHVPFLHFMTHICAIIGGVFS 360

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++DA ++   +K+  KVEIG
Sbjct: 361 VAGIIDAFIYHGQRKMKKKVEIG 383


>gi|413949704|gb|AFW82353.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 398

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 135/360 (37%), Positives = 199/360 (55%), Gaps = 22/360 (6%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
            F  RLK LDA+ K  EDF+++T+ GG VT+V  + +  L   +   YF  ST  +L VD
Sbjct: 3   AFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLVVD 62

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +SRG +L ++ DI  P+I C  L++D  D SGEQH  + H+I KRRL+  G  I E +KE
Sbjct: 63  TSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVI-EARKE 121

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
            +   K ++   ++G    + E    CG+CYGAE    +CCN+C EV+EAY+ K WAL  
Sbjct: 122 GIGGAKVERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTN 179

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            D I QC  E   +++K    EGC + G+L+V++V+G+FH APG  +  +++ V ++   
Sbjct: 180 PDLIDQCAREDFIDRVKTQQDEGCNVLGFLDVSKVAGNFHFAPGKGFYESNIDVPELS-L 238

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
               FN +H I  LSFG +         PLDG           + Y+IK++PTIY  + G
Sbjct: 239 LEGGFNISHKINKLSFGTEFPG---VVNPLDGAQWTQPASDGTYQYFIKVVPTIYTDIRG 295

Query: 302 SKLGGG---------DGGM-----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
             +            DG +     PG+FF Y+ SP+ V  TE+++SL H  T  +C I G
Sbjct: 296 RGIHSNQFSVTEHFRDGNVRPKSQPGVFFFYDFSPIKVIFTEENRSLLHYLTN-LCAIVG 354


>gi|358054679|dbj|GAA99605.1| hypothetical protein E5Q_06306 [Mixia osmundae IAM 14324]
          Length = 424

 Score =  238 bits (608), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 137/415 (33%), Positives = 204/415 (49%), Gaps = 53/415 (12%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F    KGLDAF K  ED   KT +GG +T+V +  I+ L  ++  DY +V     + VD 
Sbjct: 7   FGGAFKGLDAFGKTLEDVKIKTGFGGILTLVSFTLIAALTLMEFVDYRRVHLHPSIVVDK 66

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           SRG KL +HL+I  P + C  L++D +D SGE    + H+I K RLD  G  +Q  +   
Sbjct: 67  SRGEKLVVHLNITFPRVPCYLLSVDIMDISGEHQNDIHHDILKNRLDKSGALVQATRDST 126

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +    ++ V  +         +P  CGSCYG       CCNTC+EV+E+Y  + W+    
Sbjct: 127 LKGELERAVGVK--------REPGYCGSCYGGAPGDSGCCNTCDEVRESYVRRGWSFVNP 178

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
           D I QC  E  +EK+K    EGC + G ++VN+V G+FH++PG S+  N  HVHD+ PY 
Sbjct: 179 DGIDQCVREGFSEKIKEQSEEGCNVAGQVKVNKVIGNFHLSPGKSFQSNMHHVHDLVPYL 238

Query: 243 SAA--FNTTHHIRHLSFGIKLQDDDER-----------RKPLDGTVAKAEEGASMFNYYI 289
           +A    +  H I   SF  +  D   R             PL G  A  E+   MF Y++
Sbjct: 239 AAGQQHDFGHIINRFSFAAEGDDGFNRETARLKQSLNIEDPLTGVRAHTEQSNYMFQYFV 298

Query: 290 KIIPTIYERLDGSKLGG-----------------------------GDGGMPGIFFSYEL 320
           K++ T ++ LDG  L                               G  G+PG+FF+YE+
Sbjct: 299 KVVSTKFKTLDGRTLSSHQYSVTQYERDLSKGNKPGKDEDGHQTSHGYAGVPGLFFNYEI 358

Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
           SP++V   E+ +S  H  T     + G      L+D L++S     ++++ GGK+
Sbjct: 359 SPMLVVHREERQSFAHFITSTCAIVGGILTVAGLIDTLVYSSQ---TRLQAGGKS 410


>gi|268581953|ref|XP_002645960.1| C. briggsae CBR-ERV-46 protein [Caenorhabditis briggsae]
          Length = 380

 Score =  237 bits (604), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 129/383 (33%), Positives = 198/383 (51%), Gaps = 26/383 (6%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DA+ KP +DF  KT+ GG VT++  + I  LI ++   +   +  E LFVDS+   
Sbjct: 7   LKHFDAYRKPMDDFRVKTLSGGLVTLIATIVIGLLIVLETRQFLSTAVLEHLFVDSTTSD 66

Query: 67  -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            ++ I  DI    + C+++ +D +D S E   ++  +IY+ RLD DG+ + E  +++   
Sbjct: 67  ERVHIEFDITFNKLPCNFITVDVMDVSSEAQENINDDIYRLRLDADGRNVSESAQKI--E 124

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + + K   E     TEL    KCGSCYGA  +   CCNTC +VK AY  K W +  ++ +
Sbjct: 125 INQNKTIGE----PTELVQEVKCGSCYGAVADG-ICCNTCEDVKNAYAVKGWQV-NIEEV 178

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCKN+   ++      EGC++YG ++V +V+G+FH+APG  +     HVHD+       
Sbjct: 179 EQCKNDKWVKEFNEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVK 238

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
           F+ +H + H+SFG        +  PLDG V     G  M+ YY+K++PT Y+ LDG    
Sbjct: 239 FDASHTVNHISFGKSFPG---KNYPLDGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQ 295

Query: 302 ----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                       LG    G+PG F  YE SPLMV+  E  +SL      +   + G +  
Sbjct: 296 SHQFSVTTHKKDLGFRQAGLPGFFLQYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAM 355

Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
             LVD  ++   + +     GGK
Sbjct: 356 AQLVDITIYHTSRYMKSRIAGGK 378


>gi|341884797|gb|EGT40732.1| CBN-ERV-46 protein [Caenorhabditis brenneri]
          Length = 379

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 128/383 (33%), Positives = 195/383 (50%), Gaps = 27/383 (7%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DA+ KP +DF  KT+ GG VT++  + I  LI ++   +      E LFVDS+   
Sbjct: 7   LKHFDAYRKPMDDFRVKTLSGGLVTLIATIVIGLLIVMETRQFLSTDVLEHLFVDSTTSD 66

Query: 67  -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            ++ I  DI    + C+++ +D +D S E   ++  +IY+ RLD DGK + E  +++   
Sbjct: 67  ERVHIEFDITFNKLPCNFITVDVMDVSSEAQDNINDDIYRLRLDADGKNVSETAQKI--- 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
               ++        TEL    KCGSCYGA  +   CCNTC +VK AY  K W +  ++ +
Sbjct: 124 ----EINQNKTVDATELIQEVKCGSCYGAAADG-ICCNTCEDVKNAYAIKGWQV-NIEEV 177

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCKN+   ++      EGC++YG ++V +V+G+FH+APG  +     HVHD+       
Sbjct: 178 EQCKNDKWVKEFNEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQSMRSHVHDLHNLDPVK 237

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
           F+ +H + H+SFG        +  PLDG V     G  M+ YY+K++PT Y+ LDG    
Sbjct: 238 FDASHTVNHISFGKSFPG---KNYPLDGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQ 294

Query: 302 ----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                       LG    G+PG F  YE SPLMV+  E  +SL      +   + G +  
Sbjct: 295 SHQFSVTTHKKDLGFRQSGLPGFFLQYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAM 354

Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
             LVD  ++   + +     GGK
Sbjct: 355 AQLVDITIYHSSRYMKNRIAGGK 377


>gi|224073341|ref|XP_002304080.1| predicted protein [Populus trichocarpa]
 gi|222841512|gb|EEE79059.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  235 bits (600), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 131/384 (34%), Positives = 204/384 (53%), Gaps = 22/384 (5%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           ++++ LDA+ K  EDF+ +T+ GG +T++  + I +L   ++  Y    T  +L VD+SR
Sbjct: 6   QKVRNLDAYPKINEDFYSRTLSGGLITLISSVLILFLFFSELSLYLHKVTETKLLVDTSR 65

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           G  L I+ D+  P I C  L++DA+D SGEQHL + H+I K+R++  G  I E ++E + 
Sbjct: 66  GQSLRINFDVTFPAIRCSLLSVDAIDISGEQHLDIRHDISKKRINAHGDVI-EVRQEGIG 124

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
           A K  +    +G      E+   CGSC+G E     CCNTC EV+EAYR K WA+  +D 
Sbjct: 125 APKIDRPLQSHGGRLGHNEE--YCGSCFGGEMSHDDCCNTCEEVREAYRRKGWAMTNMDL 182

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QCK E   + +K+   EGC I G LEVNRV+GSFH AP  S+ +++  + D+      
Sbjct: 183 IDQCKREGFIQMIKDEEGEGCNINGSLEVNRVAGSFHFAPWKSFHLSNFLIQDLLDLQKD 242

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
           ++N +H I  L+FG           PL G     +    +  ++IK++PTIY  + G  +
Sbjct: 243 SYNISHRINRLAFGDYFPG---VVNPLAGIQLMHDTPNGVQQFFIKVVPTIYTDIRGRTV 299

Query: 305 GGGD---------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                                +PG++F Y+ SP+ V   E+  S  H  T I   I G +
Sbjct: 300 HSNQYSATEHFKKSELTPLDSLPGVYFFYDFSPIKVIFKEEHISFLHFMTSICAIIGGIF 359

Query: 350 ITFMLVDALLHSCVKKIS-KVEIG 372
               ++D+ ++   + I+ KV IG
Sbjct: 360 TIAGIIDSFIYYGQRAITKKVGIG 383


>gi|326497521|dbj|BAK05850.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 391

 Score =  235 bits (599), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 123/341 (36%), Positives = 188/341 (55%), Gaps = 21/341 (6%)

Query: 49  YFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
           Y    T   L VD+SRG KL I+ DI  P + C  +++D +D SG++HL V+H+++K+R+
Sbjct: 55  YLHAVTETTLRVDTSRGEKLRINFDITFPALQCSIISVDVMDISGQEHLDVKHDVFKQRI 114

Query: 109 DLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEV 168
           D  G  I   Q + V  +K +K    +G      E    CGSCYGA+    +CCN+C +V
Sbjct: 115 DAHGNVIATKQ-DAVGGMKVEKPLQHHGGRLEHNE--TYCGSCYGAQESPEQCCNSCEDV 171

Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
           +EAYR K W +   D+I QCK+E   + +K+   EGC IYG+LE+N+V+G+FH APG S+
Sbjct: 172 REAYRKKGWGVSNPDSIDQCKSEGFLQTIKDEEGEGCNIYGFLEINKVAGNFHFAPGKSF 231

Query: 229 SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
             ++VHVHD+ P+   +FN +H I  LSFG           PLDG          M  Y+
Sbjct: 232 QQSNVHVHDLLPFQKDSFNLSHKINKLSFGEPFPG---VINPLDGAQWIQHSSYGMAQYF 288

Query: 289 IKIIPTIYERLDGSKL-----------GGGDGG----MPGIFFSYELSPLMVKITEKSKS 333
           +K++PT+Y  ++   +             GD G    +PG+FF Y+LSP+ V  TE+  S
Sbjct: 289 VKVVPTVYSHINEQIILSNQFSVTEHSRSGDSGRVQALPGVFFFYDLSPIKVTFTERHVS 348

Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
             H  T +   + G +    ++D+ ++   + I+K    GK
Sbjct: 349 FLHFLTNVCAIVGGVFTVSGIIDSFVYHGQRAITKKRELGK 389


>gi|308483051|ref|XP_003103728.1| CRE-ERV-46 protein [Caenorhabditis remanei]
 gi|308259746|gb|EFP03699.1| CRE-ERV-46 protein [Caenorhabditis remanei]
          Length = 380

 Score =  234 bits (598), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 128/383 (33%), Positives = 197/383 (51%), Gaps = 26/383 (6%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DA+ KP +DF  KT+ GG VT++  + I  LI ++   +      E LFVDS+   
Sbjct: 7   LKHFDAYRKPMDDFRVKTLSGGLVTLIATIVIGLLIVLETKQFLSTDVLEHLFVDSTTSD 66

Query: 67  -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            ++ I  DI    + C+++ +D +D S E   ++  +IY+ RLD DG+ I E  +++   
Sbjct: 67  ERVHIEFDITFNKLPCNFITVDVMDVSSEAQDNINDDIYRLRLDADGRNISESAQKI--E 124

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + + K   +     TEL    KCGSCYGA  +   CCNTC +VK AY  K W +  ++ +
Sbjct: 125 INQNKTIAD----PTELTQEVKCGSCYGAAADG-ICCNTCEDVKSAYAIKGWQV-NIEEV 178

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCKN+   ++      EGC++YG ++V +V+G+FH+APG  +     HVHD+       
Sbjct: 179 EQCKNDKWVKEFTEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVK 238

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
           F+ +H + HL+FG        +  PLDG V     G  M+ YY+K++PT Y+ LDG    
Sbjct: 239 FDASHTVNHLTFGKSFPG---KHYPLDGKVNTENRGGIMYQYYVKVVPTRYDYLDGRVDQ 295

Query: 302 ----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                       LG    G+PG F  YE SPLMV+  E  +SL      +   + G +  
Sbjct: 296 SHQFSVTTHKKDLGFRQSGLPGFFVQYEFSPLMVQYEEFRQSLASFLVSLCAIVGGVFAM 355

Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
             L+D  ++   + +     GGK
Sbjct: 356 AQLIDITIYQTHRYMKNRIAGGK 378


>gi|224059030|ref|XP_002299683.1| predicted protein [Populus trichocarpa]
 gi|222846941|gb|EEE84488.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score =  234 bits (597), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 131/384 (34%), Positives = 203/384 (52%), Gaps = 22/384 (5%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           ++L+ LDA+ K  EDF+ +T+ GG +T++  + + +L   +   Y    T  +L VD++R
Sbjct: 6   QKLRNLDAYPKINEDFYSRTLSGGLITLISSIIMLFLFFSEFSLYLHAVTETKLLVDTTR 65

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           G  L I+ DI  P I C  L++DA+D SGEQH  + H+I K+R++  G  I E +++ + 
Sbjct: 66  GQTLRINFDITFPAIRCSLLSVDAIDISGEQHHDIRHDITKKRINAHGDVI-EVRQDGIG 124

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
           A K  K   ++G      E+   CGSC+GAE     CCN+C+EV+EAYR K WAL  +D 
Sbjct: 125 APKIDKPLQKHGGRLEHNEE--YCGSCFGAEMSDDHCCNSCDEVREAYRKKGWALTNMDL 182

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           I QC  E   + +K+   EGC I G LEVNRV+G+FH  PG S+  ++  + D+      
Sbjct: 183 IDQCIREGFVQMIKDEEGEGCNINGSLEVNRVAGNFHFVPGKSFHQSNFQLLDLLDMQKE 242

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
           ++N +H I  L+FG           PLDG          +  ++IK++PTIY  + G  +
Sbjct: 243 SYNISHRINRLAFGDYFPG---VVNPLDGIQLMHGTQNGVQQFFIKVVPTIYTDIRGRTV 299

Query: 305 GGGD---------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                                +PG++F Y+ SP+ V   E+  S  H  T I   I G +
Sbjct: 300 HSNQYSVTEHFTKSELMRLDSLPGVYFIYDFSPIKVTFKEEHTSFLHFMTSICAIIGGIF 359

Query: 350 ITFMLVDALLHSCVKKI-SKVEIG 372
               +VD+ ++   + I  K+EIG
Sbjct: 360 TIAGIVDSFIYHGRRAIKKKMEIG 383


>gi|393907059|gb|EFO23462.2| hypothetical protein LOAG_05019 [Loa loa]
          Length = 378

 Score =  234 bits (597), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 135/390 (34%), Positives = 203/390 (52%), Gaps = 42/390 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   ERLK  DA+TKP +DF  +T  GGAVT+V    I ++   +   +  V   E+L+V
Sbjct: 1   MSLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYV 60

Query: 61  DSSRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
           DS+    ++ ++ DI  P + C  + +D +D SG+    +  ++YK  L LDGK     +
Sbjct: 61  DSTPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIRDDVYKISL-LDGKEGNGVR 119

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNK---CGSCYGAETETRKCCNTCNEVKEAYRYKK 176
           +EV            N  T+T    P     CGSCYGA+     CCNTC EVKEAY  K 
Sbjct: 120 QEV------------NINTSTASSVPASQVLCGSCYGAK---EGCCNTCEEVKEAYMRKG 164

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W L  ++T+ QCK++   +K+     EGC++YG ++V +V+G+FHIAPG     +  H H
Sbjct: 165 WELINIETVEQCKSDLWVKKMSEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLRAHRSHFH 224

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV---AKAEEGASMFNYYIKIIP 293
           D+   + + F+T+H + H SFG        +  PLDG     A+  +G  M+ Y++K++P
Sbjct: 225 DLHSLSPSKFDTSHTVNHFSFGNSFPG---KVYPLDGKFFGSARNSDGI-MYQYHLKLVP 280

Query: 294 TIYERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           T Y  LD ++               +  G  G+PG F  YE SPLMVK  E+ +SL    
Sbjct: 281 TSYVFLDSTRNIFSHLFSVTTYQKDISQGASGLPGFFVQYEFSPLMVKYEERQQSLSTFL 340

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISK 368
             I   I G +    L+DA ++   + IS+
Sbjct: 341 VSICAIIGGIFTVASLIDAFIYRSGRIISQ 370


>gi|384252531|gb|EIE26007.1| DUF1692-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 386

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 138/392 (35%), Positives = 210/392 (53%), Gaps = 40/392 (10%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK LDA+ K  EDF ++T+ GG +TI   + +  L   ++  + +++TT EL VD++RG
Sbjct: 7   KLKNLDAYPKVNEDFFQRTLSGGIITIGSSIIMLCLFLSELSLFMKITTTNELSVDTTRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            +L I+ D+  P + C++++LD +D SGE HL V+H++YKRRLD +G  I +       +
Sbjct: 67  DQLSINFDMTFPALPCEWISLDLMDISGEMHLDVDHDVYKRRLDSNGVVIPD-------S 119

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           ++K +V  E   T     +  +CGSCYGA  +  +CCN C EV+ AYR K W   +   I
Sbjct: 120 IEKHQVGPELDDTLLHKANETECGSCYGAAPD-EECCNNCEEVRAAYRRKGWGFTDPQQI 178

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E   EKL+    EGC ++G L VN+V+G+FH APG S+    +HVHD+ P+    
Sbjct: 179 SQCAKEGFVEKLRAQEGEGCHMWGSLAVNKVAGNFHFAPGKSFQQGPMHVHDLVPFQGVT 238

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDG------TVAKAEEGASMFNYYIKIIPTIY--- 296
           F+ +H I  LSFG    +      PLD            +     + Y++K++PTIY   
Sbjct: 239 FDLSHRIDKLSFG---HEYPGMTNPLDRVNLPKFNTRNPQGLPGAYQYFLKVVPTIYVNS 295

Query: 297 -------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                        E   GS+       +PG+FF Y+LSP+ VK  E   S  H  T +  
Sbjct: 296 HNHTINSNQYSVTEHFKGSQ--DFQAQLPGVFFYYDLSPIKVKYHETRMSFLHFLTSVCA 353

Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
            + G +    +VDA +   H  +KK  KV++G
Sbjct: 354 IVGGIFTVAGIVDAFIYHGHQAIKK--KVDLG 383


>gi|17568835|ref|NP_510575.1| Protein ERV-46 [Caenorhabditis elegans]
 gi|3878494|emb|CAB01889.1| Protein ERV-46 [Caenorhabditis elegans]
          Length = 380

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 127/383 (33%), Positives = 199/383 (51%), Gaps = 26/383 (6%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DA+ KP +DF  KT+ GG VT++  + I  LI ++   +      E LFVDS+   
Sbjct: 7   LKHFDAYRKPMDDFRVKTLSGGLVTLIATIAIVLLIVLETKQFLSTEVLEHLFVDSTTSD 66

Query: 67  -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            ++ I  DI    + C+++ +D +D S E   ++  +IY+ RLD +G+ I E  +++   
Sbjct: 67  ERVHIEFDITFTKLPCNFITVDVMDVSSEAQENINDDIYRLRLDPEGRNISESAQKI--E 124

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + + K + E    TT++    KCGSCYGA  +   CCNTC++VK AY  K W +  ++ +
Sbjct: 125 INQNKTSVE----TTDVIQEVKCGSCYGAAADG-ICCNTCDDVKSAYAVKGWQV-NIEEV 178

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCKN+   ++      EGC++YG ++V +V+G+FH+APG  +     HVHD+       
Sbjct: 179 EQCKNDKWVKEFNEHKNEGCRVYGTVKVAKVAGNFHLAPGDPHQAMRSHVHDLHNLDPVK 238

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
           F+ +H + H+SFG        +  PLDG V     G  M+ YY+K++PT Y+ LDG    
Sbjct: 239 FDASHTVNHVSFGKSFPG---KNYPLDGKVNTDNRGGIMYQYYVKVVPTRYDYLDGRVDQ 295

Query: 302 ----------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                       LG    G+PG F  YE SPLMV+  E  +S       +   + G +  
Sbjct: 296 SHQFSVTTHKKDLGFRQSGLPGFFLQYEFSPLMVQYEEFRQSFASFLVSLCAIVGGVFAM 355

Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
             LVD  ++   + +     GGK
Sbjct: 356 AQLVDITIYHSSRYMKSRIAGGK 378


>gi|226498912|ref|NP_001150650.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
 gi|194699894|gb|ACF84031.1| unknown [Zea mays]
 gi|195640862|gb|ACG39899.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
          Length = 387

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 139/392 (35%), Positives = 215/392 (54%), Gaps = 28/392 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +L+ LDA+ K  EDF+ +T+ GG +TI+  L I  L   ++  Y   +T  +L V
Sbjct: 1   MELWSKLRNLDAYPKVNEDFYSRTLSGGLITILSSLAILLLFFSEIRLYLYSATESKLTV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+SRG +L I+ D+  P + C  +A+D +D SGEQH  + H+I K+R+D  G  I E +K
Sbjct: 61  DTSRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDIIKKRIDHLGNVI-ESRK 119

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           + V A K ++   ++G      E    CGSCYGAE    +CCN+C EV++AYR K WA+ 
Sbjct: 120 DGVGAPKIERPLQKHGGRLDHNE--VYCGSCYGAEESDDQCCNSCEEVRDAYRKKGWAVN 177

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            ++ I QCK E   ++LK+   EGC I+G++ VN+V+G+FH APG S   +   + D+  
Sbjct: 178 NVELIDQCKREGYVQRLKDEQGEGCTIHGFVNVNKVAGNFHFAPGKSLDQSFNFLQDLLN 237

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT--VAKAEEGAS-MFNYYIKIIPTIYE 297
                +N +H I  LSFG +         PLDG   +     G + M+ Y++K++PTIY 
Sbjct: 238 LQPETYNISHKINKLSFGEEFPG---VVNPLDGVEWIQDNSNGLTGMYQYFVKVVPTIYT 294

Query: 298 RLDGSKLGGGDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            + G K+      +              PG++F YE SP+ V  TE++ SL H  T I  
Sbjct: 295 DIRGRKIHSNQFSVTEHFREAIGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICA 354

Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
            + G +    ++D+ +   H  +KK  K+E+G
Sbjct: 355 IVGGIFTVAGIIDSFVYHGHRAIKK--KMELG 384


>gi|393212588|gb|EJC98088.1| endoplasmic reticulum-derived transport vesicle ERV46 [Fomitiporia
           mediterranea MF3/22]
          Length = 421

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 136/412 (33%), Positives = 201/412 (48%), Gaps = 56/412 (13%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LKG+DAF K  ED   KT  G  +TI+    I     ++  DY +V+    + VD SRG 
Sbjct: 9   LKGIDAFGKTMEDVKVKTKTGAFLTILSAAIILAFTTIEFLDYRRVNLETSIVVDRSRGE 68

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           +L + +++  P + C  L+LD +D SGE    + HNI K RLD +G  +        +A 
Sbjct: 69  RLTVRMNVTFPKVPCYLLSLDVMDISGEAQRDISHNIVKARLDANGAVVPNSH----SAE 124

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            + K+   N  T       N CGSCYG       CCNTC EV++AY  K W+    D+I 
Sbjct: 125 LRNKLDVMNDQT-----QDNYCGSCYGGVAPEGGCCNTCEEVRQAYVNKGWSFSNPDSIE 179

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC  E+ +EKL    TEGC I G L VN+V G+ H++PG S+  N++++H++ PY     
Sbjct: 180 QCVREHWSEKLHEQSTEGCNISGRLRVNKVIGNIHLSPGRSFQTNYMNIHELVPYLKEDK 239

Query: 247 NTTHHIRHLSFGIKLQDDDE---RRK---------------PLDGTVAKAEEGASMFNYY 288
           N  H   H+   +  + DDE   R+K               PLDG V KA     MF Y+
Sbjct: 240 NR-HDFGHIVHELSFEGDDEYNFRKKERSKGIKKKLGIEANPLDGAVGKAASLQYMFQYF 298

Query: 289 IKIIPTIYERLDGSKLG---------------GGDG-------------GMPGIFFSYEL 320
           +K++ T +E +DG  +                G  G             GMPG+F +YE+
Sbjct: 299 VKVVSTKFELMDGQTVKTHQYSATHFERDLTTGAIGQTKEGVHIAHTNVGMPGVFINYEI 358

Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           SPL+V  +E  +S  H  T     I G      +VD+++ +  +++ K  +G
Sbjct: 359 SPLLVVHSETRQSFAHFLTSTCAIIGGVLTIATIVDSVVFATGRRLKKSGVG 410


>gi|328770814|gb|EGF80855.1| hypothetical protein BATDEDRAFT_19389 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 409

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 137/395 (34%), Positives = 203/395 (51%), Gaps = 39/395 (9%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DA+ KP +DF  +T+ G  VT+V  L I +L   +  D++Q      L VD  R  
Sbjct: 28  LKKYDAYAKPLDDFRIRTISGALVTVVSTLVILFLTFSEFTDWYQKEMLPSLEVDKGRKE 87

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+ I+L++    + C  L++D +D SGE   ++ H+++K R+D  G  + E QK++ N  
Sbjct: 88  KMNINLNVTFYHMPCYLLSVDVMDVSGEHQNNLPHSMHKVRIDQLGN-LLEKQKKLGN-- 144

Query: 127 KKKKVTTENGTTTTELED----PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
                 T +     E+ D    P  CGSCYG      KCCNTC +V+EAY    W+  + 
Sbjct: 145 ------TNSSGVKKEIRDMALDPKYCGSCYGGVAPESKCCNTCEQVQEAYERSGWSFTDP 198

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
           D+I QC  E  +++++    E C IYG++EVN+V G+ H APG S+  N +HVHD+  Y 
Sbjct: 199 DSIEQCVREGWSKRMETQINEACNIYGHIEVNKVQGNIHFAPGHSFQQNALHVHDLHDYN 258

Query: 243 S--AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
           +   +FN  H I  LSFG    +      PLD            + YYIK++ T    L+
Sbjct: 259 APNGSFNFKHTIHELSFG----ESSSFVNPLDTVTKTPPTKYFSYQYYIKVVGTDISYLN 314

Query: 301 GSKL------------------GGGDGGMPG-IFFSYELSPLMVKITEKSKSLGHLWTKI 341
           GS+L                  G    GMPG +FF++E+SP++VK  E  K   H  T +
Sbjct: 315 GSQLTTNQFSVTEHEQDVTPLFGALPIGMPGKLFFNFEISPMLVKFKEFRKPFTHFLTDL 374

Query: 342 MCNISGTYITFMLVDALLHSCVKKI-SKVEIGGKT 375
              I G +    ++DALL +  + I +KVEIG  T
Sbjct: 375 CAIIGGVFTVAGMIDALLFATQRSIQAKVEIGKNT 409


>gi|331241265|ref|XP_003333281.1| hypothetical protein PGTG_14201 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|309312271|gb|EFP88862.1| hypothetical protein PGTG_14201 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 421

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 137/419 (32%), Positives = 200/419 (47%), Gaps = 51/419 (12%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F    KGLD F+K  ED   KT +GG +T+     I  LI V+  DY Q+     + VD 
Sbjct: 7   FGGYFKGLDGFSKTMEDVKVKTGFGGMLTMASAALIFTLILVEFRDYRQIHVQPSILVDK 66

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           SRG KL +H++I  P + C  L++D +D SGE    V H++ K RL LDG P+       
Sbjct: 67  SRGEKLLVHMNITFPRVPCYLLSVDVMDISGEHQNDVAHDLAKTRLGLDGVPLS------ 120

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
            N  +K +   E   T       + CGSCYG E     CCN+C EV+E+Y  + W+    
Sbjct: 121 TNTTQKLQGELE---TIIASRAKDYCGSCYGGEPGPSGCCNSCEEVRESYVRRGWSFNNP 177

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
           D I QC  E+ +E++K    EGC I G L+VN+V G+FH++PG S+  + VHVHD+ PY 
Sbjct: 178 DGIEQCVQEHWSERIKEQSKEGCNINGVLKVNKVIGNFHLSPGRSFQTHQVHVHDLVPYL 237

Query: 243 --SAAFNTTHHIRHLSF-----------GIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
             S   +  H I + +F            ++L+       PLDG  A  E    MF Y++
Sbjct: 238 QDSNLHDFGHVIHNFAFMDANQPTETAHTLRLKKTLGIVNPLDGVKAHTEASNYMFQYFL 297

Query: 290 KIIPTIYERLDGS-------------------------KLGG----GDGGMPGIFFSYEL 320
           K++ T ++ LDG                          +LG     G  G+PG+FF+YE+
Sbjct: 298 KVVGTQFQLLDGQVAKTHQYSVTQYERDLDNSDKSDADELGHLTSHGHSGVPGVFFNYEI 357

Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTKR 379
           SP+ V   E  +S  H  T     + G      L+D+ ++    ++      G     R
Sbjct: 358 SPMQVVHQEYRQSFAHFATSTCAIVGGVLTVAGLLDSFVYGAQNRMKGGSSNGAASHSR 416


>gi|242035905|ref|XP_002465347.1| hypothetical protein SORBIDRAFT_01g036890 [Sorghum bicolor]
 gi|241919201|gb|EER92345.1| hypothetical protein SORBIDRAFT_01g036890 [Sorghum bicolor]
          Length = 387

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 138/392 (35%), Positives = 214/392 (54%), Gaps = 28/392 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +L+ LDA+ K  EDF+ +T+ GG +TI+  L I  L   ++  Y   +T  +L V
Sbjct: 1   MELWSKLRNLDAYPKVNEDFYSRTLSGGLITILSSLAILLLFFSEIRLYLYSATESKLTV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+SRG +L I+ D+  P + C  +A+D +D SGEQH  + H+I K+R+D  G  I E +K
Sbjct: 61  DTSRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDITKKRIDHLGNVI-ESRK 119

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           + V A K ++   ++G      E    CGSCYGAE    +CCN+C EV++ YR K WA+ 
Sbjct: 120 DRVGAPKIERPLQKHGGRLDHNE--VYCGSCYGAEETDDQCCNSCEEVRDVYRKKGWAIN 177

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            ++ I QCK E   ++LK+   EGC I+G++ VN+V+G+FH APG S   +   + D+  
Sbjct: 178 NVELIDQCKREGYVQRLKDETGEGCTIHGFVNVNKVAGNFHFAPGKSLDQSFNFLQDLLN 237

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT--VAKAEEGAS-MFNYYIKIIPTIYE 297
                +N +H I  LSFG +         PLDG   +     G + M+ Y++K++PTIY 
Sbjct: 238 IQPETYNISHKINKLSFGEEFPG---VVNPLDGVEWIQDNSNGLTGMYQYFVKVVPTIYT 294

Query: 298 RLDGSKLGGGDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            + G K+      +              PG++F YE SP+ V  TE++ SL H  T I  
Sbjct: 295 DIRGRKIYSNQFSVTEHFREAIGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICA 354

Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
            + G +    ++D+ +   H  +KK  K+E+G
Sbjct: 355 IVGGIFTVAGIIDSFVYHGHRAIKK--KMELG 384


>gi|79318328|ref|NP_001031077.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
 gi|332192090|gb|AEE30211.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
          Length = 338

 Score =  228 bits (580), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 184/335 (54%), Gaps = 21/335 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RL+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T  +L VD+SRG
Sbjct: 7   RLRNLDAYPKINEDFYRRTLSGGVITLASSIVMLILFFSELQLYIHPVTETQLRVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I+ D+  P + C  ++LD++D SGE+HL V H+I KRRLD  G  I+  Q  + + 
Sbjct: 67  EKLRINFDVTFPALQCSIISLDSMDISGERHLDVRHDIIKRRLDSSGNVIEAKQDGIGHT 126

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             +K +    G       +   CGSC+GAE     CCN+C EV+EAYR K WAL + ++I
Sbjct: 127 KIEKPLQKHGGRLE---HNETYCGSCFGAEASDDACCNSCEEVREAYRKKGWALSDPESI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   +K+K+   EGC ++G+LEVN+V+G+FH  PG S+  +    HD+  +    
Sbjct: 184 DQCKREGFVQKVKDEEGEGCNVHGFLEVNKVAGNFHFIPGQSFHQSGFQFHDMLLFQQGN 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL- 304
           +N +H +  L+FG           PLDG      + + ++ Y+IK++P+IY  +  + + 
Sbjct: 244 YNISHKVNRLAFGDFFPG---VVNPLDGVQWNQGKQSGVYQYFIKVVPSIYTDVHQNTIQ 300

Query: 305 --------------GGGDGGMPGIFFSYELSPLMV 325
                          G     PG+FF Y+LSP+ V
Sbjct: 301 SNQFSVTEHFQNMEAGRMQSPPGVFFYYDLSPIKV 335


>gi|358334909|dbj|GAA53334.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Clonorchis sinensis]
          Length = 323

 Score =  227 bits (579), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 121/305 (39%), Positives = 169/305 (55%), Gaps = 36/305 (11%)

Query: 84  LALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELE 143
           L LD +DS+GEQ + V   IYK R+D  G PI   +++  N  K + VT          +
Sbjct: 23  LNLDTMDSTGEQKIDVSQQIYKTRIDSTGSPISATRRDDGNPSKGQVVT----------K 72

Query: 144 DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTE 203
           DP+ CGSCYGAE+ETRKCCNTC E++ AY+ + W +  L    QC+ E   + L N  +E
Sbjct: 73  DPDYCGSCYGAESETRKCCNTCKEIQLAYQERHWVVKNLSVFEQCREEQWDDTLANLGSE 132

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+I G L+VN+V+GSFHI PG SY+ + VHVH++Q +     N +H I  L+FG     
Sbjct: 133 GCRIQGSLQVNKVAGSFHITPGNSYASDQVHVHNLQGFDGQKLNMSHKIDKLAFGNMYPG 192

Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD---------------------GS 302
              +  PLDGT     E A M  YY+K++PT+Y   +                     GS
Sbjct: 193 ---QTNPLDGTTMNVVEPAQMVTYYMKLVPTMYVSYNTTTRSLSTVHTNQYSVTWHSKGS 249

Query: 303 KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH-- 360
            L     G+PG+FF+YELSPL+VKI+ + KS  H  T     I G +    L+DA ++  
Sbjct: 250 PLTSDSSGIPGLFFNYELSPLLVKISYEHKSFLHFLTNTCAIIGGVFTVASLLDAFIYQS 309

Query: 361 SCVKK 365
           +CV +
Sbjct: 310 TCVVR 314


>gi|395324643|gb|EJF57079.1| endoplasmic reticulum-derived transport vesicle ERV46 [Dichomitus
           squalens LYAD-421 SS1]
          Length = 423

 Score =  226 bits (577), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 135/412 (32%), Positives = 199/412 (48%), Gaps = 56/412 (13%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F   LKG+DAF K  ED   KT  G  +TI+    I     ++  DY +V     + VD 
Sbjct: 5   FLNALKGVDAFGKTMEDVKVKTRTGALLTIIAAAIILSFTTIEFFDYRRVFVDTSIVVDR 64

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ-KE 121
           SRG KL ++++I  P + C  L+LD +D SGE    + HNI K RLD  GKP+      E
Sbjct: 65  SRGEKLTVNMNITFPRVPCYLLSLDVMDISGETQSDITHNILKTRLDEKGKPVSHSLIAE 124

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
           + N + K     ++G           CGSCYG       CCNTC EV++AY  + W+   
Sbjct: 125 LQNDLDKLNEQRQSGY----------CGSCYGGIEPEGGCCNTCEEVRQAYVNRGWSFNR 174

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            D+I QC  E  ++KLK    EGC I G + VN+V G+ H++PG S+  +  +++++ PY
Sbjct: 175 PDSIEQCVKEGWSDKLKEQAHEGCNIAGRVRVNKVVGNIHLSPGRSFRTSAHNLYELVPY 234

Query: 242 TSAAFNT---THHIRHLSF---------GIKLQDDDERR-----KPLDGTVAKAEEGASM 284
                N    TH I H +F           KL  + + R      PLDGT  +  +   M
Sbjct: 235 LRTDGNRHDFTHQIHHFAFEGDDEYDPRNAKLGKELKNRLGIDANPLDGTQGRTIKQQYM 294

Query: 285 FNYYIKIIPTIYERLDGSKLGG----------------------------GDGGMPGIFF 316
           F Y++K++ T ++ +DG K+G                             G+GG+PG FF
Sbjct: 295 FQYFLKVVSTQFQTIDGKKVGTHQYSATHFERDLDKGPSEDSPAGLHVAHGNGGIPGAFF 354

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           +YE+SPL+++  E  +S  H  T     + G      L+D+LL +  K   K
Sbjct: 355 NYEISPLLIRHVETRQSFAHFLTSTCAIVGGVLTVASLIDSLLFATRKAFKK 406


>gi|312075860|ref|XP_003140604.1| hypothetical protein LOAG_05019 [Loa loa]
          Length = 365

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 126/388 (32%), Positives = 198/388 (51%), Gaps = 51/388 (13%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   ERLK  DA+TKP +DF  +T  GGAVT+V    I ++   +   +  V   E+L+V
Sbjct: 1   MSLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYV 60

Query: 61  DSSRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE-P 118
           DS+    ++ ++ DI  P + C  + +D +D SG+    +  ++YK +++++       P
Sbjct: 61  DSTPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIRDDVYKIKVNINTSTASSVP 120

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
             +V+                        CGSCYGA+     CCNTC EVKEAY  K W 
Sbjct: 121 ASQVL------------------------CGSCYGAK---EGCCNTCEEVKEAYMRKGWE 153

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
           L  ++T+ QCK++   +K+     EGC++YG ++V +V+G+FHIAPG     +  H HD+
Sbjct: 154 LINIETVEQCKSDLWVKKMSEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLRAHRSHFHDL 213

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV---AKAEEGASMFNYYIKIIPTI 295
              + + F+T+H + H SFG        +  PLDG     A+  +G  M+ Y++K++PT 
Sbjct: 214 HSLSPSKFDTSHTVNHFSFGNSFPG---KVYPLDGKFFGSARNSDGI-MYQYHLKLVPTS 269

Query: 296 YERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           Y  LD ++               +  G  G+PG F  YE SPLMVK  E+ +SL      
Sbjct: 270 YVFLDSTRNIFSHLFSVTTYQKDISQGASGLPGFFVQYEFSPLMVKYEERQQSLSTFLVS 329

Query: 341 IMCNISGTYITFMLVDALLHSCVKKISK 368
           I   I G +    L+DA ++   + IS+
Sbjct: 330 ICAIIGGIFTVASLIDAFIYRSGRIISQ 357


>gi|402083890|gb|EJT78908.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 444

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 142/448 (31%), Positives = 205/448 (45%), Gaps = 86/448 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG +TIV  + + YL   +  DY +++   EL V
Sbjct: 1   MAPKSRFTRLDAFTKTVEDARIRTTSGGIITIVSLIVVLYLALGEWSDYRRIAIHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ IHL+I  P + C+ L LD +D SGEQ   V+H + K RL        +PQ 
Sbjct: 61  DKSRGDRMEIHLNITFPRMPCELLTLDVMDVSGEQQHGVQHGVVKVRL--------QPQS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    +  K ++      +    DP  CG CYGA   +      CC+TC+EV+EAY    
Sbjct: 113 EGGGVIDVKALSLHADEDSATHLDPKYCGPCYGAPAPSNAAKAGCCSTCDEVREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC  E+  E+L     EGCQI G L VN+V G+FH+APG S+S  ++HVH
Sbjct: 173 WAFGRGENVEQCLREHYAERLDEQRQEGCQIAGSLRVNKVIGNFHLAPGRSFSNGNMHVH 232

Query: 237 DIQPYTSAAFNTTHHIRH----LSFGIKLQDDDERR----------------KPLDGTVA 276
           D++ Y     +  H   H    LSFG +L  + ++R                 PLDGT  
Sbjct: 233 DLKNYWDTPVDGGHSFSHVVHSLSFGPQLPLEVQKRLDRGRSLPWADHSHQLNPLDGTSQ 292

Query: 277 KAEEGASMFNYYIKIIPTIYERL---------------------------DGS------- 302
           +  +    F Y++KI+PT Y  L                           DG+       
Sbjct: 293 ETADPNFSFMYFLKIVPTSYLPLGWEGRRAKIATGNHDKDSWVGTYGYSPDGAVETHQYS 352

Query: 303 ------KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIM 342
                  L GGD             GG+PG+FFSY++SP+ ++   E+ K+     T + 
Sbjct: 353 VTSHKRSLAGGDDAAEGHQERLHSKGGIPGVFFSYDISPMKVINREERPKTFAGFLTGLC 412

Query: 343 CNISGTYITFMLVDALLHSCVKKISKVE 370
             + GT      VD   +    ++ K+ 
Sbjct: 413 AILGGTLTVAAAVDRTFYEGATRLKKMR 440


>gi|164655211|ref|XP_001728736.1| hypothetical protein MGL_4071 [Malassezia globosa CBS 7966]
 gi|159102620|gb|EDP41522.1| hypothetical protein MGL_4071 [Malassezia globosa CBS 7966]
          Length = 427

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 133/417 (31%), Positives = 201/417 (48%), Gaps = 60/417 (14%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
            F  +L+GLDAF +  +D   +T  G  +T+   L I  LI  +  DY +V T+  L VD
Sbjct: 5   AFFGQLRGLDAFGRMSDDVRIRTNVGALLTLTSALMILVLIVSEFLDYRRVQTSPRLEVD 64

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
            SRG +L +  ++  P I C  L+LD VD  GE  + V H++ +RRLD  GKP+ E   E
Sbjct: 65  LSRGERLAVQFNVTFPRIPCYLLSLDVVDVVGETQMDVHHDVERRRLDETGKPVSE---E 121

Query: 122 VVNAVKK--KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           V+  ++   K+V  E G        P+ CG CYGA+     CCN+C+ V+EAY    W+ 
Sbjct: 122 VIRELESEAKRVIAERG--------PDYCGDCYGADPPEGGCCNSCDAVREAYMLHNWSF 173

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
              D I QC  E+ +E ++    EGC I G + VN+V G+ H  PG ++  N +H HD+ 
Sbjct: 174 TSPDDIEQCAQEHWSEHVREQNHEGCNIAGEVRVNKVVGNLHFIPGRTFHRNDIHTHDLV 233

Query: 240 PYTSAAFNTTHHIRH----LSFGIKLQDDDER----------------RKPLDGTVAKAE 279
           PY     +  HH  H     SFG++ +   ER                +  L+G  AK  
Sbjct: 234 PYLHGTGDDVHHFGHKIHRFSFGMEDEFAIERTSRGRRQGPLKNRMGIKNALEGRSAKTL 293

Query: 280 EGASMFNYYIKIIPTIYERLDGSKLG------------------GGD---------GGMP 312
               MF Y++K++P    +L+G ++                   GG           G+P
Sbjct: 294 SSNYMFQYFLKVVPVEVHKLNGHEMSTYQYSATSYERNLEDFDRGGQMSGHIVRMIEGIP 353

Query: 313 GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
           G++F+YE+SPL V  TE   S+ HL + +   I G      L+D  ++   +  + V
Sbjct: 354 GVYFNYEISPLRVIQTEWHHSIWHLVSNLFALIGGIVTVAGLIDGAIYRSRRTFNIV 410


>gi|345569114|gb|EGX51983.1| hypothetical protein AOL_s00043g717 [Arthrobotrys oligospora ATCC
           24927]
          Length = 397

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 152/407 (37%), Positives = 201/407 (49%), Gaps = 51/407 (12%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M  + RL  LDAFTK  ED   +T  GG VTI   L I  L+  +  DY +VS   EL V
Sbjct: 1   MGRASRLMRLDAFTKTVEDARIRTSSGGIVTIFSVLVIFCLVIGEWNDYRKVSVISELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D +RG ++ IHL+I  P I C+ L LD +D SG+    V H I K RLD  G  I+    
Sbjct: 61  DKTRGEQMEIHLNITFPHIPCELLTLDVMDVSGDLQPSVSHGIGKHRLDKSGGIIESKFL 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E+     K               DP+ CG CYGA     ++   CC TC++V+EAY  K 
Sbjct: 121 ELHPEHPKHL-------------DPSYCGECYGAVAPDTSKKAGCCQTCDDVREAYAAKG 167

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   + QC+ E   E LK    EGC+I G+L VN+V G+FHIAPG S+S   +HVH
Sbjct: 168 WAFGDGTGVHQCEEEGYKEMLKEQAGEGCRIDGHLWVNKVVGNFHIAPGKSFSNAQMHVH 227

Query: 237 DIQPYTSAAF--NTTHHIRHLSFGIKLQDD-----DERRKPLDGTVAKAEEGASMFNYYI 289
           D+  Y       + TH I  LSFG  L  D       ++ PLD T  K  +    + Y++
Sbjct: 228 DLANYLQGDVHHDFTHTINALSFGPPLPTDLLHENHHQQNPLDATSKKTSDRNYNYLYFL 287

Query: 290 KIIPTIYERLDG----------------SKLGGGD----------GGMPGIFFSYELSPL 323
           KI+ T YE LD                 S  GG D          GG+PGIFFSY++SP+
Sbjct: 288 KIVSTSYEHLDHGYTIHTHQYSVTSHERSLEGGKDDVHPGTVHARGGIPGIFFSYDISPM 347

Query: 324 MVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
            V   E ++KS     T I   I GT      +D  L+   ++I K+
Sbjct: 348 KVVNREIRTKSFSGFLTSICAIIGGTLTVAAALDRGLYEGARRIGKL 394


>gi|347842451|emb|CCD57023.1| similar to endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Botryotinia fuckeliana]
          Length = 439

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 143/443 (32%), Positives = 207/443 (46%), Gaps = 81/443 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTI   L + YL   +  DY +++   EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVDEARVRTTSGGIVTIASLLIVLYLAFGEWADYRRITVHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG K+ IHL+I  P I C+ L LD +D SGEQ + V H + K RL         PQ+
Sbjct: 61  DKGRGEKMEIHLNITFPKIPCELLTLDVMDVSGEQQVGVMHGVKKVRLG--------PQE 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E    +  K +   N   +    DPN CG+CYGA      +   CCNTC+EV+EAY    
Sbjct: 113 EGGKVIDIKALDLHNAEDSATHLDPNYCGACYGATPPPNAQKPGCCNTCDEVREAYASVS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E+  E+L +   EGC+I G L VN+V G+FHIAPG S++  ++HVH
Sbjct: 173 WAFGRGENVEQCEREHYGERLDSQRKEGCRIEGGLRVNKVIGNFHIAPGRSFTNGNMHVH 232

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDD--------------DERRKPLDGTVAKA 278
           D+  +           +HHI  L FG +L ++              +    PLD T    
Sbjct: 233 DLNNFFDTPVPGGHVFSHHIHSLRFGPELPEEVFKKLGSDSIIPWTNHHLNPLDNTEQIT 292

Query: 279 EEGASMFNYYIKIIPTIYERL------------------------DGS------------ 302
            E A  F Y++K++ T Y  L                        DGS            
Sbjct: 293 HEAAYNFMYFVKVVSTSYLPLGWETNYNSRPHDASVDIGTYGHSEDGSIETHQYSVTSHR 352

Query: 303 -KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISG 347
             L GGD             GG+PG+FFSY++SP+ ++   E++K+L    T +   + G
Sbjct: 353 RSLNGGDDSAEGHKEKLHARGGIPGVFFSYDISPMKVINKEERTKTLAGFLTGLCAIVGG 412

Query: 348 TYITFMLVDALLHSCVKKISKVE 370
           T      VD  ++    ++ K++
Sbjct: 413 TLTVAAAVDRGVYEGATRLRKMQ 435


>gi|328858670|gb|EGG07782.1| hypothetical protein MELLADRAFT_105603 [Melampsora larici-populina
           98AG31]
          Length = 422

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 136/402 (33%), Positives = 197/402 (49%), Gaps = 51/402 (12%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
            KGLD F K  ED   +T +GG +T+   + I  L+ V+  DY  +     + VD SRG 
Sbjct: 11  FKGLDGFGKTMEDVKIRTGFGGFLTLASAILIVTLVLVEFVDYRTLHLNPSIVVDKSRGE 70

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL + ++I  P + C  L++D +D SGE    V H++ K RL+ DG         +V+A 
Sbjct: 71  KLIVDMNITFPRVPCYLLSVDLMDISGEHQNDVNHDMTKTRLNPDGT--------LVSAS 122

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
             K +  E   T      P  CGSCYG       CCNTC EV+E+Y  + W+    D I 
Sbjct: 123 VSKGLKGEL-DTIAATRAPGYCGSCYGGTPPESGCCNTCEEVRESYVRRGWSFSNPDGIE 181

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY--TSA 244
           QC  E+ ++K+K    EGC + G ++VN+V G+FH++PG S+  N +HVHD+ PY  T  
Sbjct: 182 QCVQEHWSDKIKEQEKEGCNMNGQVKVNKVIGNFHMSPGRSFQTNAMHVHDLVPYLQTGN 241

Query: 245 AFNTTHHIRHLSFGIKLQ--DDDERRK---------PLDGTVAKAEEGASMFNYYIKIIP 293
           + +  H I   +F  + Q  DDDE R+         PLDG  A  EE   MF Y++K++ 
Sbjct: 242 SHDFGHIIHKFAFLAEHQSPDDDETRRIKTSLGIVNPLDGIKAHTEESNYMFQYFLKVVG 301

Query: 294 TIYERLDG-------------------SKLGGGD----------GGMPGIFFSYELSPLM 324
           T +  LD                    S  GG D           G+PG+FF+YE+SP+ 
Sbjct: 302 TEFHLLDQRVVKTHQYSVTQYERDLTKSSRGGTDELGHQTSHGYAGVPGLFFNYEISPMQ 361

Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           V   E  +S  H  T     I G      L+D+ ++    +I
Sbjct: 362 VIHKEYRQSFAHFATSTCAIIGGVLTVAGLIDSAVYGARNRI 403


>gi|171696240|ref|XP_001913044.1| hypothetical protein [Podospora anserina S mat+]
 gi|170948362|emb|CAP60526.1| unnamed protein product [Podospora anserina S mat+]
          Length = 437

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 146/440 (33%), Positives = 201/440 (45%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  + + +L   +  DY ++    EL V
Sbjct: 1   MAAKSRFTKLDAFTKTVEDARIRTTSGGIVTIVSLIVVFFLAWGEWQDYRRIEIHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGEQ   V+H + K RL         P  
Sbjct: 61  DKGRGERMEIHLNVSFPRVPCELLTLDVMDVSGEQQHGVQHGVVKTRL--------RPLS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E    ++ K +            DPN CG CYGA      +   CC TC+EVKEAY  + 
Sbjct: 113 EGGGVIEAKALALHARDEEAAHLDPNYCGPCYGAAPPVHAQKPNCCQTCDEVKEAYAAQA 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + I QC+ E+  EKL     EGC+I G + VN+V G+FHIAPG S+S  ++HVH
Sbjct: 173 WAFGRGEGIEQCEREHYAEKLDEQRNEGCRIEGNVRVNKVIGNFHIAPGKSFSNGNMHVH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR--------------KPLDGTVAKAEE 280
           D++ Y  T      TH I HL FG +L D   ++               PLD T  + ++
Sbjct: 233 DLKNYWDTPVKHTFTHEIHHLRFGPQLPDGLAKKLGKNKALPWTNHHVNPLDNTHQETDD 292

Query: 281 GASMFNYYIKIIPTIY------------------------ERLDGS-------------K 303
               F Y+IKI+PT Y                        +  DGS              
Sbjct: 293 VNYNFMYFIKIVPTSYLPLGWEKTWQGFKDQHHKELGSFGQSADGSLETHQYSVTSHRRS 352

Query: 304 LGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTY 349
           L GGD             GG+PG+FFSY++SP+ ++   E+ KS       +   + GT 
Sbjct: 353 LSGGDDGSEGHKERLHAKGGIPGVFFSYDISPMKVINREERPKSFLGFLAGLCAIVGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                VD  L     K+ K+
Sbjct: 413 TVAAAVDRALFEGGMKLKKL 432


>gi|429853391|gb|ELA28466.1| copii-coated vesicle membrane protein [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 437

 Score =  223 bits (569), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 144/427 (33%), Positives = 200/427 (46%), Gaps = 79/427 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  + + +L   +  DY ++    EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVDEARVRTTSGGIVTIVSLIVVLWLAWGEWVDYRRIEIHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V H + K RL         PQK
Sbjct: 61  DQGRGERMEIHLNITFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRL--------RPQK 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E    +  K ++  +     E  DPN CG CYGA      +   CCNTC EV+EAY    
Sbjct: 113 EGGGVIDVKALSLHSSDEAAEHLDPNYCGPCYGAPAPPNAQKAGCCNTCEEVREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  + + + QC  E+  EKL+    EGC+I G L VN+V G+FH+APG S+S  ++HVH
Sbjct: 173 WAFGKGENVEQCTREHYAEKLEEQRREGCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVH 232

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
           D++ Y      A  + TH I  L FG +L D   ++              PLD T  +  
Sbjct: 233 DLKNYWETPDDAQHDFTHVIHTLRFGPQLPDTITKKMTKRAYAWTNHHGNPLDSTHQETN 292

Query: 280 EGASMFNYYIKIIPTIYERL-----------------------DGS-------------K 303
           +    F Y++KI+PT Y  L                       DGS              
Sbjct: 293 DPNYNFMYFVKIVPTSYLALNWQKSASIQDEESSGLGLLGHLSDGSVETHQYSVTSHKRS 352

Query: 304 LGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTY 349
           L GGD             GG+PG+FFSY++SP+ ++   E++K+     T +   I GT 
Sbjct: 353 LAGGDDSAEGHQERLHSRGGIPGVFFSYDISPMKVINREERAKTFTGFLTGLCAIIGGTL 412

Query: 350 ITFMLVD 356
                VD
Sbjct: 413 TVAAAVD 419


>gi|71021625|ref|XP_761043.1| hypothetical protein UM04896.1 [Ustilago maydis 521]
 gi|46100607|gb|EAK85840.1| hypothetical protein UM04896.1 [Ustilago maydis 521]
          Length = 435

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 128/413 (30%), Positives = 206/413 (49%), Gaps = 62/413 (15%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+G+DAF+K  +D   +T  G  +T++  L I+ L   +  DY  V     L VD SRG
Sbjct: 9   QLRGIDAFSKTMDDVRIRTNAGALITLISALLIAVLTIGEFIDYRTVHVKPALEVDRSRG 68

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL ++++I  P + C  L+LD +D SGE    ++H++ + R++ DGK I++ +K +   
Sbjct: 69  EKLTVNMNITFPRVPCYLLSLDVMDISGEHVNDIQHDVERTRINHDGKIIEQGKKSLKGD 128

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             +   T          +  + CG CYG +    KCCNTC+EV+EAY  K W+  + D +
Sbjct: 129 AARIANT----------KGKDYCGDCYGGQPPASKCCNTCDEVREAYVRKGWSFADPDHV 178

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E  +EK+K    EGC+I G L VN+V GSFH++PG ++  N +H+HD+ PY S  
Sbjct: 179 DQCVAEGWSEKIKEQNKEGCRISGKLHVNKVVGSFHLSPGKAFQRNSMHIHDLVPYLSGT 238

Query: 246 FNTTHHIRHL----SFGIK-----LQDDDER--------RKPLDGTVAKAEEGASMFNYY 288
            +  H   H+    SFG +     L    ER        + PL+G  A+ ++   MF Y+
Sbjct: 239 GSEHHDFGHIIHEFSFGSEQEYHGLTSAKERAVKAKLGVKDPLEGVRAQTQQSQFMFQYF 298

Query: 289 IKIIPTIYERLDGSKL-----------------------------------GGGDGGMPG 313
           +K++ T +  L G  L                                     G  G+PG
Sbjct: 299 VKVVSTEFRPLSGETLKTQQYSVTTYERDLSPGANAAALAGLSNEGSGAHISHGFAGVPG 358

Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           +FF+YE+SPL    +E  +SL H  T     + G      ++D+L+++  +++
Sbjct: 359 VFFNYEISPLKTIHSEYRQSLSHFLTSTCAIVGGILTVAGILDSLVYNSRRRL 411


>gi|310800359|gb|EFQ35252.1| hypothetical protein GLRG_10396 [Glomerella graminicola M1.001]
          Length = 437

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 142/427 (33%), Positives = 201/427 (47%), Gaps = 79/427 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  + + +L   +  DY ++    EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLIVVFWLAWGEWADYRRIEIHSELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGEQ   V H + K RL         P+K
Sbjct: 61  DKGRGERMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRL--------RPRK 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET----ETRKCCNTCNEVKEAYRYKK 176
           E    +  K +   +   + E  DPN CG CYGA+     +   CCNTC+EV+EAY    
Sbjct: 113 EGGGVIDIKALDLHSRDDSAEHLDPNYCGPCYGAQAPPNAQKPGCCNTCDEVREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  + + + QC  E+  E+L+    EGC+I G L VNRV G+FH+APG S+S  ++HVH
Sbjct: 173 WAFGKGEGVEQCTREHYAERLEEQRQEGCRIEGNLRVNRVVGNFHLAPGRSFSNGNMHVH 232

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
           D++ Y      A  + TH I  L FG +L D   ++              PLD T     
Sbjct: 233 DLKNYWDTPADAQHDFTHTIHSLRFGPQLPDQVTKKMGKRAYAWTNHHGNPLDNTHQDTN 292

Query: 280 EGASMFNYYIKIIPTIYERL-----------------------DGS-------------K 303
           +    F Y++KI+PT Y  L                       DGS              
Sbjct: 293 DPNYNFMYFVKIVPTSYLALNWQKSTAYQDDDSSSLGLLGQGNDGSVETHQYSVTSHKRS 352

Query: 304 LGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTY 349
           L GGD             GG+PG+FFSY++SP+ ++   E++K+     T +   I GT 
Sbjct: 353 LAGGDDAAEGHQERLHSRGGIPGVFFSYDISPMKVINREERAKTFTGFLTGLCAIIGGTL 412

Query: 350 ITFMLVD 356
                VD
Sbjct: 413 TVAAAVD 419


>gi|389632999|ref|XP_003714152.1| hypothetical protein MGG_01245 [Magnaporthe oryzae 70-15]
 gi|351646485|gb|EHA54345.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Magnaporthe oryzae 70-15]
          Length = 439

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 145/443 (32%), Positives = 201/443 (45%), Gaps = 81/443 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG +TIV  + + YL   +  DY ++    EL V
Sbjct: 1   MAPKSRFTRLDAFTKTVEDARIRTTSGGIITIVSLIVVLYLAWGEWADYRRIDIHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ IHL+I  P + C+ L LD +D SGEQ   V+H + K RL         PQ 
Sbjct: 61  DKSRGDRMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVIKVRL--------RPQS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    +  K +            DPN CG CYGA          CCNTC+EV+EAY    
Sbjct: 113 EGGGVIDAKTLALHAEDEAATHLDPNYCGGCYGAPAPANAKKAGCCNTCDEVREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC  E+  E+L     EGCQI G L VN+V G+FH+APG S+S  ++HVH
Sbjct: 173 WAFGRGENVEQCTREHYAERLDEQRHEGCQIAGSLRVNKVVGNFHLAPGRSFSNGNMHVH 232

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGI--------KLQDDDERR-------KPLDGTVAK 277
           D++ Y         + +H I  L FG         KL + D+          PLDG +  
Sbjct: 233 DLKNYWDTPVEGGHSFSHTIHSLRFGPQLPPSALEKLGNKDKNMPWTNHHINPLDGVIQT 292

Query: 278 AEEGASMFNYYIKIIPTIYERL-----------------------DGS------------ 302
             +    + Y++KI+PT Y  L                       DGS            
Sbjct: 293 TVDPNFNYMYFVKIVPTSYLPLGWEKRTHLATMHDHGVGTYGYSGDGSVETHQYSVTSHK 352

Query: 303 -KLGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISG 347
             L GGD             GG+PG+FFSY++SP+ V   E ++K+     T +   + G
Sbjct: 353 RSLAGGDDGEDGHKERMHSRGGIPGVFFSYDISPMKVINREVRTKTFAGFLTGLCAILGG 412

Query: 348 TYITFMLVDALLHSCVKKISKVE 370
           T      +D +    V +I K++
Sbjct: 413 TLTVAAAIDRMTFEGVTRIKKMQ 435


>gi|367019108|ref|XP_003658839.1| hypothetical protein MYCTH_2295135 [Myceliophthora thermophila ATCC
           42464]
 gi|347006106|gb|AEO53594.1| hypothetical protein MYCTH_2295135 [Myceliophthora thermophila ATCC
           42464]
          Length = 436

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 144/440 (32%), Positives = 204/440 (46%), Gaps = 78/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  + + +L   +  DY ++    EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVEDARIRTTSGGIVTIVSLIVVFFLALGEWSDYRRIVVHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P I C+ L LD +D SGEQ   V+H I K RL         P  
Sbjct: 61  DKGRGERMEIHLNITFPRIPCELLTLDVMDVSGEQQHGVQHGITKTRL--------RPLS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    +  K++   +        DPN CG CYGA          CCNTC+EV++AY    
Sbjct: 113 EGGGDIDSKEIVLHSRDEAAVHLDPNYCGECYGAPPPNNAKKPGCCNTCDEVRDAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + IVQC+ E+ +EKL     EGC+I G L VN+V G+FHIAPG S+S  ++HVH
Sbjct: 173 WAFGRGEGIVQCEREHYSEKLDAQRNEGCRIEGGLRVNKVVGNFHIAPGRSFSNGNMHVH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAEEG 281
           D++ Y  +      TH I HL FG +L +   ++              PLD T  + ++ 
Sbjct: 233 DLKNYWDSPTKHTFTHTIHHLRFGPQLPESLTQKLGTKNLPWTNHHVNPLDDTHQQTDDV 292

Query: 282 ASMFNYYIKIIPTIYERL------------------------DGS-------------KL 304
              + Y++KI+PT Y  L                        DGS              L
Sbjct: 293 NYNYMYFLKIVPTSYLPLGWEKTWAGFRERHSAELGSFGTSPDGSVETHQYSVTSHKRSL 352

Query: 305 GGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYI 350
            GG+             GG+PG+FFSY++SP+ ++   E++KS       +   + GT  
Sbjct: 353 AGGNDAAEGHQERQHARGGIPGVFFSYDISPMKVINREERAKSFLGFLAGLCAIVGGTLT 412

Query: 351 TFMLVDALLHSCVKKISKVE 370
               +D  L     ++ K+ 
Sbjct: 413 VAAAIDRALFEGTVRLKKLR 432


>gi|412994036|emb|CCO14547.1| predicted protein [Bathycoccus prasinos]
          Length = 436

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 134/420 (31%), Positives = 208/420 (49%), Gaps = 57/420 (13%)

Query: 7   LKGLDAFTKPYE-DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           L+  DAF K  + DF+ ++  GG +T+V ++    L+  +   Y +     +L+VD+ RG
Sbjct: 17  LRKFDAFPKFVDVDFYSRSFGGGIITVVTYIVAVSLLLAETKLYLKTHVKHDLYVDNGRG 76

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEP------ 118
             + I++D+  P +SC  L LD +D SGE HL V +H + K R D  G  + +       
Sbjct: 77  ETMRINVDVFFPNLSCGSLGLDVMDVSGETHLDVVDHEMRKIRYDRYGVKLADALNDEHG 136

Query: 119 QKEVVNAVKKKKVTTENGTTTTE--------------LEDPNK--CGSCYGAET------ 156
           ++EVVN        TE  ++  +              +ED     CGSCYGA+       
Sbjct: 137 KEEVVNEKAFDSNETETASSLRKNKTKKTAKELIPRYMEDGKTKYCGSCYGADVSGANRG 196

Query: 157 ETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRV 216
             ++CC TC EV+EAY    WA     ++ QCK E  +E L N   EGC+  G+L+VN+V
Sbjct: 197 REQRCCQTCEEVREAYIEVGWAFTGASSMEQCKREGFSEVLGNVHEEGCEFKGFLDVNKV 256

Query: 217 SGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT-- 274
            G+FHIAPG S+     HVHD+ P+    FN +H +RHLSFG   +    +  PLDGT  
Sbjct: 257 QGNFHIAPGKSFQQGEQHVHDLSPFPDGKFNFSHEVRHLSFG---EGYPGKVDPLDGTKR 313

Query: 275 VAKAEEGASMFNYYIKIIPTIYERL---------------------DGSKLGGGDGGMPG 313
             K      ++ Y+ +I+PT Y  L                     D + + GG   +PG
Sbjct: 314 TLKLPAETGVYQYFFRIVPTTYTYLNPFKKDISTNQYSVVDHFKPVDAASIQGGSSDLPG 373

Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
           +FF Y+LSP+ V I E   S+     ++  ++ G +    +VD +++     I  K+++G
Sbjct: 374 VFFFYDLSPIKVDIAEYRTSVWKFLAEVCASVGGVFAVSGIVDKVVYKGSLAIKKKIQLG 433


>gi|340923948|gb|EGS18851.1| hypothetical protein CTHT_0054620 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 436

 Score =  221 bits (564), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 144/439 (32%), Positives = 205/439 (46%), Gaps = 78/439 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  + + +L   +  +Y ++    EL V
Sbjct: 1   MAGKSRFTRLDAFTKTVEDARIRTTSGGIVTIVSLIVVLFLSWSEWREYRRIVVHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V+H + K RL         P +
Sbjct: 61  DKGRGERMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVTKTRL--------RPWE 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    + KK++   +   +    DPN CGSCYGA          CC TC+EV+EAY    
Sbjct: 113 EGGGDIDKKELALHSIEESATHLDPNYCGSCYGANPPPNAVKPGCCQTCDEVREAYAQAA 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + I QC+ E+  E+L     EGC+I G L VN+V G+FHIAPG S+S  ++HVH
Sbjct: 173 WAFGRGENIEQCQREHYAERLDQQRREGCRIEGGLRVNKVVGNFHIAPGKSFSNGNMHVH 232

Query: 237 DIQPYTSAAFNT--THHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAEEG 281
           D++ Y  +      TH I HL FG +L +   ++              PLD T  + +E 
Sbjct: 233 DLKNYWESPVRHTFTHIIHHLRFGPQLPESLHQKLGNKALPWSNHHVNPLDNTHQETDEV 292

Query: 282 ASMFNYYIKIIPTIYERL------------------------DGS-------------KL 304
              + Y+IKI+PT Y  L                        DGS              L
Sbjct: 293 NFSYMYFIKIVPTSYLPLGWEKTWDQFREQHHAELGSFGTSADGSVETHQYSVTSHRRSL 352

Query: 305 GGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYI 350
            GGD             GG+PG+FFSY++SP+ ++   E++KS       +   + GT  
Sbjct: 353 SGGDDAAEGHSERLHSKGGIPGVFFSYDISPMKVINREERAKSFLGFLAGLCAIVGGTLT 412

Query: 351 TFMLVDALLHSCVKKISKV 369
               +D  L     ++ K+
Sbjct: 413 VAAAIDRALFEGTVRLKKL 431


>gi|380489161|emb|CCF36889.1| hypothetical protein CH063_08353 [Colletotrichum higginsianum]
          Length = 437

 Score =  221 bits (562), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 141/427 (33%), Positives = 201/427 (47%), Gaps = 79/427 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  + + +L   +  DY ++    EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVDEARVRTTSGGIVTIVSLIVVFWLAWGEWVDYRKIEIHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGEQ   V H + K RL          QK
Sbjct: 61  DKGRGERMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGVIHGVNKVRL--------RSQK 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET----ETRKCCNTCNEVKEAYRYKK 176
           E    +  K +   +   T E  DPN CG+CYGA+     +   CCNTC EV+EAY    
Sbjct: 113 EGGGVIDMKALDLHSREATAEHLDPNYCGACYGAQAPANAQKAGCCNTCEEVREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  + + + QC  E+  E+L+    EGC++ G L VN+V G+FH+APG S+S  ++HVH
Sbjct: 173 WAFGKGENVEQCTREHYAERLEEQRQEGCRLEGNLRVNKVVGNFHLAPGRSFSNGNMHVH 232

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
           D++ Y      A  + TH I  L FG +L D   ++              PLD T  +  
Sbjct: 233 DLKNYWDTPDDAQHDFTHTIHSLRFGPQLPDQVTKKMGKRAYAWTNHHGNPLDNTHQETT 292

Query: 280 EGASMFNYYIKIIPTIYERL-----------------------DGS-------------K 303
           +    F Y++KI+PT Y  L                       DGS              
Sbjct: 293 DPNYNFMYFVKIVPTSYLALNWQKSSSYQDEENSGLGLLGQGNDGSVETHQYSVTSHKRS 352

Query: 304 LGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTY 349
           L GGD             GG+PG+FFSY++SP+ ++   E++K+     T +   I GT 
Sbjct: 353 LAGGDDAAEGHKERLHSRGGIPGVFFSYDISPMKVINREERAKTFTGFLTGLCAIIGGTL 412

Query: 350 ITFMLVD 356
                VD
Sbjct: 413 TVAAAVD 419


>gi|453082617|gb|EMF10664.1| DUF1692-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 432

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 151/440 (34%), Positives = 200/440 (45%), Gaps = 84/440 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTI   + I YL+  +  DY +     EL V
Sbjct: 1   MPAKSRFTKLDAFTKTVEDARIRTSTGGIVTITSLILILYLVWGEWTDYRRTVVHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG K+ IH++I  P + C+ L LD +D SGE    V H + K RLD +GK I     
Sbjct: 61  DKGRGEKMEIHMNISFPRVPCELLTLDVMDVSGEVQSGVMHGVNKVRLDANGKEI----- 115

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA---ETETRK-CCNTCNEVKEAYRYKK 176
                   K+  T N        DP+ CG CYGA   ET T+  CCN C EV+EAY    
Sbjct: 116 -------GKEALTVNSEEQVPHLDPDYCGDCYGAPAPETATKAGCCNNCAEVREAYAGVS 168

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W+    + + QC  E+  E L     EGC+I G + VN+V G+FH APG S+S  ++HVH
Sbjct: 169 WSFGRGEGVEQCTREHYAEHLDEQRKEGCRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVH 228

Query: 237 DIQPYTSAA---FNTTHHIRHLSFGIKLQDD-------------DERRKPLDGTVAKAEE 280
           D++ Y  +     + TH I HL FG +L DD             +    PLD T    +E
Sbjct: 229 DLENYFQSGEVQHSFTHKIHHLRFGPELPDDVVKAVGKKGMAWSNHHLNPLDDTEQVTDE 288

Query: 281 GASMFNYYIKIIPTIYERL--DGS------------------------------------ 302
            A  F Y++K++ T Y  L  DGS                                    
Sbjct: 289 VAYNFMYFVKVVSTAYLPLGWDGSGSLLDIPHELIALGGYGKGEQGSIETHQYSVTSHKR 348

Query: 303 KLGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGT 348
            L GGD             GG+PG+FFSY++SP+ V   E ++KS       +   I GT
Sbjct: 349 SLTGGDAKAEGHEERLHAKGGIPGVFFSYDISPMKVINREARAKSFSGFLVGVCAVIGGT 408

Query: 349 YITFMLVDALLHSCVKKISK 368
                 VD LL+    K+ K
Sbjct: 409 LTVAAAVDRLLYEGGSKLRK 428


>gi|222628979|gb|EEE61111.1| hypothetical protein OsJ_15023 [Oryza sativa Japonica Group]
          Length = 369

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 123/383 (32%), Positives = 192/383 (50%), Gaps = 39/383 (10%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++      +           G
Sbjct: 7   KLRSLDAYPKVNEDFYSRTLSGGIITLASSVVMLLLFVSELRHTLTYTF----------G 56

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L +  D+  P + C  ++LDA+D SG++HL V+H+I+K+R+D+ G  I   Q  V   
Sbjct: 57  MILKMQFDVTFPALQCSIISLDAMDISGQEHLDVKHDIFKQRIDVHGNVIATKQDAV--- 113

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                    NG  +      N             +CCN+C +V+EAYR K W +   D I
Sbjct: 114 -------GGNGPYSGMAAGLNTMRPIVALVMSDEQCCNSCEDVREAYRKKGWGVSNPDLI 166

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   + +K+   EGC IYG+LEVN+V+G+FH APG S+   +VHVHD+ P+   +
Sbjct: 167 DQCKREGFLQSIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDS 226

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD----- 300
           FN +H I  LSFG +         PLDG          M+ Y+IK++PT+Y  ++     
Sbjct: 227 FNVSHKINKLSFGQRFPG---VVNPLDGAQWMQHSSYGMYQYFIKVVPTVYTDINEHIIL 283

Query: 301 ----------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                      S   G    +PG+FF Y+LSP+ V  TE+  S  H  T +   + G + 
Sbjct: 284 SNQFSVTEHFRSSESGRIQAVPGVFFFYDLSPIKVTFTEQHVSFLHFLTNVCAIVGGVFT 343

Query: 351 TFMLVDALLHSCVKKI-SKVEIG 372
              ++D+ ++   + I  K+EIG
Sbjct: 344 VSGIIDSFVYHGQRAIKKKMEIG 366


>gi|336465550|gb|EGO53790.1| hypothetical protein NEUTE1DRAFT_151014 [Neurospora tetrasperma
           FGSC 2508]
 gi|350295150|gb|EGZ76127.1| DUF1692-domain-containing protein [Neurospora tetrasperma FGSC
           2509]
          Length = 444

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 148/448 (33%), Positives = 203/448 (45%), Gaps = 86/448 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  L + +L   +  DY +V    EL V
Sbjct: 1   MAGKSRFTKLDAFTKTVEDARIRTTSGGIVTIVSLLVVLFLSWGEWRDYRKVVIHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V+H + K RL         PQ 
Sbjct: 61  DKGRGERMEIHLNITFPKVPCELLTLDVMDVSGEQQHGVQHGVKKIRL--------RPQS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E    +  K ++      +    DP+ CG CYGA      +   CC+TC EV+EAY    
Sbjct: 113 EGGGEIDAKILSLHAADESATHLDPSYCGPCYGAPAPYNAKKPGCCSTCEEVREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +  T+ QC+ E+ TE+L     EGC+I G L VN+V G+FHIAPG S+S  ++HVH
Sbjct: 173 WAFGDGATMEQCQREHYTERLAEQRHEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVH 232

Query: 237 DIQPYTSAAFNTTHHIRH----LSFGIKLQDDDERR---------------KPLDGTVAK 277
           D+  + S      H   H    L FG +L DD  R+                PLD T  +
Sbjct: 233 DLAQWWSTPVPGGHSFSHIIHSLRFGPQLPDDLVRKLGGNGKNTLWTNHHLNPLDNTKQE 292

Query: 278 AEEGASMFNYYIKIIPTIYERL----------------------------DGS------- 302
            ++    F Y++KI+PT Y  L                            DGS       
Sbjct: 293 TDDPNYNFMYFVKIVPTSYLPLGWEKQAAQNKATWEQDHSVGLGAYGYGSDGSMETHQYS 352

Query: 303 ------KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIM 342
                  L GGD             GG+PG+FFSY++SP+ +V   E++KS       + 
Sbjct: 353 VTSHKRSLTGGDDSKEGHGERLHSRGGIPGVFFSYDISPMKVVNREERAKSFLGFLAGLC 412

Query: 343 CNISGTYITFMLVDALLHSCVKKISKVE 370
             + GT      VD  L     ++ K+ 
Sbjct: 413 AVVGGTLTVAAAVDRGLFEGTVRLKKLR 440


>gi|170089933|ref|XP_001876189.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
           bicolor S238N-H82]
 gi|164649449|gb|EDR13691.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
           bicolor S238N-H82]
          Length = 421

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 133/416 (31%), Positives = 200/416 (48%), Gaps = 64/416 (15%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F   LKG+DAF K  ED   KT  G  +TI+    I     V+  DY  V+    + VD 
Sbjct: 5   FFANLKGVDAFGKTTEDVKVKTRTGALLTIISAAIILAFSFVEFIDYRAVNIDTSIVVDK 64

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-E 121
           SRG KL ++L++  P + C  L+LD +D SGE    + HN+ K RLD  GK +      E
Sbjct: 65  SRGEKLTVNLNVTFPRVPCYLLSLDIMDISGELQRDISHNVMKVRLDTHGKEVPNSHSAE 124

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
           + N + K            + +  N CGSC+G       CCNTC +V+ AY  + W+   
Sbjct: 125 LRNDLDK----------MNDAKRENYCGSCFGGLEPEGGCCNTCEDVRLAYVNRGWSFSN 174

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            + I QCKNE   +KLK    EGC I G + VN+V G+ H++PG S+  N  +++++ PY
Sbjct: 175 PEAIEQCKNEGWADKLKEQADEGCNISGRIRVNKVIGNIHLSPGRSFQTNARNLYELVPY 234

Query: 242 TSAAFNT---THHIRHLSFGIKLQDDDE------------RRK------PLDGTVAKAEE 280
                N    +H I HL+F    + DDE            R++      PLDG +A+  +
Sbjct: 235 LRDDGNRHDFSHTIHHLAF----EGDDEYDYWKAAAGSAMRQRMGLTENPLDGAIARTAK 290

Query: 281 GASMFNYYIKIIPTIYERLDGSKL-----------------------GG-----GDGGMP 312
              MF Y++K++ T +  LDG K+                       GG     G  G+P
Sbjct: 291 AQYMFQYFLKVVSTQFRTLDGRKVNTHQYSTTQFERDLTEGAAGETAGGIHVQHGVSGLP 350

Query: 313 GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           G FF++E+SP++V   E  +S  H  T     I G      ++D++L +  +++ K
Sbjct: 351 GAFFNFEISPILVVHAETRQSFAHFLTSTCAIIGGVLTVASIIDSILFATNRRLKK 406


>gi|402218655|gb|EJT98731.1| ER to Golgi transport-related protein [Dacryopinax sp. DJM-731 SS1]
          Length = 455

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 137/440 (31%), Positives = 203/440 (46%), Gaps = 84/440 (19%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
           VFS+ LKGLDAF K  ED   KT  G  +T++    I +   ++  DY ++     + VD
Sbjct: 6   VFSQ-LKGLDAFGKTMEDVKVKTRTGALLTLISACIIVFFTLMEFVDYRRIHLATSVVVD 64

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE--PQ 119
            SRG KL ++++I  P + C  L+LD +D SGE+   V HN+ + RL   G PI +  P+
Sbjct: 65  RSRGEKLLVNMNITFPRVPCYLLSLDVMDISGERQHDVTHNMQRVRLSPQGIPIPDVLPE 124

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
             + N ++K            E  +  +CGSCYG +     CCNTC +V+EAY  + W+ 
Sbjct: 125 SGLSNEIEK----------VIEAREGGECGSCYGGDPPASGCCNTCEDVREAYMRRGWSF 174

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
              + I QC NE  TEK+K+   EGC I G + VN+V G+FH +PG S+  N +HVHD+ 
Sbjct: 175 SSPEDIKQCVNEGWTEKVKSQSEEGCNISGRVRVNKVIGNFHFSPGKSFQTNAMHVHDLV 234

Query: 240 PYTSAA--FNTTHHIRHLSFGIKLQDDDE--------------RRKPLDGTVA------- 276
           PY   A   +  H I +  F    +   E               + PLDG  A       
Sbjct: 235 PYLKDANRHDFGHEIHYFGFESDGEQQAEVGRLSKSIKTKLGIDKNPLDGLRAHVRSLSR 294

Query: 277 -------------------KAEEGASMFNYYIKIIPTIYERLDGS--------------K 303
                              + E+   MF Y++K++ T YE L G+               
Sbjct: 295 RETRRVPGMSSNRRSYRPEQTEKSNYMFQYFLKVVSTKYEMLRGTVVNSHQYSVTSYERD 354

Query: 304 LGGGD---------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
           L  GD                G+PG FF++E+SP++V   E  +S  H  T     + G 
Sbjct: 355 LSQGDKAQRDEHGTMTSHGVSGIPGAFFNFEISPMVVVHQETRQSFAHFLTSTCAIVGGV 414

Query: 349 YITFMLVDALLHSCVKKISK 368
                + D++L S  +K+ K
Sbjct: 415 LTVAAIFDSMLFSAERKLKK 434


>gi|367052857|ref|XP_003656807.1| hypothetical protein THITE_2121964 [Thielavia terrestris NRRL 8126]
 gi|347004072|gb|AEO70471.1| hypothetical protein THITE_2121964 [Thielavia terrestris NRRL 8126]
          Length = 436

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 150/440 (34%), Positives = 204/440 (46%), Gaps = 78/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG +TIV  + + +L   +  DY +V    EL V
Sbjct: 1   MPPKSRFTRLDAFTKTVEDARIRTTSGGIITIVSIIVVLFLAWGEWADYRRVVVHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V+H + K RL         P  
Sbjct: 61  DKGRGERMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVTKTRL--------RPLS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    +  K +            DP+ CG CYGA+  T      CCNTC+EVKEAY  + 
Sbjct: 113 EGGGDIDSKALALHAADEAAIHLDPSYCGPCYGAKPPTTAKKPGCCNTCDEVKEAYAQQA 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    D I QC+ E+  E+L     EGC+I G L VN+V G+FHIAPG S+S  +VHVH
Sbjct: 173 WAFGRGDGIEQCEREHYGERLDEQRREGCRIEGGLRVNKVVGNFHIAPGRSFSNGNVHVH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAEEG 281
           D++ Y  T      TH I HL FG +L D   ++              PLDGT  + ++ 
Sbjct: 233 DLKNYWDTPTKHTFTHIIHHLRFGPQLPDSLHKKLGTKHLPWTNHHLNPLDGTSQETDDV 292

Query: 282 ASMFNYYIKIIPTIYERL------------------------DGS-------------KL 304
              + Y+IKI+PT Y  L                        DGS              L
Sbjct: 293 NFNYMYFIKIVPTSYLPLGWEKTWAGFREEHQAELGSFGTSADGSVETHQYSVTSHKRSL 352

Query: 305 GGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYI 350
            GGD             GG+PG+FFSY++SP+ ++   E+SK+       +   + GT  
Sbjct: 353 AGGDDAAEGHRERLHAKGGIPGVFFSYDISPMKVINREERSKTFLGFIAGLCAIVGGTLT 412

Query: 351 TFMLVDALLHSCVKKISKVE 370
               VD  L     ++ K+ 
Sbjct: 413 VAAAVDRALFEGTVRLKKLR 432


>gi|85115136|ref|XP_964815.1| hypothetical protein NCU08607 [Neurospora crassa OR74A]
 gi|28926610|gb|EAA35579.1| hypothetical protein NCU08607 [Neurospora crassa OR74A]
          Length = 444

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 148/448 (33%), Positives = 202/448 (45%), Gaps = 86/448 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  L + +L   +  DY +V    EL V
Sbjct: 1   MAGKWRFTKLDAFTKTVEDARIRTTSGGIVTIVSLLVVLFLSWGEWRDYRKVVIHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V+H + K RL         PQ 
Sbjct: 61  DKGRGERMEIHLNITFPKVPCELLTLDVMDVSGEQQHGVQHGVKKIRL--------RPQS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E    +  K ++      +    DP+ CG CYGA      +   CC+TC EV+EAY    
Sbjct: 113 EGGGEIDAKVLSLHAADESATHLDPSYCGPCYGAPAPYNAKKPGCCSTCEEVREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +  T+ QC+ E+ TE+L     EGC+I G L VN+V G+FHIAPG S+S  ++HVH
Sbjct: 173 WAFGDGATMEQCQREHYTERLAEQRHEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVH 232

Query: 237 DIQPYTSAAFNTTHHIRH----LSFGIKLQDDDERR---------------KPLDGTVAK 277
           D+  + S      H   H    L FG +L DD  R+                PLD T  +
Sbjct: 233 DLAQWWSTPVPGGHSFSHIIHSLRFGPQLPDDLVRKLGGNGKNTLWTNHHLNPLDNTKQE 292

Query: 278 AEEGASMFNYYIKIIPTIYERL----------------------------DGS------- 302
             +    F Y++KI+PT Y  L                            DGS       
Sbjct: 293 TNDPNYNFMYFVKIVPTSYLPLGWEKQAAQNKAAWEQDHSVGLGAYGYGSDGSMETHQYS 352

Query: 303 ------KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIM 342
                  L GGD             GG+PG+FFSY++SP+ +V   E++KS       + 
Sbjct: 353 VTSHKRSLTGGDDSKEGHGERLHSRGGIPGVFFSYDISPMKVVNREERAKSFLGFLAGLC 412

Query: 343 CNISGTYITFMLVDALLHSCVKKISKVE 370
             + GT      VD  L     ++ K+ 
Sbjct: 413 AVVGGTLTVAAAVDRGLFEGTVRLKKLR 440


>gi|388856238|emb|CCF50047.1| uncharacterized protein [Ustilago hordei]
          Length = 435

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 128/413 (30%), Positives = 203/413 (49%), Gaps = 62/413 (15%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+G+DAF+K  +D   +T  G  +T++  L I  L   +  DY  V     L VD SRG
Sbjct: 9   QLRGIDAFSKTMDDVRIRTNAGALITLISALLILVLTIGEYVDYRTVHLKPALEVDRSRG 68

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL ++++I  P + C  L+LD +D SGE    ++H+I + R+  DGK           +
Sbjct: 69  EKLTVNMNITFPRVPCYLLSLDVMDISGEHVNDIQHDIERTRISQDGKV----------S 118

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           ++  K    +       +  + CG CYG +     CCNTC+EV+EAY  K W+  + D +
Sbjct: 119 IQGTKSLKGDAARIANTKGKDYCGDCYGGQPPASGCCNTCDEVREAYVRKGWSFSDPDHV 178

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E  +EK+K    EGC+I G L VN+V GSFH++PG ++  N +H+HD+ PY S +
Sbjct: 179 EQCVAEGWSEKIKEQNKEGCRISGKLHVNKVVGSFHLSPGRAFQRNSMHIHDLVPYLSGS 238

Query: 246 FNTTHHIRHL----SFGIK-----LQDDDER--------RKPLDGTVAKAEEGASMFNYY 288
               H   H+    SFG +     L    ER        + PL+G  A+ +E   MF Y+
Sbjct: 239 GAEHHDFGHIIHEFSFGSEQEYHGLTTAKERAVKDKLGVKDPLEGVRARTKESQYMFQYF 298

Query: 289 IKIIP------------------TIYER-----------------LDGSKLGGGDGGMPG 313
           +K++                   T YER                   G+++  G  G+PG
Sbjct: 299 LKVVSTEFRPLAGETLKTQQYSVTTYERDLSPGANAAALAGLSNEGSGARISHGFAGVPG 358

Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           +FF+YE+SPL    +E  +SL H  T     + G      ++D+L+++  +++
Sbjct: 359 VFFNYEISPLKTIHSEYRQSLSHFLTSTCAIVGGILTVAGILDSLIYNSGRRL 411


>gi|402590490|gb|EJW84420.1| hypothetical protein WUBG_04668 [Wuchereria bancrofti]
          Length = 341

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 124/352 (35%), Positives = 187/352 (53%), Gaps = 39/352 (11%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   ERLK  DA+TKP +DF  +T  GGAVT+V    I ++   +   +  V   E+L+V
Sbjct: 1   MSLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYV 60

Query: 61  DSSRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
           DS+    ++ ++ DI  P + C  + +D +D SG+    ++ ++YK  L L+GK     +
Sbjct: 61  DSTPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIKDDVYKISL-LNGKEGNGIR 119

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNK---CGSCYGAETETRKCCNTCNEVKEAYRYKK 176
           + V            N  TTT    P     CGSCYGA+     CCNTC EVKEAY  K 
Sbjct: 120 QGV------------NINTTTVSSAPASQILCGSCYGAKD---GCCNTCEEVKEAYIKKG 164

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W L  ++T+ QCK++   +K+     EGC++YG ++V +V+G+FHIAPG     +  H H
Sbjct: 165 WELVNIETVEQCKSDLWVKKMNEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLKAHRSHFH 224

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT-VAKAEEGASMFNYYIKIIPTI 295
           D+   + + F+T+H + HLSFG        +  PLDG     A++   M+ Y++K++PT 
Sbjct: 225 DLHSLSPSKFDTSHTVNHLSFGNSFPG---KVYPLDGKFFGSAKDSGIMYQYHLKLVPTS 281

Query: 296 YERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSK 332
           Y  LD ++               +  G  G+PG F  YE SPLMVK  E+ +
Sbjct: 282 YVFLDSTRNIFSHLFSVTTYQKDISQGASGLPGFFIQYEFSPLMVKYEERRQ 333


>gi|170586880|ref|XP_001898207.1| Serologically defined breast cancer antigen NY-BR-84 homolog,
           putative [Brugia malayi]
 gi|158594602|gb|EDP33186.1| Serologically defined breast cancer antigen NY-BR-84 homolog,
           putative [Brugia malayi]
          Length = 341

 Score =  217 bits (552), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 124/352 (35%), Positives = 187/352 (53%), Gaps = 39/352 (11%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   ERLK  DA+TKP +DF  +T  GGAVT+V    I ++   +   +  V   E+L+V
Sbjct: 1   MSLLERLKDFDAYTKPLDDFRVRTFAGGAVTLVSSAVIIFMFVSETLSFLSVDIVEQLYV 60

Query: 61  DSSRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
           DS+    ++ ++ DI  P + C  + +D +D SG+    ++ ++YK  L L+GK     +
Sbjct: 61  DSTPAEQRVDVNFDITFPRLPCSVITIDVMDLSGDNQDDIKDDVYKISL-LNGKEGNGIR 119

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNK---CGSCYGAETETRKCCNTCNEVKEAYRYKK 176
           + V            N  TTT    P     CGSCYGA+     CCNTC EVKEAY  K 
Sbjct: 120 QGV------------NINTTTVSSVPASQILCGSCYGAKD---GCCNTCEEVKEAYIKKG 164

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W L  ++T+ QCK++   +K+     EGC++YG ++V +V+G+FHIAPG     +  H H
Sbjct: 165 WELVNIETVEQCKSDLWVKKMNEHKNEGCRVYGKVQVAKVAGNFHIAPGDPLKAHRSHFH 224

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT-VAKAEEGASMFNYYIKIIPTI 295
           D+   + + F+T+H + HLSFG        +  PLDG     A++   M+ Y++K++PT 
Sbjct: 225 DLHSLSPSKFDTSHTVNHLSFGNSFPG---KVYPLDGKFFGSAKDSGIMYQYHLKLVPTS 281

Query: 296 YERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSK 332
           Y  LD ++               +  G  G+PG F  YE SPLMVK  E+ +
Sbjct: 282 YVFLDSTRNIFSHLFSVTTYQKDISQGASGLPGFFIQYEFSPLMVKYEERRQ 333


>gi|242803029|ref|XP_002484091.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218717436|gb|EED16857.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 440

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 148/442 (33%), Positives = 205/442 (46%), Gaps = 82/442 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VT+V  + I +L+  +  DY +V    EL V
Sbjct: 1   MPAKSRFTRLDAFAKTVEDARVRTTSGGIVTLVSLVVILWLVWGEWADYRRVVVLPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ IHL++  P + C+ L LD +D SGEQ + V H + K RL     P+ E  K
Sbjct: 61  DKSRGERMEIHLNMTFPRLPCELLTLDVMDVSGEQQMGVVHGLNKVRL----SPVAEGGK 116

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
             V  V K ++  +N        +P  CG C GA     T    CCNTC EV+EAY  K 
Sbjct: 117 --VIDVAKLELHAQNEVAVHL--NPEYCGQCGGAPPPPNTNKPGCCNTCEEVREAYALKS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  + + I QC+ E   EK+     EGC+I G + VN+V G+FHIAPG S+S  ++HVH
Sbjct: 173 WAFGKGENIEQCQREGYAEKINAQRREGCRIEGDIRVNKVIGNFHIAPGRSFSTGNMHVH 232

Query: 237 DIQPYTSAAFN------TTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEG 281
           D+  Y     +       +H I  L FG +L D+  RR          PLD T    +E 
Sbjct: 233 DLDTYMDRELSDNEKHTMSHIIHQLRFGPQLSDELSRRWQWTDHHHTNPLDDTQQFTDEP 292

Query: 282 ASMFNYYIKIIPTIY----------ERLDG------------------------------ 301
           A  +NYYIK++ T Y          ++L G                              
Sbjct: 293 AYNYNYYIKVVSTSYLPLGWDSSQSDQLHGDDQSTPLGLHGAVHGAAGSLETHQYSVTSH 352

Query: 302 --SKLGGGD------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
             S  GG D            GG+PG+FF+Y++SP+ V   E + K+     T +   I 
Sbjct: 353 KRSLHGGNDAAEGHKERVHAEGGIPGVFFNYDISPMKVVNREVRPKTFTGFLTGVCAVIG 412

Query: 347 GTYITFMLVDALLHSCVKKISK 368
           GT      VD  L+   +++ K
Sbjct: 413 GTLTVAAAVDRFLYEGSRRMRK 434


>gi|452842116|gb|EME44052.1| hypothetical protein DOTSEDRAFT_71753 [Dothistroma septosporum
           NZE10]
          Length = 436

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 142/441 (32%), Positives = 203/441 (46%), Gaps = 82/441 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VT+   L I YL+  +  DY +++   EL V
Sbjct: 1   MPAKSRFTKLDAFTKTVEDARIRTTSGGIVTVTSLLLILYLVWGEWADYRRITVHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG K+ IH+++  P + C+ L LD +D SGE    V H + K RL         P+ 
Sbjct: 61  DKGRGEKMEIHMNVSFPRVPCELLTLDVMDVSGEVQTGVMHGVNKVRL--------RPEA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    ++KK +          L DP+ CG CYGA   +      CCNTC EV+EAY    
Sbjct: 113 EGGGEIEKKALDLGVEEAAQHL-DPDYCGECYGAPAPSNAAKPGCCNTCAEVREAYAGVS 171

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W+    + + QC+ E+ +E L     EGC+I G + VN+V G+FH APG S+S  ++HVH
Sbjct: 172 WSFGRGENVEQCEREHYSEHLDAQRKEGCRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVH 231

Query: 237 DIQPYTSAA----FNTTHHIRHLSFGIKLQDD-------------DERRKPLDGTVAKAE 279
           D++ + ++        TH I  L FG +L DD             +    PLDGT    E
Sbjct: 232 DLENFFNSPEGIQHTFTHKIHSLRFGPQLPDDVVNKVGKRGIAWSEHHLNPLDGTSQVTE 291

Query: 280 EGASMFNYYIKIIPTIYERL----DGS--------------------------------- 302
           E +  F Y++K++ T Y  L     GS                                 
Sbjct: 292 EKSYNFMYFVKVVSTAYLPLAWKPSGSLLDLPHELVELGGYGKGEGGSIETHQYSVTSHK 351

Query: 303 -KLGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISG 347
             L GGD             GG+PG+FFSY++SP+ V   E ++K+     T +   I G
Sbjct: 352 RSLQGGDANEEGHKERLHARGGIPGVFFSYDISPMKVVNREARTKTFTGFLTGVAAVIGG 411

Query: 348 TYITFMLVDALLHSCVKKISK 368
           T      VD L++   +++ K
Sbjct: 412 TLTVAAAVDRLMYEGGQRVRK 432


>gi|440473660|gb|ELQ42442.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Magnaporthe oryzae Y34]
 gi|440486294|gb|ELQ66175.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Magnaporthe oryzae P131]
          Length = 444

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 145/448 (32%), Positives = 201/448 (44%), Gaps = 86/448 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG +TIV  + + YL   +  DY ++    EL V
Sbjct: 1   MAPKSRFTRLDAFTKTVEDARIRTTSGGIITIVSLIVVLYLAWGEWADYRRIDIHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ IHL+I  P + C+ L LD +D SGEQ   V+H + K RL         PQ 
Sbjct: 61  DKSRGDRMEIHLNITFPRVPCELLTLDVMDVSGEQQHGVQHGVIKVRL--------RPQS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    +  K +            DPN CG CYGA          CCNTC+EV+EAY    
Sbjct: 113 EGGGVIDAKTLALHAEDEAATHLDPNYCGGCYGAPAPANAKKAGCCNTCDEVREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC  E+  E+L     EGCQI G L VN+V G+FH+APG S+S  ++HVH
Sbjct: 173 WAFGRGENVEQCTREHYAERLDEQRHEGCQIAGSLRVNKVVGNFHLAPGRSFSNGNMHVH 232

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGI--------KLQDDDERR-------KPLDGTVAK 277
           D++ Y         + +H I  L FG         KL + D+          PLDG +  
Sbjct: 233 DLKNYWDTPVEGGHSFSHTIHSLRFGPQLPPSALEKLGNKDKNMPWTNHHINPLDGVIQT 292

Query: 278 AEEGASMFNYYIKIIPTIYERL-----------------------DGS------------ 302
             +    + Y++KI+PT Y  L                       DGS            
Sbjct: 293 TVDPNFNYMYFVKIVPTSYLPLGWEKRTHLATMHDHGVGTYGYSGDGSVETHQYSVTSHK 352

Query: 303 -KLGGGD-------------GGMPGIFFSY-----ELSPLMVKITE-KSKSLGHLWTKIM 342
             L GGD             GG+PG+FFSY     ++SP+ V   E ++K+     T + 
Sbjct: 353 RSLAGGDDGEDGHKERMHSRGGIPGVFFSYPFCPQDISPMKVINREVRTKTFAGFLTGLC 412

Query: 343 CNISGTYITFMLVDALLHSCVKKISKVE 370
             + GT      +D +    V +I K++
Sbjct: 413 AILGGTLTVAAAIDRMTFEGVTRIKKMQ 440


>gi|302923326|ref|XP_003053651.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256734592|gb|EEU47938.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 437

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 145/443 (32%), Positives = 205/443 (46%), Gaps = 83/443 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  + + +L   +  +Y +V    EL V
Sbjct: 1   MPPKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLIVVIFLAWGEWSEYRRVEIHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V H + K RL        +PQ 
Sbjct: 61  DRGRGERMEIHLNITFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRL--------QPQS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE--TETRK--CCNTCNEVKEAYRYKK 176
           +    +  K ++  +        DP+ CG CYGA+     RK  CC TC+EV+EAY    
Sbjct: 113 KGGADIDSKSLSLHDDAAAHL--DPSYCGGCYGAQPPANARKAGCCQTCDEVREAYAQAS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E+  EKL     EGC+I G L VN+V G+FH APG S+S  ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYAEKLDAQREEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVH 230

Query: 237 DIQPYTSA----AFNTTHHIRHLSFGIKLQDDDERR------------KPLDGTVAKAEE 280
           D++ Y  A    A + TH I  L FG +L D+  R+             PLDGT    ++
Sbjct: 231 DLKNYWDAPKGKAHDFTHIIHSLRFGPQLPDEVARKVGKGTPWTNHHQNPLDGTRQDIKD 290

Query: 281 GASMFNYYIKIIPTIYERL--------------------------DGS------------ 302
               F Y++KI+PT Y  L                          DGS            
Sbjct: 291 PNFNFMYFVKIVPTSYLPLGWDSKGLKIAGLLQDDTSLGAYGYAEDGSVETHQYSVTSHK 350

Query: 303 -KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISG 347
             L GG+             GG+PG+FFSY++SP+ +V   EK K+       +   + G
Sbjct: 351 RSLAGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKGKTFSGFLAGLCAIVGG 410

Query: 348 TYITFMLVDALLHSCVKKISKVE 370
           T      VD  L     ++ K+ 
Sbjct: 411 TLTVAAAVDRGLFEGAARLKKMR 433


>gi|148674215|gb|EDL06162.1| ERGIC and golgi 3, isoform CRA_b [Mus musculus]
          Length = 269

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 112/259 (43%), Positives = 161/259 (62%), Gaps = 17/259 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 17  KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 76

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 77  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAER--HE 134

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DPN+C SCYGAE+E  KCCN+C +V+EAYR + WA    DTI
Sbjct: 135 LGKVEVTVFDPNSL----DPNRCESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTI 190

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVH ++ +   +
Sbjct: 191 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHAVEIHDLQS 250

Query: 246 F-----------NTTHHIR 253
           F           N TH+I+
Sbjct: 251 FGLDNPSDCLQINMTHYIK 269


>gi|320592791|gb|EFX05200.1| copii-coated vesicle membrane protein [Grosmannia clavigera kw1407]
          Length = 440

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 144/446 (32%), Positives = 205/446 (45%), Gaps = 86/446 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  + + YL   +  DY +V    EL V
Sbjct: 1   MAAKSRFTRLDAFTKTVEDARIRTTSGGVVTIVSLIVVLYLAWGEWLDYRRVIIRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P I C+ L LD +D SGEQ   V+H +   RL        EPQ 
Sbjct: 61  DKGRGERMEIHLNITFPRIPCELLTLDVMDVSGEQQHGVQHGVRMVRL--------EPQS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
              + ++ K +   +    + L DP  CG CYGA          CCNTC+EV+EAY    
Sbjct: 113 RGGSEIEVKTLDL-HADAASHL-DPEYCGPCYGATPPQHAIKTGCCNTCDEVREAYASSS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  + + + QC+ E+  E++     EGC+I G L VN+V G+FHIAPG S+S  ++HVH
Sbjct: 171 WAFGKGENVEQCQREHYAERIDEQRHEGCRIEGGLRVNKVVGNFHIAPGRSFSNGNMHVH 230

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR----------------KPLDGTVA 276
           D++ Y    T    + TH +  L FG +L +  ++                  PLDG + 
Sbjct: 231 DLKNYWDMPTPNLHSFTHTVHSLRFGPQLPESLQKTLAGGGAKGQPWTNHHINPLDGVMQ 290

Query: 277 KAEEGASMFNYYIKIIPTIYERL-------------------------DGS--------- 302
           +  +    + Y+IKI+PT Y  L                         DGS         
Sbjct: 291 QTSDPNFNYMYFIKIVPTSYLALGWEKTFRGFVDDHDSADVGSYGLLADGSVETHQYSVT 350

Query: 303 ----KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCN 344
                L GGD             GG+PG+FFSY++SP+ +V   E++K+       +   
Sbjct: 351 SHKRSLQGGDDAAEGHQERLHARGGIPGVFFSYDISPMKVVNREERAKTFAGFLAGLCAI 410

Query: 345 ISGTYITFMLVDALLHSCVKKISKVE 370
           I GT      VD  +     ++ K+ 
Sbjct: 411 IGGTLTVAAAVDRTVFEGTIRLKKMR 436


>gi|414586930|tpg|DAA37501.1| TPA: DUF1692 domain, endoplasmic reticulum vescicle transporter
           protein [Zea mays]
          Length = 268

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 104/253 (41%), Positives = 155/253 (61%), Gaps = 3/253 (1%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+V    +  L   ++  Y    T   L VD+SRG
Sbjct: 7   KLRSLDAYPKVNEDFYSRTLSGGIITLVSSAVMLLLFVSELRLYLHAVTETTLRVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  ++LDA+D SG++HL V+H+++K+R+D  G  I   Q +VV  
Sbjct: 67  ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDVFKQRIDAHGNVIATRQ-DVVGG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +K +     +G      E    CGSCYGA+    +CCNTC +V+EAYR K W +   D +
Sbjct: 126 MKMEAPLQHHGGRLEHNE--TYCGSCYGAQESDDQCCNTCEDVREAYRKKGWGVSNPDLL 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   + +K+   EGC IYG++EVN+V+G+FH APG S+  ++VHVHD+ P+   +
Sbjct: 184 DQCKREGFLQSIKDEEGEGCNIYGFIEVNKVAGNFHFAPGKSFQQSNVHVHDLLPFQKDS 243

Query: 246 FNTTHHIRHLSFG 258
           FN +H I  LSFG
Sbjct: 244 FNVSHKINRLSFG 256


>gi|390603136|gb|EIN12528.1| endoplasmic reticulum-derived transport vesicle ERV46 [Punctularia
           strigosozonata HHB-11173 SS5]
          Length = 419

 Score =  214 bits (545), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 131/419 (31%), Positives = 196/419 (46%), Gaps = 58/419 (13%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LKGLDAF K  ED   KT  G  +T +    I     ++  DY +V+    + VD SRG 
Sbjct: 10  LKGLDAFGKTMEDVKVKTRTGAFLTFLSAAIILTFTMIEFVDYRRVNMDTSIVVDKSRGE 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVNA 125
           KL + +++  P + C  L+LD +D SGEQ   + HNI K RLD  GK I   Q+ E+ + 
Sbjct: 70  KLTVRMNVTFPRVPCYLLSLDVMDISGEQQRDISHNILKTRLDSTGKLIPGSQRSELESE 129

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             ++     +G           CGSCYGAE     CCN+C+ V++AY  + W+    D+I
Sbjct: 130 FDRQNKPMPDGY----------CGSCYGAEPSEGACCNSCDAVRQAYVNRGWSFGNPDSI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E  +EKLK+  +EGC I G + VN+V G+ H++PG S+      ++++ PY    
Sbjct: 180 EQCVKENWSEKLKDQASEGCNIAGRVRVNKVIGNIHLSPGRSFQSQGRSMYELVPYLRED 239

Query: 246 FNTTHHIRHLSFGIKLQDDDE------------RRK------PLDGTVAKAEEGASMFNY 287
            N  H   H       + DDE            R K      PLDG V +  +   MF Y
Sbjct: 240 GN-RHDFSHTIHEFAFEGDDEYLPDKYKVSKEMRAKMGLEAGPLDGAVGRTIKAQYMFQY 298

Query: 288 YIKIIPTIYERLDGSKL----------------GGGDG------------GMPGIFFSYE 319
           ++K++ T +  LDG  +                G  D             G+PG FF++E
Sbjct: 299 FLKVVSTQFRTLDGQTVNSHQYSATHFERDLDKGSEDNTAEGVHISHTTYGVPGAFFNFE 358

Query: 320 LSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
           +SP+++  +E  +S  H  T     + G      +VD++L +  K + K   G     K
Sbjct: 359 ISPILIVHSETRQSFAHFLTSTCAIVGGVLTIASIVDSVLFATTKALKKGASGSAASGK 417


>gi|134054958|emb|CAK36967.1| unnamed protein product [Aspergillus niger]
          Length = 406

 Score =  214 bits (545), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 136/417 (32%), Positives = 194/417 (46%), Gaps = 62/417 (14%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG +TI   L I +L+  +  DY +V    EL V
Sbjct: 1   MPAKSRFTRLDAFAKTVEDARVRTTSGGVITIASLLVILWLVWGEWADYRRVVVMPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG K+ IHL++  P + C+ L LD +D SGEQ   V H I K RL            
Sbjct: 61  DKSRGEKMEIHLNVTFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRL------------ 108

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
              +A +  +V         +  DP+ CG CYGA          CCNTC+EV+EAY  ++
Sbjct: 109 --TSAAEGGRVIDVKALELAKHLDPDYCGECYGATAPAGASKPGCCNTCDEVREAYAQQQ 166

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  + + + QC+ E   E++     EGC++ G L VN+V G+FHIAPG S++  ++HVH
Sbjct: 167 WAFGKGENVEQCELEGYAERIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVH 226

Query: 237 DIQPYTSAAF------NTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
           D+  +  A          TH I  L FG +L D         D     PLDGT  +  E 
Sbjct: 227 DLANFFDADLPDAEKHTMTHEIHQLRFGPQLPDELSDRWQWTDHHHTNPLDGTKQETNEP 286

Query: 282 ASMFNYYIKIIPTIYERLDGS---------------KLGGGDG-------------GMPG 313
              + Y++K++ T Y  L                   L GGD              G+PG
Sbjct: 287 GYNYMYFVKVVSTSYLPLGWDPLIETHQYSVTSHKRSLMGGDASDEGHKERLHAANGIPG 346

Query: 314 IFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
           +F +Y++SP+ V   E + K+     T +   I GT      +D  L+  V ++ K+
Sbjct: 347 VFVNYDISPMKVINREARPKTFTGFLTGVCAIIGGTLTVAAALDRGLYEGVSRMKKL 403


>gi|378732932|gb|EHY59391.1| hypothetical protein HMPREF1120_07381 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 437

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 144/445 (32%), Positives = 201/445 (45%), Gaps = 81/445 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  L + YLI  +  DY ++    EL V
Sbjct: 1   MPAKTRFTRLDAFTKTVEDARIRTTSGGIVTIVSILVVIYLILGEWADYRRIVVQPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG K+ IHL+I  P I C+ L LD +D SGEQ   V H + K RL    +  +    
Sbjct: 61  DKGRGEKMEIHLNITFPRIPCELLTLDVMDVSGEQQSGVVHGVNKVRLTSVAEGSRVIDT 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           + +   ++ +V++          DP+ CGSCY A          CCNTC+EV+EAY    
Sbjct: 121 QALQLHQQAEVSSH--------LDPDYCGSCYSAPAPPNAKKPGCCNTCDEVREAYAANS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E    +L     EGC+I G + VN+V G+FHIAPG S+S  ++HVH
Sbjct: 173 WAFGRGEGVEQCEREGYGARLDEQRHEGCRIEGVIRVNKVVGNFHIAPGRSFSNGNMHVH 232

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDER---------RKPLDGTVAKAEEGAS 283
           D+  +           TH I  L FG +L D + +           PLDG   + +E   
Sbjct: 233 DLNNFFDTPIEGGHTFTHEIHSLRFGPQLSDQEAKWTGADHHLNANPLDGLRQETDEPGY 292

Query: 284 MFNYYIKIIPTIYERLD--------------------------GSK-------------- 303
            F Y+IK++ T Y  L                           GS+              
Sbjct: 293 NFMYFIKVVSTSYLPLGWDEDKSIQQHSSLSDLIPLGMHGKGAGSQGSIETHQYSVTSHK 352

Query: 304 --LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISG 347
             L GG+             GG+PG+FFSY++SP+ V   E + KS  +  T +   I G
Sbjct: 353 RSLAGGNDAAEGHKERLHAHGGIPGVFFSYDISPMKVINREVRPKSFANFLTGVCAVIGG 412

Query: 348 TYITFMLVDALLHSCVKKISKVEIG 372
           T      +D  L+    ++ KV  G
Sbjct: 413 TLTVAAAIDRGLYEGATRLKKVHQG 437


>gi|440301578|gb|ELP93964.1| endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Entamoeba invadens IP1]
          Length = 363

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 126/378 (33%), Positives = 188/378 (49%), Gaps = 30/378 (7%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DA+ K  ED   K  +GG +TIVC + I  L+  +   Y Q   T +L VD  R  
Sbjct: 1   MKRFDAYGKVPEDLQVKHGFGGIMTIVCGILIGILVLTEFRYYLQREVTPQLIVDRERDE 60

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+ +H DI  P  SC   ++D +  SGE  + +E NI K RL+ +G P+ E +       
Sbjct: 61  KIKVHFDITFPFSSCPITSVDVLTKSGESMIDIEKNITKTRLNKNGVPLTESEL------ 114

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
              K T +      +  D   C SCYGAET +RKCC TC++V EAY+ + W L  + TI 
Sbjct: 115 ---KATQQKLNANIKTVDQKTCRSCYGAETPSRKCCYTCDDVIEAYKERGWNL-NIRTIA 170

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC N    E  K T  EGC++ G L +N++ G+FHIAPG S +    H H+I+       
Sbjct: 171 QCDNSEKLEMAKLTLEEGCRVEGNLLLNKIGGNFHIAPGTSDNTWTGHHHNIEWTGRTKI 230

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL-- 304
           + TH    LSFG       E  K   G+   A+    MF Y++ +IP     ++G+K   
Sbjct: 231 DLTHTWNDLSFG-------EGSKTYSGSKKDAKMNG-MFQYFLTLIPKKNNFINGTKFVY 282

Query: 305 ---------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
                     G   G PG+F  Y++SP+++++ E +    H    +   I G +  F L+
Sbjct: 283 DFVINEQTRSGQGEGEPGVFVYYDVSPMLLEVNEFNHGFLHFLIGVCAIIGGVFTVFQLI 342

Query: 356 DALLHSCVKKIS-KVEIG 372
           DA +   +  +  K+E+G
Sbjct: 343 DAFVFDSIHTLQKKIELG 360


>gi|296417040|ref|XP_002838173.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295634087|emb|CAZ82364.1| unnamed protein product [Tuber melanosporum]
          Length = 399

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 141/410 (34%), Positives = 203/410 (49%), Gaps = 49/410 (11%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    RL  LDAFTK  ED   +T  GG VT+V    +  L+  +  +Y ++    EL V
Sbjct: 1   MGRGSRLTRLDAFTKTVEDARVRTTSGGIVTLVSLFVVFVLVVGEFREYRRIQVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D +RG +LPI L+I  P I C+ L LD +D SGEQ   + H I+  RL     P  E + 
Sbjct: 61  DKTRGEQLPISLNITFPHIPCELLTLDVMDVSGEQQSSITHGIHLTRL----TPFPESKP 116

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE--TETRKCCNTCNEVKEAYRYKKWA 178
               ++   + T  +        DP  CG CYGA    + + CC TC +V+EAY    WA
Sbjct: 117 VSTTSLNVHEDTASH-------LDPAYCGKCYGAPGPEKDKGCCQTCEDVREAYASIGWA 169

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
             + + + QC+ E+  E+L     EGC I G+L VN+V G+FHIAPG S+S   +HVHD+
Sbjct: 170 FGKGEGVEQCEREHYAERLDEMREEGCNIAGHLSVNKVIGNFHIAPGKSFSSAQMHVHDL 229

Query: 239 QPY--TSAAFNTTHHIRHLSFGIKLQDDDE-RRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
             Y  ++     TH I HLSFG  L  + + +R PLD +    +E +  F Y+IK++ T 
Sbjct: 230 NQYFASTKEHTFTHTIHHLSFGPDLPANVKVQRNPLDDSRQVTQERSFNFMYFIKVVSTS 289

Query: 296 YERLDGSK----------------------LGGGD----------GGMPGIFFSYELSPL 323
           Y  L  S+                      +GG D          GG+PG+FFSY++SP+
Sbjct: 290 YLPLGTSENSYIPGAIETHQYSVTSHKRSLMGGADKEHASTIHARGGIPGVFFSYDISPM 349

Query: 324 MVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
            V   E ++KS     T +   I GT      +D  L+    ++ K+  G
Sbjct: 350 KVINREVRAKSFAGFLTGVCAVIGGTLTVAAAIDRGLYEGGMRVKKLHQG 399


>gi|443894052|dbj|GAC71402.1| hypothetical protein PANT_3d00017 [Pseudozyma antarctica T-34]
          Length = 461

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 127/407 (31%), Positives = 195/407 (47%), Gaps = 62/407 (15%)

Query: 14  TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLD 73
           +K  +D   +T  G  +T+V  L I  L   +  DY  V     L VD SRG KL +++D
Sbjct: 44  SKTMDDVRIRTNAGALITMVSALLIVVLTIGEFVDYRTVHLKPSLEVDRSRGEKLTVNMD 103

Query: 74  IVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTT 133
           I  P + C  L+LD +D SGE    ++H+I + R+  DGKPI + +K +     +   T 
Sbjct: 104 ITFPRVPCYLLSLDVMDISGEHVNDIQHDIERTRVTHDGKPITQGKKNLKGDAARIAAT- 162

Query: 134 ENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYS 193
                    +  + CG CYG +     CCNTC+EV+EAY  K W+  + D + QC  E  
Sbjct: 163 ---------KGKDYCGDCYGGQPPASGCCNTCDEVREAYVRKGWSFADPDHVDQCVAEGW 213

Query: 194 TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIR 253
           ++K+K    EGC+I G L VN+V GSFH++PG ++  N VH+HD+ PY S      H   
Sbjct: 214 SDKIKEQNKEGCRISGKLHVNKVVGSFHLSPGKAFQRNSVHIHDLVPYLSGTGAEHHDFG 273

Query: 254 HL----SFGIKLQ-----DDDER--------RKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
           H+    SFG + Q        ER        + PL+G  A+ ++   MF Y++K++ T +
Sbjct: 274 HIIHDFSFGSEQQYHGLTTAKEREVKQKLGVKDPLEGVRAQTQQSQFMFQYFLKVVSTEF 333

Query: 297 ERLDGSKL-----------------------------------GGGDGGMPGIFFSYELS 321
             L G  L                                     G  G+PG+FF+YE+S
Sbjct: 334 RPLSGDTLKTQQYSVTTYERDLSPGANAAAMAGMSNEGSGAHISHGFAGVPGVFFNYEIS 393

Query: 322 PLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           PL    +E  +SL H  T     + G      +VD+L+++  +++ +
Sbjct: 394 PLKTIHSEHRQSLSHFLTSTCAIVGGILTVAGIVDSLVYNSRRRLRR 440


>gi|255941116|ref|XP_002561327.1| Pc16g10170 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211585950|emb|CAP93687.1| Pc16g10170 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 412

 Score =  214 bits (544), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 136/417 (32%), Positives = 197/417 (47%), Gaps = 56/417 (13%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG +TI   L + +L+  +  DY ++    EL V
Sbjct: 1   MPAKSRFTRLDAFAKTVEDARIRTNSGGVITIASLLIVMWLVWGEWADYRRIVVQPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ IHL++  P + C+ L LD +D SGEQ + V H + K RL         P  
Sbjct: 61  DKSRGERMEIHLNMTFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLS--------PHN 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR----KCCNTCNEVKEAYRYKK 176
           E    +  + +   + +   +   P+ CG C GA          CC TC EV+EAY  K+
Sbjct: 113 EGGKVIDVQALDLHSSSEAAKHLAPDYCGECGGATPPANVIKPGCCTTCEEVREAYAEKQ 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   I QCK E   EKL     EGC+I G L+VN+V G+FHIAPG S++  ++HVH
Sbjct: 173 WAFGDGSNIEQCKREGYAEKLAEQRREGCRIEGVLKVNKVVGNFHIAPGRSFTTGNMHVH 232

Query: 237 DIQPYT-----SAAFNTTHHIRH-LSFGIKLQ---------DDDERRKPLDGTVAKAEEG 281
           D+  Y       A  +T  H+ H L FG +L           D     PLD T  + +E 
Sbjct: 233 DLDAYVVPNAGPAEQHTMSHLVHELRFGPQLPTELAGRWGWTDHHHTNPLDDTKQETDEP 292

Query: 282 ASMFNYYIKIIPTIYERLDGS---------------KLGGGD-------------GGMPG 313
           A  F Y++K++ T Y  L                   L GG+             GG+PG
Sbjct: 293 AYNFMYFVKVVSTSYLPLGWDPHIEAHQYSVTSHKRPLSGGNDAAEGHKERVHAGGGIPG 352

Query: 314 IFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
           +FF+Y++SP+ V   E + K+  +  T +   I GT      +D  L+    ++ K+
Sbjct: 353 VFFNYDISPMKVINREARPKTFTNFLTGVCAIIGGTLTVAAALDRGLYEGAMRVKKL 409


>gi|452980033|gb|EME79795.1| hypothetical protein MYCFIDRAFT_64499 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 436

 Score =  214 bits (544), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 145/443 (32%), Positives = 202/443 (45%), Gaps = 86/443 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTI   L I YL   +  DY ++    EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVEDARVRTSTGGIVTIASLLLILYLTWGEWADYRKIIIHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD--LDGKPIQEP 118
           D  RG ++ IHL++  P + C+ L LD +D SGE    V H I K RL    DG  + E 
Sbjct: 61  DKGRGERMEIHLNVSFPRVPCELLTLDVMDVSGEVQTGVLHGINKVRLSSVADGSKVIEK 120

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRY 174
           QK  ++A        EN         P+ CG CYGA      +   CCNTC EV++AY  
Sbjct: 121 QKLDLDAA-------ENSVHLA----PDYCGECYGAPAPDNAKKAGCCNTCAEVRDAYAS 169

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
             W+    + + QC+ E+ +E+L     EGC+I G L VN+V G+FH APG S+S  ++H
Sbjct: 170 VSWSFGRGENVEQCEREHYSEQLDAQRKEGCRIEGALRVNKVVGNFHFAPGKSFSNGNLH 229

Query: 235 VHDIQPYTSAA---FNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKA 278
           VHD+  Y ++     + THHI  L FG  L  D ++R              PLD T  + 
Sbjct: 230 VHDLDNYFNSGEVEHSFTHHIHRLRFGPPLPHDFDKRVGKKGMAWSNHHLNPLDDTHQET 289

Query: 279 EEGASMFNYYIKIIPTIYERLDGSK----------------------------------- 303
           ++ A  F Y++K++ T Y  L   K                                   
Sbjct: 290 DDSAFNFMYFVKVVSTAYLPLGWEKTNSFSRSLPHELIDLGDYGHGEQGSIETHQYSVTS 349

Query: 304 ----LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNI 345
               L GGD             GG+PG+FFSY++SP+ V   E ++KS       +   I
Sbjct: 350 HKRSLQGGDAKDEGHKERVHARGGIPGVFFSYDISPMKVINRETRAKSFSGFLVGVCAVI 409

Query: 346 SGTYITFMLVDALLHSCVKKISK 368
            GT      VD +L+   +++ K
Sbjct: 410 GGTLTVAAAVDRMLYEGEQRVRK 432


>gi|406866287|gb|EKD19327.1| copii-coated vesicle membrane protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 453

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 148/458 (32%), Positives = 204/458 (44%), Gaps = 97/458 (21%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTI   L + YL   +  DY +++   EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVDEARVRTTSGGIVTIASLLIVLYLAFGEWTDYRRIAVHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG K+ IHL+I  P I C+ L LD +D SGEQ   V H + K RL         P+ 
Sbjct: 61  DKGRGEKMEIHLNISFPRIPCELLTLDVMDVSGEQQTGVMHGVKKVRLG--------PEA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E    +  + +        T L DP+ CG CYGA      +   CCNTC EV+EAY    
Sbjct: 113 EGGKEISIESLDLHGDDQATHL-DPDYCGGCYGATAPPNAKKAGCCNTCEEVREAYASVS 171

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E+  EKL     EGC+I G + VN+V G+FHIAPG S+S  ++HVH
Sbjct: 172 WAFGRGENVEQCEREHYGEKLDAQRKEGCRIEGGIRVNKVVGNFHIAPGRSFSNGNMHVH 231

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
           D+  Y           THHI  L FG +L +   ++              PLD T   A 
Sbjct: 232 DLNNYFDTPVPGGHVFTHHIHSLRFGPQLPESVTKKLGNKALPWTNHHINPLDDTRQVAP 291

Query: 280 EGASMFNYYIKIIPTIYERL------------------------DGS------------- 302
           E A  F Y++K++PT Y  L                        DGS             
Sbjct: 292 ETAYNFMYFVKVVPTSYLPLGWDNSVTSEQRIDHVDIGSYGHLDDGSVETHQFSVTSHKR 351

Query: 303 KLGGGD-------------GGMPGIFFSY----------------ELSPL-MVKITEKSK 332
            L GGD             GG+PG+FFSY                ++SP+ ++   E++K
Sbjct: 352 SLSGGDDGAEGHKEKLHSRGGIPGVFFSYVSSHFYPQKISTNKTQDISPMKVINREERAK 411

Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVE 370
           SL    T +   I GT      VD  ++    ++ K++
Sbjct: 412 SLAGFLTGLCAIIGGTLTVAAAVDRGVYEGTTRLKKMQ 449


>gi|61555552|gb|AAX46728.1| serologically defined breast cancer antigen 84 isoform b [Bos
           taurus]
          Length = 283

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 107/237 (45%), Positives = 153/237 (64%), Gaps = 8/237 (3%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAE E  KCCN+C +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVHVHD+Q +
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVHVHDLQSF 235


>gi|6598578|gb|AAF18633.1|AC006228_4 F5J5.4 [Arabidopsis thaliana]
          Length = 440

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 130/387 (33%), Positives = 197/387 (50%), Gaps = 52/387 (13%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDV-CDYFQVSTTEELFVDSSR 64
           +L+ LDA+ K  EDF+ +T+ GG +T++  + +  L   ++       S  +E +    +
Sbjct: 7   KLRNLDAYPKINEDFYSRTLSGGVITLLSSVVMFLLFFSELRTSLSSYSHRDEAYSRYFK 66

Query: 65  GSKL--PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           G  +    + DI  P ++C  L++DA+D SGE HL V+H+I KRRLD +G  I E +++ 
Sbjct: 67  GRDVTHQRNFDITFPALACSILSVDAMDISGELHLDVKHDIIKRRLDSNGNTI-EARQDG 125

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET-------------------------- 156
           + A K +    ++G      E    CGSCYGAE                           
Sbjct: 126 IGATKIENPLQKHGGRLGHNE--TYCGSCYGAEAVIVLSLYLTLWSMVSQLSSEVCFFPV 183

Query: 157 -ETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNR 215
            E   CCN+C +V+EAYR K W +   D I QCK E   +++K+   EGC IYG+LEVN+
Sbjct: 184 QEEHDCCNSCEDVREAYRKKGWGVTNPDLIDQCKREGFLQRVKDEEGEGCNIYGFLEVNK 243

Query: 216 VSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV 275
           V+G+FH APG S+  + VHVHD+  +   +FN +H I  L++G           PLD   
Sbjct: 244 VAGNFHFAPGKSFHQSGVHVHDLLAFQKDSFNISHKINRLTYGDYFPG---VVNPLDKVE 300

Query: 276 AKAEEGASMFNYYIKIIPTIYERLDG---------------SKLGGGDGGMPGIFFSYEL 320
              +   +M+ Y+IK++PT+Y  + G               S   G    +PG+FF Y+L
Sbjct: 301 WSQDTPNAMYQYFIKVVPTVYTDIRGHTIQSNQFSVTEHVKSSEAGQLQSLPGVFFFYDL 360

Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISG 347
           SP+ V  TE+  S  H  T + C I G
Sbjct: 361 SPIKVTFTEEHISFLHFLTNV-CAIVG 386


>gi|409042254|gb|EKM51738.1| hypothetical protein PHACADRAFT_150385 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 422

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 132/416 (31%), Positives = 191/416 (45%), Gaps = 62/416 (14%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LKGLDAF K  ED   KT  G  +TI+    I  +  ++  DY +V+    + VD SRG 
Sbjct: 9   LKGLDAFGKTMEDVKVKTRTGAFLTILSAAIILAITTMEFFDYRRVNVDTSIEVDKSRGE 68

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV--- 123
           KL +  ++  P + C  L+LD +D SGE    + HN+ K RL+  G P+  P  ++V   
Sbjct: 69  KLIVSFNVTFPRVPCYLLSLDVMDISGETQTDIVHNVIKTRLNEQGNPV--PANKIVELR 126

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
           N + K     ++G           CGSCYG       CCNTC +V++AY  + W+    D
Sbjct: 127 NDIDKLNEQRQDGY----------CGSCYGGVEPAGGCCNTCEDVRQAYVNRGWSFTAPD 176

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
           +I QC  E   +KL++   EGC   G L VN+V G+ H++PG S+     +++DI PY  
Sbjct: 177 SIEQCAQEGWADKLRDQANEGCNAAGKLRVNKVVGNIHLSPGRSFRSGSHNIYDIVPYLK 236

Query: 244 AAFNTTHHIRHLSFGIKLQDDDE-------------RR-----KPLDGTVAKAEEGASMF 285
              N  H   H         DDE             RR      PLDGT  K  + A MF
Sbjct: 237 EDGNR-HDFSHTVHAFAFAGDDEFNFQKADHGNSLKRRLGIADGPLDGTTQKTSKQAYMF 295

Query: 286 NYYIKIIPTIYERLDGSKLGG----------------------------GDGGMPGIFFS 317
            Y++K++ T +  LDG  +                              G  G+PG FF+
Sbjct: 296 QYFLKVVSTQFITLDGKSIKTHQHSATHFERDLSKGIAENSQQGMHVMHGMTGIPGAFFN 355

Query: 318 YELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGG 373
           YE+SP++V   E  +S  H  T     + G      L+D++L +  KK+ K    G
Sbjct: 356 YEISPILVVHRETRQSFAHFLTSTCAVVGGVLTVASLIDSMLFATSKKLKKSGTSG 411


>gi|346979363|gb|EGY22815.1| ER-derived vesicles protein ERV46 [Verticillium dahliae VdLs.17]
          Length = 435

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 138/439 (31%), Positives = 199/439 (45%), Gaps = 77/439 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG +TIV    + +L   +  DY +++   EL V
Sbjct: 1   MAGKSRFTRLDAFTKTVDEARIRTSSGGIITIVSLFIVFWLAWGEWADYRRITLHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG K+ IHL++  P + C+ L LD +D SGEQ   +   I K RL          QK
Sbjct: 61  DKGRGEKMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGIVSGISKVRL--------RSQK 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET----RKCCNTCNEVKEAYRYKK 176
           +    +  K ++            P+ CG CYGA+       + CCNTC EV+EAY    
Sbjct: 113 DGGGVIDTKALSLHAADEAATHLAPDYCGDCYGAKAPANAVKQGCCNTCEEVREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  + + + QC  E+  E+L     EGC+I G L VN+V G+FH+APG S+S  ++HVH
Sbjct: 173 WAFGKGENVEQCTREHYAERLDEQRAEGCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVH 232

Query: 237 DIQPYTSA--AFNTTHHIRHLSFGIKLQDD-------------DERRKPLDGTVAKAEEG 281
           D++ Y       + TH I  L FG +L +              +    PLDGT     + 
Sbjct: 233 DLKNYWDGDITHDFTHQIHALRFGPQLPESITKNLGNKATPWTNHHLNPLDGTSQITTDP 292

Query: 282 ASMFNYYIKIIPTIYERL-----------DGSKLG------------------------- 305
           +  F Y++KI+PT Y  L           DG  LG                         
Sbjct: 293 SFNFMYFVKIVPTSYLPLGWDSKRSPQDHDGGLLGSFGQGSDGSIETHQYSVTSHKRSLS 352

Query: 306 GGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYIT 351
           GGD             GG+PG+FFSY++SP+ ++   E+SKS     T +   I GT   
Sbjct: 353 GGDDSAEGHAERLHTRGGIPGVFFSYDISPMKVINREERSKSFTGFLTGLCAVIGGTLTV 412

Query: 352 FMLVDALLHSCVKKISKVE 370
              VD  +     ++ K+ 
Sbjct: 413 AAAVDRGMFEGSLRLKKIR 431


>gi|396471326|ref|XP_003838845.1| similar to endoplasmic reticulum-golgi intermediate compartment
           protein 3 [Leptosphaeria maculans JN3]
 gi|312215414|emb|CBX95366.1| similar to endoplasmic reticulum-golgi intermediate compartment
           protein 3 [Leptosphaeria maculans JN3]
          Length = 439

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 146/454 (32%), Positives = 204/454 (44%), Gaps = 103/454 (22%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VT+V  + I +L   +  DY +V+   EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVEDARVRTTSGGIVTLVSLVVIFWLTWGEWADYRRVTVRPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD--------LDG 112
           D  RG ++ I L+I  P + C+ L LD +D SGE  + + H I K RL         +D 
Sbjct: 61  DKGRGERMEISLNITFPRMPCELLTLDVMDVSGELQMGITHGINKVRLSPEVDGSKVIDA 120

Query: 113 KPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEV 168
           KP+   Q E  +                   DP+ CG+CYGA   T      CCNTC+EV
Sbjct: 121 KPLDLHQDEASHL------------------DPSYCGNCYGAPPPTNAIKHGCCNTCDEV 162

Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
           ++AY    W+    + + QC+ E+  E L     EGC++ G ++VN+V G+FHIAPG S+
Sbjct: 163 RDAYASISWSFGRGEGVEQCEREHYAEHLDEQRQEGCRLEGSIKVNKVVGNFHIAPGKSF 222

Query: 229 SINHVHVHDIQPY--TSAAFNTTHHIRHLSFGIKL-----QDDDERR------------- 268
           S  ++HVHD++ Y     A   TH I HL FG +L     QD  ++              
Sbjct: 223 SNGNLHVHDLENYFRDEYAHTFTHKIHHLRFGPQLSQAVVQDMAKKHMATGPGGWTNHHV 282

Query: 269 KPLDGTVAKAEEGASMFNYYIKIIPTIYERL------DGSKLGGGD-------------- 308
            PLD T  + +E A  + Y+IK++ T Y  L      DGS  GG D              
Sbjct: 283 NPLDHTEQRTDEKAFNYMYFIKVVSTAYLPLGWEKSADGSSSGGYDDLLGTTIHSVNKGS 342

Query: 309 --------------------------------GGMPGIFFSYELSPLMVKITE-KSKSLG 335
                                           GG+PG+FFSY++SP+ V   E + K+  
Sbjct: 343 IETHQYSVTSHKRSLQGGSDEKEGHKERIHARGGIPGVFFSYDISPMKVINREMREKTFS 402

Query: 336 HLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
                +   I GT      VD  L+  V KI K+
Sbjct: 403 GFLVGLCAVIGGTLTVAAAVDRALYEGVNKIKKI 436


>gi|299743758|ref|XP_002910702.1| ER-derived vesicles protein ERV46 [Coprinopsis cinerea
           okayama7#130]
 gi|298405804|gb|EFI27208.1| ER-derived vesicles protein ERV46 [Coprinopsis cinerea
           okayama7#130]
          Length = 416

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 135/410 (32%), Positives = 193/410 (47%), Gaps = 57/410 (13%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F   LKG+DAF K  ED   KT  G  +T++    I  +  ++  DY +V     + VD 
Sbjct: 5   FFSTLKGIDAFGKTTEDVKVKTRTGAFLTLLSAAIILAITTMEFFDYRKVFIDTSIVVDR 64

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           SRG KL ++L++  P + C  L+LD +D SGE    + HN+ K RLD  GK +       
Sbjct: 65  SRGEKLTVNLNVTFPKVPCYLLSLDIMDISGEVQRDISHNVLKVRLDRSGKEVPGSHTAD 124

Query: 123 VNA-VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
           ++A V+K   T + G           CGSCYG       CCNTC +V+ AY  + W+   
Sbjct: 125 LSADVEKLSHTKKEGY----------CGSCYGGLEPESGCCNTCEDVRMAYVNRGWSFTN 174

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            D I QC+NE   +KL++   EGC I G + VN+V G+ H++PG S+  N  +++++ PY
Sbjct: 175 PDAIEQCRNEGWADKLRDQADEGCNISGRIRVNKVIGNIHMSPGRSFQSNSRNIYELVPY 234

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDE------------RRK------PLDGTVAKAEEGAS 283
                N  H   H+      + DDE            RR+      PLDG  A+  +   
Sbjct: 235 LRDDQN-RHDFSHIIHHFGFEGDDEYDYWKAEAGQKMRRRMGLTENPLDGIEARTWKSQY 293

Query: 284 MFNYYIKIIPTIYERLDGS--------------KLGGG----DG---------GMPGIFF 316
           MF Y++K++ T +  LDG                LG G    DG         G+PG FF
Sbjct: 294 MFQYFLKVVSTRFRTLDGQTVNTHQYSTTSFERDLGEGMNQDDGGIRVQHGVSGLPGAFF 353

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           +YE+SP+ V   E  +S  H  T     I G      LVD+ L    K I
Sbjct: 354 NYEISPIQVVHAESRQSFAHFLTSTCAVIGGVLTVAALVDSALFVTAKAI 403


>gi|67524561|ref|XP_660342.1| hypothetical protein AN2738.2 [Aspergillus nidulans FGSC A4]
 gi|40743850|gb|EAA63036.1| hypothetical protein AN2738.2 [Aspergillus nidulans FGSC A4]
 gi|259486349|tpe|CBF84116.1| TPA: COPII-coated vesicle membrane protein Erv46, putative
           (AFU_orthologue; AFUA_1G05120) [Aspergillus nidulans
           FGSC A4]
          Length = 437

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 144/446 (32%), Positives = 208/446 (46%), Gaps = 89/446 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ++   +T  GG +TI   L I +L   +  DY +V+   EL V
Sbjct: 1   MAAKSRFTRLDAFAKTVDEARIRTTSGGIITIASLLIIIWLTWGEWVDYRRVAVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD--LDGKPIQEP 118
           D SRG K+ IHL+I  P + C+   LD +D SGEQ + V H + K RL    +G  + + 
Sbjct: 61  DKSRGEKMEIHLNITFPRLPCELTTLDVMDVSGEQQVGVAHGVNKVRLAPAAEGGRVLDV 120

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRY 174
           Q   ++A + K +            DP+ CG C GA          CC+TC+EV+EAY  
Sbjct: 121 QALQLHAEEAKHL------------DPDYCGECGGAPPPPNAIKPGCCSTCDEVREAYAQ 168

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
           K+W   +   I QC+ E+ +E++     EGC++ G + VN+V G+FHIAPG S+S N+VH
Sbjct: 169 KQWGFGKGTNIEQCEREHYSERIDAQRREGCRLEGVIRVNKVVGNFHIAPGRSFSSNNVH 228

Query: 235 VHDIQPY-----TSAAFNTTHHIRH-LSFGIKLQD---------DDERRKPLDGTVAKAE 279
           +HDI  Y     + A  +T  HI H L FG +L D         D     PLD T  +A 
Sbjct: 229 IHDIANYEERGLSPAEQHTMSHIIHSLRFGPQLPDELSDRWQWTDHHHTNPLDSTSQEAP 288

Query: 280 EGASMFNYYIKIIPTIYERLD--------------------------GSK---------- 303
           E A  F Y+IK++ T Y  L                           GS+          
Sbjct: 289 EPAYSFMYFIKVVSTSYLPLGWDPLYSASLHAAADTNTPLGAQGLSAGSQGSIETHQYSV 348

Query: 304 ------LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMC 343
                 L GGD             GG+PG+FF+Y++SP+ V   E + K+     T +  
Sbjct: 349 TSHKRSLRGGDASDEAHKERIHAAGGIPGVFFNYDISPMKVINREARPKTFTGFLTGVCA 408

Query: 344 NISGTYITFMLVDALLHSCVKKISKV 369
            + GT      +D  L+  V ++ K+
Sbjct: 409 IVGGTLTVAAAIDRTLYEGVSRVRKL 434


>gi|336265645|ref|XP_003347593.1| hypothetical protein SMAC_04901 [Sordaria macrospora k-hell]
 gi|380096460|emb|CCC06508.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 428

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 140/433 (32%), Positives = 199/433 (45%), Gaps = 72/433 (16%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  L + +L   +  DY +V    EL V
Sbjct: 1   MAGKSRFTKLDAFTKTVEDARIRTTSGGIVTIVSLLVVLFLSWGEWRDYRKVVIHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V+H + K RL         PQ 
Sbjct: 61  DKGRGERMEIHLNITFPKVPCELLTLDVMDVSGEQQHGVQHGVKKIRL--------RPQS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E    +  K +       +    DP+ CG CYGA      +   CC+TC E++EAY    
Sbjct: 113 EGGGEIDAKVLALHAADESATHLDPSYCGPCYGAPAPYNAKKAGCCSTCEEIREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +  T+ QC+ E+ TE+L     EGC+I G L VN+V G+FHIAPG S+S  ++HVH
Sbjct: 173 WAFGDGSTMEQCQREHYTERLAEQRHEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVH 232

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDD----DERRKPLDGTVAKAEEGASMFNYYIKII 292
           D+  + ++       +R L  G   + +    +    PLD T  + ++    F Y++KI+
Sbjct: 233 DLAQWWNSPL-PDDLVRKLGGGKDGKRNTLWTNHHLNPLDNTRQETDDPNYNFMYFVKIV 291

Query: 293 PTIYERL----------------------------DGS-------------KLGGGD--- 308
           PT Y  L                            DGS              L GGD   
Sbjct: 292 PTSYLPLGWEKQAAQNKASWDQDHSVGLGVFGQGSDGSMETHQYSVTSHKRSLAGGDDAK 351

Query: 309 ----------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
                     GG+PG+FFSY++SP+ +V   E++KS       +   + GT      VD 
Sbjct: 352 EGHGERLHSRGGIPGVFFSYDISPMKVVNREERAKSFIGFLAGLCAVVGGTLTVAAAVDR 411

Query: 358 LLHSCVKKISKVE 370
            L     ++ K+ 
Sbjct: 412 GLFEGTVRLKKLR 424


>gi|302688477|ref|XP_003033918.1| hypothetical protein SCHCODRAFT_75438 [Schizophyllum commune H4-8]
 gi|300107613|gb|EFI99015.1| hypothetical protein SCHCODRAFT_75438 [Schizophyllum commune H4-8]
          Length = 415

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 128/412 (31%), Positives = 194/412 (47%), Gaps = 56/412 (13%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F   LKG+DAF K  ED   KT  G  +T++    I     ++  DY +V     + VD 
Sbjct: 5   FLSHLKGIDAFGKTAEDVKVKTRTGALLTLIAASIILAFTTLEFFDYRKVIIDTSVTVDQ 64

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ-KE 121
           SRG +L + +++  P + C  L++D  D SG+    V HN+ K RLD DGK I+     E
Sbjct: 65  SRGERLTVRMNVTFPRVPCYLLSVDVTDISGDVQRDVSHNMLKTRLDKDGKAIRGAHTAE 124

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
           + N + K+           E    + CGSCYG       CCNTC EV+ AY  + W+   
Sbjct: 125 LRNEIDKQN----------EQRGADYCGSCYGGLPPASGCCNTCEEVRTAYVNRGWSFNN 174

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            D+I QCKNE   +KL+    EGC I G L +N+V+G+ H++PG S+     +V+++ PY
Sbjct: 175 PDSIEQCKNEGWADKLREQANEGCNIAGRLRINKVAGNIHLSPGRSFQTGGRNVYELVPY 234

Query: 242 TSAAFNT---THHIRHLSFGIKLQDDDERRK--------------PLDGTVAKAEEGASM 284
                N    +H I  LSF      D+ +R+              PLDGTV    +   M
Sbjct: 235 LRDDGNRHDFSHTIHSLSFEGDDAYDNRKRETSKEMRQRMGLSSNPLDGTVRVTNKAQYM 294

Query: 285 FNYYIKIIPTIYERLDGSKL----------------GG------------GDGGMPGIFF 316
           F Y++K++ T +  L+G  +                GG            G  G+PG F 
Sbjct: 295 FQYFVKVVSTKFRPLNGRTVNSHSYSVTHFERDLTDGGQAQTGQNVQVQHGVTGLPGAFI 354

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           ++++SP+ +  TE  +S  H  T     + G      L+D++L +  K + K
Sbjct: 355 NFDVSPIQLVHTEWRQSFAHFVTSTCAIVGGVLTVASLLDSVLFATSKALKK 406


>gi|115388503|ref|XP_001211757.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114195841|gb|EAU37541.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 438

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 141/443 (31%), Positives = 200/443 (45%), Gaps = 82/443 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG +TI   L I +L+  +  DY +V    EL V
Sbjct: 1   MPAKSRFTRLDAFAKTVEDARIRTTSGGIITIASLLIILWLVWGEWVDYRRVVVMPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG K+ IHL+I  P + C+ L LD +D SGEQ + V H I K RL            
Sbjct: 61  DKSRGEKMEIHLNITFPRLPCELLTLDVMDVSGEQQVGVAHGINKVRL--------ASPA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET---ETRKCCNTCNEVKEAYRYKKW 177
           E  + +  + +   +     +  DPN CG C G      E ++CCNTC EV+EAY   +W
Sbjct: 113 EGGHVLDVQALELHSEQEVAKHLDPNYCGECGGIPQQPGEPKRCCNTCEEVREAYAEHQW 172

Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
           A  + + I QC+ E    ++     EGC++ G L VN+V G+FHIAPG S+S  ++HVHD
Sbjct: 173 AFGKGENIEQCEREGYAARIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFSSGNIHVHD 232

Query: 238 IQPY------TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGA 282
           ++ Y       S     THHI  L FG +L D         D     PLD TV + +  A
Sbjct: 233 LENYFELDQPASEKHTMTHHIHQLRFGPQLPDELSDRWQWTDHHHTNPLDDTVQETDLAA 292

Query: 283 SMFNYYIKIIPTIYERL-----------------------------DGS----------- 302
             + Y++K++ T Y  L                             DGS           
Sbjct: 293 FNYMYFVKVVSTAYLPLGWDPRVSSYIHSASSHNVPLGRHGIGYGHDGSIETHQYSVTSH 352

Query: 303 --KLGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
              L GG+              G+PG+FF+Y++SP+ V   E + K+     T +   I 
Sbjct: 353 KRPLMGGNAADEGHKERLHAAAGIPGVFFNYDISPMKVINREARPKTFTGFLTGVCAIIG 412

Query: 347 GTYITFMLVDALLHSCVKKISKV 369
           GT      +D  L+    ++ K+
Sbjct: 413 GTLTVAAAIDRGLYEGAIRVKKL 435


>gi|218192721|gb|EEC75148.1| hypothetical protein OsI_11348 [Oryza sativa Indica Group]
 gi|222624836|gb|EEE58968.1| hypothetical protein OsJ_10656 [Oryza sativa Japonica Group]
          Length = 355

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 130/392 (33%), Positives = 201/392 (51%), Gaps = 60/392 (15%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +L+ LDA+ K  EDF+ +T+ GG +TI   L I  L   ++  Y   +T  +L V
Sbjct: 1   MDLWNKLRSLDAYPKVNEDFYSRTLSGGLITIASSLAILLLFLSEIRLYLYSATDSKLTV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+SRG +L I+ D+  P + C  +A+D +D SGEQH  + H+I K+R+D  G  I E +K
Sbjct: 61  DTSRGERLHINFDVTFPALPCSLVAVDTMDVSGEQHYDIRHDIIKKRIDNLGNVI-ESRK 119

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           + V A K ++   ++G      E    CGSCYG+E    +CCN+C +V++AYR K WAL 
Sbjct: 120 DGVGAPKIERPLQKHGGRLDHNE--VYCGSCYGSEESDDQCCNSCEDVRDAYRKKGWALT 177

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            ++ I QCK E   ++LK+   EGC I+G++ VN++S                       
Sbjct: 178 NIEEIDQCKREGFVQRLKDEQGEGCSIHGFVNVNKIS----------------------- 214

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGT--VAKAEEGAS-MFNYYIKIIPTIYE 297
                    H I  LSFG++         PLDG   + +   G + M+ Y++K++PTIY 
Sbjct: 215 ---------HKINKLSFGVEFPG---VVNPLDGVEWIQEHTNGLTGMYQYFVKVVPTIYT 262

Query: 298 RLDGSKLGGGDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            + G K+      +              PG++F YE SP+ V  TE++ SL H  T I  
Sbjct: 263 DIRGRKINSNQFSVTEHFREAIGYPRPPPGVYFFYEFSPIKVDFTEENTSLLHFLTNICA 322

Query: 344 NISGTYITFMLVDALL---HSCVKKISKVEIG 372
            + G +    ++D+ +   H  +KK  K+EIG
Sbjct: 323 IVGGIFTVAGIIDSFVYHGHRAIKK--KMEIG 352


>gi|407929248|gb|EKG22082.1| protein of unknown function DUF1692 [Macrophomina phaseolina MS6]
          Length = 442

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 145/453 (32%), Positives = 205/453 (45%), Gaps = 92/453 (20%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTI   + I +LI  +  ++ QV+   EL V
Sbjct: 1   MPAKSRFMRLDAFTKTVEDARVRTSTGGIVTITSIIMILWLIWGEWAEFRQVTVKPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG K+ IH++I  P I C+ L LD +D SGE    V H + K RL     P  E  +
Sbjct: 61  DKSRGEKMEIHMNISFPRIPCELLTLDVMDVSGEIQTGVMHGVNKVRL----TPENEGSR 116

Query: 121 EV-VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYK 175
            + VNA+        +        DP+ CG CYGA   T      CCNTC++V++AY   
Sbjct: 117 PIEVNALNLHADEASH-------MDPDYCGECYGAPAPTTAKKPGCCNTCDDVRDAYAAI 169

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            W+    D + QC+ E+  EKL     EGC++ G + VN+V G+FH APG S+S  ++HV
Sbjct: 170 SWSFTRGDGVEQCEREHYGEKLDAQRREGCRVEGGIRVNKVIGNFHFAPGKSFSNGNMHV 229

Query: 236 HDIQPY--TSAAFNTTHHIRHLSFGIKLQDD-----------------DERRKPLDGTVA 276
           HD++ Y    A  + TH +  L FG +L DD                 +    PLD T  
Sbjct: 230 HDLENYFKDGAPHSFTHQVHSLRFGPQLPDDVIAKLEASGMSASSLWTNHHINPLDNTEQ 289

Query: 277 KAEEGASMFNYYIKIIPTIYERL------------------------------DGS---- 302
           + +E A  F Y++K++ T Y  L                              +GS    
Sbjct: 290 RTDEKAFNFMYFVKVVSTAYLPLGWENKGSSSLSGLLPDADRAPLGSYGLASGEGSIETH 349

Query: 303 ---------KLGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWT 339
                     L GG+             GG+PG+FFSY++SP+ V   E ++KS      
Sbjct: 350 QYSVTSHKRSLAGGNDEKDGHKERLHARGGIPGVFFSYDISPMKVINRESRAKSFSGFLV 409

Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
            +   I GT      +D  L+    K+ K+  G
Sbjct: 410 GVCAVIGGTLTVAAAIDRALYEGSTKLKKLHQG 442


>gi|358378080|gb|EHK15763.1| hypothetical protein TRIVIDRAFT_86970 [Trichoderma virens Gv29-8]
          Length = 420

 Score =  210 bits (535), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 138/425 (32%), Positives = 201/425 (47%), Gaps = 66/425 (15%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  L + +L   +  DY ++    EL V
Sbjct: 1   MAPKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVFFLSWGEWTDYRRIVVHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGEQ   V H I K RL    +   E + 
Sbjct: 61  DKGRGERMDIHLNMTFPNMPCELLTLDVMDVSGEQQHGVAHGISKIRLRPAAQGGGEIES 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
             +  + +K           E   P+ CG CYGA      E   CCNTC+EV+EAY    
Sbjct: 121 NTLTQLHEK----------AEHLAPDYCGGCYGATAPANAEKPGCCNTCDEVREAYAQMS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E+  E+L     EGC+I G L+VN+V G+FH+APG S+S  ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYAERLDQQREEGCRIEGLLQVNKVVGNFHLAPGRSFSNGNMHVH 230

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR------------KPLDGTVAKAEE 280
           D++ Y         + TH I  L FG +L D    R             PLD T  + ++
Sbjct: 231 DLKTYWDFPEGKPHDFTHIIHSLRFGPQLPDTVIERMGGKNTWTNHHLNPLDATHQETKD 290

Query: 281 GASMFNYYIKIIPTIYERL---------DGS-------------KLGGGD---------- 308
               + Y++KI+PT Y  L         DGS              L GGD          
Sbjct: 291 PNFNYMYFVKIVPTSYLPLGWEKRTPGYDGSIETHQYSVTSHKRSLMGGDDSQEGHPERL 350

Query: 309 ---GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
               G+PG+FFSY++SP+ ++   E++K+     + +   + GT      VD  L     
Sbjct: 351 HARNGIPGVFFSYDISPMKVINREERAKTFLGFLSGLCAIVGGTLTVAAAVDRGLFEGAS 410

Query: 365 KISKV 369
           ++ K+
Sbjct: 411 RLKKL 415


>gi|322708973|gb|EFZ00550.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Metarhizium anisopliae ARSEF 23]
          Length = 429

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 142/434 (32%), Positives = 205/434 (47%), Gaps = 75/434 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTI+    + +L   +  +Y +V    EL V
Sbjct: 1   MPPKSRFTRLDAFTKTVDEARIRTTSGGVVTIISLFVVLFLSWGEWAEYRRVVVRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ IHL++  P + C+ L LD +D SGEQ   V H +   RL    +P  E Q 
Sbjct: 61  DKSRGERMQIHLNMTFPRMPCELLTLDVMDVSGEQQHGVSHGVKNVRL----RP--ESQG 114

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET--ETRK--CCNTCNEVKEAYRYKK 176
             V  +K  KV  +      +  DP+ CG CYGA      RK  CCNTC+EV+EAY  + 
Sbjct: 115 GGVIDIKSMKVHDD----PADHLDPSYCGECYGATAPPNARKAGCCNTCDEVREAYASQG 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC  E+  E+L     EGC++ G+LEVN+V G+FH+APG S+S  ++HVH
Sbjct: 171 WAFGRGENVEQCTREHYAERLDEQREEGCRVEGHLEVNKVVGNFHLAPGRSFSNGNMHVH 230

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
           D++ Y         + TH I  L FG +L      R              PLDGT  +  
Sbjct: 231 DLKNYWETPNGKQHDFTHTIHQLRFGPQLPAAVSDRLGKGSMPWTNHHLNPLDGTRQEIG 290

Query: 280 EGASMFNYYIKIIPTIYERL-----------------DGS-------------KLGGGD- 308
           + A  + Y++KI+PT Y  L                 DGS              L GG+ 
Sbjct: 291 DPAFNYMYFVKIVPTSYLPLGWEKRFKNAAGSTYGNADGSLETHQYSVTSHKRSLEGGND 350

Query: 309 ------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
                       GG+PG+FFSY++SP+ ++   E +K+       +   + GT      V
Sbjct: 351 AAEGHAERQHSQGGIPGVFFSYDISPMKVINREEPAKTFTGFLAGLCAIVGGTLTVAAAV 410

Query: 356 DALLHSCVKKISKV 369
           D  L     ++ K+
Sbjct: 411 DRGLFEGAARLKKM 424


>gi|358391585|gb|EHK40989.1| ER-derived vesicle Erv46-like protein [Trichoderma atroviride IMI
           206040]
          Length = 422

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 136/428 (31%), Positives = 202/428 (47%), Gaps = 68/428 (15%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  L + +L   +   Y ++    EL V
Sbjct: 1   MAPKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVLFLSWGEWSSYRRIVVHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V H I K RL        +P  
Sbjct: 61  DKGRGERMDIHLNITFPNMPCELLTLDVMDVSGEQQHGVAHGITKLRL--------QPPS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
                ++   +   +     E  +P+ CG CYGA      E   CCNTC+EV+EAY    
Sbjct: 113 RGGGVIESNSLAQLH--EKAEHLNPDYCGGCYGATAPANAEKPGCCNTCDEVREAYAQAS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E+ +E+L     EGC+I G L+VN+V G+FH+APG S+S  ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYSERLDQQREEGCRIEGLLQVNKVVGNFHLAPGRSFSNGNMHVH 230

Query: 237 DIQ-----PYTSAAFNTTHHIRHLSFGIKLQDD------------DERRKPLDGTVAKAE 279
           D++     P    A + TH I  L FG +L  +            +    PLDG   +  
Sbjct: 231 DLKNYWDLPNGMKAHDFTHVIHSLRFGPQLPPEVIARMGRRTAWTNHHLNPLDGIHQETS 290

Query: 280 EGASMFNYYIKIIPTIYERL----------DGS-------------KLGGGD-------- 308
           +    + Y++KI+PT Y  L          DGS              L GGD        
Sbjct: 291 DPNFNYMYFVKIVPTSYLPLGWEQKSASASDGSVETHQYSVTSHKRSLMGGDDAKEGHAE 350

Query: 309 -----GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
                GG+PG+FFSY++SP+ ++   E++K+     + +   + GT      +D  L   
Sbjct: 351 RLHSKGGIPGVFFSYDISPMKVINREERAKTFLGFLSGLCAIVGGTLTVAAAIDRGLFEG 410

Query: 363 VKKISKVE 370
             ++ K+ 
Sbjct: 411 ATRLKKLR 418


>gi|342874382|gb|EGU76396.1| hypothetical protein FOXB_13074 [Fusarium oxysporum Fo5176]
          Length = 439

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 141/445 (31%), Positives = 205/445 (46%), Gaps = 85/445 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  L + +L   +  +Y ++    EL V
Sbjct: 1   MPPKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVLFLSWGEWAEYRRIEIHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V H + K RL        +P  
Sbjct: 61  DKGRGERMEIHLNITFPKMPCELLTLDVMDVSGEQQHGVMHGVNKVRL--------QPAN 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE--TETRK--CCNTCNEVKEAYRYKK 176
           +    +  K +   +   + +  DP+ CG CYGA+     RK  CC TC+EV+EAY    
Sbjct: 113 QGGAVIDIKSLALHD--ESADHLDPSYCGGCYGAQPPANARKAGCCQTCDEVREAYAQSS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E+  EKL     EGC+I G L VN+V G+FH APG S+S  ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYGEKLDAQREEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVH 230

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDD-------------DERRKPLDGTVAKAE 279
           D++ Y       + + TH+I  L FG +L D+             +  + PLD T  +  
Sbjct: 231 DLKNYWDVPKGKSHDFTHYIHSLRFGPQLPDNIAKKVGTKSSLWTNHHQNPLDNTRQEIH 290

Query: 280 EGASMFNYYIKIIPTIYERL---------------------------DGS---------- 302
           +    F Y++KI+PT Y  L                           DGS          
Sbjct: 291 DPNFNFMYFVKIVPTSYLPLGWDSKGIKIAGLLQDDNAGLGAYGYSEDGSVETHQYSVTS 350

Query: 303 ---KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNI 345
               L GG+             GG+PG+FFSY++SP+ +V   EK+K+       +   +
Sbjct: 351 HKRSLAGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKAKTFSGFLAGLCAIV 410

Query: 346 SGTYITFMLVDALLHSCVKKISKVE 370
            GT      VD  L     +I K+ 
Sbjct: 411 GGTLTVAAAVDRGLFEGAARIKKMR 435


>gi|116181584|ref|XP_001220641.1| hypothetical protein CHGG_01420 [Chaetomium globosum CBS 148.51]
 gi|88185717|gb|EAQ93185.1| hypothetical protein CHGG_01420 [Chaetomium globosum CBS 148.51]
          Length = 438

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 145/442 (32%), Positives = 196/442 (44%), Gaps = 80/442 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  + + +L   +  DY +V    EL V
Sbjct: 1   MPPKSRFTRLDAFTKTVEDARIRTTSGGIVTIVSLVVVFFLAWGEWSDYRRVEVHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P I C+ L LD +D SGEQ   V+H + K RL         PQ 
Sbjct: 61  DKGRGERMEIHLNITFPRIPCELLTLDVMDISGEQQHGVQHGVTKTRL--------RPQS 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    +  K V            DP+ CG CYGA+         CCNTC EVK+AY    
Sbjct: 113 EGGGDIDTKAVALHARDEVATHLDPSYCGPCYGAQPPPNAKKPGCCNTCEEVKDAYAQAA 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + I QC+ E+ +EKL     EGC+I G L VN+V G+FHIAPG S+S  ++HVH
Sbjct: 173 WAFGRGEGIEQCEREHYSEKLDEQRNEGCRIEGGLRVNKVIGNFHIAPGRSFSNGNMHVH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDD-----DERR---------KPLDGTV----- 275
           D++ Y  T      +H I HL FG +L D+     D R+          PLD T      
Sbjct: 233 DLKNYWDTPTKHTFSHQIHHLRFGPQLPDNLHKKLDARKNMRGRSTTFNPLDDTPPGDGT 292

Query: 276 --------------------AKAEEGASMFNYYIKIIPTIYERLDGS------------- 302
                               A  +  A     +   + +     DGS             
Sbjct: 293 TSTTTTCTSSRSCPHRTCRWAGRKTWAGFREEHHAELGSFGASADGSVETHQYSVTSHKR 352

Query: 303 KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGT 348
            L GGD             GG+PG+FFSY++SP+ ++   EK+KS       +   + GT
Sbjct: 353 SLAGGDDSAEGHQERLHARGGIPGVFFSYDISPMKVINREEKAKSFLGFIAGLCAIVGGT 412

Query: 349 YITFMLVDALLHSCVKKISKVE 370
                 +D  L     ++ K+ 
Sbjct: 413 LTVAAAIDRALFEGGVRLKKMR 434


>gi|449299159|gb|EMC95173.1| hypothetical protein BAUCODRAFT_529716 [Baudoinia compniacensis
           UAMH 10762]
          Length = 435

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 142/441 (32%), Positives = 201/441 (45%), Gaps = 83/441 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VT+   L I YL+  +  DY +V+   EL V
Sbjct: 1   MPSKSRFTRLDAFTKTVEDARIRTTSGGIVTLASLLLILYLVWGEWADYRRVTVAPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG K+ IH++I  P + C+ L LD +D SGE    V H + K RL  DG+ +     
Sbjct: 61  DKGRGEKMEIHMNISFPRVPCELLTLDVMDVSGEVQTGVMHGVNKVRLGEDGREVGREAL 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E+   V++          + +  DP  CG CYGA          CCNTC EV+EAY    
Sbjct: 121 ELGKEVEE----------SMKHMDPEYCGECYGAPAPGNAIRAGCCNTCAEVREAYASVS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W+    + + QC+ E+ +E L     EGC+I G + VN+V G+FH APG S+S  ++HVH
Sbjct: 171 WSFGRGENVEQCEREHYSEHLDEQRREGCRIEGGIRVNKVVGNFHFAPGKSFSNGNMHVH 230

Query: 237 DIQPYTSAA----FNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAE 279
           D++ Y +         +H I HL FG +L +D  RR              PLD T  K +
Sbjct: 231 DLENYFAGGEGIDHTFSHTIHHLRFGPQLPEDVVRRIGRRGMAWSNHHLNPLDETEQKTD 290

Query: 280 EGASMFNYYIKIIPTIYERLDGSKLG---------------------------------- 305
           E A  + Y++K++ T Y  L   + G                                  
Sbjct: 291 EKAYNYMYFVKVVSTAYLPLGWERTGSILDIPHELVELGGYGKGEAGSVETHQYSVTSHK 350

Query: 306 ----GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISG 347
               GGD             GG+PG+FFSY++SP+ V   E +SKS       +   I G
Sbjct: 351 RSLAGGDGGEEGHKERLHARGGIPGVFFSYDISPMKVINREARSKSFSGFLVGVCAVIGG 410

Query: 348 TYITFMLVDALLHSCVKKISK 368
           T      +D  L+   +++ K
Sbjct: 411 TLTVAAAIDRALYEGGQRVKK 431


>gi|119496763|ref|XP_001265155.1| COPII-coated vesicle membrane protein Erv46, putative [Neosartorya
           fischeri NRRL 181]
 gi|119413317|gb|EAW23258.1| COPII-coated vesicle membrane protein Erv46, putative [Neosartorya
           fischeri NRRL 181]
          Length = 438

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 138/443 (31%), Positives = 198/443 (44%), Gaps = 82/443 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG +T+   + I YL+  +  DY +V    EL V
Sbjct: 1   MPAKSRFTRLDAFAKTVEDARIRTTSGGIITLASLVVILYLVWGEWLDYRRVVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ IH++I  P + C+ L LD +D SGEQ + V H + K RL    +  +    
Sbjct: 61  DKSRGERMEIHMNITFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLSSPAEGGRVLDV 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE----TETRKCCNTCNEVKEAYRYKK 176
           + ++   K+++            DPN CG C GA+    +    CCNTC+EV+EAY  K 
Sbjct: 121 QALDLHSKEEIAKH--------LDPNYCGDCGGADPLPGSMKEGCCNTCDEVREAYAAKN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   I QC+ E    ++     EGC++ G L VN+V G+FHIAPG S++   VH H
Sbjct: 173 WAFGKGSNIEQCEREGYAARIDAQRREGCRLEGILRVNKVVGNFHIAPGRSFTSGQVHAH 232

Query: 237 DIQPYTSAAF------NTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
           D+Q Y             THHI  L FG +L D         D     PLD T  +  + 
Sbjct: 233 DLQNYLDLELPDNEKHTMTHHIHQLRFGPQLPDEVSDRWQWTDHHHTNPLDSTSQETNDP 292

Query: 282 ASMFNYYIKIIPTIYERL--------------DGSKLG---------------------- 305
           A  F Y++K++ T Y  L              D + LG                      
Sbjct: 293 AYNFVYFVKVVSTSYLPLGWDPLFSSAAHNAHDQTPLGSHGIAYGSGGSIETHQYSVTSH 352

Query: 306 -----GGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
                GGD              G+PG+FF+Y++SP+ V   E + KS     T +   I 
Sbjct: 353 KRSLRGGDASDEGHKERLHAANGIPGVFFNYDISPMKVINREARPKSFSGFLTGVCAIIG 412

Query: 347 GTYITFMLVDALLHSCVKKISKV 369
           GT      +D  L+    ++ K+
Sbjct: 413 GTLTVAAAIDRGLYEGALRVKKL 435


>gi|212540034|ref|XP_002150172.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210067471|gb|EEA21563.1| COPII-coated vesicle membrane protein Erv46, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 440

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 144/444 (32%), Positives = 204/444 (45%), Gaps = 86/444 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VT+V  + I +L+  +  DY +V    EL V
Sbjct: 1   MPPKSRFTRLDAFAKTVEDARVRTTSGGIVTLVSLVVILWLVWGEWADYRRVVVLPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD--LDGKPIQEP 118
           D SRG ++ IHL++  P + C+ L LD +D SGEQ + V H + K RL    DG  + + 
Sbjct: 61  DKSRGERMEIHLNMTFPRLPCELLTLDVMDVSGEQQMGVVHGLNKVRLSSVADGGRVID- 119

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRY 174
                  V K ++ ++N        DP  CG C GA      +   CCNTC EV+EAY  
Sbjct: 120 -------VSKLELHSQNEVAIHL--DPEYCGECGGASPPENAKKPGCCNTCEEVREAYAL 170

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
           K WA  + + I QC+ E   +++     EGC+I G + VN+V G+FHIAPG S+S  ++H
Sbjct: 171 KSWAFGKGENIEQCQREGYADRIDAQRREGCRIEGDIRVNKVIGNFHIAPGRSFSSGNMH 230

Query: 235 VHDIQPYTSAAF------NTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAE 279
           VHD+  Y             +H I  L FG +L D+  +R          PLD T     
Sbjct: 231 VHDLDTYLDRELADYEKHTMSHIIHQLRFGPQLSDEVSQRWQWTDHHHTNPLDSTQQLTN 290

Query: 280 EGASMFNYYIKIIPTIY----------ERLDG---------------------------- 301
           E A  +NYYIK++ T Y          ++L G                            
Sbjct: 291 EPAYNYNYYIKVVSTSYLPLGWDSARSDQLHGDDQFTPLGLHGAAHGTAGSIETHQYSVT 350

Query: 302 ----SKLGGGD------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCN 344
               S  GG D            GG+PG+FF+Y++SP+ V   E ++K+     T +   
Sbjct: 351 SHKRSLHGGNDAAEGHQERIHAEGGIPGVFFNYDISPMKVVNREARAKTFTGFLTGVCAV 410

Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
           I GT      VD  L+   ++I K
Sbjct: 411 IGGTLTVAAAVDRFLYEGSRRIRK 434


>gi|408400673|gb|EKJ79750.1| hypothetical protein FPSE_00030 [Fusarium pseudograminearum CS3096]
          Length = 439

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 141/445 (31%), Positives = 203/445 (45%), Gaps = 85/445 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  L + +L   +  DY ++    EL V
Sbjct: 1   MPPKSRFTRLDAFTKTVDEARIRTTSGGVVTIVSLLVVLFLSWGEWADYRRIDIHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L+LD +D SGEQ   V H + K RL        +P+ 
Sbjct: 61  DKGRGERMEIHLNITFPKMPCELLSLDVMDVSGEQQHGVMHGVNKVRL--------QPES 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           +    +  K ++  +        DP+ CG CYGA      +   CC TC+EV+EAY    
Sbjct: 113 QGGAVIDTKSLSLHD--DAAHHLDPSYCGGCYGATPPANAQKAGCCQTCDEVREAYAQAS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E+  EKL    +EGC+I G L VN+V G+FH APG S+S  ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYGEKLDAQRSEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVH 230

Query: 237 DIQPYTSAAFNTTH---HIRH-LSFGIKLQDDDERR-------------KPLDGTVAKAE 279
           D++ Y       +H   HI H L FG +L D   R+              PLD T  +  
Sbjct: 231 DLKNYWDVPKGFSHDFTHIVHSLRFGPQLPDHIARKVGHKNTLWTNHHQNPLDDTRQETH 290

Query: 280 EGASMFNYYIKIIPTIYERL---------------------------DGS---------- 302
           +    F Y++KI+PT Y  L                           DGS          
Sbjct: 291 DPNYNFMYFVKIVPTSYLPLGWDKKGIKIAGLLQEDNAGLGAYGYGEDGSVETHQYSVTS 350

Query: 303 ---KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNI 345
               L GG+             GG+PG+FFSY++SP+ +V   EK+K+       +   +
Sbjct: 351 HRRSLAGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKAKTFSGFLAGLCAIV 410

Query: 346 SGTYITFMLVDALLHSCVKKISKVE 370
            GT      VD  L     ++ K+ 
Sbjct: 411 GGTLTVAAAVDRGLFEGAARLKKMR 435


>gi|46105482|ref|XP_380545.1| hypothetical protein FG00369.1 [Gibberella zeae PH-1]
          Length = 444

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 141/445 (31%), Positives = 203/445 (45%), Gaps = 85/445 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  L + +L   +  DY ++    EL V
Sbjct: 1   MPPKSRFTRLDAFTKTVDEARIRTTSGGVVTIVSLLVVLFLSWGEWADYRRIDIHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L+LD +D SGEQ   V H + K RL        +P+ 
Sbjct: 61  DKGRGERMEIHLNITFPKMPCELLSLDVMDVSGEQQHGVMHGVNKVRL--------QPES 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           +    +  K ++  +        DP+ CG CYGA      +   CC TC+EV+EAY    
Sbjct: 113 QGGAVIDTKSLSLHD--DAAHHLDPSYCGGCYGATPPANAQKAGCCQTCDEVREAYAQAS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E+  EKL    +EGC+I G L VN+V G+FH APG S+S  ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYGEKLDAQRSEGCRIEGGLRVNKVIGNFHFAPGRSFSSGNMHVH 230

Query: 237 DIQPYTSAAFNTTH---HIRH-LSFGIKLQDDDERR-------------KPLDGTVAKAE 279
           D++ Y       +H   HI H L FG +L D   R+              PLD T  +  
Sbjct: 231 DLKNYWDVPKGFSHDFTHIVHSLRFGPQLPDHIARKVGHKNTLWTNHHQNPLDDTRQETH 290

Query: 280 EGASMFNYYIKIIPTIYERL---------------------------DGS---------- 302
           +    F Y++KI+PT Y  L                           DGS          
Sbjct: 291 DPNYNFMYFVKIVPTSYLPLGWDKKGIKIAGLLQEDNAGLGAYGYGEDGSVETHQYSVTS 350

Query: 303 ---KLGGGD-------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNI 345
               L GG+             GG+PG+FFSY++SP+ +V   EK+K+       +   +
Sbjct: 351 HRRSLAGGNDAAEGHAERQHTSGGIPGVFFSYDISPMKVVNREEKAKTFSGFLAGLCAIV 410

Query: 346 SGTYITFMLVDALLHSCVKKISKVE 370
            GT      VD  L     ++ K+ 
Sbjct: 411 GGTLTVAAAVDRGLFEGAARLKKMR 435


>gi|70990824|ref|XP_750261.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus fumigatus
           Af293]
 gi|66847893|gb|EAL88223.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           fumigatus Af293]
 gi|159130735|gb|EDP55848.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           fumigatus A1163]
          Length = 438

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 138/443 (31%), Positives = 199/443 (44%), Gaps = 82/443 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG +T+   + I YL+  +  DY +V    EL V
Sbjct: 1   MPAKSRFTRLDAFAKTVEDARIRTTSGGIITLASLVVILYLVWGEWLDYRRVVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ IH++I  P + C+ L LD +D SGEQ + V H + K RL    +  +    
Sbjct: 61  DKSRGERMEIHMNITFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLSSPAEGGRVLDV 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE----TETRKCCNTCNEVKEAYRYKK 176
           + ++   K+++            DPN CG C GA+    +    CCNTC+EV+EAY  K 
Sbjct: 121 QALDLHSKEEIAKH--------LDPNYCGDCGGADPLPGSIKEGCCNTCDEVREAYAAKN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   I QC+ E    ++     EGC++ G L VN+V G+FHIAPG S++   VH H
Sbjct: 173 WAFGKGTNIEQCEREGYAARIDAQRREGCRLEGILRVNKVVGNFHIAPGRSFTSGQVHAH 232

Query: 237 DIQPYTSAAF------NTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
           D+Q Y  +          THHI  L FG +L D         D     PLD T  +  + 
Sbjct: 233 DLQNYLDSELPDNEKHTMTHHIHQLRFGPQLPDEVSDRWQWTDHHHTNPLDSTSQETNDP 292

Query: 282 ASMFNYYIKIIPTIYERL--------------DGSKLG---------------------- 305
           A  F Y++K++ T Y  L              D + LG                      
Sbjct: 293 AYNFVYFVKVVSTSYLPLGWDPLFSSAAHNAHDQTPLGSHGIAYGSGGSIETHQYSVTSH 352

Query: 306 -----GGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
                GGD              G+PG+FF+Y++SP+ V   E + KS     T +   I 
Sbjct: 353 KRSLRGGDASDEGHKERLHAANGIPGVFFNYDISPMKVINREARPKSFSGFLTGVCAIIG 412

Query: 347 GTYITFMLVDALLHSCVKKISKV 369
           GT      +D  L+    ++ K+
Sbjct: 413 GTLTVAAAIDRGLYEGALRVKKL 435


>gi|340520521|gb|EGR50757.1| predicted protein [Trichoderma reesei QM6a]
          Length = 430

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 138/436 (31%), Positives = 203/436 (46%), Gaps = 76/436 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  L + +L   +  DY ++    EL V
Sbjct: 1   MPPKSRFTRLDAFTKTVDEARIRTTSGGIVTIVSLLVVVFLAWGEWTDYRRIVVHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGEQ   V H I K RL        E + 
Sbjct: 61  DKGRGERMDIHLNMTFPNMPCELLTLDVMDVSGEQQHGVAHGITKIRLQPAALGGGEIES 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           + ++ + +K           E  DPN CG CYGA      +   CCNTC+EV+EAY    
Sbjct: 121 KSLSQLHEK----------AEHLDPNYCGGCYGAIAPSTAQKPGCCNTCDEVREAYALAS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E+  E+L     EGC+I G L+VN+V G+FH+APG S+S  ++HVH
Sbjct: 171 WAFGRGEGVEQCEREHYAERLDQQREEGCRIEGLLQVNKVIGNFHLAPGRSFSNGNMHVH 230

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQD------------DDERRKPLDGTVAKAEE 280
           D++ Y       + + TH I  L FG +L D             +    PLD T    ++
Sbjct: 231 DLKNYWDLPEGKSHDFTHIIHSLRFGPQLPDTVIERLGGKNTWSNHHLNPLDNTRQDTKD 290

Query: 281 GASMFNYYIKIIPTIYERL-------------------DGS-------------KLGGGD 308
               + Y++KI+PT Y  L                   DGS              L GGD
Sbjct: 291 PNFNYMYFVKIVPTSYLPLGWEKRKPSTTNGGVTTFYSDGSIETHQYSVTSHKRSLMGGD 350

Query: 309 -------------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFML 354
                         G+PG+FFSY++SP+ ++   E++K+     + +   + GT      
Sbjct: 351 DAKEGHPERLHARNGIPGVFFSYDISPMKVINREERAKTFLGFLSGLCAIVGGTLTVAAA 410

Query: 355 VDALLHSCVKKISKVE 370
           VD  L     ++ K+ 
Sbjct: 411 VDRGLFEGATRLKKLR 426


>gi|403417426|emb|CCM04126.1| predicted protein [Fibroporia radiculosa]
          Length = 419

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 121/404 (29%), Positives = 189/404 (46%), Gaps = 51/404 (12%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+G+DAF K  +D   KT  G  +TI+    I     ++  DY QV     + VD SRG 
Sbjct: 10  LRGVDAFGKTTDDVKVKTRTGAFLTILSAAIILAFTMMEFLDYRQVKIDTSVVVDKSRGE 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL + +++  P + C  L+LD +D SGE    + HNI K RL+  G P+Q   K      
Sbjct: 70  KLNVRMNVTFPRVPCYLLSLDVMDISGESQADITHNILKTRLNEKGIPLQSLAKSAELRN 129

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
              K+  + G         N CGSCYG +     CCNTC++V++AY  + W+    D+I 
Sbjct: 130 DLDKINEQRG--------DNYCGSCYGGQAPPGGCCNTCDQVRQAYIDRGWSFTRPDSIE 181

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC NE  +EKLK   +EGC I G + VN+V G+  ++PG S+     +++D+ PY     
Sbjct: 182 QCTNEGWSEKLKEQASEGCNIAGKVRVNKVIGNIQLSPGRSFRTAAQNMYDLVPYLKEDK 241

Query: 247 NTTHHIRHLSFGIKLQDDDERRK--------------PLDGTVAKAEEGASMFNYYIKII 292
           N  H   H       + D E+ +              PLD T  K  +   MF Y++K++
Sbjct: 242 N-RHDFSHTIHQFAFESDQEKERHRARDFQKRVGIESPLDNTERKTSKQQYMFQYFLKVV 300

Query: 293 PTIYERLD--------------------GSKLGGGDG--------GMPGIFFSYELSPLM 324
            T +  LD                    G +    +G        G+PG+F +Y++SP++
Sbjct: 301 STHFAMLDNKVYKTHQYSATHFERDLTKGQQEDNKEGVHIAHTATGIPGVFINYDISPML 360

Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           +  +E  +S  H  T     + G      L+D++L +  + + K
Sbjct: 361 ILHSETRQSFAHFLTSTCAIVGGVLTVASLIDSVLFATTRALKK 404


>gi|392591676|gb|EIW81003.1| ER-derived vesicles protein ERV46 [Coniophora puteana RWD-64-598
           SS2]
          Length = 419

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 131/417 (31%), Positives = 198/417 (47%), Gaps = 64/417 (15%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F   LKG+DAF K  ED   KT  G  +T++    I     ++  DY +V T   + VD 
Sbjct: 5   FLAGLKGIDAFGKTTEDVKVKTRTGAFLTLLSAAIILSFTLMEFVDYRRVYTDTSIVVDR 64

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-E 121
           SRG KL + +++  P + C  L++D +D SGE    V HN+ K+RLD  GK I   +  +
Sbjct: 65  SRGEKLSVRMNVTFPHVPCYLLSVDVMDISGETQRDVSHNVVKQRLDKTGKGIAGSRSGD 124

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET-RKCCNTCNEVKEAYRYKKWALP 180
           + N + K            EL  P+ CGSCYG  T T   CCN+C EV++AY  K W+  
Sbjct: 125 LRNEIDK----------LAELRGPDYCGSCYGGYTSTDNGCCNSCEEVRQAYVNKGWSFG 174

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
             + I QC  E  T+K+K+   EGC I G + VN+V G+ +I+PG S+     + +D  P
Sbjct: 175 NPEGIEQCTQEGWTDKVKDQADEGCNISGRIRVNKVVGNINISPGRSFQTGSRNFYDFVP 234

Query: 241 Y---TSAAFNTTHHIRHLSFGIKLQDDD------------ERR-----KPLDGTVAKAEE 280
           Y        + TH+I  L+F   L DD+            ++R      PLDG  A   +
Sbjct: 235 YLKEDGGQHDFTHYIDELTF---LADDEYNPNKMKHGKELKQRMGLDSNPLDGFKASTTK 291

Query: 281 GASMFNYYIKIIPTIYERLDGSK------------------LGGGD-----------GGM 311
              M+ Y++K++ T +  L+G                    +GGG+           GG 
Sbjct: 292 KMFMYQYFLKVVSTQFRTLNGRTINTHQYSATHFERDLSRGMGGGENNQGVYVQHGAGGA 351

Query: 312 PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           PG +F++E+SP+ V   E  +S  H  T     + G      L+D+ L +  + + K
Sbjct: 352 PGAYFNFEISPIQVVHAETRQSFAHFLTSTCAIVGGVLTVAALLDSFLFATSRALKK 408


>gi|330919615|ref|XP_003298687.1| hypothetical protein PTT_09471 [Pyrenophora teres f. teres 0-1]
 gi|311327999|gb|EFQ93219.1| hypothetical protein PTT_09471 [Pyrenophora teres f. teres 0-1]
          Length = 437

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 139/443 (31%), Positives = 199/443 (44%), Gaps = 85/443 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTI   L I +L   +  DY +V+   EL V
Sbjct: 1   MPAKSRFNKLDAFTKTVEDARVRTTSGGIVTIASLLVIFWLSWGEWADYRRVTVRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ I ++I  P + C+ L LD +D SGE  + V H I K RL         P+ 
Sbjct: 61  DKSRGERMEIAMNISFPRMPCELLTLDVMDVSGELQMGVTHGINKVRLS--------PEA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           +   A++ K V     T       P+ CG CYGA   +      CCNTC+EV++AY    
Sbjct: 113 DGSKAIEIKAVDLH--TDEASHLAPDYCGQCYGAPAPSNAKKPTCCNTCDEVRDAYASVS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W+    + + QC+ E+  E L     EGC++ G ++VN+V G+FH APG S+S  ++HVH
Sbjct: 171 WSFGRGEGVEQCEREHYAEHLDQQRQEGCRLEGNIKVNKVVGNFHFAPGKSFSNGNLHVH 230

Query: 237 DIQPYTSAAFNT--THHIRHLSFGIKLQD------------------DDERRKPLDGTVA 276
           D++ Y    +    THHI  L FG +L D                   +    PLD T+ 
Sbjct: 231 DLENYFKDEYTHTFTHHIHQLRFGPQLSDVVVQNMQKKHQESGIGGWSNHHINPLDETMQ 290

Query: 277 KAEEGASMFNYYIKIIPTIYERLDGSKL-------------------------------- 304
             +E A  + Y+IK++ T+Y  L   K+                                
Sbjct: 291 HTDEKAYNYMYFIKVVTTVYLPLGWEKVFPHPSKFSDILGATIDESYKGSIETHQYSVTS 350

Query: 305 ------GGGD------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNI 345
                 GG D            GG+PG+FFSY++SP+ V   E + K+       +   I
Sbjct: 351 HKRSLQGGNDEKDGHKERIHARGGIPGVFFSYDISPMEVINREVREKTFSGFLVGLCAVI 410

Query: 346 SGTYITFMLVDALLHSCVKKISK 368
            GT      +D  L+  V +I K
Sbjct: 411 GGTLTVAAAIDRALYEGVNRIKK 433


>gi|388581981|gb|EIM22287.1| endoplasmic reticulum-derived transport vesicle ERV46 [Wallemia
           sebi CBS 633.66]
          Length = 407

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 124/400 (31%), Positives = 193/400 (48%), Gaps = 53/400 (13%)

Query: 9   GLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKL 68
           G DAF K  E+   +T +G  +TI+C + IS+L   +  DY  V     + VD SR  KL
Sbjct: 8   GFDAFAKTLEESRIRTNFGAYLTIICAILISFLTFNEFRDYRAVDFKPRIIVDQSRSEKL 67

Query: 69  PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKK 128
            ++ ++  P + C  L+LD +D SGEQ   + H I + RL   G        E ++ +K 
Sbjct: 68  QLNFNVTFPRVPCYLLSLDLMDVSGEQVRDLRHAIVRTRLSEKG--------ETIDGMKT 119

Query: 129 KKVTTENGTTTTELEDPNKCGSCYGA-ETETRKCCNTCNEVKEAYRYKKWALPELDTIVQ 187
             ++        E+  P +CGSCYG       KCC TC++V+E+Y  + W+    D + Q
Sbjct: 120 AGMSG----YLNEVAKPRECGSCYGGVPPNEEKCCYTCDDVRESYVKQGWSFVNPDGVKQ 175

Query: 188 CKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFN 247
           C +E+  E++K   +EGC + G ++VN+V G+FHI+PG S+  N  H+HD+ PY   A N
Sbjct: 176 CLDEHWAERVKEQSSEGCNVAGLVDVNKVVGNFHISPGRSFQSNAHHIHDLVPYLKNANN 235

Query: 248 ---TTHHIRHLSFGIKLQ--DDDERRK------PLDGTVAKAEEGASMFNYYIKIIPTIY 296
                H + H SF    +  D D  ++      PL  T A  E    MF Y++K++ T +
Sbjct: 236 HHDFGHILHHFSFKSSNEPADTDNLKEMLNINDPLSNTKAHTEVSNYMFQYFLKVVSTDF 295

Query: 297 ERLDGSKLGG-----------------------------GDGGMPGIFFSYELSPLMVKI 327
           + L+G KL                               G  G PG+FF+Y++SPL V  
Sbjct: 296 DFLNGEKLNSHQYSATAYERNLDEKGIYAQDGHGQTILHGVEGFPGVFFNYDISPLRVIY 355

Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
           TE  +S     T     + G      ++DA +    +K++
Sbjct: 356 TESRRSFASFLTSTCAIVGGVLTVASIIDAGVFGARQKLT 395


>gi|261188384|ref|XP_002620607.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
           dermatitidis SLH14081]
 gi|239593207|gb|EEQ75788.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
           dermatitidis SLH14081]
 gi|239609349|gb|EEQ86336.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
           dermatitidis ER-3]
 gi|327354450|gb|EGE83307.1| COPII-coated vesicle membrane protein Erv46 [Ajellomyces
           dermatitidis ATCC 18188]
          Length = 435

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 135/440 (30%), Positives = 195/440 (44%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTI   + I +LI  +  +Y +V    EL V
Sbjct: 1   MPPKSRFARLDAFTKTVEDARIRTRSGGVVTITALIIIFFLIWGEWSEYRRVVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGE    V H + K RL         P +
Sbjct: 61  DKGRGERMEIHLNVTFPNLPCELLTLDVMDISGEYQTEVVHGVNKLRL--------SPAE 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E    +    +   + T   +  DPN CGSCYGA      +   CCNTC+EV+EAY  K+
Sbjct: 113 EGGQVLDITALQLHSKTDNAKDLDPNYCGSCYGAPAPPNAQKPGCCNTCDEVREAYAAKR 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W+    + + QC+ E  +  L     EGC++ G + VN+V G+FHIAPG S++  ++H H
Sbjct: 173 WSFGRGENVEQCEKEGYSANLDAQRKEGCRVEGVIRVNKVIGNFHIAPGRSFTNGNMHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
           D+  Y  T    N  H I +L FG +L D+  RR          PLD T          F
Sbjct: 233 DLNNYYNTPIPHNVGHKIHYLRFGPQLPDEVSRRWKWTDHHHTNPLDNTEQHTTNPRLNF 292

Query: 286 NYYIKIIPTIYERL----------------------DGSKLGGG---------------- 307
            Y++K++ T Y  L                       G  LG G                
Sbjct: 293 AYFVKVVATSYLPLGWDDDWSSTVHSKVSNNVPLGKQGVSLGSGGSIETHQYSVTSHKRS 352

Query: 308 -----------------DGGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
                             GG+PG+F +Y++SP+ V   E ++K+     T +   I GT 
Sbjct: 353 VDGGNDAEEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                +D  L+    ++ K+
Sbjct: 413 TVAAAIDRALYEGSVRVKKL 432


>gi|407034208|gb|EKE37117.1| hypothetical protein ENU1_208770 [Entamoeba nuttalli P19]
          Length = 361

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 124/378 (32%), Positives = 191/378 (50%), Gaps = 32/378 (8%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  D + K  ED   +  +GG +TI+C + I  L   +   Y Q     +L VD  R S
Sbjct: 1   MKRFDTYGKVPEDLRTRHCFGGFLTIICVVIIIVLSIAEFAFYLQREVVPQLLVDRERSS 60

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+P+H DI  P  SC   ++D +  SGE  + +E N+ K R+  DG  + E + + + + 
Sbjct: 61  KIPVHFDITFPYSSCPITSVDILTKSGESMIDIEQNVTKIRIHHDGSLVTENEMKAIQS- 119

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                       +TE  DP +C SCYGAET  +KCC TC++VKEAY+ K W L +L+ + 
Sbjct: 120 ----------KLSTETHDPKECRSCYGAETPEKKCCFTCDDVKEAYKKKGWRL-DLNIVS 168

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC+N    +  K T  EGC++ G   +N++ G+FHIAPG S  +   H H+++       
Sbjct: 169 QCQNHEKIQMAKLTKDEGCRLIGDFLLNKIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQI 228

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL-- 304
           + +H    LSFG       E  K    T  K  +  SMF YY+ IIP     ++G+    
Sbjct: 229 DLSHKWNELSFG-------ENSKKFT-TEKKDTQMNSMFQYYLTIIPIKNNFINGTSTFY 280

Query: 305 ---------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
                     G   G PG+F  Y++SP+++++TE +    H    I   + G + TF L 
Sbjct: 281 DYSIQENTRSGKGEGQPGVFVYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLF 340

Query: 356 DALLHSCVKKI-SKVEIG 372
           DA++   +  +  KVE+G
Sbjct: 341 DAIVFESIHTLKKKVELG 358


>gi|343425773|emb|CBQ69306.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 435

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 128/419 (30%), Positives = 204/419 (48%), Gaps = 62/419 (14%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+G+DAF+K  +D   +T  G  +T+V  L I  L   +  DY  V     L VD SRG
Sbjct: 9   QLRGIDAFSKTMDDVRIRTNAGALITLVSVLLIVVLTIGEFVDYRTVHLKPALEVDRSRG 68

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL ++++I  P + C  L+LD +D SGE    ++H+I + R+  DGK +++ +K +   
Sbjct: 69  EKLTVNMNITFPRVPCYLLSLDVMDISGEHVNDIQHDIERTRISHDGKVVEQGKKHLKGD 128

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
             +   T          +  + CG CYG +     CCNTC+EV+EAY  + W+  + D +
Sbjct: 129 AARIANT----------KGKDYCGDCYGGQPPASGCCNTCDEVREAYVRRGWSFADPDHV 178

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E  ++K+K    EGC+I G L VN+V GSFH++PG ++  N +H+HD+ PY S  
Sbjct: 179 DQCVAEGWSDKIKQQNKEGCRISGKLHVNKVVGSFHLSPGKAFQRNSMHIHDLVPYLSGT 238

Query: 246 FNTTHHIRHL----SFGIK-----LQDDDER--------RKPLDGTVAKAEEGASMFNYY 288
               H   H+    SFG +     L    ER        + PL G  A+ ++   MF Y+
Sbjct: 239 GAEHHDFGHIIHEFSFGSEQEYHGLTTAKERAVKAKLGVKDPLAGVRAQTQQSQFMFQYF 298

Query: 289 IKIIP------------------TIYER-----------------LDGSKLGGGDGGMPG 313
           +K++                   T YER                   G+ +  G  G+PG
Sbjct: 299 VKVVATEFRPLAGETLKTQQYSVTTYERDLSPGASAAALAGMSNEGSGAHISHGFAGVPG 358

Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           +FF+YE+SPL     E  +SL H  T     + G      ++D+L+++  +++   + G
Sbjct: 359 VFFNYEISPLKTIHAEYRQSLAHFLTSTCAIVGGILTVAGILDSLVYNSRRRLGLRDAG 417


>gi|451849936|gb|EMD63239.1| hypothetical protein COCSADRAFT_38106 [Cochliobolus sativus ND90Pr]
          Length = 437

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 143/444 (32%), Positives = 202/444 (45%), Gaps = 85/444 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  L I +L   +  DY +V+   EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVEDARVRTTSGGIVTIVSLLVIFWLTWGEWADYRRVTVRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ I L+I  P + C+ + LD +D SGE  + V H I K RL         P++
Sbjct: 61  DKGRGERMEIALNISFPRVPCELITLDVMDVSGELQMGVTHGINKVRLS--------PER 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    ++ K +   +    + L  P+ CG C+GA          CCNTC+EV++AY    
Sbjct: 113 EGSKTIEIKALDL-HADEASHLA-PDYCGECFGAPPPANAKKPGCCNTCDEVRDAYASIS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W+    + + QC+ E+  E L     EGC++ G + VN+V G+FHIAPG S+S  ++HVH
Sbjct: 171 WSFGRGEGVEQCEREHYAEHLDEQRQEGCRLEGSIRVNKVVGNFHIAPGKSFSNGNMHVH 230

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD------DDERR------------KPLDGTVA 276
           D++ Y     A   TH I  L FG +L D       D+ R             PLD T  
Sbjct: 231 DLENYFKDEYAHTFTHKIHQLRFGPQLSDVVIQGIQDKHRGSGPGSWSNHHINPLDNTEQ 290

Query: 277 KAEEGASMFNYYIKIIPTIY---------------ERLDGSKL----------------- 304
             +E A  F Y+IK++ T Y               + L GS +                 
Sbjct: 291 HTDEKAFNFMYFIKVVSTAYLPLGWEDAAPRLTKHDELLGSTIDATHKGSIETHQYSVTS 350

Query: 305 ------GGGD------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNI 345
                 GG D            GG+PG+FFSY++SP+ V   E + K+       +   I
Sbjct: 351 HKRNLKGGNDEKDGHKERVHARGGIPGVFFSYDISPMKVINREVREKTFSGFLVGLCAVI 410

Query: 346 SGTYITFMLVDALLHSCVKKISKV 369
            GT      VD  L+  V +I K+
Sbjct: 411 GGTLTVAAAVDRALYEGVNRIKKI 434


>gi|317025332|ref|XP_001388859.2| COPII-coated vesicle membrane protein Erv46 [Aspergillus niger CBS
           513.88]
 gi|350638031|gb|EHA26387.1| hypothetical protein ASPNIDRAFT_196625 [Aspergillus niger ATCC
           1015]
          Length = 438

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 138/443 (31%), Positives = 198/443 (44%), Gaps = 82/443 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG +TI   L I +L+  +  DY +V    EL V
Sbjct: 1   MPAKSRFTRLDAFAKTVEDARVRTTSGGVITIASLLVILWLVWGEWADYRRVVVMPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG K+ IHL++  P + C+ L LD +D SGEQ   V H I K RL            
Sbjct: 61  DKSRGEKMEIHLNVTFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRL--------TSAA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    +  K +   +   + +  DP+ CG CYGA          CCNTC+EV+EAY  ++
Sbjct: 113 EGGRVIDVKALELHSKDESAKHLDPDYCGECYGATAPAGASKPGCCNTCDEVREAYAQQQ 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  + + + QC+ E   E++     EGC++ G L VN+V G+FHIAPG S++  ++HVH
Sbjct: 173 WAFGKGENVEQCELEGYAERIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVH 232

Query: 237 DIQPYTSAAF------NTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
           D+  +  A          TH I  L FG +L D         D     PLDGT  +  E 
Sbjct: 233 DLANFFDADLPDAEKHTMTHEIHQLRFGPQLPDELSDRWQWTDHHHTNPLDGTKQETNEP 292

Query: 282 ASMFNYYIKIIPTIYERL--------------DGSKLG---------------------- 305
              + Y++K++ T Y  L              D + LG                      
Sbjct: 293 GYNYMYFVKVVSTSYLPLGWDPLFSSSIHSAYDQAPLGSHGIAYGAEGSIETHQYSVTSH 352

Query: 306 -----GGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
                GGD              G+PG+F +Y++SP+ V   E + K+     T +   I 
Sbjct: 353 KRSLMGGDASDEGHKERLHAANGIPGVFVNYDISPMKVINREARPKTFTGFLTGVCAIIG 412

Query: 347 GTYITFMLVDALLHSCVKKISKV 369
           GT      +D  L+  V ++ K+
Sbjct: 413 GTLTVAAALDRGLYEGVSRMKKL 435


>gi|12060847|gb|AAG48265.1|AF308298_1 serologically defined breast cancer antigen NY-BR-84, partial [Homo
           sapiens]
          Length = 239

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 105/229 (45%), Positives = 147/229 (64%), Gaps = 6/229 (2%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 15  KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 74

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+    +   + 
Sbjct: 75  DKLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAER--HE 132

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K +VT  +  +     DP++C SCYGAE E  KCCNTC +V+EAYR + WA    DTI
Sbjct: 133 LGKVEVTVFDPDSL----DPDRCESCYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTI 188

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
            QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVH
Sbjct: 189 EQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVH 237


>gi|440636941|gb|ELR06860.1| hypothetical protein GMDG_08151 [Geomyces destructans 20631-21]
          Length = 441

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 138/448 (30%), Positives = 203/448 (45%), Gaps = 89/448 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTI   L + YL   +  DY ++    EL V
Sbjct: 1   MPPKSRFTRLDAFTKTVDEARIRTTSGGIVTIASLLIVIYLAFGEWADYRRIVVHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL---DLDGKPIQE 117
           D SRG K+ I ++I  P + C+ L LD +D SGE    V+H + K RL   D  G     
Sbjct: 61  DKSRGEKMEIWMNITFPYVPCELLTLDVMDVSGEMQTGVKHGVSKVRLNSPDAGG----- 115

Query: 118 PQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYR 173
                  A+  K +   +        DP+ CG CYGA      +   CCNTC+EV++AY 
Sbjct: 116 ------GAIDVKALDLHSTEEKAAHLDPSYCGQCYGATPPPNAQKAGCCNTCDEVRDAYA 169

Query: 174 YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV 233
              WA    + + QC+ E+ +E+L     EGC+I G + VN+V G+FHIAPG SYS  ++
Sbjct: 170 SASWAFGRGENVEQCEREHYSERLDEQRKEGCRIEGGVRVNKVIGNFHIAPGRSYSNGNM 229

Query: 234 HVHDIQ-----PYTSAAFNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTV 275
           HVHD+      P      +  H I H+ FG +L +   ++              PLDGT 
Sbjct: 230 HVHDLANYWDTPSLERGHSFAHTIHHVRFGPQLPEGLSKKFGGKNQPWTNHHLNPLDGTQ 289

Query: 276 AKAEEGASMFNYYIKIIPTIY--------------------------ERLDGS------- 302
               + A  + Y++K++ T Y                            +DGS       
Sbjct: 290 QHTRDPAFNYMYFVKVVSTSYLPLGWNSKSAAKTQISEENIGLGAYGHAVDGSVETHQYS 349

Query: 303 ------KLGGGDG-------------GMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIM 342
                  L GGD              G+PG+FFSY++SP+ ++   E++K+L    T + 
Sbjct: 350 VTSHKRSLSGGDDGAEGHKERLHSRTGIPGVFFSYDISPMKVINREERTKTLSGFITGLC 409

Query: 343 CNISGTYITFMLVDALLHSCVKKISKVE 370
             + GT      VD  L+  V +I K++
Sbjct: 410 AIVGGTLTVAAAVDRGLYEGVSRIKKLQ 437


>gi|189203047|ref|XP_001937859.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187984958|gb|EDU50446.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 437

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 143/445 (32%), Positives = 201/445 (45%), Gaps = 89/445 (20%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTI   L I +L   +  DY +V+   EL V
Sbjct: 1   MPVKSRFNKLDAFTKTVEDARVRTTSGGIVTIASLLVIFWLSWGEWADYRRVTVRPELMV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL--DLDGKPIQEP 118
           D  RG ++ I +++  P I C+ L LD +D SGE  + V H I K RL  + DG  + E 
Sbjct: 61  DKGRGERMEIAMNVSFPRIPCELLTLDVMDVSGELQMGVTHGINKVRLSPEADGSKVIET 120

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR----KCCNTCNEVKEAYRY 174
           +   ++A +   +             P+ CG CYGA   T      CCNTC+EV++AY  
Sbjct: 121 KALDLHADEASHLA------------PDYCGQCYGAPPPTNAKKPNCCNTCDEVRDAYAS 168

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
             W+    + + QC+ E+  E L     EGC++ G ++VN+V G+FH APG S+S  ++H
Sbjct: 169 ISWSFGRGEGVEQCEREHYAEHLDQQRQEGCRLEGSIKVNKVVGNFHFAPGKSFSNGNLH 228

Query: 235 VHDIQPY--TSAAFNTTHHIRHLSFGIKLQD---DDERRK---------------PLDGT 274
           VHD++ Y     A   TH I  L FG +L D    D ++K               PLD T
Sbjct: 229 VHDLENYFKDDYAHTFTHRIHQLRFGPQLSDVVVRDMQKKHLDSGHNGWSNHHVNPLDNT 288

Query: 275 VAKAEEGASMFNYYIKIIPTIY------------------------ERLDGS-------- 302
           V   +E A  + Y+IK++ T Y                        E   GS        
Sbjct: 289 VQHTDEKAYNYMYFIKVVSTAYLPLGWEQEFPHPSKYSDILGTTIDESYKGSIETHQYSV 348

Query: 303 -----KLGGG-------------DGGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMC 343
                 L GG              GG+PG+FFSY++SP+ V   E + KS       +  
Sbjct: 349 TSHKRSLQGGTDEKDGHKERIHARGGIPGVFFSYDISPMKVVNREVREKSFSGFLVGLCA 408

Query: 344 NISGTYITFMLVDALLHSCVKKISK 368
            I GT      +D  L+  V +I K
Sbjct: 409 VIGGTLTVAAAIDRALYEGVNRIKK 433


>gi|167376738|ref|XP_001734125.1| endoplasmic reticulum-golgi intermediate compartment protein
           [Entamoeba dispar SAW760]
 gi|165904489|gb|EDR29705.1| endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Entamoeba dispar SAW760]
          Length = 361

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 125/378 (33%), Positives = 192/378 (50%), Gaps = 32/378 (8%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  D + K  ED   +  +GG +TI+C + I  L   +   Y Q     +L VD  R S
Sbjct: 1   MKRFDTYGKLPEDLRTRHCFGGFLTIICVVIIIILSIAEFTFYLQREVVPQLLVDRDRSS 60

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+P+H DI  P  SC   ++D +  SGE  + +E N+ K R+  DG  + E + + + + 
Sbjct: 61  KIPVHFDITFPYSSCPITSVDILTKSGESMIDIEQNVTKIRIHHDGSLVTESEMKAIQS- 119

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                       +TE  DP +C SCYGAET  +KCC TC++VKEAY+ K W L +L+ + 
Sbjct: 120 ----------KLSTETHDPKECRSCYGAETPEKKCCFTCDDVKEAYKKKGWRL-DLNIVS 168

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC+N    +  + T  EGC++ G   +N++ G+FHIAPG S      H H+++       
Sbjct: 169 QCQNHEKIQMARLTKDEGCRVIGDFLLNKIGGNFHIAPGSSEQSWGRHSHNLEWTGKTQI 228

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP----------TIY 296
           + +H    LSFG       E  K    T  K  +  SMF YY+ IIP          T Y
Sbjct: 229 DLSHKWNELSFG-------EHSKKFT-TEKKDTQMNSMFQYYLTIIPIKNNFINGTSTFY 280

Query: 297 ERLDGSKLGGGDG-GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
           +      +  G+G G PG+F  Y++SP+++++TE +    H    I   + G + TF L 
Sbjct: 281 DYSIQENIRSGEGEGSPGVFVYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLF 340

Query: 356 DALLHSCVKKIS-KVEIG 372
           DA++   +  +  KVE+G
Sbjct: 341 DAIVFESIHSLEKKVELG 358


>gi|389744843|gb|EIM86025.1| ER-derived vesicles protein ERV46 [Stereum hirsutum FP-91666 SS1]
          Length = 419

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 122/411 (29%), Positives = 195/411 (47%), Gaps = 55/411 (13%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F   LKG+DAF K  ED   KT  G  +T++    I     ++  DY +V+    + VD 
Sbjct: 5   FFGALKGVDAFGKTMEDVKVKTRTGAFLTLMAAAIILTFTTMEFFDYRRVTMDTSVEVDR 64

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKE 121
           SRG KL + +++  P + C  L+LD +D SGE    + HNI K RL+ DG  +      +
Sbjct: 65  SRGEKLTVRMNVTFPRVPCYLLSLDVMDISGETQRDISHNIVKTRLNSDGTQVPNSANMQ 124

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
           + N + K     ++G           CGSCYG       CCNTC++V+EAY  + W+   
Sbjct: 125 LRNELDKLNAQRQDGY----------CGSCYGGTPPEGGCCNTCDQVREAYVQRGWSFGN 174

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            D+I QC  E+ +EKL    +EGC I G + VN+V G+ H++PG S+  +   ++++ PY
Sbjct: 175 PDSIEQCVQEHWSEKLHEQSSEGCNISGRVRVNKVIGNIHLSPGKSFQNSASSIYELVPY 234

Query: 242 TSAAFNT---THHIRHLSFGIKLQDDDERRK--------------PLDGTVAKAEEGASM 284
                N    +H +  L+FG   + D  + K              PLDG  A+  + ++M
Sbjct: 235 LKDDKNRHDFSHIVHSLTFGADDEYDSRKTKIANEMKQRMGLDSNPLDGYHARTSQPSTM 294

Query: 285 FNYYIKIIPTIYERLDGSKLGG---------------------------GDGGMPGIFFS 317
           F Y++K + T +  +DG  +                             G  G+PG FF+
Sbjct: 295 FQYFLKAVSTQFRTIDGKVVNTHQYQVTHYNRDAGNPQDKTNQGVNVMHGITGVPGAFFN 354

Query: 318 YELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           YE+SP+ V   E  +S  H  T     + G      ++D++L +  +++ K
Sbjct: 355 YEISPIKVIHEETRQSFAHFLTSTCAIVGGVLTVTSILDSVLFAANQRLKK 405


>gi|384483831|gb|EIE76011.1| hypothetical protein RO3G_00715 [Rhizopus delemar RA 99-880]
          Length = 408

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 121/388 (31%), Positives = 194/388 (50%), Gaps = 36/388 (9%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F +R + LDA+ K  +DF  +T  GGAVTI+  L I  L+  +   Y       E+ VD 
Sbjct: 26  FIKRFRKLDAYAKTLDDFRVRTATGGAVTIISGLCILILVLFETVQYLTPIMKPEILVDG 85

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
               KLPI  DI  P + C  L+LD +D SGE   + +H++YK RLD        P  EV
Sbjct: 86  GNMEKLPIKFDITFPHLPCYMLSLDIMDESGEHISNYDHDVYKERLD--------PNGEV 137

Query: 123 VNAVKKKKVTTENGTTTTE--LEDPNK-CGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           + A K   ++        E  +  P+  CGSCYGA+  + +CCNTC E++ AY    W +
Sbjct: 138 ITAEKSNDLSNSQAKNAREHSMNVPDDYCGSCYGAKG-SNECCNTCEEIQNAYSELGWNV 196

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            + D   QC  E   EK+++   EGC+++G L VN++ G+FH + G ++  +  H+HD+ 
Sbjct: 197 -DPDNFEQCIREGWKEKIESQSREGCRMHGTLLVNKIRGNFHFSAGKAFKQSGSHIHDMS 255

Query: 240 PY--TSAAFNTTHHIRHLSFGIKLQDDDERRK--------PLDGTVAKAEEGASMFNYYI 289
            +       N  H I+HL FG    + +++++        PL+   +   E A M+ Y++
Sbjct: 256 TFLHNDKNQNFMHTIQHLQFGNHDYNSEKQKRTKSRELIHPLENIKSGNSETAIMYQYFL 315

Query: 290 KIIPTIYERLDGSKLGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGH 336
           KI+PT +  L+G ++                 GG+PG+FF  + SP+ +  +E   SL  
Sbjct: 316 KIVPTEFNFLNGKRIRTFQYSVSKQDHIVSYLGGLPGVFFMLDHSPMRIIYSETKTSLAS 375

Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVK 364
             T +   I G +    ++D  +   +K
Sbjct: 376 YLTSLCAIIGGIFTVASVIDGSIQHMLK 403


>gi|400602673|gb|EJP70275.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Beauveria bassiana ARSEF 2860]
          Length = 423

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 138/428 (32%), Positives = 201/428 (46%), Gaps = 69/428 (16%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  L + +L+  +  DY  ++   EL V
Sbjct: 1   MAAKSRFTRLDAFTKTVDEARIRTTSGGVVTIVSLLVVLFLVWGEWADYRTIAIRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V H ++K RL         P+ 
Sbjct: 61  DQGRGERMDIHLNITFPRMPCELLTLDVMDVSGEQQHGVAHGVHKVRL--------RPEA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR----KCCNTCNEVKEAYRYKK 176
           E    +    +   N     E  DP+ CG C GA   +      CCNTC E++EAY    
Sbjct: 113 EGGGVIDVSSLDLHN--DAAEHLDPSYCGDCGGAPAPSNVKKAGCCNTCEEIREAYAQVS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +     QC+ E+  E+L+    EGC+I G L+VN+V G+FH+APG S+S  ++HVH
Sbjct: 171 WAFGDGKAFEQCEREHYAERLEEQRHEGCRIDGLLQVNKVVGNFHLAPGRSFSNGNMHVH 230

Query: 237 DIQPYTSAA----FNTTHHIRHLSFGIKLQD-------------DDERRKPLDGTVAKAE 279
           D++ Y         + TH+I HL FG +L +              +    PLD T    +
Sbjct: 231 DLKNYWETTDDKKHDFTHYIHHLRFGPQLPEAVVKKMGKGATPWTNHHANPLDNTKQLTD 290

Query: 280 EGASMFNYYIKIIPTIYERL-----------DGS-------------KLGGGD------- 308
           +    F Y++KI+PT +  L           DGS              L GGD       
Sbjct: 291 DPNYNFMYFVKIVPTSFLPLGWEKMSRAMNTDGSVETHQYSVTSHKRSLTGGDDAAEGHA 350

Query: 309 ------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
                 GG+PG+FFSY++SP+ ++   E+ KS       +   + GT      VD  L  
Sbjct: 351 ERLHSRGGIPGVFFSYDISPMKVINREEQGKSFLGFIAGLCAVVGGTLTVAAAVDRGLFE 410

Query: 362 CVKKISKV 369
              ++ K+
Sbjct: 411 GTTRLKKI 418


>gi|398398231|ref|XP_003852573.1| hypothetical protein MYCGRDRAFT_72288 [Zymoseptoria tritici IPO323]
 gi|339472454|gb|EGP87549.1| hypothetical protein MYCGRDRAFT_72288 [Zymoseptoria tritici IPO323]
          Length = 435

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 138/440 (31%), Positives = 198/440 (45%), Gaps = 80/440 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF+K  ED   +T  GG VT+   L I +L   +  DY +++   E+ V
Sbjct: 1   MPVKSRFTKLDAFSKTVEDARIRTTSGGFVTVFSMLLIIWLAWGEWSDYRRITIQPEIIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D +RG K+ IHL++  P I C+ L LD +D SG+    V H I K RL        +P+ 
Sbjct: 61  DKARGEKMEIHLNVTFPRIPCELLTLDVMDVSGDVQTGVLHGIVKTRL--------KPES 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    + K ++         +    + CG CYGA          CCNTC EV+EAY    
Sbjct: 113 EGGGDIDKGRLQVNEVEEAAKHLARDYCGDCYGAPPPANAIKSGCCNTCAEVREAYASVS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W+    + + QC  E+ +E L     EGC++ G + VN+V G+FH APG S+S  ++HVH
Sbjct: 173 WSFGRGENVEQCTREHYSEHLDEQRKEGCRVDGVIRVNKVVGNFHFAPGKSFSNGNMHVH 232

Query: 237 DIQPYTSAAFNTT--HHIRHLSFGIKLQD-------DDERR-------KPLDGTVAKAEE 280
           D++ Y +   + T  H I HL FG  L +       D ER         PLDG   +  E
Sbjct: 233 DLENYLTGGGDHTPSHIIHHLRFGPLLPESYKHRVRDTERHWSNNHHLSPLDGFRQETNE 292

Query: 281 GASMFNYYIKIIPTIYERLD------------------------GSK------------- 303
            A  + Y++K++PT Y  L                         GS              
Sbjct: 293 KAYNYMYFVKVVPTAYLPLGYENLPSVGDYPHEHAHVGEYGISHGSSIETHQYSVTSHKR 352

Query: 304 -LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGT 348
            LGGGD             GG+PG+FFSY++SP+ V   E ++KS       I   + GT
Sbjct: 353 HLGGGDANDEGHKERLHARGGIPGVFFSYDISPMKVIDREVRAKSFSSFLVGICGVLGGT 412

Query: 349 YITFMLVDALLHSCVKKISK 368
                 VD +     +++ K
Sbjct: 413 LTVAAAVDRIWFEGTQRVKK 432


>gi|58264656|ref|XP_569484.1| ER to Golgi transport-related protein [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|134109945|ref|XP_776358.1| hypothetical protein CNBC5750 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50259032|gb|EAL21711.1| hypothetical protein CNBC5750 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57225716|gb|AAW42177.1| ER to Golgi transport-related protein, putative [Cryptococcus
           neoformans var. neoformans JEC21]
          Length = 422

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 126/412 (30%), Positives = 191/412 (46%), Gaps = 63/412 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
            +G DAF K  ED   KT  G  +T +    I   + ++  DY ++     + VD SRG 
Sbjct: 10  FQGFDAFGKTMEDVKIKTRTGALLTFISLSIILTSVMLEFIDYRRIHMEPSIIVDRSRGE 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVNA 125
           KL I  DI  P + C  L+LD +D SGE     EH + K R++ DG  I + Q  ++   
Sbjct: 70  KLVIDFDIEFPRVPCYLLSLDVMDISGEHQTEFEHQVTKTRMNKDGNVISKVQGGQLKGD 129

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           V++  +           +DPN CGSCYGA      CCN+C EV++AY  K W+  + + I
Sbjct: 130 VERANLN----------QDPNYCGSCYGALPPESGCCNSCEEVRQAYGRKGWSFSDPEGI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E   +K+K    EGC+I G++ VN+V G+ H +PG S+  N + + ++ PY    
Sbjct: 180 EQCVEEGWMDKMKEQNEEGCRIDGHIRVNKVIGNLHFSPGRSFQNNMMQMLELVPYLR-- 237

Query: 246 FNTTHH-----IRHLSFGIKLQDDDE---------------RRKPLDGTVAKAEEGASMF 285
            +  HH     +    FG  +   +E                R PL G  A  E    MF
Sbjct: 238 -DKNHHDFGHIVHKFRFGADMTKAEELTVLPKEQRWRDKLGLRDPLQGIKAHTEVSNYMF 296

Query: 286 NYYIKIIPTIYERLDGSKL----------------GGGDG-------------GMPGIFF 316
            Y++K++ T +  L G ++                G   G             G+PG+FF
Sbjct: 297 QYFLKVVSTNFISLSGEEISSHQYSVTQYERDLRTGNAPGKDAHGHMTSHGMMGVPGVFF 356

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           +YE+SP+ V  TE+ +S  H  T     + G      LVD+L+ +  K++ K
Sbjct: 357 NYEISPMKVIHTEERQSFAHFLTSTCAIVGGVLTVASLVDSLIFNSSKRLKK 408


>gi|67479189|ref|XP_654976.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56472072|gb|EAL49587.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
          Length = 361

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 124/378 (32%), Positives = 192/378 (50%), Gaps = 32/378 (8%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  D + K  ED   +  +GG +TI+C + I  L   +   Y Q     +L VD  R S
Sbjct: 1   MKRFDTYGKVPEDLRTRHCFGGFLTIICVVIIIVLSIAEFAFYLQREVVPQLLVDRERSS 60

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+P+H DI  P  SC   ++D +  SGE  + +E N+ K R+  DG  + E + + + + 
Sbjct: 61  KIPVHFDITFPYSSCPITSVDILTKSGESMIGIEQNVTKIRIHHDGSLVTENEMKAIQS- 119

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                       + E  DP +C SCYGAET  +KCC TC++VKEAY+ + W L +L+ + 
Sbjct: 120 ----------KLSIETPDPKECRSCYGAETPEKKCCFTCDDVKEAYKKRGWRL-DLNIVS 168

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC+N    +  K T  EGC++ G   +N++ G+FHIAPG S  +   H H+++       
Sbjct: 169 QCQNHEKIQMAKLTKDEGCRLIGDFLLNKIGGNFHIAPGSSEQLWGRHSHNLEWTGKTQI 228

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP----------TIY 296
           + +H    LSFG       E  K    T  K  +  SMF YY+ IIP          T Y
Sbjct: 229 DLSHKWNELSFG-------ENSKKFT-TEKKDTQMNSMFQYYLTIIPIKNNFINGTSTFY 280

Query: 297 ERLDGSKLGGGDG-GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
           +      +  G+G G PG+F  Y++SP+++++TE +    H    I   + G + TF L 
Sbjct: 281 DYSIQENIRSGEGEGQPGVFIYYDVSPMVLEVTESNHGFLHFLIGICSIVGGIFTTFQLF 340

Query: 356 DALLHSCVKKI-SKVEIG 372
           DA++   +  +  KVE+G
Sbjct: 341 DAIVFESIHTLKKKVELG 358


>gi|358372047|dbj|GAA88652.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus kawachii
           IFO 4308]
          Length = 438

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 138/443 (31%), Positives = 197/443 (44%), Gaps = 82/443 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG +TI   L I +L+  +  DY +V    EL V
Sbjct: 1   MPAKSRFTRLDAFAKTVEDARVRTTSGGVITIASLLVILWLVWGEWVDYRRVVVMPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG K+ IHL+I  P + C+ L LD +D SGEQ   V H I K RL            
Sbjct: 61  DKSRGEKMEIHLNITFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRLT--------SAA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    +  K +   +   + +  DP+ CG CYGA          CCNTC+EV+EAY  ++
Sbjct: 113 EGGRVIDVKALELHSKDESAKHLDPDYCGECYGATAPAGASKPGCCNTCDEVREAYAQQQ 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  + + + QC+ E   E++     EGC++ G L VN+V G+FHIAPG S++  ++HVH
Sbjct: 173 WAFGKGENVEQCELEGYAERIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNMHVH 232

Query: 237 DIQPYTSAAF------NTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
           D+  +  A          TH I  L FG +L D         D     PLD T  +  E 
Sbjct: 233 DLATFFDAELPESERHTMTHEIHQLRFGPQLPDELSDRWQWTDHHHTNPLDNTKQETNEP 292

Query: 282 ASMFNYYIKIIPTIYERL--------------DGSKLG---------------------- 305
              + Y++K++ T Y  L              D + LG                      
Sbjct: 293 GYNYMYFVKVVSTSYLPLGWDPLFSSSIHSAYDQAPLGSHGIAYGAEGSIETHQYSVTSH 352

Query: 306 -----GGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
                GGD              G+PG+F +Y++SP+ V   E + K+     T +   I 
Sbjct: 353 KRSLMGGDASDEGHKERLHAANGIPGVFVNYDISPMKVINREARPKTFTGFLTGVCAIIG 412

Query: 347 GTYITFMLVDALLHSCVKKISKV 369
           GT      +D  L+  V ++ K+
Sbjct: 413 GTLTVAAALDRGLYEGVSRMKKL 435


>gi|213409826|ref|XP_002175683.1| COPII-coated vesicle component Erv46 [Schizosaccharomyces japonicus
           yFS275]
 gi|212003730|gb|EEB09390.1| COPII-coated vesicle component Erv46 [Schizosaccharomyces japonicus
           yFS275]
          Length = 394

 Score =  204 bits (520), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 124/405 (30%), Positives = 201/405 (49%), Gaps = 49/405 (12%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+F  +L+  DAFTK  ED   KT  GG ++I+  + +  ++ ++  +Y ++    E+ V
Sbjct: 1   MLFRAQLRRFDAFTKTVEDAKIKTAGGGLISIISAVIVFVIVFLEWKNYQRIVVQPEIVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SR  ++ I+ +I  P + C Y+ +D +D SG+    V+H++ K RLD  G  I     
Sbjct: 61  DPSRNERMEINFNITFPHVPCHYMGVDVMDISGDFQQDVQHSVTKTRLDKYGNIIAVIDS 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           ++ +A  +  +  +   T         CG CYGA      ET  CCN C  V++AY  K+
Sbjct: 121 DIGSATDESAMDKDGEVT---------CGDCYGAGDAAPPETPGCCNNCKAVRDAYARKQ 171

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA+ + D   QC++E    +  +   EGC I G+L VNRV+G+FH APG S+     H+H
Sbjct: 172 WAIGDYDAFQQCRDENYKAEHASQKGEGCNIAGHLFVNRVAGNFHFAPGRSFQTQQGHLH 231

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII-- 292
           D++ Y     A + TH I  LSFG  ++   E   PLDG     ++    + Y+IK +  
Sbjct: 232 DLRGYEEEQEAHDMTHMIHQLSFGPPIKPSAEHTDPLDGHFKNTDDALHNYAYFIKCVAH 291

Query: 293 ---------PTI---------YERLDGSKLGGGD----------GGMPGIFFSYELSPLM 324
                    PTI         +ER   S  GG +          GG+PG+FF+ ++SP++
Sbjct: 292 KFVPLDPADPTINTNEFSVTQHER---SVTGGRENDNPSHLNRRGGIPGVFFNIDISPML 348

Query: 325 VKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           V   + +  + G   + ++  + G      LVD  L++   K+ K
Sbjct: 349 VIQRQIRGNTFGGFISNVLSFLGGFITLTTLVDRGLYAAELKMKK 393


>gi|303322923|ref|XP_003071453.1| hypothetical protein CPC735_069900 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240111155|gb|EER29308.1| hypothetical protein CPC735_069900 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320033474|gb|EFW15422.1| COPII-coated vesicle membrane protein Erv46 [Coccidioides posadasii
           str. Silveira]
          Length = 435

 Score =  204 bits (520), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 139/440 (31%), Positives = 195/440 (44%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VTIV  + +  L+  +  DY +V    EL V
Sbjct: 1   MPVKSRFTRLDAFAKTVEDARIRTRSGGVVTIVSLIVVILLVWGEWRDYRRVVVLPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C  L LD +D SGEQ   V H + K RL    +       
Sbjct: 61  DKGRGERMEIHLNITFPHLPCQLLTLDVMDVSGEQQSGVIHGVNKVRLSAASEGGHALDV 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E V+  KK +             DP  CGSCY        +   CCNTC+EV+EAY  + 
Sbjct: 121 ETVDLDKKDQAPLH--------LDPGYCGSCYDGIPPPNAKKPGCCNTCDEVREAYALRN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E    K+ +   EGC++ G L VN+V G+FH+APG S++  ++H H
Sbjct: 173 WAFGRGEGVEQCEQEGYGSKIDSQRNEGCRLEGILRVNKVVGNFHVAPGRSFTNGYMHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
           D++ Y  T      +H I  L FG +L D         D     PLD T    E+    F
Sbjct: 233 DLKTYYETPVKHTMSHIIHQLRFGPQLPDELSQKWKWTDHHHTNPLDSTSQTTEDPKFNF 292

Query: 286 NYYIKIIPTIYERL----------------------DGSKLG------------------ 305
            Y++K++ T Y  L                       G +LG                  
Sbjct: 293 MYFVKVVSTSYLPLGWDASLSSEVHSRLSSDAPLGKQGIQLGQYGSIETHQYSVTSHKRS 352

Query: 306 --GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
             GGD             GG+PG+FF+Y++SP+ V   E ++KSL    T +   I GT 
Sbjct: 353 IEGGDDSAEGHKERVHTAGGIPGVFFNYDISPMKVINREARTKSLSGFLTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                VD  L+    ++ K+
Sbjct: 413 TVAAAVDRALYEGSVRVKKL 432


>gi|169770949|ref|XP_001819944.1| COPII-coated vesicle membrane protein Erv46 [Aspergillus oryzae
           RIB40]
 gi|238486566|ref|XP_002374521.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           flavus NRRL3357]
 gi|83767803|dbj|BAE57942.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|220699400|gb|EED55739.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           flavus NRRL3357]
 gi|391874294|gb|EIT83200.1| COPII vesicle protein [Aspergillus oryzae 3.042]
          Length = 436

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 139/441 (31%), Positives = 198/441 (44%), Gaps = 80/441 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG +TI   L I +L+  +  DY +V    EL V
Sbjct: 1   MPAKSRFTRLDAFAKTVEDARIRTTSGGIITIASLLAILWLVWGEWVDYRRVVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG K+ IHL++  P + C+ L LD +D SGEQ   V H I K RL            
Sbjct: 61  DKSRGEKMEIHLNMTFPRLPCELLTLDVMDVSGEQQTGVVHGINKVRLS--------SPA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETE--TRKCCNTCNEVKEAYRYKKWA 178
           E  + +  K +   +     +  DPN CG C G       ++CCNTC EV+EAY  ++WA
Sbjct: 113 EGGHVIDVKALELHSEQEAAKHLDPNYCGDCGGVPQPGGEKRCCNTCEEVREAYAQQQWA 172

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
             + + I QC+ E   ++L     EGC++ G L VN+V G+FHIAPG S++  +VHVHD+
Sbjct: 173 FGKGENIEQCEREGYAQRLDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTSGNVHVHDL 232

Query: 239 QPY------TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGAS 283
           + Y       +     TH I  L FG +L D         D     PLD T  +  + A 
Sbjct: 233 ENYFEGDLPDAEKHTMTHIIHQLRFGPQLPDELSDRWQWTDHHHTNPLDSTQQETSDPAY 292

Query: 284 MFNYYIKIIPTIYERL--------------DGSKLG------------------------ 305
            F Y++K++ T Y  L              + S LG                        
Sbjct: 293 NFMYFVKVVSTSYLPLGWDPLFSSAVHSAYEDSPLGSHGIAYGSQSSIETHQYSVTSHKR 352

Query: 306 ---GGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGT 348
              GGD              G+PG+FF+Y++SP+ V   E + K+     T +   I GT
Sbjct: 353 SLRGGDASDEGHKERLHAANGIPGVFFNYDISPMKVINKEARPKTFTGFLTGVCAIIGGT 412

Query: 349 YITFMLVDALLHSCVKKISKV 369
                 +D  L+    ++ K+
Sbjct: 413 LTVAAALDRGLYEGALRVKKL 433


>gi|301106576|ref|XP_002902371.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
 gi|262098991|gb|EEY57043.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
          Length = 393

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 127/376 (33%), Positives = 192/376 (51%), Gaps = 40/376 (10%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK +D + K + +F  +T +G  V+IV  +F++ L   ++  Y+ V+T E + VDS+ G 
Sbjct: 30  LKKVDVYPKMHREFKVQTEFGATVSIVAGIFMAILFLSELSTYWTVNTHEHMVVDSTLGE 89

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL ++LD+    ++C    ++A+D +GE  +++   + K RLD +G+ I           
Sbjct: 90  KLQVNLDVSFLAVNCRDAHINAMDVAGELQVNMHQTVVKTRLDANGRSIS---------- 139

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETET-RKCCNTCNEVKEAYRYKKWALPELDTI 185
                TT +    T+L     CGSCYG      ++CCNTC EVKEA+ +   +L E +  
Sbjct: 140 -----TTADELAKTDLP-AGYCGSCYGTRHPAGKECCNTCEEVKEAFIHSDLSLEEAEQK 193

Query: 186 VQCKNE-YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
            QC  E   TEKL     EGC+  G + VNRV+G+FH+A G ++      VH  +P    
Sbjct: 194 EQCVRESIDTEKLAQD-GEGCRFTGKMFVNRVAGNFHVALGRTFHRQGRLVHQFRPGQEH 252

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
            FN++H I  LSFG  +        PLDG    AE+   +F YYIKI+PTIY  +D S +
Sbjct: 253 TFNSSHIIHSLSFGEPIPGAT---SPLDGVSKIAEQSGGVFQYYIKIVPTIYSDIDESAI 309

Query: 305 ----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                            G    +PG FF ++LSP MVK+        H  TKI C I G 
Sbjct: 310 HSYQFSVTQQSNYLNPRGQMTSLPGTFFVFDLSPFMVKVENDRVPFTHFLTKI-CAIVGG 368

Query: 349 YITFM-LVDALLHSCV 363
            I+    VD+ +++ +
Sbjct: 369 VISIAGFVDSFMYNSL 384


>gi|452001785|gb|EMD94244.1| hypothetical protein COCHEDRAFT_1202021 [Cochliobolus
           heterostrophus C5]
          Length = 437

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 142/444 (31%), Positives = 200/444 (45%), Gaps = 85/444 (19%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV  L I +L   +  DY +V+   EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVEDARIRTTSGGIVTIVSLLVIFWLTWGEWADYRRVTVRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ I L+I  P + C+ + LD +D SGE  + V H I K RL         P+K
Sbjct: 61  DKGRGERMEIALNISFPRVPCELITLDVMDVSGELQMGVTHGINKVRLG--------PEK 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E    ++ K +   +    + L  P+ CG C+GA          CCNTC+EV++AY    
Sbjct: 113 EGSKTIEIKALDL-HADEASHLA-PDYCGECFGAPPPANAKKPGCCNTCDEVRDAYASIS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W+    + + QC+ E+  E L     EGC++ G + VN+V G+FHIAPG S+S  ++HVH
Sbjct: 171 WSFGRGEGVEQCEREHYAEHLDEQRQEGCRLEGSIRVNKVVGNFHIAPGKSFSNGNMHVH 230

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD------------------DDERRKPLDGTVA 276
           D++ Y     A   TH I  L FG +L D                   +    PLD T  
Sbjct: 231 DLENYFKDEYAHTFTHKIHQLRFGPQLSDVVIQGIQDKHKGSGPGSWSNHHINPLDNTEQ 290

Query: 277 KAEEGASMFNYYIKIIPTIY---------------ERLDGSKL----------------- 304
             +E A  F Y+IK++ T Y               + L GS +                 
Sbjct: 291 HTDEKAFNFMYFIKVVSTAYLPLGWEDAAPRLTKHDELLGSTIDASHKGSIETHQYSVTS 350

Query: 305 ------GGGD------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNI 345
                 GG D            GG+PG+FFSY++SP+ V   E + K+       +   I
Sbjct: 351 HKRNLKGGNDEKDGHKERIHARGGIPGVFFSYDISPMKVINREVREKTFSGFLVGLCAVI 410

Query: 346 SGTYITFMLVDALLHSCVKKISKV 369
            GT      VD  L+  V +I K+
Sbjct: 411 GGTLTVAAAVDRALYEGVNRIKKI 434


>gi|121702771|ref|XP_001269650.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           clavatus NRRL 1]
 gi|119397793|gb|EAW08224.1| COPII-coated vesicle membrane protein Erv46, putative [Aspergillus
           clavatus NRRL 1]
          Length = 438

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 139/443 (31%), Positives = 196/443 (44%), Gaps = 82/443 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VTI   + I YL+  +  DY +V    EL V
Sbjct: 1   MPAKSRFTRLDAFAKTVEDARVRTTSGGIVTIASLIVILYLVWGEWVDYRRVVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ IH++I  P + C+ + LD +D SGEQ + V H + K RL            
Sbjct: 61  DKSRGERMEIHMNITFPRLPCELVTLDVMDVSGEQQVGVAHGVNKVRL--------SSPA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E  + +  + +   +     +  DPN CG C GA+         CCNTC+EV+EAY  K 
Sbjct: 113 EGGHVLDIRSLDLHSKDEVAKHLDPNYCGDCGGADPLPGAIKPGCCNTCDEVREAYAAKN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   I QC+ E  T ++     EGC++ G L VN+V G+FHIAPG S++  ++HVH
Sbjct: 173 WAFGKGANIEQCEREGYTARIDAQRREGCRLEGVLRVNKVVGNFHIAPGRSFTNGNIHVH 232

Query: 237 DIQPY------TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEG 281
           D Q Y        A     H I  L FG +L D         D     PLD T  +  + 
Sbjct: 233 DTQAYFDLDLPDDAKHTMEHEIHQLRFGPQLPDELSARWQWTDHHHTNPLDNTHQETNDP 292

Query: 282 ASMFNYYIKIIPTIYERL----------------------------DGS----------- 302
           A  F Y++K++ T Y  L                             GS           
Sbjct: 293 AYNFVYFVKVVSTSYLPLGWDPLFSSALHSTYEKAPLGAHGIGYGASGSIETHQYSVTSH 352

Query: 303 --KLGGGDG-------------GMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
              L GGD              G+PG+FF+Y++SP+ V   E + K+L    T +   I 
Sbjct: 353 KRSLRGGDAEDEGHKERLHAANGIPGVFFNYDISPMKVINREARPKTLSSFLTGVCAIIG 412

Query: 347 GTYITFMLVDALLHSCVKKISKV 369
           GT      +D  L+    ++ K+
Sbjct: 413 GTLTVAAAIDRGLYEGALRVKKL 435


>gi|297602842|ref|NP_001052965.2| Os04g0455900 [Oryza sativa Japonica Group]
 gi|255675519|dbj|BAF14879.2| Os04g0455900 [Oryza sativa Japonica Group]
          Length = 253

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 99/243 (40%), Positives = 148/243 (60%), Gaps = 3/243 (1%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ LDA+ K  EDF+ +T+ GG +T+   + +  L   ++  Y    T   L VD+SRG
Sbjct: 7   KLRSLDAYPKVNEDFYSRTLSGGIITLASSVVMLLLFVSELRLYLHAVTETTLRVDTSRG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I+ D+  P + C  ++LDA+D SG++HL V+H+I+K+R+D+ G  I   Q + V  
Sbjct: 67  ETLRINFDVTFPALQCSIISLDAMDISGQEHLDVKHDIFKQRIDVHGNVIATKQ-DAVGG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +K ++    +G      E    CGSCYGAE    +CCN+C +V+EAYR K W +   D I
Sbjct: 126 MKVEQPLQRHGGRLEHNE--TYCGSCYGAEESDEQCCNSCEDVREAYRKKGWGVSNPDLI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QCK E   + +K+   EGC IYG+LEVN+V+G+FH APG S+   +VHVHD+ P+   +
Sbjct: 184 DQCKREGFLQSIKDEEGEGCNIYGFLEVNKVAGNFHFAPGKSFQKANVHVHDLLPFQKDS 243

Query: 246 FNT 248
           FN 
Sbjct: 244 FNV 246


>gi|51214107|emb|CAH17876.1| hypothetical protein (22C8.0001), conserved [Pneumocystis carinii]
          Length = 388

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 132/398 (33%), Positives = 190/398 (47%), Gaps = 48/398 (12%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
            +  DAF+K  E+   KT+ GG +TI+  + I  LI  +  DY Q+    EL +D SRG 
Sbjct: 7   FRRFDAFSKTIENAQIKTINGGFITILSIIVIFVLIYFEWRDYRQIVILPELTIDRSRGE 66

Query: 67  KLPIHLDIVVPTISCD---YLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
           KL I+L++  P I C     L+LD +D SGE    V HN+ K RLD +G  I       +
Sbjct: 67  KLQINLNLTFPKIPCSRLLVLSLDVMDVSGELETDVSHNVVKNRLDSNGIFINSTSLNTL 126

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
           N  +  K              P+ CGSCYGA+     CCNTC +V +AY    W +P+  
Sbjct: 127 NFQQPAKT-----------RPPDYCGSCYGAK---EGCCNTCQQVIDAYASNNWPVPDTK 172

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT- 242
              QCK +Y+     N F EGC   G +EVN+V G+FH APG S  I   H+HDI  Y  
Sbjct: 173 AFEQCKEKYNN---LNEFDEGCNFVGRIEVNKVVGNFHFAPGHSSQIMRNHIHDIYDYMT 229

Query: 243 -SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
            S+  + +H I  LSFG +++     + PLD    + +     ++Y+IK +   +E L  
Sbjct: 230 DSSPHDFSHTINKLSFGPEVE-GRSLQNPLDNVKKETDNPTLRYSYFIKCVAYRFEYLSK 288

Query: 302 SKLG-------------GGDG------------GMPGIFFSYELSPLMVKITEKSKSLGH 336
             L               GD             G+PG+FFSY++SP+ +   E   +   
Sbjct: 289 PSLDTNKYSVTVHERSISGDSDPNYPTHISPKDGIPGVFFSYDISPIKIIERETRGNFST 348

Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
             T  +  ISG      +VD +L+   ++I K    GK
Sbjct: 349 FLTSTVIIISGVLTIAGIVDRILYETERQIEKKLREGK 386


>gi|393233667|gb|EJD41236.1| endoplasmic reticulum-derived transport vesicle ERV46 [Auricularia
           delicata TFB-10046 SS5]
          Length = 419

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 123/413 (29%), Positives = 185/413 (44%), Gaps = 57/413 (13%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
           +FS  LKG+DAF K  +D   KT  G  +T++    I     ++  DY +++    + VD
Sbjct: 5   IFST-LKGVDAFGKTMDDVKVKTRTGALLTLISIAIIFTFTTIEFVDYRRINHDTSMVVD 63

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
            SRG KL ++L++  P I C  L+LD +D SGE+   V HNI K R+D + + I +    
Sbjct: 64  KSRGEKLTVNLNVTFPKIPCYLLSLDVMDISGERQADVTHNILKTRIDANRQRIADQTTT 123

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                + +KV    G         N CGSCYG       CC TC  V++AY  + WA  +
Sbjct: 124 YDLQNEAEKVVAARG--------ANYCGSCYGGLEPEGGCCQTCEAVRQAYINRGWAFSD 175

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            D I QCK E   EK++    EGC + G + VN+V GS   + G S+ +N + +HD+ PY
Sbjct: 176 PDAIEQCKQEGWKEKIQAQMNEGCNVEGRVRVNKVVGSIQFSFGRSFQMNQMSLHDLVPY 235

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDE------------------RRKPLDGTVAKAEEGAS 283
                   H  RH         DDE                     PLDG     E    
Sbjct: 236 LRD--ENVHDWRHRVQHFYFSSDDEFNIYKAGISSSMKQRLGIAANPLDGNYGHTESTEY 293

Query: 284 MFNYYIKIIPTIYERL----------------------------DGSKLGGGDGGMPGIF 315
           MF Y++K++ T +  +                            DG  +  G  G+PG+F
Sbjct: 294 MFQYFLKVVSTQFRTIGGEVINTHQYSATHFDRDLAEGVRGKTEDGVVVTHGVQGLPGVF 353

Query: 316 FSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           F++E+SP+ +  +E  +S  H  T     + G      +VD+LL +  + + K
Sbjct: 354 FNFEISPMRIIHSETRQSFAHFITSTCAIVGGVLTIASIVDSLLFTTQQALKK 406


>gi|302414546|ref|XP_003005105.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Verticillium albo-atrum VaMs.102]
 gi|261356174|gb|EEY18602.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Verticillium albo-atrum VaMs.102]
          Length = 349

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 127/376 (33%), Positives = 182/376 (48%), Gaps = 39/376 (10%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG +TIV    + +L   +  DY +++   EL V
Sbjct: 1   MAGKSRFTKLDAFTKTVDEARIRTSSGGIITIVSLFIVFWLAWGEWADYRRITLHPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG K+ IHL++  P + C+ L LD +D SGEQ   +   I K RL          QK
Sbjct: 61  DKGRGEKMEIHLNMTFPKMPCELLTLDVMDVSGEQQHGIVSGISKVRL--------RSQK 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET----RKCCNTCNEVKEAYRYKK 176
           +    +  K ++            P+ CG CYGA+       + CCNTC EV+EAY    
Sbjct: 113 DGGGVIDTKALSLHAADEAATHLAPDYCGDCYGAKAPANAVKQGCCNTCEEVREAYAQAS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  + + + QC  E+  E+L     EGC+I G L VN+V G+FH+APG S+S  ++HVH
Sbjct: 173 WAFGKGENVEQCTREHYAERLDEQRAEGCRIEGGLRVNKVVGNFHLAPGRSFSNGNMHVH 232

Query: 237 DIQPYTSAAF--NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           D++ Y  A    + TH I  L F +     DE +  L G    AE  A            
Sbjct: 233 DLKNYWDAEIIHDFTHQIHALRFVLS----DEPQAQLSGGDDSAEGHA------------ 276

Query: 295 IYERLDGSKLGGGDGGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFM 353
             ERL         GG+PG+FFSY++SP+ ++   E+SKS     T +   I GT     
Sbjct: 277 --ERLHTR------GGIPGVFFSYDISPMKVINREERSKSFTGFLTGLCAVIGGTLTVAA 328

Query: 354 LVDALLHSCVKKISKV 369
            VD  +     ++ K+
Sbjct: 329 AVDRGMFEGSLRLKKI 344


>gi|74267709|gb|AAI02327.1| ERGIC and golgi 3 [Bos taurus]
          Length = 231

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 103/230 (44%), Positives = 147/230 (63%), Gaps = 8/230 (3%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I+++++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKININVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKKRLDKDGFPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAE E  KCCN+C +V+EAYR + WA    DT
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAEMEDIKCCNSCEDVREAYRRRGWAFKNPDT 178

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
           I QC+ E  ++K++    EGCQ+YG+LEVN+V+G+FH APG S+  +HVH
Sbjct: 179 IEQCRREGFSQKMQEQKNEGCQVYGFLEVNKVAGNFHFAPGKSFQQSHVH 228


>gi|258565913|ref|XP_002583701.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237907402|gb|EEP81803.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 435

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 136/440 (30%), Positives = 198/440 (45%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VTIV  + +  L+  +  DY +V    EL V
Sbjct: 1   MAAKSRFTRLDAFAKTVEDARIRTRSGGVVTIVALIAVILLVWGEWKDYRRVVVLSELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   + H I K RL    +       
Sbjct: 61  DKGRGERMEIHLNITFPHLPCELLTLDVMDVSGEQQSGLIHGIKKVRLGPASEGGHVLDA 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           + ++  KK +V            DP  CGSCY        + + CCNTC+EV+EAY  + 
Sbjct: 121 QTLDLHKKDEVAVH--------LDPEYCGSCYDGVPPPNAQKQGCCNTCDEVREAYASRG 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E    ++     EGC++ G L VN+V G+FHIAPG S++  ++H H
Sbjct: 173 WAFGRGEGVAQCEREGYGARIDAQRHEGCRLEGILRVNKVIGNFHIAPGRSFTNGYMHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
           D++ Y  T       H I  L FG +L D         D     PLD T    E+    F
Sbjct: 233 DLKIYHETPVKHTMAHIIHQLRFGPQLPDELSQKWKWTDHHHTNPLDSTSQTTEDPKYNF 292

Query: 286 NYYIKIIPT--------------IYERL--------DGSKLG------------------ 305
            Y++K++ T              ++ RL         G +LG                  
Sbjct: 293 MYFVKVVSTSYLPLGWDASLSSEVHSRLASDAPLGKQGIQLGRHGSIETHQYSVTSHKRS 352

Query: 306 --GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
             GGD             GG+PG+FF+Y++SP+ V   E ++KS     T +   I GT 
Sbjct: 353 VEGGDDSAEGHKERIHTAGGIPGVFFNYDISPMKVINREARTKSFSGFLTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                +D +L+    ++ K+
Sbjct: 413 TVAAAIDRMLYEGAVRVKKL 432


>gi|295672798|ref|XP_002796945.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226282317|gb|EEH37883.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 435

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 137/440 (31%), Positives = 192/440 (43%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV    IS+LI  +  +Y ++    EL V
Sbjct: 1   MAPKSRFARLDAFTKTVEDARIRTRSGGLVTIVALFVISFLIWGEWYEYRRIVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGE    V H I K RL         P+ 
Sbjct: 61  DKGRGERMEIHLNITFPHLPCELLTLDVMDVSGEMQSGVIHGISKVRL--------APES 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E  + +    +     T   +  DP+ CG CYGA          CC+TC EV+EAY  + 
Sbjct: 113 EGGHVIDTTALVLHTQTDAAKHLDPDYCGPCYGAPPPPHATKPGCCSTCEEVREAYASQS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E  ++ L     EGC+I G L VN+V G+FHIAPG S+S  ++H H
Sbjct: 173 WAFGRGENVEQCEREGYSKNLDAQRNEGCRIEGVLRVNKVIGNFHIAPGRSFSNGNLHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
           D+  Y  T       H I  L FG +L D+   R          PLD T     +    F
Sbjct: 233 DLDTYYHTPVPHYMAHKIHQLRFGPQLPDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNF 292

Query: 286 NYYIKIIPTIYERLDGS------------------------------------------K 303
            Y++K++ T Y  L  S                                           
Sbjct: 293 MYFVKVVSTSYLPLGWSPEFSSSVHETTLRDTPLGKQGVHFGSSGSIETHQYSVTSHKRS 352

Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
           + GGD             GG+PG+F +Y++SP+ V   E ++K+     T +   I GT 
Sbjct: 353 IDGGDDAAEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                VD  L+    ++ K+
Sbjct: 413 TVAAAVDRALYEGAVRVKKL 432


>gi|119189667|ref|XP_001245440.1| hypothetical protein CIMG_04881 [Coccidioides immitis RS]
 gi|392868334|gb|EAS34105.2| COPII-coated vesicle membrane protein Erv46 [Coccidioides immitis
           RS]
          Length = 435

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 137/440 (31%), Positives = 195/440 (44%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VTIV  + +  L+  +  DY +V    EL V
Sbjct: 1   MPVKSRFTRLDAFAKTVEDARIRTRSGGVVTIVSLIVVILLVWGEWKDYRRVVVLPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C  L LD +D SGEQ   V H + K RL    +       
Sbjct: 61  DKGRGERMEIHLNITFPHLPCQLLTLDVMDVSGEQQSGVIHGVNKVRLSAASEGGHALDV 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E ++  K+ +             DP  CGSCY        +   CCNTC+EV+EAY  + 
Sbjct: 121 ETLDLDKRDQAPLH--------LDPAYCGSCYDGIPPPNAKKPGCCNTCDEVREAYALRN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E    K+ +   EGC++ G L VN+V G+FH+APG S++  ++H H
Sbjct: 173 WAFGRGEGVEQCEQEGYGSKIDSQRNEGCRLEGILRVNKVVGNFHVAPGRSFTNGYMHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
           D++ Y  T      +H I  L FG +L D         D     PLD T    E+    F
Sbjct: 233 DLKTYYETPVKHTMSHIIHQLRFGPQLPDELSQKWKWTDHHHTNPLDSTSQTTEDPKFNF 292

Query: 286 NYYIKIIPTIYERL----------------------DGSKLG------------------ 305
            Y++K++ T Y  L                       G +LG                  
Sbjct: 293 MYFVKVVSTSYLPLGWDASLSSEVHSRLSSDAPLGKQGIQLGQYGSIETHQYSVTSHKRS 352

Query: 306 --GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
             GGD             GG+PG+FF+Y++SP+ V   E ++KSL    T +   I GT 
Sbjct: 353 IEGGDDSAEGHKERVHTAGGIPGVFFNYDISPMKVINREARTKSLSGFLTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                VD  L+    ++ K+
Sbjct: 413 TVAAAVDRALYEGSVRVKKL 432


>gi|425772976|gb|EKV11354.1| COPII-coated vesicle membrane protein Erv46, putative [Penicillium
           digitatum PHI26]
 gi|425782132|gb|EKV20058.1| COPII-coated vesicle membrane protein Erv46, putative [Penicillium
           digitatum Pd1]
          Length = 438

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 136/443 (30%), Positives = 200/443 (45%), Gaps = 82/443 (18%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG +TI   L + +L+  +  DY +V    EL V
Sbjct: 1   MSAKSRFTRLDAFAKTVEDARIRTKSGGVITIASLLIVMWLVWGEWADYRRVVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D SRG ++ IHL++  P + C+ L LD +D SGEQ + V H + K RL         P+ 
Sbjct: 61  DKSRGERMEIHLNMTFPRLPCELLTLDVMDVSGEQQVGVAHGVNKVRLS--------PRN 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR----KCCNTCNEVKEAYRYKK 176
           E    +  + +   + +   +  DP  CG C GA          CC TC EV++AY  K+
Sbjct: 113 EGGKVIDVQALDLHSPSEAAKHLDPEYCGECGGATPPPNVIKPGCCTTCEEVRQAYAEKQ 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   I QC  E   E+L     EGC+I G L+VN+V G+FHIAPG S++  ++HVH
Sbjct: 173 WAFGDGSNIEQCTREGYAERLAEQRREGCRIEGVLKVNKVIGNFHIAPGRSFTTGNMHVH 232

Query: 237 DIQPYTS-----AAFNTTHHIRH-LSFGIKLQ---------DDDERRKPLDGTVAKAEEG 281
           D+  Y       A  +T  H+ H L FG +L           D     PLD T  + +E 
Sbjct: 233 DLDTYIDPNAGPAEQHTMSHLVHELRFGPQLPAELAGRWGWTDHHHTNPLDDTKQETDEP 292

Query: 282 ASMFNYYIKIIPTIYERL--------------DGSKLG---------------------- 305
           A  F Y++K++ T Y  L              D + LG                      
Sbjct: 293 AYNFLYFVKVVSTSYLPLGWDPQFSTAIHNAYDKAPLGYHGLAYGTQGSIEAHQYSVTSH 352

Query: 306 -----GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNIS 346
                GG+             GG+PG+FF+Y++SP+ V   E + K+  +  T +   I 
Sbjct: 353 KRPLSGGNDAAEGHKERVHAGGGIPGVFFNYDISPMKVVNREARPKTFTNFLTGVCAIIG 412

Query: 347 GTYITFMLVDALLHSCVKKISKV 369
           GT      +D  ++    ++ K+
Sbjct: 413 GTLTVAAALDRGVYEGAMRVKKL 435


>gi|348680250|gb|EGZ20066.1| CopII vesicle protein [Phytophthora sojae]
          Length = 409

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 123/376 (32%), Positives = 195/376 (51%), Gaps = 32/376 (8%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK +D + K + +F  +T +G  V+IV  + ++ L   ++  Y+ ++T E + VDSS G 
Sbjct: 31  LKKVDVYPKMHREFKVQTEFGATVSIVAGIVMAILFLSELSAYWSLNTHEHMVVDSSLGE 90

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL ++LD+    ++C    ++A+D +GE  +++   + K RLD DG  I  P    ++ +
Sbjct: 91  KLQVNLDVSFLAVNCRDAHINAMDVAGELQVNMHQTVVKTRLDADGNTIGRP----ISMI 146

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETET-RKCCNTCNEVKEAYRYKKWALPELDTI 185
             +    +  T   E      CGSC+GA+    ++CCNTC +VKEA+ Y  ++L + +  
Sbjct: 147 TDEGAEEQAKTALPE----GYCGSCHGAQHPAGKECCNTCEDVKEAFIYSDFSLEDAEQK 202

Query: 186 VQCKNE-YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
            QC  E    EKL     EGC+  G + VNRV+G+FH+A G ++      VH  +P    
Sbjct: 203 EQCVREIMEAEKLAQD-GEGCRFTGKMFVNRVAGNFHVALGRTFHRQGRLVHQFRPGQEH 261

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD---- 300
            +N++H I  LSFG  +        PLDG    AE+   +F YYIKI+PTIY  +D    
Sbjct: 262 TYNSSHIIHSLSFGEPMPG---VAGPLDGVSKIAEQSGGVFQYYIKIVPTIYSDIDENTI 318

Query: 301 ----------GSKLG--GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                     G+ L   G    +PG FF ++LSP MVK+        H  TK+ C I G 
Sbjct: 319 HSYQFSVTQQGNYLNPRGQMTSLPGTFFVFDLSPFMVKVENDRMPFTHFLTKV-CAIVGG 377

Query: 349 YITFM-LVDALLHSCV 363
            I+    VD+ +++ +
Sbjct: 378 VISIAGFVDSFMYNSL 393


>gi|426196003|gb|EKV45932.1| hypothetical protein AGABI2DRAFT_207344 [Agaricus bisporus var.
           bisporus H97]
          Length = 1000

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 121/399 (30%), Positives = 184/399 (46%), Gaps = 61/399 (15%)

Query: 18  EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVP 77
           ED   KT  G  +T +    I     ++  DY +V T   + VD SRG KL ++L+I  P
Sbjct: 602 EDVKVKTRTGAFLTFIAAAIILSFTTLEFLDYRRVYTDTSIVVDKSRGEKLTVNLNITFP 661

Query: 78  TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGT 137
            + C  L+LD +D SGE    + HNI K RL+ +G         +V A    ++  E   
Sbjct: 662 RVPCFLLSLDVMDISGEVQRDISHNILKTRLENNGT--------IVPASYSAQLQNE-LD 712

Query: 138 TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
              E++    CGSCYG       CCNTC+EV++AY  + W+    D I QCK E  +EK+
Sbjct: 713 KMNEVQQSGYCGSCYGGVEPASGCCNTCDEVRQAYVNRGWSFSSPDAIEQCKREGWSEKM 772

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT--SAAFNTTHHIRHL 255
           K+   EGC + G L VN+V G+ H++PG S+  N  +++++ PY       + +H I H 
Sbjct: 773 KDQADEGCNVSGRLRVNKVIGNIHLSPGRSFQTNSRNLYELVPYLRDENKHDFSHEIHHF 832

Query: 256 SFGIKLQDDDE------------------RRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
           +F    + DDE                     PLDG   +  +   MF Y++K++ T + 
Sbjct: 833 AF----EGDDEYVYWKASAGREMKNRLGLNINPLDGAKYRTSKVQYMFQYFLKVVSTQFR 888

Query: 298 RLDGS----------------------------KLGGGDGGMPGIFFSYELSPLMVKITE 329
            LDG                              +  G  G+PG FF+YE+SP++V   +
Sbjct: 889 TLDGKIVNTHQYSVTHFERDLEEGGGGQSPGGINIQHGAQGLPGAFFNYEISPILVVHAD 948

Query: 330 KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
             +S  H  T     + G      LVD+LL +  + + K
Sbjct: 949 SRQSFAHFLTSTCAIVGGVLTVASLVDSLLFATTRALKK 987


>gi|50548631|ref|XP_501785.1| YALI0C13112p [Yarrowia lipolytica]
 gi|49647652|emb|CAG82095.1| YALI0C13112p [Yarrowia lipolytica CLIB122]
          Length = 401

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 134/400 (33%), Positives = 195/400 (48%), Gaps = 45/400 (11%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L   DAF KP  D   KT  GG VT++  L I  L   +   Y       ++ VD  RG
Sbjct: 4   KLFRYDAFAKPTADATIKTASGGIVTLLAILLIVVLTISEYWAYTTPVMRSQMTVDRYRG 63

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            +L IHL+I  P + C  + LD +DSSGE    V+H++ K  LD  G  +      +   
Sbjct: 64  DRLDIHLNITFPQLPCSLVTLDIIDSSGEVQQSVDHDMTKVTLDERGNILSSEALTLGEN 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
              K V        T L+DPN CGSCYGAE+E  +CCNTC +V+ AY  K WA  +   +
Sbjct: 124 PDSKAVAKR-----TFLDDPNYCGSCYGAESEPDQCCNTCEQVRAAYATKGWAFTDGSGV 178

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS-- 243
            QC+     E+LK  + +GC I G   V +V+G+FH APG+S   +  H+HD+  +    
Sbjct: 179 EQCEVIGFKEQLKAQYNQGCNIAGKFTVQKVAGNFHFAPGVSSHRDEQHLHDLSHFKDPE 238

Query: 244 AAFNTTHHIRHLSFGIKLQ----DDDE----RRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
           A F  +H I  LSFG ++     D D+       PL+ T    +     FNY+ K++ T 
Sbjct: 239 APFTFSHIIHDLSFGEQVDVSGLDWDKGVAMETSPLENTPHHTDNKWFRFNYFTKVVSTR 298

Query: 296 YERLDGSKL---------------GGGD----------GGMPGIFFSYELSPL-MVKITE 329
           +E LDG K+               GG D          GG+PG+FFSY++SP+ +V   E
Sbjct: 299 FEFLDGKKIETNQYAATAHERPLQGGRDEDHQNTRHMRGGLPGVFFSYDISPMRIVNKQE 358

Query: 330 KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
                G    +++  I G     + V A+L   + ++ +V
Sbjct: 359 YRSHFGAFVMQVVATIGGV----LTVAAVLDRGIYEVDQV 394


>gi|409079094|gb|EKM79456.1| hypothetical protein AGABI1DRAFT_120853 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 1000

 Score =  201 bits (510), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 121/399 (30%), Positives = 184/399 (46%), Gaps = 61/399 (15%)

Query: 18  EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVP 77
           ED   KT  G  +T +    I     ++  DY +V T   + VD SRG KL ++L+I  P
Sbjct: 602 EDVKVKTRTGAFLTFIAAAIILSFTTLEFLDYRRVYTDTSIVVDKSRGEKLTVNLNITFP 661

Query: 78  TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGT 137
            + C  L+LD +D SGE    + HNI K RL+ +G         +V A    ++  E   
Sbjct: 662 RVPCFLLSLDVMDISGEVQRDISHNILKTRLENNGT--------IVPASYSAQLQNE-LD 712

Query: 138 TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
              E++    CGSCYG       CCNTC+EV++AY  + W+    D I QCK E  +EK+
Sbjct: 713 KMNEVQQSGYCGSCYGGVEPASGCCNTCDEVRQAYVNRGWSFSSPDAIEQCKREGWSEKM 772

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT--SAAFNTTHHIRHL 255
           K+   EGC + G L VN+V G+ H++PG S+  N  +++++ PY       + +H I H 
Sbjct: 773 KDQADEGCNVSGRLRVNKVIGNIHLSPGRSFQTNSRNLYELVPYLRDENKHDFSHEIHHF 832

Query: 256 SFGIKLQDDDE------------------RRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
           +F    + DDE                     PLDG   +  +   MF Y++K++ T + 
Sbjct: 833 AF----EGDDEYVYWKASAGREMKNRLGLNINPLDGAKYRTSKVQYMFQYFLKVVSTQFR 888

Query: 298 RLDGS----------------------------KLGGGDGGMPGIFFSYELSPLMVKITE 329
            LDG                              +  G  G+PG FF+YE+SP++V   +
Sbjct: 889 TLDGKIVNTHQYSVTHFERDLEEGGGGQSPGGINIQHGAQGLPGAFFNYEISPILVVHAD 948

Query: 330 KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
             +S  H  T     + G      LVD+LL +  + + K
Sbjct: 949 SRQSFAHFLTSTCAIVGGVLTVASLVDSLLFATTRALKK 987


>gi|392566201|gb|EIW59377.1| endoplasmic reticulum-derived transport vesicle ERV46 [Trametes
           versicolor FP-101664 SS1]
          Length = 423

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 127/412 (30%), Positives = 193/412 (46%), Gaps = 56/412 (13%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F   LKG+DAF K  ED   KT  G  +T++    I+    ++  DY +V+    + VD 
Sbjct: 5   FLSALKGVDAFGKTMEDVKVKTRTGALLTLIAAAIITSFTTIEFFDYRRVNVDTSIVVDR 64

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ-KE 121
           SRG KL +++++  P + C  L+LD +D SGE    + HNI K R+D  G P+      E
Sbjct: 65  SRGEKLTVNMNVTFPRVPCYLLSLDVMDISGETQSDITHNILKTRMDERGFPVPTTVITE 124

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
           + N + K           ++ E         G E E   CCNTC +V++AY  + W+   
Sbjct: 125 LQNDLDK---------INSQREGGYCGSCYGGVEPEG-GCCNTCEDVRQAYVNRGWSFNR 174

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            D+I QC  E  +EKLK   TEGC I G + VN+V G+ H++PG S+  +   ++++ PY
Sbjct: 175 PDSIEQCVQEGWSEKLKEQATEGCNIAGRVRVNKVVGNIHLSPGRSFRTSSHSLYELVPY 234

Query: 242 TSAAFNT---THHIRHLSF---------GIKLQDDDERR-----KPLDGTVAKAEEGASM 284
                N    TH I HL+F           KL  + ++R      PLDGT  +  +   M
Sbjct: 235 LKTDGNRHDFTHTIHHLAFEGDDEWDLAKAKLGKELKQRLGIAANPLDGTTGRTIKQQYM 294

Query: 285 FNYYIKIIPTIYERLDGSKL----------------------------GGGDGGMPGIFF 316
           F Y++K++ T +  L G  +                              G+GG+PG FF
Sbjct: 295 FQYFLKVVATQFRTLSGKTINTHQYSATHFERDLDKGSQENTPTGVHVAHGNGGIPGAFF 354

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           +YE+SPL +   E  +S  H  T     + G      L+D+ L +  K + K
Sbjct: 355 NYEISPLRIVHAETRQSFAHFLTSTCAIVGGVLTVASLIDSALFATRKALKK 406


>gi|346324387|gb|EGX93984.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Cordyceps militaris CM01]
          Length = 423

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 137/429 (31%), Positives = 195/429 (45%), Gaps = 69/429 (16%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTIV  + + +L   +   Y  V    EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVDEARIRTTSGGVVTIVSLVVVLFLAWGEWASYRTVVIRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGEQ   V H ++K RL         P+ 
Sbjct: 61  DQGRGERMDIHLNITFPRMPCELLTLDVMDVSGEQQHGVAHGVHKVRL--------RPEG 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET----RKCCNTCNEVKEAYRYKK 176
           E    +    +   N     E  DP+ CG C GA   T      CCNTC E++EAY    
Sbjct: 113 EGGGVIDVSSLNLHN--DAAEHLDPSYCGDCGGAPAPTTVTKAGCCNTCEEIREAYAQVS 170

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +     QC+ E+  E+L+    EGC+I G L+VN+V G+FH+APG S+S  ++HVH
Sbjct: 171 WAFGDGKAFEQCEREHYAERLEEQRHEGCRIDGLLQVNKVVGNFHLAPGRSFSNGNMHVH 230

Query: 237 DIQPYTSAA----FNTTHHIRHLSFGIKLQD-------------DDERRKPLDGTVAKAE 279
           D++ Y         + THHI HL FG +L +              +    PLD T     
Sbjct: 231 DLKNYWETTDDKKHDFTHHIHHLRFGPQLPETVVQKLGKGATPWTNHHGNPLDSTKQLTN 290

Query: 280 EGASMFNYYIKIIPTIYERLDGSKLG------------------------GGD------- 308
           +    F Y++KI+PT +  L   K+                         GGD       
Sbjct: 291 DPNFNFMYFVKIVPTSFLPLGWEKMARTMNVDASVETHQYSVTSHKRSLTGGDDSAEGHA 350

Query: 309 ------GGMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
                 GG+PG+FFSY++SP+ ++   EK KS       +   + GT      VD  L  
Sbjct: 351 ERLHSRGGIPGVFFSYDISPMKVINREEKGKSFLGFVAGLCAVVGGTLTVAAAVDRGLFE 410

Query: 362 CVKKISKVE 370
              ++ K+ 
Sbjct: 411 GTTRLKKIR 419


>gi|405123077|gb|AFR97842.1| COPII-coated vesicle component Erv46 [Cryptococcus neoformans var.
           grubii H99]
          Length = 422

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 123/403 (30%), Positives = 186/403 (46%), Gaps = 63/403 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
            +G DAF K  ED   KT  G  +T +    I   + ++  DY ++     + VD SRG 
Sbjct: 10  FQGFDAFGKTMEDVKIKTRTGALLTFISLSIILTSVMLEFIDYRRIHLEPSIIVDRSRGE 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ-KEVVNA 125
           KL I  DI  P + C  L+LD +D SGE     EH + K R++ DG  I + Q  ++   
Sbjct: 70  KLVIDFDIEFPRVPCYLLSLDVMDISGEHQTEFEHQVTKTRMNKDGNVISKVQGSQLKGD 129

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           V++  +           +DPN CGSCYGA      CCN+C EV++AY  K W+  + + I
Sbjct: 130 VERANLN----------QDPNYCGSCYGAPPPESGCCNSCEEVRQAYGRKGWSFSDPEGI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E   +K+K    EGC+I G++ VN+V G+ H +PG S+  N + + ++ PY    
Sbjct: 180 EQCVEEGWMDKMKEQNEEGCRIDGHIRVNKVIGNLHFSPGRSFQNNMMQMLELVPYLR-- 237

Query: 246 FNTTHH-----IRHLSFGIKLQDDDE---------------RRKPLDGTVAKAEEGASMF 285
            +  HH     +    FG  +   +E                R PL G  A  E    MF
Sbjct: 238 -DKNHHDFGHIVHKFRFGGDMTKAEELTVLPKEQRWRDKLGLRDPLQGMKAHTEVSNYMF 296

Query: 286 NYYIKIIPTIYERLDGSKL----------------GGGDG-------------GMPGIFF 316
            Y++K++ T +  L+G ++                G   G             G+PG+FF
Sbjct: 297 QYFLKVVSTNFISLNGEEIPSHQYSVTQYERDLRTGNAPGKDAHGHMTSHGMMGVPGVFF 356

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
           +YE+SP+ V  TE+ +S  H  T     + G      LVD+ +
Sbjct: 357 NYEISPMKVIHTEERQSFAHFLTSTCAIVGGVLTVASLVDSFI 399


>gi|321253192|ref|XP_003192660.1| ER to Golgi transport-related protein [Cryptococcus gattii WM276]
 gi|317459129|gb|ADV20873.1| ER to Golgi transport-related protein, putative [Cryptococcus
           gattii WM276]
          Length = 435

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 122/412 (29%), Positives = 191/412 (46%), Gaps = 63/412 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
            +G DAF K  ED   KT  G  +T +    I   + ++  DY ++     + VD SRG 
Sbjct: 10  FQGFDAFGKTMEDVKVKTRTGALLTFISLSIILTSVMLEFIDYRRIHLEPSIIVDRSRGE 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVNA 125
           KL I  DI  P + C  L+LD +D SGE     EH + K R+D +GK I + Q  ++   
Sbjct: 70  KLVIDFDIEFPRVPCYLLSLDVMDISGEHQTEFEHQVTKTRIDKNGKIISKVQGGQLKGD 129

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +++  +           +DPN CGSCYGA      CCN+C EV++AY  K W+  + + I
Sbjct: 130 LERANLN----------QDPNYCGSCYGAPPPESGCCNSCEEVRQAYGRKGWSFSDPEGI 179

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E   +K+K    EGC+I G++ VN+V G+ H +PG S+  N + + ++ PY    
Sbjct: 180 EQCVEEGWMDKMKEQNEEGCRIGGHIRVNKVIGNLHFSPGRSFQNNMMQMLELVPYLR-- 237

Query: 246 FNTTHH-----IRHLSFGIKLQDDDE---------------RRKPLDGTVAKAEEGASMF 285
            +  HH     +    FG  +   +E                + PL G     E    MF
Sbjct: 238 -DKNHHDFGHIVHKFRFGGDMTKAEELTVLPKEQRWRDKLGLKDPLQGIKVHTEVSNYMF 296

Query: 286 NYYIKIIPTIYERLDGSKL----------------GGGDG-------------GMPGIFF 316
            Y++K++ T +  L+G ++                G   G             G+PG+FF
Sbjct: 297 QYFLKVVSTNFISLNGEEIPSHQYSVTQYERDLRTGNAPGKDAHGHMTSHGMMGVPGVFF 356

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           +YE+SP+ V  TE+ +S  H  T     + G      L+D+ + +  K++ K
Sbjct: 357 NYEISPMKVIHTEERQSFAHFLTSTCAIVGGVLTVASLLDSFIFNSSKRLKK 408


>gi|296811622|ref|XP_002846149.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Arthroderma otae CBS 113480]
 gi|238843537|gb|EEQ33199.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Arthroderma otae CBS 113480]
          Length = 435

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 139/440 (31%), Positives = 199/440 (45%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VTI   L + YL+  +  DY +V    EL V
Sbjct: 1   MAGKSRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVIQPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGE    V+H + K RL    +  +    
Sbjct: 61  DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRLSSAAEGGKVIDV 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
             ++  KK               DPN CG+CYG  A +  +K  CCNTC EV++AY  K 
Sbjct: 121 TALDLHKKDDSPAH--------LDPNYCGNCYGVPAPSTAKKPGCCNTCAEVRDAYAEKN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC +E  ++++     EGC+I G L VN+V+G+FHIAPG S +  + H H
Sbjct: 173 WAFGRGEGVTQCMDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
           D+  Y  T      TH I  L FG +L ++   R          PLD +  + +E    F
Sbjct: 233 DLDNYYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHRTDEVRYNF 292

Query: 286 NYYIKIIPTIYERLD--------------------------GSK---------------- 303
            Y++K++ T Y  L                           GS+                
Sbjct: 293 LYFVKVVSTSYLPLGWDATWSSEVHSQAHKDIPLGNHGVYFGSQGSIETHQYSVTSHKRS 352

Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
           L GGD             GG+P + F+YE+SP+ V   E + KSL   +T +   I GT 
Sbjct: 353 LDGGDDSAEGHKERQYARGGIPSVMFNYEISPMKVINRETRPKSLSTFFTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                VD LL+    ++ K+
Sbjct: 413 TVAAAVDRLLYEGSLRVKKL 432


>gi|336369994|gb|EGN98335.1| hypothetical protein SERLA73DRAFT_109778 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336382751|gb|EGO23901.1| hypothetical protein SERLADRAFT_450196 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 988

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 123/403 (30%), Positives = 189/403 (46%), Gaps = 57/403 (14%)

Query: 18  EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVP 77
           ED   KT  G  +TI+    I     ++  DY  V+    + VD SRG KL + +++  P
Sbjct: 591 EDVKVKTRTGAFLTILSAAIILAFTAMEFFDYRTVNVDTSIIVDRSRGEKLSVRMNMTFP 650

Query: 78  TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVNAVKKKKVTTENG 136
            + C  L+LD +D SGEQ   V HNI+K R+  +G P+   +  E+ N + K      NG
Sbjct: 651 RVPCYLLSLDIMDISGEQQRDVSHNIHKTRITPEGGPVPGARNGELRNEIDKLNDQRSNG 710

Query: 137 TTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEK 196
                      CGSCYG       CCN+C +V++AY  + W+    D I QC  E  +EK
Sbjct: 711 Y----------CGSCYGGVEPEGGCCNSCEDVRQAYVNRGWSFNNPDNIEQCVAEGWSEK 760

Query: 197 LKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNT---THHIR 253
           LK+   EGC I G L VN+V G+ +++PG S+  +  + +++ PY     N    +H I 
Sbjct: 761 LKDQAEEGCNISGRLRVNKVIGNINVSPGRSFQSSSRNFYELVPYLREDNNRHDFSHVIH 820

Query: 254 HLSF---------GIKLQDDDERR-----KPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
             SF           KL  D ++R      PLDG  AK  +   MF Y++K++ T +  +
Sbjct: 821 EFSFMTDDEYNLHKAKLGKDMKQRMGIAENPLDGLNAKTNKAQYMFQYFLKVVSTQFRTI 880

Query: 300 DGSKL-------------------GGGDG----------GMPGIFFSYELSPLMVKITEK 330
           DG  +                   GG +G          G+PG FF++E+SP++V  +E 
Sbjct: 881 DGKTINTHQYSATHFERDLSKGSQGGDNGEGVVTQHGVSGVPGAFFNFEISPILVVHSEG 940

Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGG 373
            +S  H  T     + G      L+D+ L +  +++ K    G
Sbjct: 941 RQSFAHFLTSTCAIVGGVLTVAALLDSFLFATGRRLKKGSSNG 983


>gi|19113757|ref|NP_592845.1| COPII-coated vesicle component Erv46 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|1351651|sp|Q09895.1|YAI8_SCHPO RecName: Full=Uncharacterized protein C24B11.08c
 gi|1061296|emb|CAA91773.1| COPII-coated vesicle component Erv46 (predicted)
           [Schizosaccharomyces pombe]
          Length = 390

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 128/411 (31%), Positives = 196/411 (47%), Gaps = 58/411 (14%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M F   L+  DAF K  ED   KT  GG +T+V  L + +++ ++  +Y +V    E+ V
Sbjct: 1   MQFRSPLRRFDAFQKTVEDARIKTASGGLITLVSGLIVIFIVLMEWINYRRVIAVHEIIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           + S G ++ I+ +I  P I C  L +D +D SGE    + H + K RL   G+ I     
Sbjct: 61  NPSHGDRMEINFNITFPRIPCQILTVDVLDVSGEFQRDIHHTVSKTRLSPSGEIISVDDL 120

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET----ETRKCCNTCNEVKEAYRYKK 176
           ++ N    ++  +++G          +CG CYGA      +T  CCNTC+ V++AY    
Sbjct: 121 DIGN----QQSISDDGAA--------ECGDCYGAADFAPEDTPGCCNTCDAVRDAYGKAH 168

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           W + ++D   QCK+E   E  +    EGC + G L VNR++G+FHIAPG S    + HVH
Sbjct: 169 WRIGDVDAFKQCKDENFKELYEAQKVEGCNLAGQLSVNRMAGNFHIAPGRSTQNGNQHVH 228

Query: 237 DIQPYTSA--AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           D + Y +     + +H I HLSFG  L        PLDGTV K       + Y+IK +  
Sbjct: 229 DTRDYINELDLHDMSHSIHHLSFGPPLDASVHYSNPLDGTVKKVSTADYRYEYFIKCVSY 288

Query: 295 IYERLDGSKL-----------------GGGD----------GGMPGIFFSYELSPLMVKI 327
            +  L  S L                 GG +          GG+PG++F +++SP+ V  
Sbjct: 289 QFMPLSKSTLPIDTNKYAVTQHERSIRGGREEKVPTHVNFHGGIPGVWFQFDISPMRV-- 346

Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
                    +  ++  N  G +++ +L  ALL  CV   S V+ G   V K
Sbjct: 347 ---------IERQVRGNTFGGFLSNVL--ALLGGCVTLASFVDRGYYEVQK 386


>gi|349804919|gb|AEQ17932.1| putative ergic and golgi 3 [Hymenochirus curtipes]
          Length = 228

 Score =  197 bits (500), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 98/237 (41%), Positives = 143/237 (60%), Gaps = 26/237 (10%)

Query: 94  EQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCY 152
           EQ L VEHN++K RLD D +P+  E ++  +   ++  +            DP++C SCY
Sbjct: 1   EQQLDVEHNLFKLRLDKDRQPVSSEAERHDLGKAEEPVIFDPKSL------DPDRCESCY 54

Query: 153 GAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLE 212
           GAET+  +CCN+C++V+EAYR + WA    D+I QCK E  ++K++    EGC++YG+LE
Sbjct: 55  GAETDDFRCCNSCDDVREAYRRRGWAFKTPDSIEQCKREGFSQKMQEQKNEGCRVYGFLE 114

Query: 213 VNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLD 272
           VN+V+G+FH APG S+  +HVHVHD+Q +     N TH I+HLSFG+   D      PLD
Sbjct: 115 VNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHEIKHLSFGM---DYPGLVNPLD 171

Query: 273 GTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------------GDGGMPG 313
           GT   A + + MF Y++KI+PT+Y ++DG  L                  GD G+PG
Sbjct: 172 GTSVSAVQSSMMFQYFVKIVPTVYVKVDGEVLRTNQFSVTRHEKVTNGLIGDQGLPG 228


>gi|194224360|ref|XP_001916465.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Equus caballus]
          Length = 342

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 128/387 (33%), Positives = 186/387 (48%), Gaps = 68/387 (17%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVN 124
            KL I++D+  P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+  E ++  + 
Sbjct: 66  DKLKINIDVFFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPVSSEAERHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V+ K    ++        DP++C SCYGAETE  K    C                   
Sbjct: 126 KVEVKVFDPDS-------LDPDRCESCYGAETEDIKPPYFC------------------- 159

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
                                 +  +L  +              +++ V +HD+Q +   
Sbjct: 160 ----------------------LQDHLHSSLAGKGLPWGRDQEEALHAVEIHDLQSFGLD 197

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N TH+IRHLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG  L
Sbjct: 198 NINMTHYIRHLSFG---EDYPGIVNPLDRTNVTAPQASMMFQYFVKVVPTVYMKVDGEVL 254

Query: 305 GG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             GD G+PG+F  YELSP+MVK+TEK +S  H  T +   I G 
Sbjct: 255 RTNQFSVTRHEKVANGLMGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGM 314

Query: 349 YITFMLVDALLHSCVKKISKVEIGGKT 375
           +    L+D+L++   + I K    GKT
Sbjct: 315 FTVAGLIDSLIYHSARAIQKKIDLGKT 341


>gi|145340712|ref|XP_001415464.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144575687|gb|ABO93756.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 379

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 124/388 (31%), Positives = 190/388 (48%), Gaps = 36/388 (9%)

Query: 11  DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD---SSRGSK 67
           D F K  +DF  +T  GGA+  +    +  L      +  + +T  +L VD   +    K
Sbjct: 1   DLFPKISDDFARRTATGGAIATIGLALMVILFLQQTAELMRTTTAYDLRVDDGVAGATKK 60

Query: 68  LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHN-IYKRRLDLDGKPI-QEPQKEVVNA 125
           + I++D+ +  + C  ++LDA+D +GE  L V  + +   R+D  G+ I    ++  VNA
Sbjct: 61  IVINVDLTLRAMHCAQVSLDAMDVTGETRLDVSRSEVRTTRVDARGRAIAMTSERTAVNA 120

Query: 126 VKKKKVTTENGTTTTELED-PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
                  TE G    E     + CG CYGA  E   CC+ C+ V+EAYR K WALP+L  
Sbjct: 121 ------KTEAGEREREATGGRSACGDCYGA-AEAGTCCDDCDSVREAYRVKGWALPDLRR 173

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           + QC  EY    ++N   EGC   G+ EVN+V+G+FHIAPG SY+    HVHD+ P+   
Sbjct: 174 VTQCTKEYDVVAMRNEHKEGCHFSGHFEVNKVAGNFHIAPGKSYNNLGQHVHDLSPFAGV 233

Query: 245 -AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEG-ASMFNYYIKIIPTIYERLD-- 300
            +FN +H I  LSFG +         PLDG     ++  A ++ Y + ++P  Y+ L   
Sbjct: 234 ESFNFSHIIHKLSFGEEFPG---VVNPLDGVTRTMDDANAGVYQYRLSVVPARYKYLGFR 290

Query: 301 -----------GSKLGGGD----GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
                           G D     G+PG+FF Y+LSPL V+  E+        + +   I
Sbjct: 291 ARVVESNDYSVTDHFRGFDVTKNPGLPGLFFFYDLSPLRVEYEERRIGFFQYLSNVAAII 350

Query: 346 SGTYITFMLVDALLHSCVKKI-SKVEIG 372
            G      +VD L++   + +  KV++G
Sbjct: 351 GGVSAVVNIVDGLVYRGQRALREKVDLG 378


>gi|307105810|gb|EFN54058.1| hypothetical protein CHLNCDRAFT_25376, partial [Chlorella
           variabilis]
          Length = 312

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 102/309 (33%), Positives = 166/309 (53%), Gaps = 26/309 (8%)

Query: 81  CDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTT 140
           C +L++DA+D SGE  L V+H++YKRRL  DG P+ E        +K       N +   
Sbjct: 1   CSWLSIDAMDISGEVQLEVDHDVYKRRLSPDGTPLDEGGCPRAGWLKP---VPGNDSEAD 57

Query: 141 ELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNT 200
             + P  CGSCYG+E+   +CCNTC EV++AYR K WAL +++ + QC +E   E++   
Sbjct: 58  PTKAPGYCGSCYGSESRAGQCCNTCAEVRDAYRTKGWALLDVEKVEQCHHEGYKEEIDEQ 117

Query: 201 FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
             EGC ++G L++N+V+G+FHIAPG SY   ++H+HD+ P+   AF+ +H I  L+FG +
Sbjct: 118 KGEGCHVWGELQINKVAGNFHIAPGRSYQQGNMHIHDLSPFAGQAFDFSHTIHKLAFGRE 177

Query: 261 LQDDDERRKPLDG---TVAKAEEGASMFNYYIKIIPTIYERLDGSKL------------- 304
                 R + L     +V    E   ++ Y++K++PT Y  L  + +             
Sbjct: 178 YP--GTRGQALSTFCLSVGTRRERMGLYQYFLKVVPTSYSDLRNNTIYTNQFSVTEHFRE 235

Query: 305 ----GGGDGGMPGIFFSYELSPLMVKITEKSK-SLGHLWTKIMCNISGTYITFMLVDALL 359
                 G G +PG+F  Y+LSP+   +  +++ S     T +   I G +    ++DA +
Sbjct: 236 TASPTAGGGQLPGVFLFYDLSPIKASLEGRARLSFLSFLTSLCAIIGGVFTVSGIIDATV 295

Query: 360 HSCVKKISK 368
           +   + I K
Sbjct: 296 YHGQQAIKK 304


>gi|226292523|gb|EEH47943.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Paracoccidioides brasiliensis Pb18]
          Length = 435

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 133/440 (30%), Positives = 190/440 (43%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV    IS+LI  +  +Y ++    EL V
Sbjct: 1   MAPKSRFARLDAFTKTVEDARIRTRSGGLVTIVALFVISFLIWGEWYEYRRIVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL+I  P + C+ L LD +D SGE    + H I K RL         P+ 
Sbjct: 61  DKGRGERMEIHLNITFPHLPCELLTLDVMDVSGEMQSGIIHGISKVRL--------APES 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E  + +    +     T   +  DP+ CG CYGA   +            EV+EAY  + 
Sbjct: 113 EGGHVIDTTALVLHTQTDAAKHLDPDYCGPCYGAPPPSHATKPGVALPAKEVREAYASQS 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E  ++ L     EGC+I G L VN+V G+FHIAPG S+S  ++H H
Sbjct: 173 WAFGRGENVEQCEREGYSKNLDAQRNEGCRIEGVLRVNKVVGNFHIAPGRSFSSGNIHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
           D+  Y  T    + +H I  L FG +L D         D     PLD T     +    F
Sbjct: 233 DLDTYYHTPVPHHMSHKIHQLRFGPQLSDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNF 292

Query: 286 NYYIKIIPTIYERLDGS------------------------------------------K 303
            Y++K++ T Y  L  S                                           
Sbjct: 293 MYFVKVVSTSYLPLGWSPEFSSSVHETTLGNTPLGKQGVHFGSSGSIETHQYSVTSHKRS 352

Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
           + GGD             GG+PG+F +Y++SP+ V   E ++K+     T +   I GT 
Sbjct: 353 IDGGDDAAEGHKERLHSHGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                VD  L+  V ++ K+
Sbjct: 413 TVAAAVDRALYEGVARVKKL 432


>gi|254581328|ref|XP_002496649.1| ZYRO0D04972p [Zygosaccharomyces rouxii]
 gi|238939541|emb|CAR27716.1| ZYRO0D04972p [Zygosaccharomyces rouxii]
          Length = 404

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 129/414 (31%), Positives = 193/414 (46%), Gaps = 60/414 (14%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF+K  ED   +T  GG + ++C L   +L+  +  ++ QV    EL VD  R  
Sbjct: 7   LRTFDAFSKTEEDVRIRTRTGGIIALLCCLVTIFLLISEWLNFNQVVNRPELVVDKDRQL 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL +  DI  P++ CD L+LD +DS+GE  L + E    K RLD +G+ +          
Sbjct: 67  KLELEADITFPSMPCDMLSLDIMDSAGEIQLDLLESGFTKTRLDQNGQSL---------G 117

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYKK 176
               KV+ E    + + +D N CG+CYGA+ ++R          CC TCN+V+ AY    
Sbjct: 118 SSSLKVSDE----SYDPKDENYCGACYGAKDQSRNNEVPKEERVCCQTCNDVRRAYLEAN 173

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   I QC+ E   +++     EGC++ G   +NR+ G+ H APG+++     H H
Sbjct: 174 WAFFDGKNIEQCEREGYVDRVNEQLNEGCRVQGSALLNRIQGTLHFAPGVAFQNPKGHFH 233

Query: 237 DIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEGASM--FNY 287
           D+  Y  +   N  H I HLSFG  +  +   R       PLDG  A  +    M  F+Y
Sbjct: 234 DLSLYEKTHNLNFNHIINHLSFGKPVTSNARGRGASVATAPLDGRQAFPDRDTHMHQFSY 293

Query: 288 YIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSP 322
           + KI+PT YE +D   +               GG D          GG PG+F  +E+SP
Sbjct: 294 FTKIVPTRYEYMDKMVVETAQFSATLHDRPLHGGADQDHPTTLHTKGGFPGLFVYFEMSP 353

Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
           L V   E+       W+  + N   +    + V  +L     K  K   G K+V
Sbjct: 354 LKVINREQH---AQTWSGFILNCITSIGGVLAVGTVLDKITYKAQKSIWGKKSV 404


>gi|154280410|ref|XP_001541018.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150412961|gb|EDN08348.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 435

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 132/440 (30%), Positives = 193/440 (43%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTI     I +LI  +  +Y ++    EL V
Sbjct: 1   MPPKSRFARLDAFTKTVEDARIRTRLGGVVTISALFVIFFLIWGEWSEYRRIVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGE    V H + K RL      ++E  +
Sbjct: 61  DKGRGERMEIHLNVTFPNLPCELLTLDVMDISGEYQTGVIHGVNKVRL----SSVEEGGR 116

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
            +     +    T  GT      DP+ CG CYGA   +      CCNTC EV++AY  K 
Sbjct: 117 VLDITALQLHSQTNKGTDV----DPDYCGQCYGATPPSNAKKPGCCNTCEEVRDAYAAKG 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E  +  L     EGC++ G + VN+V G+FHIAPG S++  ++H H
Sbjct: 173 WAFGRGENVEQCEKEGYSANLDAQRKEGCRVEGVIRVNKVVGNFHIAPGRSFTNGNLHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
           D+  Y  T    N  H + +L FG +L +         D+    PLD T          F
Sbjct: 233 DLDNYYHTPVQHNMGHRVHYLRFGPQLPEELSSRWKWTDNHHTNPLDNTEQHTTNPRFNF 292

Query: 286 NYYIKIIPTIYERL----------------------DGSKLG------------------ 305
            Y++K++ T Y  L                       G   G                  
Sbjct: 293 IYFVKVVSTSYLPLGWDPDASSSAHSKYSKNAPLGKQGLSFGSYGSIETHQYSVTSHKRS 352

Query: 306 --GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
             GGD             GG+PG+F +Y++SP+ V   E ++KS     T +   I GT 
Sbjct: 353 VDGGDDSAEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKSFSGFLTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                +D +L+    ++ K+
Sbjct: 413 TVAAAIDRVLYEGAVRVKKL 432


>gi|348667280|gb|EGZ07106.1| hypothetical protein PHYSODRAFT_319656 [Phytophthora sojae]
          Length = 398

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 114/361 (31%), Positives = 178/361 (49%), Gaps = 27/361 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RLKGLDA+ K  E+F  +T+ GG  +++ +  IS L+  ++  Y    T +++ VD  R 
Sbjct: 11  RLKGLDAYPKTIEEFKVRTLQGGLFSLLAFACISLLLVSELSFYLATDTVDKMTVDGGRN 70

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
           + + I+ D+  P ++C  +AL++ D +G     +EHNI K  LD  G+ + E   +V+  
Sbjct: 71  TMVAINFDVEFPRMACSVVALESADMAGNVQHDIEHNIRKIPLDHTGQALAEGMHDVIGG 130

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                  T N     E + P  CGSCY A  E  +CC+TC  VK AY  K W +P L TI
Sbjct: 131 A-----LTNNTELHGETDKP-ACGSCYSA-GEPGECCDTCESVKAAYARKSWMMPSLHTI 183

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+     + L+    EGC+I G L V++V+G  + AP   +   ++   D+   T   
Sbjct: 184 AQCQEVEIEKVLRGEVNEGCRIQGSLVVSKVAGKLYFAPSKFFRSGYLSSKDLVDATFKV 243

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA--EEGASMFNYYIKIIPTIYERLDGSK 303
           F+T+H IR LSFG    D    + PLD    +   E+    F Y++K++PT Y  L  S+
Sbjct: 244 FDTSHTIRSLSFGEAYPD---MKNPLDNRKKELPDEKTRGSFQYFLKVVPTEYTFLSASR 300

Query: 304 L---------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
           +                  D G+P + FSY  SP+M +I +         T +   + G 
Sbjct: 301 IITNQFSATEHFRQLTPVSDKGLPMVTFSYTFSPIMFRIEQYRVGFLQFLTSVCAIVGGV 360

Query: 349 Y 349
           +
Sbjct: 361 F 361


>gi|225562998|gb|EEH11277.1| COPII coated vesicle component Erv46 [Ajellomyces capsulatus
           G186AR]
 gi|240279818|gb|EER43323.1| COPII-coated vesicle component Erv46 [Ajellomyces capsulatus H143]
 gi|325092948|gb|EGC46258.1| COPII-coated vesicle component Erv46 [Ajellomyces capsulatus H88]
          Length = 435

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 132/440 (30%), Positives = 193/440 (43%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTI     I +LI  +  +Y ++    EL V
Sbjct: 1   MPPKSRFARLDAFTKTVEDARIRTRSGGVVTISALFVIFFLIWGEWSEYRRIVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGE    V H + K RL      ++E  +
Sbjct: 61  DKGRGERMEIHLNVTFPNLPCELLTLDVMDISGEYQTGVIHGVNKVRL----SSVEEGGR 116

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
            +     +    T  GT      DP+ CG CYGA   +      CCNTC EV++AY  K 
Sbjct: 117 VLDITALQLHSQTNKGTDV----DPDYCGQCYGATPPSNAKKPGCCNTCEEVRDAYAAKG 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E  +  L     EGC++ G + VN+V G+FHIAPG S++  ++H H
Sbjct: 173 WAFGRGENVEQCEKEGYSANLDAQRKEGCRVEGVIRVNKVVGNFHIAPGRSFTNGNLHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
           D+  Y  T    N  H I +L FG +L +         D+    PLD T          F
Sbjct: 233 DLDNYYHTPVQHNMGHRIHYLRFGPQLPEQLSSRWKWTDNHHTNPLDNTEQHTTNPRFNF 292

Query: 286 NYYIKIIPTIYERL----------------------DGSKLG------------------ 305
            Y++K++ T Y  L                       G   G                  
Sbjct: 293 MYFVKVVSTSYLPLGWDPDASSSAHSQYSKNAPLGKQGLSFGSYGSIETHQYSVTSHKRS 352

Query: 306 --GGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
             GGD             GG+PG+F +Y++SP+ V   E ++K+     T +   I GT 
Sbjct: 353 VDGGDDSAEGHKERLHSQGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                +D +L+    ++ K+
Sbjct: 413 TVAAAIDRVLYEGAVRVKKL 432


>gi|406606433|emb|CCH42207.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Wickerhamomyces ciferrii]
          Length = 405

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 199/404 (49%), Gaps = 53/404 (13%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF+K  ED   KT  GG +T+ C L +  LI  +   + +++   EL VD  R  
Sbjct: 7   LLSFDAFSKTVEDARVKTTSGGLITVTCILTLFSLIINEWRQFNEITIDPELVVDRDRNL 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I+LD+  P + CD ++LD +D SG+  L V +  + +        I+  +       
Sbjct: 67  KLDINLDVTFPDLPCDIMSLDIMDVSGDLQLDVTNYGFTK--------IRLTETGEEIGE 118

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAET---------ETRKCCNTCNEVKEAYRYKKW 177
           ++ K+  ++G    ++   + CG CYGA+          E + CCN C+ V++AY    W
Sbjct: 119 EEMKIGDDHGHADADIP-ADYCGPCYGAKNQDKNENKPQEEKVCCNDCDSVRKAYASVGW 177

Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
           A  +   + QC+ E   +K+ +   EGC++ G  ++NR++G+ H APG SYS  + HVHD
Sbjct: 178 AFFDGKNVEQCEREGYVKKINDRLGEGCRVKGTAKLNRINGNIHFAPGASYSAPNRHVHD 237

Query: 238 IQPY-TSAAFNTTHHIRHLSFGIKLQDD------DERRKPLDGTVAKAEEGASMFNYYIK 290
           +  Y  +  FN  H I H SFG  +         +    PLDGT A       +++Y++K
Sbjct: 238 LSLYGKNKDFNFRHVINHFSFGPDVNSKYTAETLELSSHPLDGTNAIQGSRDHLYSYFLK 297

Query: 291 IIPTIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMV 325
           ++PT YE L+G+K+               GG D          GG+PG+FF +E+SPL  
Sbjct: 298 VVPTRYEYLNGTKVETNQFSSTYHDRPLTGGRDEDHPNTFHARGGIPGLFFHFEMSPL-- 355

Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
           KI  K ++ G  W+  + N+       + V A++   V    KV
Sbjct: 356 KIINK-ETYGTSWSGFLLNVISAIGGILTVGAVVDRTVFVADKV 398


>gi|149237735|ref|XP_001524744.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146451341|gb|EDK45597.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 411

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 135/415 (32%), Positives = 203/415 (48%), Gaps = 54/415 (13%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  LDAF K  ED   KT  GG +T++C L    LI  +  DY  + T  EL VD    
Sbjct: 7   KLISLDAFAKTVEDARIKTASGGIITLLCCLVALILIRNEYIDYTTIVTLPELVVDRDIN 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV------EHNIYKRRLDLDGKPIQEPQ 119
            +L I++D+  P + CD + +D  D +G+  L V      ++ I KR    + K ++E  
Sbjct: 67  KQLEINMDMSFPNLPCDMINMDLFDETGDMKLDVINSGLEKYRIIKRG---NNKVVEELD 123

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKW 177
            +   A+++++   E      E E   +CGSCYGA  + +K  CCN+C  V+ AY +KKW
Sbjct: 124 DQP--ALRREQPLHEICKGLGENEQ-GECGSCYGALPQDKKEYCCNSCAAVRRAYAHKKW 180

Query: 178 ALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
              + + I QC+ E   +KLK+     EGC++ G  ++NRV+G+   APG+S + N  HV
Sbjct: 181 QFFDGENIEQCEKEGYVQKLKDRINQNEGCRVKGSAKINRVAGTMDFAPGISTTSNGQHV 240

Query: 236 HDIQPYTS--AAFNTTHHIRHLSFG------IKLQDDDERRKPLDGTVAKAEEGASMFNY 287
           HD+  YT     FN  H I HLSFG        LQ+ D    PLDG      +   M NY
Sbjct: 241 HDLSLYTKYPDKFNFDHVIHHLSFGKIPTAITNLQETDS-LSPLDGHSFLQHKRYHMNNY 299

Query: 288 YIKIIPTIYERLDGSK----------------LGGGD----------GGMPGIFFSYELS 321
           Y+KI+ T +E LDG+K                +GG D          GG+P + F +++S
Sbjct: 300 YLKIVSTRFENLDGTKKVDTNQFSVITHDRPLVGGKDEDHQHTLHARGGVPSVAFHFDIS 359

Query: 322 PLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
           PL +   E+       W+  +  +  +    ++V ALL   V    +   G K +
Sbjct: 360 PLKIINRER---YAKTWSGFVLGVVSSVAGVLMVGALLDRSVFAAQQAMKGKKDL 411


>gi|353237029|emb|CCA69011.1| related to ERV46-component of copii vesicles [Piriformospora indica
           DSM 11827]
          Length = 428

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 125/405 (30%), Positives = 188/405 (46%), Gaps = 50/405 (12%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
            KG+DAF +  ED   KT  G  +T++   FI+    ++  D+ +V     + VD SRG 
Sbjct: 9   FKGIDAFGRTSEDVKVKTRTGAFLTLISAFFIATFTFIEFMDFRRVGVDTAIVVDRSRGE 68

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL +  +I  P + C  L LD  D SG+    + H++ K RLD       +P  + +   
Sbjct: 69  KLQVVFNITFPRVPCFLLNLDVTDISGDVVREITHHVVKTRLD---PAAHQPIPDGIYRT 125

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
             K   ++  T T++      CGSCYG +     CCNTC++V+ AY  + WA    D I 
Sbjct: 126 DLKSDLSKQLTATSK----GYCGSCYGGQPPEGGCCNTCDDVRRAYTDRGWAFGNPDQID 181

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC +E  TEK+     EGC I G + VN+V+G+   +PG S+ +N   V+ + PY   + 
Sbjct: 182 QCVSENWTEKIMAMQREGCNIEGRVRVNKVTGNMQFSPGRSFVVNRPEVYALVPYLKDSN 241

Query: 247 N-TTHHIRHLSFGIKLQDDDERRK--------------PLDGTVAKAEEGASMFNYYIKI 291
           +   HHI  L      +D   RR               PL+   A  E    MF Y++K+
Sbjct: 242 HFFGHHIHSLEIYDYEEDTWTRRNLPEQIKERLGITKPPLEDVYAHTESADYMFQYFLKV 301

Query: 292 IPTIYERLDG-----------------SKLGGG---DG--------GMPGIFFSYELSPL 323
           + + Y+ LDG                 + +  G   DG        G+PG+FF++E+SP+
Sbjct: 302 VKSSYKGLDGKAYSTHQYSTSSFERDLATMSHGKNEDGIEIVHERQGVPGVFFNFEISPM 361

Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
            V   E+ +S  H  T +   I G      LVDALL +    I K
Sbjct: 362 EVIHIEQRQSWAHFITSMAAIIGGVLTVATLVDALLFNTQGLIKK 406


>gi|260950825|ref|XP_002619709.1| hypothetical protein CLUG_00868 [Clavispora lusitaniae ATCC 42720]
 gi|238847281|gb|EEQ36745.1| hypothetical protein CLUG_00868 [Clavispora lusitaniae ATCC 42720]
          Length = 415

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 132/420 (31%), Positives = 199/420 (47%), Gaps = 49/420 (11%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    RL  LDAF K  ED   KT  GG +T+VC L + +LI  +  DY  V    EL V
Sbjct: 1   MSSRPRLLSLDAFAKTVEDARVKTASGGVITLVCVLIVLFLIRNEYSDYMSVVVRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKPIQEPQ 119
           +     +L I+LDI  P + C  ++LD +D +G+ HL  VE      R+   G+ I +  
Sbjct: 61  NRDVNRQLDINLDITFPDVPCGVMSLDILDMTGDLHLDIVESGFEMFRVLPSGEEISDDL 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA--ETETRKCCNTCNEVKEAYRYKKW 177
             +  A K + V      T  E+     CG CYGA  +T+ ++CCNTC  V+ AY  ++W
Sbjct: 121 PLLSGAKKFEDVCGP--LTEDEISRGVPCGPCYGAVDQTDNKRCCNTCEAVRMAYAVQEW 178

Query: 178 ALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
              +   I QC+ E   EK+ +     EGC+I G  ++NR+SG+ H APG+  S N  H 
Sbjct: 179 GFFDGSNIEQCEREGYVEKMVSRINNNEGCRIKGSAKINRISGNLHFAPGVPLSRNGRHS 238

Query: 236 HDIQPYT--SAAFNTTHHIRHLSFG------IKLQDDDERRK----PLDGTVAKAEEGAS 283
           HD+  +T  S  F+  H I H SFG       +L   D+ ++    PLDG     ++   
Sbjct: 239 HDLSLWTKYSNKFSIDHKINHFSFGEDPSASRRLASTDDSQEPSIHPLDGFHFDLKKKNH 298

Query: 284 MFNYYIKIIPTIYERLDGSK-----------------LGGGD----------GGMPGIFF 316
           + +YY+ ++ T +E LDG K                 +GG D          GG+PG FF
Sbjct: 299 VASYYLSVVSTRFEFLDGKKEAVDTNQFSVITHDRPIVGGRDDDHQNTMHAQGGVPGAFF 358

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
            +++SP+ +   E+       W+  +  +  +    + V A L   V    +V  G K +
Sbjct: 359 HFDISPMKIISREE---YAKTWSGFILGVVSSIAGVLTVGAALDRSVWTAEQVLRGKKDM 415


>gi|156844136|ref|XP_001645132.1| hypothetical protein Kpol_538p34 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156115789|gb|EDO17274.1| hypothetical protein Kpol_538p34 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 405

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 129/419 (30%), Positives = 195/419 (46%), Gaps = 59/419 (14%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +L  LDAF +P E+   +T  GG +TI C L   YL+  +   + +V +  +L V
Sbjct: 1   MSKKSKLSSLDAFARPDEEVRIRTKMGGIITISCILTTLYLLSWEWSKFREVISKPQLVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQ 119
           D    SKL ++LDI  P + CD++ LD +D SG+  L V E+   K RLD DGK ++   
Sbjct: 61  DRDHSSKLELNLDISFPNVPCDFINLDIMDDSGDLQLDVLEYGFTKTRLDPDGKVLETDD 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA---------ETETRKCCNTCNEVKE 170
            ++           ++G  +T   DPN CG CYG+         E   R CC TC +V++
Sbjct: 121 FDMYK---------QDGAPST---DPNYCGPCYGSIDQSKNDEVEASERVCCQTCEDVRK 168

Query: 171 AYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSI 230
           AY    WA  +   I QC+ E   +K+ +   EGC++ G   +NR+ G+ H APG S+  
Sbjct: 169 AYVKAGWAFYDGKGIEQCEQEGYVKKINSHLNEGCRVAGSASLNRIQGNIHFAPGKSFQT 228

Query: 231 NHVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEGAS 283
              H HD   Y  +   N  H I H SFG ++      R       PLDG     E    
Sbjct: 229 VRGHFHDQSLYERNPQLNFNHIIHHFSFGKEIPTKLASRHSKNIVNPLDGRSVAPERDTH 288

Query: 284 M--FNYYIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIFF 316
           +  F+YY KI+PT +E L+ + +               GG D           G+PG+FF
Sbjct: 289 LHQFSYYTKIVPTRFEYLNKAVVDTAQFSATYHDRPLRGGADDDHPNTFHFRSGIPGVFF 348

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            ++ SP  +K+  K    G  W+    N   +    + V ++L   + K  +  +G K+
Sbjct: 349 FFDASP--IKVINKEYISGS-WSSFFLNCITSIGGVLAVGSMLDRLMYKAQRSFLGKKS 404


>gi|401839164|gb|EJT42494.1| ERV46-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 415

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 128/415 (30%), Positives = 191/415 (46%), Gaps = 66/415 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LDAF K  ED   +T  GG +T+ C L   +L+  +   +  V T  +L VD  R +
Sbjct: 6   LLSLDAFAKTEEDVRVRTKAGGLITLSCILTTLFLLVNEWRQFNSVVTRPQLVVDRDRHA 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL +++D+  P++ CD + LD +D SGE  L + +      RLD +G+P+ +        
Sbjct: 66  KLELNIDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMTRLDKEGRPVGD-------- 117

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYKK 176
             + +V  +        +DPN CG CYGA  +T+          CC  C+ V+ AY    
Sbjct: 118 AAELQVGGDGDGVAPVNDDPNYCGPCYGARDQTQNENLAQADKVCCQDCDAVRSAYLDAG 177

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   I QC+ E    K+     EGC+I G  ++NR+ G+ H APG  +   + H H
Sbjct: 178 WAFFDGKNIEQCEREGYVSKINEHLHEGCRIEGSAQINRIQGNIHFAPGRPFQNANGHFH 237

Query: 237 DIQPY-TSAAFNTTHHIRHLSFGI------KLQDDDERR-------KPLDGTVAKAEEG- 281
           D+  Y  +   N  H I HLSFG       KL ++D+R         PLDG     E   
Sbjct: 238 DVSLYEKTPDLNFNHMINHLSFGKPIESRNKLLENDDRHGGAVIATSPLDGRKVFPERTT 297

Query: 282 -ASMFNYYIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIF 315
            + +F+Y+ KI+PT YE LD   +               GG D          GG+PG+F
Sbjct: 298 HSHLFSYFAKIVPTRYEYLDDVVIETAQFSATYHSRPLRGGRDQDHPNTFHARGGIPGLF 357

Query: 316 FSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
             +E+SPL V   E+    G  W+  + N    I G      ++D L +   + I
Sbjct: 358 VFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409


>gi|443925078|gb|ELU44001.1| ER-derived vesicles protein ERV46 [Rhizoctonia solani AG-1 IA]
          Length = 383

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 122/390 (31%), Positives = 180/390 (46%), Gaps = 64/390 (16%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
           +F    KGLD F K  ED   KT  G  +T++    I     ++  DY +V     + VD
Sbjct: 5   LFGGAFKGLDGFGKTMEDVKVKTRTGAFLTMLSAAIILTFTIIEFIDYRRVVVDSSILVD 64

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEP--Q 119
            SRG KL + ++I  P +    L+LD  D SGE    + HN+ K RLD +G+ IQ+    
Sbjct: 65  RSRGEKLTVKMNITFPRVPL--LSLDVTDISGEIQQDLTHNMVKTRLDSNGQIIQDGFHN 122

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
            E+ N V+K          T +      CGSCYG E     CC TC  V++AY  + W+ 
Sbjct: 123 NELDNDVEK----------TMKARPQGYCGSCYGGEPPEGGCCQTCESVRQAYMNRGWSF 172

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            + D I QC  E+ T K+    +EGC I G + VN+V+G+FH +PG S+ +N  H  D+ 
Sbjct: 173 GDPDAIEQCVAEHWTAKIHEQNSEGCHISGRVRVNKVTGNFHFSPGRSFVLNRGHFQDLV 232

Query: 240 PYTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-------------- 283
           PY       +  H++    F  + + +DE R    GT  + + G S              
Sbjct: 233 PYLKDGNHHDFGHYVHEFRFEGESEAEDEWRGTDRGTRWRKKVGISANPLDQVSAHVVDD 292

Query: 284 -----MFNYYIKIIPTIYERLDGS--------------KLGGGDG--------------- 309
                MF Y++K++ T ++ LDG                L  GDG               
Sbjct: 293 RASNYMFQYFMKVVSTEFKYLDGDIIRSHQYSVTSYERDLTHGDGAERDSHGTLTAHGVQ 352

Query: 310 GMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
           G+PG FF++E+SP+MV   E  ++  H  T
Sbjct: 353 GLPGAFFNFEISPMMVVHRETRQTFAHFAT 382


>gi|151941348|gb|EDN59719.1| ER vesicle protein [Saccharomyces cerevisiae YJM789]
 gi|190406692|gb|EDV09959.1| ER-Golgi transport vesicle protein [Saccharomyces cerevisiae
           RM11-1a]
 gi|207348028|gb|EDZ74008.1| YAL042Wp-like protein [Saccharomyces cerevisiae AWRI1631]
 gi|256272276|gb|EEU07261.1| Erv46p [Saccharomyces cerevisiae JAY291]
 gi|259144662|emb|CAY77603.1| Erv46p [Saccharomyces cerevisiae EC1118]
 gi|323334778|gb|EGA76150.1| Erv46p [Saccharomyces cerevisiae AWRI796]
 gi|323338873|gb|EGA80087.1| Erv46p [Saccharomyces cerevisiae Vin13]
 gi|323349926|gb|EGA84136.1| Erv46p [Saccharomyces cerevisiae Lalvin QA23]
 gi|365767200|gb|EHN08685.1| Erv46p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 415

 Score =  191 bits (484), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 128/416 (30%), Positives = 189/416 (45%), Gaps = 68/416 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LDAF K  ED   +T  GG +T+ C L   +L+  +   +  V T  +L VD  R +
Sbjct: 6   LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWGQFNSVVTRPQLVVDRDRHA 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL +++D+  P++ CD + LD +D SGE  L + +      RL+ +G+P+ +  +  V  
Sbjct: 66  KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125

Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
                    NG  T  +  DPN CG CYGA+ +++          CC  C+ V+ AY   
Sbjct: 126 ---------NGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            WA  +   I QC+ E    K+     EGC+I G  ++NR+ G+ H APG  Y   + H 
Sbjct: 177 GWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHF 236

Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQD------DDERR-------KPLDGTVAKAEEG 281
           HD   Y  ++  N  H I HLSFG  +Q       +D+R         PLDG     +  
Sbjct: 237 HDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRN 296

Query: 282 ASM--FNYYIKIIPTIYERLDG--------------SKLGGG-----------DGGMPGI 314
                F+Y+ KI+PT YE LD                 L GG            GG+PG+
Sbjct: 297 THFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGM 356

Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
           F  +E+SPL V   E+    G  W+  + N    I G      ++D L +   + I
Sbjct: 357 FVFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409


>gi|6319274|ref|NP_009358.1| Erv46p [Saccharomyces cerevisiae S288c]
 gi|1723191|sp|P39727.2|ERV46_YEAST RecName: Full=ER-derived vesicles protein ERV46
 gi|1326054|gb|AAC04989.1| Yal042wp [Saccharomyces cerevisiae]
 gi|285810158|tpg|DAA06944.1| TPA: Erv46p [Saccharomyces cerevisiae S288c]
 gi|392301230|gb|EIW12318.1| Erv46p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 415

 Score =  190 bits (483), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 128/416 (30%), Positives = 189/416 (45%), Gaps = 68/416 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LDAF K  ED   +T  GG +T+ C L   +L+  +   +  V T  +L VD  R +
Sbjct: 6   LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWGQFNSVVTRPQLVVDRDRHA 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL +++D+  P++ CD + LD +D SGE  L + +      RL+ +G+P+ +  +  V  
Sbjct: 66  KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125

Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
                    NG  T  +  DPN CG CYGA+ +++          CC  C+ V+ AY   
Sbjct: 126 ---------NGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            WA  +   I QC+ E    K+     EGC+I G  ++NR+ G+ H APG  Y   + H 
Sbjct: 177 GWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHF 236

Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQD------DDERR-------KPLDGTVAKAEEG 281
           HD   Y  ++  N  H I HLSFG  +Q       +D+R         PLDG     +  
Sbjct: 237 HDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRN 296

Query: 282 ASM--FNYYIKIIPTIYERLDG--------------SKLGGG-----------DGGMPGI 314
                F+Y+ KI+PT YE LD                 L GG            GG+PG+
Sbjct: 297 THFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHVRGGIPGM 356

Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
           F  +E+SPL V   E+    G  W+  + N    I G      ++D L +   + I
Sbjct: 357 FVFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409


>gi|349576209|dbj|GAA21381.1| K7_Erv46p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 415

 Score =  190 bits (483), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 128/416 (30%), Positives = 189/416 (45%), Gaps = 68/416 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LDAF K  ED   +T  GG +T+ C L   +L+  +   +  V T  +L VD  R +
Sbjct: 6   LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWRQFNSVVTRPQLVVDRDRHA 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL +++D+  P++ CD + LD +D SGE  L + +      RL+ +G+P+ +  +  V  
Sbjct: 66  KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125

Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
                    NG  T  +  DPN CG CYGA+ +++          CC  C+ V+ AY   
Sbjct: 126 ---------NGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            WA  +   I QC+ E    K+     EGC+I G  ++NR+ G+ H APG  Y   + H 
Sbjct: 177 GWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHF 236

Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQD------DDERR-------KPLDGTVAKAEEG 281
           HD   Y  ++  N  H I HLSFG  +Q       +D+R         PLDG     +  
Sbjct: 237 HDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRN 296

Query: 282 ASM--FNYYIKIIPTIYERLDG--------------SKLGGG-----------DGGMPGI 314
                F+Y+ KI+PT YE LD                 L GG            GG+PG+
Sbjct: 297 THFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGM 356

Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
           F  +E+SPL V   E+    G  W+  + N    I G      ++D L +   + I
Sbjct: 357 FVFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409


>gi|320583549|gb|EFW97762.1| COPII-coated vesicle membrane protein Erv46, putative [Ogataea
           parapolymorpha DL-1]
          Length = 400

 Score =  190 bits (483), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 130/387 (33%), Positives = 185/387 (47%), Gaps = 52/387 (13%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
            DAF+K  +D   KT  GG +T+VC L    L+  +  DY ++ T  EL VD  R  KL 
Sbjct: 10  FDAFSKTVDDARIKTTSGGILTLVCILTTLLLLINEYTDYSRIVTRPELVVDRDRHKKLE 69

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNAVKK 128
           I+LDI    + CD L +D +D SG+  L +      K RLD  G  I +         + 
Sbjct: 70  INLDISFQNMPCDLLTMDIMDQSGDMQLDLLSSGFSKIRLDRQGNEIGQ---------EN 120

Query: 129 KKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYKKWAL 179
            +V  E   T++   DP  CGSCYGA  ++R          CCN+C  VK+AY    W  
Sbjct: 121 MRVNQEFALTSS---DPTYCGSCYGAADQSRNDELPQDQKVCCNSCESVKQAYARNAWKF 177

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            +   I QC+ E   +++     EGC++ G  E+ R+ G+ H APG S + N  HVHD+ 
Sbjct: 178 YDGKDIEQCEKEGYVDRINARLDEGCRVRGTAEIARIGGNLHFAPGSSMNFNEKHVHDLS 237

Query: 240 PYT--SAAFNTTHHIRHLSFGIKLQD--DDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
            Y   S  FN  H I H SFG+      D +   PLD T  +      +++Y++K++ T 
Sbjct: 238 LYDMHSNKFNFDHTINHFSFGLDDHSVADYKTTHPLDATTHRDGRKYHVYSYFLKVVNTR 297

Query: 296 YERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKITEK 330
           YE LDG K+               GG D          GG+PG+FF +E+SPL +   E+
Sbjct: 298 YEFLDGRKVETNQFSATQHDRPFRGGRDEDHPNTIHAQGGLPGVFFHFEISPLKIINREQ 357

Query: 331 -SKSLGHLWTKIMCNISGTYITFMLVD 356
            +K+           ISG    F L+D
Sbjct: 358 YNKTWSAFALGACAAISGVLTVFTLLD 384


>gi|323356370|gb|EGA88170.1| Erv46p [Saccharomyces cerevisiae VL3]
          Length = 415

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 128/416 (30%), Positives = 189/416 (45%), Gaps = 68/416 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LDAF K  ED   +T  GG +T+ C L   +L+  +   +  V T  +L VD  R +
Sbjct: 6   LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWGQFNSVVTRPQLVVDRDRHA 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL +++D+  P++ CD + LD +D SGE  L + +      RL+ +G+P+ +  +  V  
Sbjct: 66  KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125

Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
                    NG  T  +  DPN CG CYGA+ +++          CC  C+ V+ AY   
Sbjct: 126 ---------NGDGTXPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            WA  +   I QC+ E    K+     EGC+I G  ++NR+ G+ H APG  Y   + H 
Sbjct: 177 GWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHF 236

Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQD------DDERR-------KPLDGTVAKAEEG 281
           HD   Y  ++  N  H I HLSFG  +Q       +D+R         PLDG     +  
Sbjct: 237 HDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRN 296

Query: 282 ASM--FNYYIKIIPTIYERLDG--------------SKLGGG-----------DGGMPGI 314
                F+Y+ KI+PT YE LD                 L GG            GG+PG+
Sbjct: 297 THFHQFSYFAKIVPTRYEYLDNVVIETAQFSATFHSRPLAGGRDKDHPNTLHARGGIPGM 356

Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
           F  +E+SPL V   E+    G  W+  + N    I G      ++D L +   + I
Sbjct: 357 FVFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409


>gi|365989554|ref|XP_003671607.1| hypothetical protein NDAI_0H01900 [Naumovozyma dairenensis CBS 421]
 gi|343770380|emb|CCD26364.1| hypothetical protein NDAI_0H01900 [Naumovozyma dairenensis CBS 421]
          Length = 438

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 135/405 (33%), Positives = 184/405 (45%), Gaps = 68/405 (16%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           S +L   DAF K  E+   +T  GG +TI C L   YL+  +   +  V T+ +L VD  
Sbjct: 6   SAKLLSFDAFAKTEEEVRIRTNTGGIITISCILVTLYLLLNEWSQFNSVITSPQLVVDRD 65

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKP------IQ 116
           R  KL ++LDI  P ISCD + LD +D SGE  L  ++    K RLD  G P      + 
Sbjct: 66  RNLKLELNLDISFPNISCDLINLDIMDESGELQLDLLDSTFIKTRLDPQGNPLDNDNNVA 125

Query: 117 EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNE 167
           +   ++V  V       E        +DP+ CGSCYG++ +T           CC TCN+
Sbjct: 126 DTDADLVIGVDDLTKNGEKRLKEILAKDPDYCGSCYGSQDQTENESKSKDQKICCQTCND 185

Query: 168 VKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS 227
           V+++Y    WA  +   I QC+NE    K+     EGC+I G   +NR+ G+ H APG S
Sbjct: 186 VRDSYLNAGWAFFDGAQIEQCENEGYVAKINKHLEEGCRIKGQALLNRIQGNIHFAPGKS 245

Query: 228 YS----INHVHVHDIQPYTSA-AFNTTHHIRHLSFGIK--------LQDDDERRK----P 270
           YS        H HD   Y      N  H I HLSFG          L+D  +R+K    P
Sbjct: 246 YSNYKAKGSTHRHDTSLYDKVKKMNFNHIIHHLSFGKSIDKVGKNDLKDYSDRKKFSINP 305

Query: 271 LDG---TVAKAEEGASMFNYYIKIIPTIYERLDGSKL------------------GGGD- 308
           LD     V         F+YY KI+PT YE LD  K+                  GG D 
Sbjct: 306 LDDRKVIVKDFNPAFHQFSYYTKIVPTRYEFLD-EKISSIETAQFSATYHSRPIQGGTDE 364

Query: 309 ---------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
                    GG+PG+FF +E+SP  +K+  K       W+  + N
Sbjct: 365 DHPTTFHSRGGIPGLFFFFEMSP--IKVINKEHHF-RTWSSFLLN 406


>gi|148674214|gb|EDL06161.1| ERGIC and golgi 3, isoform CRA_a [Mus musculus]
          Length = 238

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 107/253 (42%), Positives = 150/253 (59%), Gaps = 18/253 (7%)

Query: 29  AVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDA 88
           AVTIV  L +  L   ++  Y       EL+VD SRG KL I++D++ P + C YL++DA
Sbjct: 1   AVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGDKLKINIDVLFPHMPCAYLSIDA 60

Query: 89  VDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKC 148
           +D +GEQ L VEHN++K+RLD DG P+    +   + + K +VT  +  +     DPN+C
Sbjct: 61  MDVAGEQQLDVEHNLFKKRLDKDGVPVSSEAER--HELGKVEVTVFDPNSL----DPNRC 114

Query: 149 GSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIY 208
            SCYGAE+E  KCCN+C +V+EAYR + WA    DTI QC+ E  ++K++    EGCQ+Y
Sbjct: 115 ESCYGAESEDIKCCNSCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVY 174

Query: 209 GYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR 268
           G+LEVN+V G          S     VHD+Q +     N TH+I+HLSFG   +D     
Sbjct: 175 GFLEVNKVPGG---------SKARQLVHDLQSFGLDNINMTHYIKHLSFG---EDYPGIV 222

Query: 269 KPLDGTVAKAEEG 281
            PLD T   A +G
Sbjct: 223 NPLDHTNVTAPQG 235


>gi|325189930|emb|CCA24410.1| hypothetical protein BRAFLDRAFT_63528 [Albugo laibachii Nc14]
          Length = 699

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 196/380 (51%), Gaps = 27/380 (7%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ +D   K  ++F  KT+ GG ++++    I YL+  ++  Y  V   +++ VD SR 
Sbjct: 322 KLRNVDFNPKTLDEFKVKTINGGILSLLSIGLIGYLLVSELIFYLSVDIVDKMLVDGSRN 381

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             + I+ D+  P + C  + L++  SSGE H  ++H+++K+ +DL+GK +    K  +++
Sbjct: 382 RMVTINFDVEFPRMPCSIVTLESTGSSGEIHHDIQHSVHKQAIDLNGKILSAGMK--LDS 439

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + K   T ++ T   E     +CGSCYGA   + +CCNTC +V++AY  ++W +P L TI
Sbjct: 440 IGKAW-TNQSDTVAEEKTVKVECGSCYGAGA-SGECCNTCEDVQQAYASRRWNIPSLHTI 497

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+     + L +T  EGC+IYG + V +V G    AP  +    ++   +I   T   
Sbjct: 498 EQCQKSEIEKLLHSTVEEGCRIYGSIAVTKVHGKVLFAPAKALLSGYISTEEILDKTIKI 557

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDG---TVAKAEEGASMFNYYIKIIPTIYERLDGS 302
           F+T+H I +L FG   +   E + PL+G    + K   G   + Y+++++PT Y  L+G 
Sbjct: 558 FDTSHKINYLDFG---ERYPEMKSPLNGHNTILPKGTRGT--YQYFLQVVPTAYYYLNGG 612

Query: 303 KLGG---------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
            +                 G+  +P I F Y+ SP+M +I ++ +      T +   + G
Sbjct: 613 IIDTNQYSVTQHYQELTPLGEQQLPMITFQYKFSPIMFQIEQRRRGYLQFLTSLCAILGG 672

Query: 348 TYITFMLVDALLHSCVKKIS 367
            +     VD++L +   + S
Sbjct: 673 VFTMVGAVDSILFAYSNQFS 692


>gi|47214843|emb|CAF95749.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 299

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 100/231 (43%), Positives = 140/231 (60%), Gaps = 14/231 (6%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  + +  L   ++  Y       EL+VD+SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYYLTKEVHPELYVDTSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ-EPQKEVVN 124
            KL I++DIV P + C YL++DA+D +GEQ L VEHN++K+RLD + KP+  E +K  + 
Sbjct: 66  DKLKINIDIVFPHMPCVYLSIDAMDVAGEQQLDVEHNLFKQRLDKNLKPVSTEAEKHELG 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             +  +V   +        DPN+C SCYGAET+  KCCN+C++V+EAYR + WA    DT
Sbjct: 126 GAEDVEVFDPSTL------DPNRCESCYGAETDDLKCCNSCDDVREAYRRRGWAFKNADT 179

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVS-------GSFHIAPGLSY 228
           I QCK E  T+K++    EGCQ+YG LEVN+VS       G F +  G  +
Sbjct: 180 IEQCKREGFTQKMQEQKNEGCQVYGVLEVNKVSLIAQEGGGKFSLCSGKKF 230


>gi|254569250|ref|XP_002491735.1| Protein localized to COPII-coated vesicles, forms a complex with
           Erv41p [Komagataella pastoris GS115]
 gi|238031532|emb|CAY69455.1| Protein localized to COPII-coated vesicles, forms a complex with
           Erv41p [Komagataella pastoris GS115]
 gi|328351763|emb|CCA38162.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Komagataella pastoris CBS 7435]
          Length = 401

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 121/388 (31%), Positives = 187/388 (48%), Gaps = 44/388 (11%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  LDAF K  +D   KT  GG +T++C +    L+  +  DY  V    EL VD    
Sbjct: 5   KLLSLDAFAKTADDVKVKTTSGGVITLICLIVTLILVTNEYFDYQTVVIRPELVVDRDHA 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I L++    I C+ LA+D +D +G+  + +  + +++   +DG   +E  +  VN 
Sbjct: 65  KKLDISLNVTFHHIPCELLAMDIMDITGDLQIDLLMSGFQKTRVVDGLA-KETTELRVNE 123

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETET---------RKCCNTCNEVKEAYRYKK 176
            K+     EN   T    +P  CGSCYGA  +          + CCNTC  VK+AY    
Sbjct: 124 YKQ-----ENNKLTNS-NNPYYCGSCYGALNQKDNENKPFDEKLCCNTCESVKKAYAKAG 177

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   I QC+NE   + + +   EGCQ+ G  ++NRVSG+ H APG S +    H+H
Sbjct: 178 WAFYDGRNIEQCENEGYVQLVTSMVDEGCQVSGTAQINRVSGNLHFAPGSSLTSGSRHIH 237

Query: 237 DIQPYTS--AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           D+  +      FN  H + HLSFG  + + +    PLDG  A       +++Y++K++ T
Sbjct: 238 DLSLFEKYPDKFNFDHTVNHLSFGKTIDNQEMSTHPLDGYEAATGNKNHLYSYFLKVVAT 297

Query: 295 IYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKITE 329
            YE + G K                GG D          GG+PG FF +E+SPL +   E
Sbjct: 298 RYESMSGLKWDTNQFSATYHDRPLEGGRDSDHPNTLHASGGIPGAFFHFEISPLKIINRE 357

Query: 330 K-SKSLGHLWTKIMCNISGTYITFMLVD 356
           + SK+       +  +++G      ++D
Sbjct: 358 QYSKTRSAFALGVSASVAGVLTLGSVLD 385


>gi|255712984|ref|XP_002552774.1| KLTH0D01144p [Lachancea thermotolerans]
 gi|238934154|emb|CAR22336.1| KLTH0D01144p [Lachancea thermotolerans CBS 6340]
          Length = 402

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 127/407 (31%), Positives = 191/407 (46%), Gaps = 61/407 (14%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  +DAF K  ED   +T  GG +T+ C +    L+  +     +V T  +L VD  R 
Sbjct: 5   KLLSIDAFAKTEEDVRIRTRTGGLITLSCVVVTFLLLLSEWFHLKEVVTRPQLVVDRDRH 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIY-KRRLDLDGKPIQ----EPQK 120
            KL +++DI  P I C  L +D +DS+GE  L V +  + K RLD  G+ +     +P K
Sbjct: 65  LKLDLNMDITFPHIPCYLLNMDIMDSAGEMQLEVLNKGWSKTRLDPSGQVLDTKQFKPGK 124

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET---------RKCCNTCNEVKEA 171
           +VV+                  ED N CG CYGA  ++         R CC TC++V+EA
Sbjct: 125 DVVDYAP---------------EDENYCGPCYGARDQSKNDEVNVDERVCCQTCDDVREA 169

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
           Y  K+WA  +   I QC+ E   E++     EGC+I G  ++NR+ G+ H APG  +   
Sbjct: 170 YAEKQWAFFDGKNIEQCEREGYVEQVNEHIEEGCRIKGMAKLNRIGGNLHFAPGKGFHNI 229

Query: 232 HVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQD---DDERRKPLDGTVAKAE--EGASMF 285
             H HD   Y  S + N  H I HLSFG +++D         PLDGT    E       F
Sbjct: 230 RGHFHDASLYQNSPSLNFNHIIHHLSFGKEVEDITGQGASTAPLDGTNVSPEFDTHKHQF 289

Query: 286 NYYIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIFFSYEL 320
           +Y+ KI+PT YE L G  +               GG D          GG P ++F +E+
Sbjct: 290 SYFAKIVPTRYEYLSGETVETTQFTTTYHSRPLKGGRDSDHPTTLHSQGGFPSVYFYFEM 349

Query: 321 SPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           SPL ++   + ++S    W   + +I G      ++D + +   + +
Sbjct: 350 SPLKVINKQQYAQSWSGFWLNCITSIGGVLAVGTVLDKITYKAQRSM 396


>gi|403215799|emb|CCK70297.1| hypothetical protein KNAG_0E00290 [Kazachstania naganishii CBS
           8797]
          Length = 408

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 121/411 (29%), Positives = 190/411 (46%), Gaps = 53/411 (12%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+    L  +DAF++  +D   +T  G  VT+ C +   +L+  +   +  + +   L +
Sbjct: 1   MMRRSTLLSMDAFSRAEDDVRVRTRAGAYVTLACLVTTVFLLLSEYRQWNTIVSRSSLVI 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE---HNIYKRRLDLDGKPIQE 117
           D   G KL + LD+  P + CD ++ D +D SG   L V+   ++  K R+D  G+P+  
Sbjct: 61  DREHGLKLDLRLDVTFPHLPCDLVSFDVLDDSGVLLLDVDDENNHFTKTRIDQRGEPL-- 118

Query: 118 PQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEV 168
                 +A        +         DP+ CGSCYG+  +TR          CCNTC+ V
Sbjct: 119 ------DAAAAASFKLDAEAAQLPPTDPDYCGSCYGSRDQTRNDELDPANKVCCNTCSSV 172

Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
           +EAY    WA  +   I QC+ E   +K+    TEGC+I G + +NRV G+ H APG ++
Sbjct: 173 REAYLDAGWAFFDGKNIEQCEREGYVDKISQRITEGCRIKGGVRLNRVQGNIHFAPGDAF 232

Query: 229 SINHVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERRK-------PLDG--TVAKA 278
                H HD   Y  + + N  H I HLSFG  + +     K       PLDG   + + 
Sbjct: 233 RSARGHFHDTSMYDQTGSLNFDHIIHHLSFGPSVDNMQSLEKASNVAIAPLDGKQVLPRY 292

Query: 279 EEGASMFNYYIKIIPTIYERLDGS--------------KLGGG--------DGGMPGIFF 316
           +  A  + Y+ KI+PT +E   GS               +GGG         GG PG++F
Sbjct: 293 DSHAYQYTYFTKIVPTRFEYFSGSVIETTQFSSTFSARPIGGGTTETATYTSGGTPGLYF 352

Query: 317 SYELSPLMVKITEKSK-SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           + E+SPL V   E++K S        + +I G      +VD +L+   + +
Sbjct: 353 NIEMSPLKVIHKEQNKISWSGFLLNCITSIGGVLAVGTVVDKILYRAERTL 403


>gi|327296796|ref|XP_003233092.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
 gi|326464398|gb|EGD89851.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
          Length = 435

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 133/440 (30%), Positives = 190/440 (43%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VTI   L + YL+  +  DY +V    EL V
Sbjct: 1   MAGKSRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGE    V+H + K RL            
Sbjct: 61  DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRL--------SSAA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
           E    +    ++      +    DPN CG CYG  A +  +K  CCNTC+EV++AY  K 
Sbjct: 113 EGGRVIDVTALSLHKKEDSPAHLDPNYCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC +E  ++++     EGC+I G L VN+V+G+FHIAPG S +  + H H
Sbjct: 173 WAFGRGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
           D+  Y  T      TH I  L FG +L ++   R          PLD +  K  E    F
Sbjct: 233 DLDNYYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEVRYNF 292

Query: 286 NYYIKIIPTIYERLDGSKLGGGDG----------GMPGIFF------------------- 316
            Y++K++ T Y  L        +           G  G+FF                   
Sbjct: 293 LYFVKVVSTSYLPLGWDPTLSSEAHSQAHRDIPLGNHGVFFGSQGSIETHQYSVTSHQRS 352

Query: 317 --------------------------SYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
                                     +YE+SP+ V   E + KSL   +T +   I GT 
Sbjct: 353 LDAEDASADGHKERQHARGGIPSVMFNYEISPMKVINRETRPKSLSAFFTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                VD LL+    ++ K+
Sbjct: 413 TVAAAVDRLLYEGSLRVKKL 432


>gi|302666755|ref|XP_003024974.1| hypothetical protein TRV_00895 [Trichophyton verrucosum HKI 0517]
 gi|291189052|gb|EFE44363.1| hypothetical protein TRV_00895 [Trichophyton verrucosum HKI 0517]
          Length = 435

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 133/440 (30%), Positives = 189/440 (42%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VTI   L + YL+  +  DY +V    EL V
Sbjct: 1   MAGKSRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGE    V+H + K RL            
Sbjct: 61  DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRL--------SSAA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
           E    +    +       +    DPN CG CYG  A +  +K  CCNTC+EV++AY  K 
Sbjct: 113 EGGRVIDVTALALHKKEDSPAHLDPNYCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC +E  ++++     EGC+I G L VN+V+G+FHIAPG S +  + H H
Sbjct: 173 WAFGRGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
           D+  Y  T      TH I  L FG +L ++   R          PLD +  K  E    F
Sbjct: 233 DLDNYYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEVRYNF 292

Query: 286 NYYIKIIPTIYERLDGSKLGGGDG----------GMPGIFF------------------- 316
            Y++K++ T Y  L        +           G  G+FF                   
Sbjct: 293 LYFVKVVSTSYLPLGWDPTLSSEAHSQAHRDIPLGNHGVFFGSQGSIETHQYSVTSHQRS 352

Query: 317 --------------------------SYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
                                     +YE+SP+ V   E + KSL   +T +   I GT 
Sbjct: 353 LDAEDASADGHKERQHSRGGIPSVMFNYEISPMKVINRETRPKSLSAFFTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                VD LL+    ++ K+
Sbjct: 413 TVAAAVDRLLYEGSLRVKKL 432


>gi|302511557|ref|XP_003017730.1| hypothetical protein ARB_04613 [Arthroderma benhamiae CBS 112371]
 gi|291181301|gb|EFE37085.1| hypothetical protein ARB_04613 [Arthroderma benhamiae CBS 112371]
          Length = 435

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 133/440 (30%), Positives = 189/440 (42%), Gaps = 79/440 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VTI   L + YL+  +  DY +V    EL V
Sbjct: 1   MAGKSRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGE    V+H + K RL            
Sbjct: 61  DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRL--------SSAA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
           E    +    +       +    DPN CG CYG  A +  +K  CCNTC+EV++AY  K 
Sbjct: 113 EGGRVIDVTALALHKKEDSPAHLDPNYCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC +E  ++++     EGC+I G L VN+V+G+FHIAPG S +  + H H
Sbjct: 173 WAFGRGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
           D+  Y  T      TH I  L FG +L ++   R          PLD +  K  E    F
Sbjct: 233 DLDNYYHTPVPHTMTHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEVRYNF 292

Query: 286 NYYIKIIPTIYERLDGSKLGGGDG----------GMPGIFF------------------- 316
            Y++K++ T Y  L        +           G  G+FF                   
Sbjct: 293 LYFVKVVSTSYLPLGWDPTLSSEAHSQAHRDIPLGNHGVFFGSQGSIETHQYSVTSHQRS 352

Query: 317 --------------------------SYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
                                     +YE+SP+ V   E + KSL   +T +   I GT 
Sbjct: 353 LDAEDASADGHKERQHARGGIPSVMFNYEISPMKVINRETRPKSLSAFFTGVCAVIGGTL 412

Query: 350 ITFMLVDALLHSCVKKISKV 369
                VD LL+    ++ K+
Sbjct: 413 TVAAAVDRLLYEGSLRVKKL 432


>gi|323454843|gb|EGB10712.1| hypothetical protein AURANDRAFT_2571, partial [Aureococcus
           anophagefferens]
          Length = 380

 Score =  187 bits (475), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 115/373 (30%), Positives = 181/373 (48%), Gaps = 32/373 (8%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L+ +D + K  ++F  +T+ GG  ++   +    L+  ++     VST + LFV+SS G
Sbjct: 7   KLRNMDMYPKTKDEFRVRTMQGGVSSLFAVVVAIILVRSELKHSLAVSTHDRLFVNSSHG 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV--V 123
             L +  ++  P  +C+ LA+DA D SG+    V+ ++ K RLD +G+ +   +K    V
Sbjct: 67  DGLSVRFELEFPRANCELLAIDANDESGQPLEGVQQHVIKTRLDTNGRRVLVNRKAANSV 126

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
           + V     + E+     E +    CG CYGA+ + R CC TC++V+ AYR + W   E  
Sbjct: 127 HKVGDTATSEEHLAAPDEAKPEVACGDCYGAQDDERPCCATCDDVRSAYRKRGWTFHE-H 185

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-HDIQPYT 242
           T+ QC  E +   L     EGC I G LE+  VSG+FH+APG     + +    D+   T
Sbjct: 186 TVAQCAGELAEAALDLDSDEGCSIKGTLELPAVSGNFHVAPGRHLQTSGLFKGMDLVQLT 245

Query: 243 SAAFNTTHHIRHLSFGI---KLQDDDERRK----------PLDGTVAKAEEGASMFNYYI 289
              FN +H ++ L FG     L+     RK           LDG      +G  M  YY+
Sbjct: 246 FDKFNVSHTVKQLRFGPDERSLEPARASRKVVGPDVDLSSQLDGESRTLGDGYGMHQYYL 305

Query: 290 KIIPTIYERLDGSK--------------LGGGDG-GMPGIFFSYELSPLMVKITEKSKSL 334
           K++PT+Y+ L G                +  G G G+PG+FF YE+SPL  +  E+    
Sbjct: 306 KVVPTVYKNLGGKTRELWQYSVTEHVRHVAPGSGKGLPGVFFFYEVSPLCAEFVERRNGW 365

Query: 335 GHLWTKIMCNISG 347
             L T +   + G
Sbjct: 366 LALLTGLAAIVGG 378


>gi|315044047|ref|XP_003171399.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Arthroderma gypseum CBS 118893]
 gi|311343742|gb|EFR02945.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Arthroderma gypseum CBS 118893]
          Length = 435

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 109/311 (35%), Positives = 154/311 (49%), Gaps = 23/311 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VTI   L + YL+  +  DY +V    EL V
Sbjct: 1   MAGKSRFTRLDAFAKTVEDARIRTRSGGIVTITALLVVLYLVWGEWKDYRRVVVQPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGE    V+H + K RL            
Sbjct: 61  DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRL--------SSAA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
           E    +    +       +    DPN CG CYG  A +  +K  CCNTC EV++AY  K 
Sbjct: 113 EGGKVIDVTALALHKKEDSPAHLDPNYCGDCYGVPAPSNAKKPGCCNTCEEVRDAYAEKN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC +E  ++++     EGC+I G L VN+V+G+FHIAPG S +  + H H
Sbjct: 173 WAFGRGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
           D+  Y  T      +H I  L FG +L ++   R          PLD +  K +E    F
Sbjct: 233 DLDNYYHTPVPHTMSHTIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSDHKTDEARYNF 292

Query: 286 NYYIKIIPTIY 296
            Y++K++ T Y
Sbjct: 293 MYFVKVVSTSY 303



 Score = 41.2 bits (95), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 309 GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
           GG+P + F+YE+SP+ V   E + KSL   +T +   I GT      VD LL+    ++ 
Sbjct: 371 GGIPSVIFNYEISPMKVINREARPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGGLRVK 430

Query: 368 KV 369
           K+
Sbjct: 431 KL 432


>gi|326476034|gb|EGE00044.1| COPII-coated vesicle protein [Trichophyton tonsurans CBS 112818]
 gi|326481270|gb|EGE05280.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Trichophyton equinum CBS 127.97]
          Length = 435

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 109/311 (35%), Positives = 154/311 (49%), Gaps = 23/311 (7%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAF K  ED   +T  GG VTI   L + YL+  +  DY +V    EL V
Sbjct: 1   MAGKSRFTRLDAFAKTVEDARIRTRSGGVVTIAALLIVIYLVWGEWKDYRRVVVQPELIV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG ++ IHL++  P + C+ L LD +D SGE    V+H + K RL            
Sbjct: 61  DKGRGERMEIHLNMTFPNLPCELLTLDVMDVSGELQTDVDHGVNKVRL--------SSAA 112

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG--AETETRK--CCNTCNEVKEAYRYKK 176
           E    +    +       +    DPN CG CYG  A +  +K  CCNTC+EV++AY  K 
Sbjct: 113 EGGKVIDVTALALHKKEDSPAHLDPNYCGDCYGVPAPSTAKKPGCCNTCDEVRDAYAEKN 172

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC +E  ++++     EGC+I G L VN+V+G+FHIAPG S +  + H H
Sbjct: 173 WAFGRGENVAQCIDEGYSQRIDEQRHEGCRIEGILRVNKVAGNFHIAPGRSLTAGNFHAH 232

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASMF 285
           D+  Y  T      +H I  L FG +L ++   R          PLD +  K  E    F
Sbjct: 233 DLDNYYHTPVPHTMSHIIHKLRFGPQLPEELYSRWKWTHQDTINPLDKSEHKTNEARYNF 292

Query: 286 NYYIKIIPTIY 296
            Y++K++ T Y
Sbjct: 293 LYFVKVVSTSY 303



 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 34/62 (54%), Gaps = 1/62 (1%)

Query: 309 GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
           GG+P + F+Y++SP+ V   E + KSL   +T +   I GT      VD LL+    ++ 
Sbjct: 371 GGIPSVMFNYDISPMKVINRESRPKSLSAFFTGVCAVIGGTLTVAAAVDRLLYEGSLRVK 430

Query: 368 KV 369
           K+
Sbjct: 431 KL 432


>gi|344301666|gb|EGW31971.1| hypothetical protein SPAPADRAFT_50577 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 410

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 127/401 (31%), Positives = 195/401 (48%), Gaps = 44/401 (10%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RL  LDAF K  +D   +T  GG +T++C L    LI  +  DY  V T  EL VD    
Sbjct: 8   RLLSLDAFAKTVDDARIRTTSGGIITLLCVLITLVLIRNEYIDYTTVITRPELVVDRDIN 67

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKPIQEPQKEVVN 124
            +L I+LDI    + CD  ++D +D +G+  L+ +     K RL  D   I     +   
Sbjct: 68  KQLVINLDISFINLPCDMASIDLLDETGDMQLNIINAGFQKLRLIKDKGNIVREISDDTP 127

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGA--ETETRKCCNTCNEVKEAYRYKKWALPEL 182
           A+   +  +E      E  DP  CGSCYGA  + + + CCN C  VK AY  ++W+  + 
Sbjct: 128 ALNLDRPLSEVVKGLPEGGDPKTCGSCYGALPQEKHQYCCNDCYSVKRAYAERRWSFFDG 187

Query: 183 DTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
           + I QC+ E   ++L+      EGC+I G  ++NRVSG+   APG S++ +  HVHD+  
Sbjct: 188 ENIEQCEKEGYVKRLRQRINDNEGCRIKGSAKINRVSGTMDFAPGASFTSDGRHVHDVSL 247

Query: 241 YT--SAAFNTTHHIRHLSFGIKLQDDDERRK------PLDGTVAKAEEGASMFNYYIKII 292
           Y      FN  H I HLSFG     +D R +      PLDG      +   + +YY+K++
Sbjct: 248 YGKYQDKFNFDHIINHLSFG----SNDAREEILNSVHPLDGYQFMLHKKHHVASYYLKVV 303

Query: 293 PTIYERLDGSK----------------LGGGD----------GGMPGIFFSYELSPLMVK 326
            T +E LD SK                 GG D          GG+PG+ F +++SPL + 
Sbjct: 304 ATRFESLDQSKRLDTNQFSVITHDRPLTGGKDEDHEHTLHARGGIPGVEFHFDISPLKII 363

Query: 327 ITEK-SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
             E+ +K+       ++ +I+G  +   L+D  +++  + I
Sbjct: 364 NKEQYAKTWSGFVLGVISSIAGVLMVGTLIDRSVYATQQAI 404


>gi|387219467|gb|AFJ69442.1| endoplasmic reticulum-golgi intermediate compartment protein 3
           [Nannochloropsis gaditana CCMP526]
          Length = 432

 Score =  185 bits (469), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 118/389 (30%), Positives = 187/389 (48%), Gaps = 43/389 (11%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEE-LFVDSSRG 65
           L+ +D FTK +++   +T  G ++ +  W+ +  L+C +  + F  S T+E L VD+S G
Sbjct: 34  LERMDVFTKFHDEDKIQTSRGASMALFSWVLVLVLLCSEAYEAFLTSRTKEHLVVDTSLG 93

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I LD+    ++C  + +DA+D +G+  + VEHN+ K+RL   G+ I  P  E    
Sbjct: 94  DKLNITLDMTFHALTCADVHVDAMDVAGDNQMQVEHNMLKQRLSSQGERIGFPFLEDPTD 153

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
              KK     G    +      CGSC+ A T T  CCN+C ++++AY  +   + ++ T 
Sbjct: 154 FDSKKADALLGAAPWDY-----CGSCFQARTHTGACCNSCQDLEQAYLTQGLPMGKIKTT 208

Query: 186 V-QCKNEYSTEKLKNTFT--EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
             QC   +            EGC + G++ VN+V+G+FHIA G S   +  H+H   P  
Sbjct: 209 APQCLPGFQAPAPSGPMQKGEGCNLKGFMSVNKVAGNFHIAFGDSVVKDGRHIHQFIPSE 268

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV--AKAEEGASMFNYYIKIIPTIYE--- 297
           +  FN +H I+H+SFG    +   R  PLDG V    +  G  +F Y+IK+IPT Y+   
Sbjct: 269 APFFNVSHTIQHVSFG---DEYPGRVNPLDGKVKYVSSTVGTGLFQYFIKVIPTHYKGRA 325

Query: 298 ------------------------RLDGSKLGGGD--GGMPGIFFSYELSPLMVKITEKS 331
                                   RL G      D    +PG+FF Y+LSP  V+++  S
Sbjct: 326 GEAIRTNRISVTERFKPLHKEGEARLTGDSHAHNDQTSVLPGVFFIYDLSPFNVEVSTVS 385

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLH 360
               H   K+     G +    L+D + +
Sbjct: 386 VPFSHFLVKLCAIAGGVFSISRLLDNVFY 414


>gi|344230637|gb|EGV62522.1| hypothetical protein CANTEDRAFT_131007 [Candida tenuis ATCC 10573]
          Length = 410

 Score =  184 bits (468), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 122/401 (30%), Positives = 193/401 (48%), Gaps = 42/401 (10%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  LDAF K  ED   KT  GG +T+V    + +LI  +  DY  + T  EL VD    
Sbjct: 6   KLLSLDAFAKTVEDARVKTASGGIITLVSITIVLFLIRNEYLDYTSIITRPELVVDRDIN 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
            KL I LDI  P+I C  + LD +D SG   L +  N +++ R+   G+ +      +++
Sbjct: 66  QKLDITLDISFPSIPCSMINLDILDVSGNVELDILQNGFQKYRILSSGEEVLMKNAPLID 125

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKWALPEL 182
           +   + +    G    E  +   CG CYG+  + RK  CCN C  ++ AY  K WA  + 
Sbjct: 126 STPLEVMA--KGLDKPEDAEHTPCGDCYGSLPQDRKQYCCNNCETIRRAYAAKVWAFYDG 183

Query: 183 DTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
           + I  C++E   + +++     EGC++ G  ++NR+SG+ H APG S++    HVHD+  
Sbjct: 184 ENIKPCEDEGYVKAIQSEIFNNEGCRVKGTTQINRISGNLHFAPGASFTEPSRHVHDLSL 243

Query: 241 YTSAA--FNTTHHIRHLSFG----IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           Y      FN  H I HLSFG         D +   PLDG     +E   +++Y++K++ T
Sbjct: 244 YNKFPDRFNFDHTINHLSFGKDPETNANTDKKTLHPLDGETRNLKEKYHLYSYFLKVVST 303

Query: 295 IYERL------------------DGSKLGGGD----------GGMPGIFFSYELSPLMVK 326
            YE L                  D    GG D          GG+PG++F +++SPL + 
Sbjct: 304 RYEYLQEKLKAPLETNQFSAIYHDRPIKGGKDEDHQHTLHARGGLPGLYFYFDISPLKII 363

Query: 327 ITEK-SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
             E+ SK+       ++ +I+G  +   L+D  + +  K I
Sbjct: 364 NKEQYSKTWSGFVLGVISSIAGVLMIGSLLDRSVWAAEKAI 404


>gi|344230638|gb|EGV62523.1| DUF1692-domain-containing protein [Candida tenuis ATCC 10573]
          Length = 409

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 122/401 (30%), Positives = 193/401 (48%), Gaps = 42/401 (10%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  LDAF K  ED   KT  GG +T+V    + +LI  +  DY  + T  EL VD    
Sbjct: 5   KLLSLDAFAKTVEDARVKTASGGIITLVSITIVLFLIRNEYLDYTSIITRPELVVDRDIN 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
            KL I LDI  P+I C  + LD +D SG   L +  N +++ R+   G+ +      +++
Sbjct: 65  QKLDITLDISFPSIPCSMINLDILDVSGNVELDILQNGFQKYRILSSGEEVLMKNAPLID 124

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKWALPEL 182
           +   + +    G    E  +   CG CYG+  + RK  CCN C  ++ AY  K WA  + 
Sbjct: 125 STPLEVMA--KGLDKPEDAEHTPCGDCYGSLPQDRKQYCCNNCETIRRAYAAKVWAFYDG 182

Query: 183 DTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
           + I  C++E   + +++     EGC++ G  ++NR+SG+ H APG S++    HVHD+  
Sbjct: 183 ENIKPCEDEGYVKAIQSEIFNNEGCRVKGTTQINRISGNLHFAPGASFTEPSRHVHDLSL 242

Query: 241 YTSAA--FNTTHHIRHLSFG----IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           Y      FN  H I HLSFG         D +   PLDG     +E   +++Y++K++ T
Sbjct: 243 YNKFPDRFNFDHTINHLSFGKDPETNANTDKKTLHPLDGETRNLKEKYHLYSYFLKVVST 302

Query: 295 IYERL------------------DGSKLGGGD----------GGMPGIFFSYELSPLMVK 326
            YE L                  D    GG D          GG+PG++F +++SPL + 
Sbjct: 303 RYEYLQEKLKAPLETNQFSAIYHDRPIKGGKDEDHQHTLHARGGLPGLYFYFDISPLKII 362

Query: 327 ITEK-SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
             E+ SK+       ++ +I+G  +   L+D  + +  K I
Sbjct: 363 NKEQYSKTWSGFVLGVISSIAGVLMIGSLLDRSVWAAEKAI 403


>gi|367017984|ref|XP_003683490.1| hypothetical protein TDEL_0H04200 [Torulaspora delbrueckii]
 gi|359751154|emb|CCE94279.1| hypothetical protein TDEL_0H04200 [Torulaspora delbrueckii]
          Length = 406

 Score =  184 bits (466), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 126/414 (30%), Positives = 189/414 (45%), Gaps = 62/414 (14%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           MV    L   DAF+K  ED   +T  GG +++ C +   +L+  +  ++ QV T  +L V
Sbjct: 1   MVQKSALLSFDAFSKTEEDVRIRTRSGGLISLSCVVLTIFLLISEWLNFNQVVTRPQLVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQ 119
           D  R  KL   +DI  P++ C  ++LD +D++GE  L + E    K R+D +GK I    
Sbjct: 61  DRDRQLKLDFVVDITFPSMPCAMISLDIMDNAGELQLDIMEAGFTKTRIDSNGKEI---- 116

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE---------TETRKCCNTCNEVKE 170
                          + ++    +D N CGSCYGA+          E R CC TC++V++
Sbjct: 117 -------STSSFDASDSSSDYVPDDENYCGSCYGAKDQDKNDELPKEERVCCQTCDDVRK 169

Query: 171 AYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSI 230
           AY   +WA  +   I QC+ E   E++     EGC++ G   ++R+ G+ H APG  +  
Sbjct: 170 AYLEAEWAFYDGKNIEQCEREGYVERINQQLNEGCRVQGNALLSRIQGTIHFAPGRGFQN 229

Query: 231 NHVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEGAS 283
           N  H HD+  Y  +   N  H I HLSFG  +    E R       PLDG     +    
Sbjct: 230 NRGHFHDMSLYDNTPQLNFNHIIHHLSFGKPINSGAEDRGAATSTHPLDGRQVFPDRDTH 289

Query: 284 M--FNYYIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIFF 316
           +  F+Y+ KI+PT YE LD   +               GG D          GG PG+F 
Sbjct: 290 LHQFSYFAKIVPTRYEYLDDVVVETAQFSTTYHDRPLRGGVDDDHPNTLHSRGGSPGMFV 349

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
            +E+SPL V   E+       W+  + N    I G      ++D +L+   K I
Sbjct: 350 YFEMSPLKVINKEQH---AQTWSGFLLNCITSIGGVLAVGTVLDKVLYKAQKSI 400


>gi|224011116|ref|XP_002294515.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220970010|gb|EED88349.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 454

 Score =  181 bits (458), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 187/385 (48%), Gaps = 21/385 (5%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVS--TTEELFVD 61
           ++ +K LD F K   D+  +T  GG  T+V ++ +  LI  +   +  ++  + E + VD
Sbjct: 71  AKTVKKLDFFPKLERDYEVRTERGGQATLVGYVIMLVLILAEFWTWRGLNGESLEHIVVD 130

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +S G ++ ++L+I  P + CD L LD +D +G+  L +   ++K RL+LDG  ++   K 
Sbjct: 131 TSLGKRMRVNLNITFPNLHCDDLHLDVIDVAGDSQLDLSDTLFKHRLNLDGT-LRSKAKI 189

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
              A  K     +     ++    + CG CYGA+ +   CCNTC++V E Y+ K+W    
Sbjct: 190 ATEANIKADEDKKKQEALSKDIPADYCGPCYGADEKEGDCCNTCDDVMERYKKKRWNENA 249

Query: 182 LDTIV-QCKNE--YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
           +  +  QC  E     E  + +  EGC + G+  VNRV+G+FHIA G     +  H+H  
Sbjct: 250 VQPLAEQCIREGKGKNEPKRMSNGEGCNLSGHFTVNRVAGNFHIAMGEGVDRDGRHIHQF 309

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDD--------DERRKPLDGTVAKAEEGASMFNYYIK 290
            P     FN +H +  L F  +   D        +     +   V +      +F Y+IK
Sbjct: 310 LPEDRMNFNASHVVHELIFMDEEYGDMVIAGVPGETSMNSVSKVVTEDTGTTGLFQYFIK 369

Query: 291 IIPTIYERLDGSKL-------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           ++PT Y+   G  L          +  +PG+FF YE+ P  V++T+      HL  +IM 
Sbjct: 370 VVPTKYKGKSGGTLHEKVEHHDTQNAVLPGVFFVYEIYPFAVEVTKNKVPFMHLLIRIMA 429

Query: 344 NISGTYITFMLVDALLHSCVKKISK 368
            + G +     +D+ L+S  KK S+
Sbjct: 430 TVGGVFTIMGWIDSALYSREKKSSR 454


>gi|294657513|ref|XP_459821.2| DEHA2E11792p [Debaryomyces hansenii CBS767]
 gi|199432751|emb|CAG88060.2| DEHA2E11792p [Debaryomyces hansenii CBS767]
          Length = 402

 Score =  181 bits (458), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 125/411 (30%), Positives = 194/411 (47%), Gaps = 54/411 (13%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  +D F K  ED   KT  GG +T+VC   + +LI  +  DY  + T  EL VD    
Sbjct: 6   KLISIDVFAKTVEDAKIKTASGGIITLVCIFIVMFLIRNEYKDYTSIITRPELVVDRDIN 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
           +KL I+LD+  P + CD L LD +D SG+  L +  + +++      + ++E   E+++ 
Sbjct: 66  TKLDINLDVSFPNMPCDVLTLDILDISGDLQLDILKSGFQKY-----RILKESNHEILDE 120

Query: 126 VKKKKVTTENGTTTTELED----PNKCGSCYGA--ETETRKCCNTCNEVKEAYRYKKWAL 179
                    N  +  E+        KCG CYGA  +     CCN+C  VK AY  K WA 
Sbjct: 121 AP----VLSNDLSLEEMAKGVGANGKCGPCYGALPQDNNEYCCNSCETVKLAYAEKMWAF 176

Query: 180 PELDTIVQCKNE----YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            +   I QC+NE      TE++ N   EGC++ G  ++NR+SG+ H APG S +    H+
Sbjct: 177 YDGKDIEQCENEGYVSRLTERINN--NEGCRVKGTAQINRISGNLHFAPGSSSTAPGRHI 234

Query: 236 HDIQPYT--SAAFNTTHHIRHLSFGIKLQDDDERRK--PLDGTVAKAEEGASMFNYYIKI 291
           HD+  +      FN  H I H SFG    D++ ++   PLD      +E   + +YY+K+
Sbjct: 235 HDLSLFEKYEDKFNFDHVINHFSFGSDPHDNNLQQSTHPLDNHQLVFDEKYHVASYYLKV 294

Query: 292 IPTIYERLDGS---------------KLGGGD-----------GGMPGIFFSYELSPLMV 325
           + T +E +D S                L GG            GG+PG+FF +E+SP+  
Sbjct: 295 VATRFEFIDTSLPLDTNQFSVISHHRPLRGGKDEDHKHTLHARGGLPGVFFHFEISPM-- 352

Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
           KI  K +     W+  +  +  +    ++V  +L   V    K   G K +
Sbjct: 353 KIINKEQ-YAKTWSGFILGVISSVAGVLMVGTVLDRSVWAAEKAIKGKKDM 402


>gi|444321132|ref|XP_004181222.1| hypothetical protein TBLA_0F01610 [Tetrapisispora blattae CBS 6284]
 gi|387514266|emb|CCH61703.1| hypothetical protein TBLA_0F01610 [Tetrapisispora blattae CBS 6284]
          Length = 414

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 119/412 (28%), Positives = 190/412 (46%), Gaps = 58/412 (14%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L   DAF K  E+   +T  GG +T+ C L   YL+  +  +Y++++   ++ VD  R 
Sbjct: 4   KLLSFDAFNKTDEEVRIRTRTGGIITLFCILTTLYLLQKEWIEYYKITNKPQVVVDRDRH 63

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
            KL ++LDI  P++SCD + LD VD SGE  L V E    K R+D +G  + +  +  V 
Sbjct: 64  LKLELNLDITFPSLSCDLIGLDIVDDSGETSLDVLESGFTKIRVDTNGNELDDGSQLDVG 123

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETET----------RKCCNTCNEVKEAYRY 174
                  T     ++ +++    CG CYGA  ++          + CC TC +V++AY  
Sbjct: 124 -------TDRESLSSLDMDKAKYCGPCYGALDQSGNDNIDVASEKVCCQTCYDVRKAYTD 176

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
             WA  +   I QC+ E   +++ +   EGC+I G   +NR+ G+ H APG ++     H
Sbjct: 177 VGWAFFDGKDIEQCEREGYVDRINDHLHEGCRIVGSALLNRIQGNVHFAPGAAFETAKGH 236

Query: 235 VHDIQPY-TSAAFNTTHHIRHLSFGIKLQD----------DDERRKPLDGTVAKAEEGAS 283
            HD   Y  +   N  H I HLSFG    +             RR+PLDG V   E   +
Sbjct: 237 FHDTSLYDKTEQLNFNHIINHLSFGKTGHELLTPKSSKSFSVSRRQPLDGRVMIPESRNT 296

Query: 284 ---MFNYYIKIIPTIYERLDGS---------------KLGG----------GDGGMPGIF 315
               F+Y+ KI+PT +E L G                  GG          G  G+PG+F
Sbjct: 297 HFFQFSYFAKIVPTRFESLSGKVEEAAQYSVTFHSRPLQGGRDEDHPNTFHGRSGIPGLF 356

Query: 316 FSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
             ++++PL ++ I   S++   L    +  I G      ++D + +   + I
Sbjct: 357 IYFQMAPLKVIDIEAHSQTFSGLLLNCITTIGGVLAVGTMMDKVFYKAQRSI 408


>gi|209876426|ref|XP_002139655.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209555261|gb|EEA05306.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 395

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 190/389 (48%), Gaps = 52/389 (13%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           + +K +D + K ++D+  K+  G  ++I+ ++ +  L   +   Y    T E + VD + 
Sbjct: 32  KSVKYIDIYGKVHDDYCAKSTSGSIMSILVYILVIILTIGEFLKYIGGETVEHIGVDDNM 91

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
             KL I LDI  P++ C  +++D VD+ GE  ++   N+ K  +D+ G  +QE       
Sbjct: 92  NQKLDIRLDISFPSLRCSEISVDTVDNVGENQVNAHGNLLKIPIDIHGNEVQE------- 144

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
                ++  +   +T+      KC SC+GAE+   KCCNTC  +K A+RYK W+  ++ +
Sbjct: 145 -----EIMAQYNESTSM-----KCLSCFGAESIHYKCCNTCESLKSAFRYKGWSYLDIAS 194

Query: 185 IV-QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY-T 242
              QC N           T GC+++G L+VN+VSG+ H+A G +   +  HVH+      
Sbjct: 195 KAPQCIN-----------TVGCRLHGSLQVNKVSGNIHVALGQATVRDGKHVHEFNMNDI 243

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDER-RKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
           S  FNT+H I  L FG   +D+ E    PL+ T      G SMF+YY+K++PT + +   
Sbjct: 244 SRGFNTSHTIHELRFG---KDNIEFIGSPLENTKKIVTTGTSMFHYYLKLVPTQFIKSGY 300

Query: 302 SKL------------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           SK+                   G   G+PG+F  Y+  P +++    S    H  T    
Sbjct: 301 SKVLFSNQYTYTERQKDVLVKDGELSGLPGVFIVYDFQPFVIRKIHNSIPTTHFLTSFCA 360

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIG 372
            I G Y    LVD++L   +K+ S +  G
Sbjct: 361 IIGGIYSLMSLVDSILFWFIKRTSAILSG 389


>gi|401626934|gb|EJS44847.1| erv46p [Saccharomyces arboricola H-6]
          Length = 415

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 126/416 (30%), Positives = 184/416 (44%), Gaps = 68/416 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LDAF K  ED   +T  GG +T+ C L   +L+  +   +  V T  +L VD  R +
Sbjct: 6   LLSLDAFAKTEEDVRVRTKAGGLITLSCILTTLFLLVNEWRQFNSVVTRPQLVVDRDRHA 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL +++D+  P++ C+ + LD +D SGE  L + +      R+D DG P+ +  +  V  
Sbjct: 66  KLELNMDVTFPSMPCELVNLDIMDDSGELQLDILDAGFTMTRVDKDGHPVGDATELHVGG 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAE---------TETRKCCNTCNEVKEAYRYKK 176
                    NG   T  +DPN CG CYGA           E + CC  C+ V+ AY  K 
Sbjct: 126 ---------NGEGATPNDDPNYCGQCYGARDQSNNENLAQEDKVCCQNCDSVRSAYLDKG 176

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYS-INHVHV 235
           WA  +   I QC+ E    K+ +   EGC+I G  ++NR+ G+ H APG  +      H 
Sbjct: 177 WAFFDGKDIEQCEKEGYVNKINDHLHEGCRIEGSAQINRIQGNIHFAPGKPFQDTRGNHR 236

Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDER-------------RKPLDGTVAKAEEG 281
           HD   Y  +   N  H I  LSFG  +Q   +R               PLDG     +  
Sbjct: 237 HDTSLYDKTPDLNFNHIINRLSFGKPIQSHHKRLGNDKLHGGAVVSTSPLDGRQVFPDRP 296

Query: 282 ASM--FNYYIKIIPTIYERLDGS--------------KLGGG-----------DGGMPGI 314
                F+Y+ KI+PT YE LD +               LGGG            GG+ G+
Sbjct: 297 THFHQFSYFAKIVPTRYEYLDSTVIETAQFSATYHSRPLGGGRDQDHPNTFHARGGISGL 356

Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
           +  +E+SPL V   E+    G  W+  + N    I G      ++D L +   + I
Sbjct: 357 YVFFEMSPLKVINKEQH---GQTWSGFILNCITSIGGVLAVGTVMDKLFYKAQRSI 409


>gi|345319994|ref|XP_001507420.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like, partial [Ornithorhynchus anatinus]
          Length = 203

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 89/205 (43%), Positives = 122/205 (59%), Gaps = 19/205 (9%)

Query: 161 CCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSF 220
           CCNTC +V+EAYR + WA    DTI QCK E  ++K++    EGCQ+YG+LEVN+V+G+F
Sbjct: 1   CCNTCEDVREAYRRRGWAFKNPDTIEQCKREGFSQKMQEQKNEGCQVYGFLEVNKVAGNF 60

Query: 221 HIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE 280
           H APG S+  +HVH  +         N TH+I HLSFG   +D      PLDGT   A +
Sbjct: 61  HFAPGKSFQQSHVHGKERLRIHPRPINMTHYIEHLSFG---EDYPGIVNPLDGTDVSAPQ 117

Query: 281 GASMFNYYIKIIPTIYERLDG-------------SKLGG---GDGGMPGIFFSYELSPLM 324
            + MF Y++K++PT+Y + DG              K+     GD G+PG+F  YELSP+M
Sbjct: 118 ASMMFQYFVKVVPTVYVKADGEVVRTNQFSVTRHEKVANGLIGDQGLPGVFVLYELSPMM 177

Query: 325 VKITEKSKSLGHLWTKIMCNISGTY 349
           VK+TEK +S  H  T +   I G +
Sbjct: 178 VKLTEKHRSFTHFLTGVCAIIGGVF 202


>gi|300123299|emb|CBK24572.2| unnamed protein product [Blastocystis hominis]
          Length = 376

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 115/368 (31%), Positives = 181/368 (49%), Gaps = 49/368 (13%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F+++L+ LD + K  +D+  KT  GG V++     I  L   ++ +Y +V+ T+ + +D+
Sbjct: 25  FTKKLEKLDIYPKIGDDYVIKTESGGFVSLFSGFIIIILFVSELTNYLKVNRTDVITIDN 84

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           +R  KL I+ +I +  I C   +LD +D SG+Q + V   I +  LD + KP+      V
Sbjct: 85  TRNEKLQINFNISLYGIPCSEASLDIMDISGQQQMGVTSRIVQLDLDENHKPVNMALSSV 144

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +    +K +            DP  CGSC+GA   +  CCNTC++V  AY  + W     
Sbjct: 145 L---YEKNI------------DP-ACGSCFGASL-SNVCCNTCDDVLSAYERRGW----- 182

Query: 183 DTIV------QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           DT        QC+      K     ++GC ++G LEVN+V+G+FHIA G + + +  H+H
Sbjct: 183 DTWFVSKYSPQCRKNNDEVKKPRVNSQGCMMWGVLEVNKVAGNFHIAVGHAANRDSHHIH 242

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
              P   + FN THHI  LSFG  +      + PLDG    AE   S  NYY+K++PT+Y
Sbjct: 243 SFNPLMISKFNVTHHIEKLSFGEHIPG---IQNPLDGHDMVAESLTSQ-NYYLKVMPTVY 298

Query: 297 ERLDGSKLG-----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
                + +                  G    +PGIFF Y+++P M  +TE   +  H   
Sbjct: 299 SNRTSTVVSNELSVNEVSRRVEMTPFGQITSLPGIFFIYDITPFMHVVTESRIAFAHFLV 358

Query: 340 KIMCNISG 347
           ++   I G
Sbjct: 359 RVCAVIGG 366


>gi|354544621|emb|CCE41346.1| hypothetical protein CPAR2_303350 [Candida parapsilosis]
          Length = 412

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 126/403 (31%), Positives = 193/403 (47%), Gaps = 45/403 (11%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  LDAF K  ED   KT  GG +T++C     +LI  +  DY  V    EL VD    
Sbjct: 7   KLISLDAFAKTVEDARIKTASGGIITLLCIFVALFLIRNEYIDYTTVIARPELVVDRDIN 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDG---KPIQEPQKE 121
            +L I+LDI    + CD +++D  D SG+  L  +   + K R+   G   KP++   K+
Sbjct: 67  KQLDINLDISFLNLPCDLVSIDLFDESGDLKLDIINSQLEKFRIIKQGHSSKPVE--IKD 124

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKWAL 179
              A++++    +      E +   +CGSCYGA  + +K  CCNTC  V+ AY    W  
Sbjct: 125 EQPALQREVPLEQIAPGLPEGQTEGECGSCYGAVPQDKKQYCCNTCAAVRRAYAEANWQF 184

Query: 180 PELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
            + + I QC+ E   ++LK      EGC++ G  ++NR+SG+   APG S + +  HVHD
Sbjct: 185 FDGENIAQCEQEGYVQRLKQRIGENEGCRVKGTAKINRISGTMDFAPGASMTKDGRHVHD 244

Query: 238 IQPYT--SAAFNTTHHIRHLSFG-----IKLQDDDERRKPLDGTVAKAEEGASMFNYYIK 290
           +  Y      FN  H I HLSFG      KL D      PLDG      +     NY++K
Sbjct: 245 LSLYQKYKDKFNFDHVINHLSFGNNPPASKLVDTGS-ITPLDGHKFLQHKKYHSINYFLK 303

Query: 291 IIPTIYERLDGS---------------KLGGGD-----------GGMPGIFFSYELSPL- 323
           I+ T +E LDG                 L GG            GG+PG+ F++++SPL 
Sbjct: 304 IVATRFESLDGKHKFDTNQFSVITHDRPLAGGKDEDHQHTLHARGGVPGVAFNFDISPLK 363

Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           ++   E +K+       ++ +I+G  +   L+D  + +  + I
Sbjct: 364 IINREEYAKTRSGFILGVVSSIAGVLMVGSLMDRSVFAAQQAI 406


>gi|366987855|ref|XP_003673694.1| hypothetical protein NCAS_0A07550 [Naumovozyma castellii CBS 4309]
 gi|342299557|emb|CCC67313.1| hypothetical protein NCAS_0A07550 [Naumovozyma castellii CBS 4309]
          Length = 425

 Score =  177 bits (450), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 124/396 (31%), Positives = 182/396 (45%), Gaps = 65/396 (16%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           S +L   DAF K  E+   +T  GG +T+ C +   YL+  +   +  V T+ +L VD  
Sbjct: 8   SAKLLSFDAFAKTEEEVRVRTNTGGIITLSCIIVTLYLLLNEWSQFNSVITSPQLVVDRD 67

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIY-KRRLDLDGKPIQEPQKEV 122
           R  KL ++ D+  P+ISCD + LD +D SGE  L +  + + K R+D DG  +     EV
Sbjct: 68  RNLKLELNFDVTFPSISCDLINLDIMDDSGELQLDLLDSAFTKIRVDADGNELGSSTLEV 127

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET---------ETRKCCNTCNEVKEAYR 173
                  +V   N        DP+ CGSCYG++          E+R CC TCN+V+EAY 
Sbjct: 128 GTDDLASEVQQRNN-------DPDYCGSCYGSKVQDENDKLPRESRVCCQTCNDVREAYL 180

Query: 174 YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY----- 228
              W   +   I QC+ E    K+     EGC++ G   ++R+ G+ H APG SY     
Sbjct: 181 NIGWGFFDGKGIEQCEKEGYVAKINEHLKEGCRVKGQTLLSRIQGNIHFAPGKSYTSYKR 240

Query: 229 SINHVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERRK---------PLDG---TV 275
           S +  H HD   Y  ++  N  H I HLSFG  +   DE+ +         PLDG     
Sbjct: 241 STSASHYHDTSLYDKTSNLNFNHKINHLSFGKPIDKLDEKVQDHSTEFSISPLDGREVIP 300

Query: 276 AKAEEGASMFNYYIKIIPTIYERLDGSK-----------------LGGGD---------- 308
              +    +++YY KI+PT YE L+  +                  GG D          
Sbjct: 301 TDIDTHYHVYSYYAKIVPTRYEFLNKKEKSIETAQFSTTFHSRPLRGGRDADHPTTMHSQ 360

Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           GG+PG+F  +E+S   VK+  K       W+  + N
Sbjct: 361 GGIPGLFIYFEMSA--VKVINKEHHF-RSWSSFLLN 393


>gi|255732259|ref|XP_002551053.1| hypothetical protein CTRG_05351 [Candida tropicalis MYA-3404]
 gi|240131339|gb|EER30899.1| hypothetical protein CTRG_05351 [Candida tropicalis MYA-3404]
          Length = 414

 Score =  177 bits (448), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 126/414 (30%), Positives = 201/414 (48%), Gaps = 54/414 (13%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +L   DAF K  ED   KT  GG +T++C L    LI  +  DY  + T  EL V
Sbjct: 1   MSSRPKLLSFDAFAKTVEDARIKTASGGIITLICVLITLILIRNEYIDYTTIITRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKPIQEPQ 119
           D     +L I+LDI    + CD +++D +D +G+Q L  ++  + K RL      ++  Q
Sbjct: 61  DRDINKQLDINLDISFINLPCDLISVDLLDVTGDQQLDIIDSGLKKVRL------LKNKQ 114

Query: 120 KEV-VNAVKKKKVTTENGTTTTEL-------EDPNK-CGSCYGAETETRK--CCNTCNEV 168
            +V +N ++  K    +  +  EL        D N  CG CYGA  + +K  CCN CN V
Sbjct: 115 GDVIINEIEDDKPALNSDVSLKELAKGLPEGSDQNAYCGPCYGALPQDKKQFCCNDCNTV 174

Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGL 226
           + AY  K+W   + + I QC+ E   ++L+      EGC+I G  ++NRVSG+   APG 
Sbjct: 175 RRAYAEKQWQFFDGENIEQCEKEGYVKRLRERINNNEGCRIKGSTKINRVSGTMDFAPGS 234

Query: 227 SYSINHVHVHDIQPYT--SAAFNTTHHIRHLSFG-IKLQDDDERR----KPLDGTVAKAE 279
           S++ +  H HD+  Y   +  FN  H I HLSFG +   +  E       PLD       
Sbjct: 235 SFNHDGRHFHDLSLYKKYNDKFNFDHVINHLSFGEVPTNNGAEEMFDSIHPLDDYQFMLH 294

Query: 280 EGASMFNYYIKIIPTIYERLDGSK----------------LGGGD----------GGMPG 313
           +   + +Y++K++ T YE LD SK                +GG D          GG+PG
Sbjct: 295 KKDHVVSYFLKVVATRYESLDYSKRVDTNQFSVITHDRPLIGGKDEDHQHTLHARGGIPG 354

Query: 314 IFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           + F++++SPL ++   + +K+       ++ +I+G  +   L+D  + +  + I
Sbjct: 355 VNFNFDISPLKIINRQQYAKTWSGFILGVVSSIAGVLMVGTLLDRSVFAAQQAI 408


>gi|407044387|gb|EKE42566.1| hypothetical protein ENU1_017250 [Entamoeba nuttalli P19]
          Length = 354

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 116/371 (31%), Positives = 175/371 (47%), Gaps = 41/371 (11%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           + +K  DA+ K   +   K   GG ++IVC + + ++   ++ DYF +     L VD S+
Sbjct: 2   QNIKRFDAYPKINSNNRVKHWIGGLLSIVCIITMIWMFSSELNDYFTIRKKPVLRVDESK 61

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
             KLPI+ DI  P  +C + ++D +D++GE  + +  NI K RL+L    + E +     
Sbjct: 62  NKKLPINFDITFPHSACSFTSVDVLDTTGEVIIDISKNIKKERLNL----VNEDE----- 112

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             KKK   T  GT         +C  C   E +  KCC TC E+ E+Y+     +P+   
Sbjct: 113 ISKKKFAKTVYGT---------ECPPC-NNEIDKDKCCFTCEELTESYQKLNKEVPKGSP 162

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
             + KN +      N   EGC+I G + VNR SG+FHIAPG S  +   H+H +  + S 
Sbjct: 163 QCEIKNIHKMTTFYN--GEGCRISGTVFVNRASGNFHIAPGSSQQLTQEHIHSVD-WISG 219

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--- 301
             N TH    LSFG           PLDG V       SM+ Y+++++P  Y  LD    
Sbjct: 220 GINLTHTWNFLSFGDSFPG---MINPLDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNKVI 276

Query: 302 -------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                          L   + G+PG+F  Y++S + V   E+  S GHL T I   I G 
Sbjct: 277 NTNGYSVTEHYRPGSLKSPEQGIPGVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGV 336

Query: 349 YITFMLVDALL 359
           +  F L+D  +
Sbjct: 337 FALFSLLDYFI 347


>gi|449549110|gb|EMD40076.1| hypothetical protein CERSUDRAFT_132878 [Ceriporiopsis subvermispora
           B]
          Length = 1001

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 112/398 (28%), Positives = 175/398 (43%), Gaps = 58/398 (14%)

Query: 18  EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVP 77
           ED   KT  G  +TI+    I     ++  DY +V+    + VD SRG KL + +++  P
Sbjct: 598 EDVKVKTRTGALLTILSAAIILAFTTIEFFDYRRVNVDTSIQVDKSRGEKLTVKMNVTFP 657

Query: 78  TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVNAVKKKKVTTENG 136
            + C  L+LD +D SGE    + HNI K RL   G P+      E+ N + K     + G
Sbjct: 658 RVPCYLLSLDVMDISGETQTDISHNIIKTRLTEKGLPVPNAASSELRNDIDKLNEQRQGG 717

Query: 137 TTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEK 196
              +        G C          CN+C +V++AY  + W+    + I QC +E  +EK
Sbjct: 718 YCGSCYGGVEPAGGC----------CNSCEDVRQAYVNRGWSFNRPEGIEQCVDEGWSEK 767

Query: 197 LKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLS 256
           LK+   EGC I G + VN+V G+ H++PG S+     +++D+ PY     N  H   H  
Sbjct: 768 LKDQANEGCNIAGRVRVNKVVGNIHLSPGRSFRSGSQNLYDLVPYLKDDGN-RHDFSHTI 826

Query: 257 FGIKLQDDDE------------RRK------PLDGTVAKAEEGASMFNYYIKIIPTIYER 298
                + DDE            RR+      PLDG + +  +   MF Y++K++ T +  
Sbjct: 827 HEFAFEGDDEYDILKAKSGKEMRRRMGIEGNPLDGAIGRTSKQQYMFQYFLKVVSTQFRT 886

Query: 299 LDG--------------SKLGGGDG--------------GMPGIFFSYELSPLMVKITEK 330
           LDG                L  G                G+PG FF+YE+SP+++   E 
Sbjct: 887 LDGMSVNTNQYSATHFERDLTAGQQEKDQAGLHVAHTSVGIPGAFFNYEISPILISHAES 946

Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
            +S  H  T     + G      L+D++L    + + K
Sbjct: 947 RQSFAHFLTSTCAIVGGVLTVASLIDSVLFVAGRTLKK 984


>gi|363752862|ref|XP_003646647.1| hypothetical protein Ecym_5030 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356890283|gb|AET39830.1| hypothetical protein Ecym_5030 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 399

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 123/364 (33%), Positives = 173/364 (47%), Gaps = 53/364 (14%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K  ED   +T  GG +++ C +    L+  +   +  V    +L +D  R  
Sbjct: 6   LLSFDAFAKTEEDVRVRTKAGGIISLGCIVVTLLLLFNEWSQFNTVIQRPQLVLDRDRRL 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIY-KRRLDLDGKPIQEPQKEVVNA 125
           K+ ++LD     + C  L LD +D+SGE  L ++   + K RLD  G PI+  + EV + 
Sbjct: 66  KMDLNLDFEFSNMPCAMLNLDVMDTSGEVQLDLQDAGFTKTRLDHSGTPIRTEKLEVGS- 124

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAET---------ETRKCCNTCNEVKEAYRYKK 176
              K V           +DPN CGSCYG+++         E + CC TC EV+EAY  K 
Sbjct: 125 --NKAVHLP--------DDPNYCGSCYGSKSQDNNDALPKEQKVCCQTCEEVREAYSEKG 174

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG-LSYSINHVHV 235
           WA  +   I QC  E   EK+ +   EGC++ G  ++NR+ G+ H APG  + S    H 
Sbjct: 175 WAFFDGQKIEQCIREGYVEKINSQLHEGCRVKGSAKLNRIQGNIHFAPGRTTNSGKRTHT 234

Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG---TVAKAEEGASMFNYYIKI 291
           HD+  Y T +  N  H I  LSFG     D     PLDG    +   +   S F+Y+ KI
Sbjct: 235 HDVSLYDTHSHLNFNHIIHKLSFGSDA--DGALSNPLDGHKNIIQGDDAHFSTFSYFTKI 292

Query: 292 IPTIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVK 326
           +PT YE LDG KL               GG D          GG+ G+   +E+SPL V 
Sbjct: 293 VPTRYEYLDGRKLETTQFSVTTHSRPLKGGKDDDHPNTIHHRGGIAGVTIFFEMSPLKVI 352

Query: 327 ITEK 330
            +EK
Sbjct: 353 NSEK 356


>gi|323449476|gb|EGB05364.1| hypothetical protein AURANDRAFT_30967 [Aureococcus anophagefferens]
          Length = 368

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 109/350 (31%), Positives = 173/350 (49%), Gaps = 36/350 (10%)

Query: 29  AVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDA 88
           ++T+  W+     +C ++  + +V   + + VD S G +L I L+I  P ++C  + LDA
Sbjct: 22  SLTVGHWVMALLFLC-ELLVFLRVEERDHVVVDRSMGQRLKIGLNITFPALTCAEVHLDA 80

Query: 89  VDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKC 148
           +D +G+ H ++E ++ K+RLD  G PI  P +    A+ ++    E+G   T       C
Sbjct: 81  MDVAGDYHPYMEQHMTKQRLDGRGSPI--PHR----AIPERANEYEHGPEDTG----AGC 130

Query: 149 GSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT-IVQCKNEYSTEKLKNTFT-EGCQ 206
            SC+GAET  + CCNTC+E+  AY  K W+  E+     QC ++   + ++     EGC 
Sbjct: 131 QSCFGAETAEQPCCNTCDELLRAYGNKGWSAQEIKKEAPQCVDDTRDDSIRAIKKGEGCN 190

Query: 207 IYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDE 266
           + G+LEVN+V+G+ H+A G S   N   VH   P  +  FN +H I  L+FG   +  D 
Sbjct: 191 LAGWLEVNKVAGNVHVAMGESAIQNGRFVHQFDPTRAPEFNVSHVIHDLAFG---ETYDG 247

Query: 267 RRKPLDGT--VAKAEEGASMFNYYIKIIPTIYERL-DGSKL-----------------GG 306
              PL GT  +  A  G  +F Y+IK++PTIY    D + +                   
Sbjct: 248 MALPLSGTSRIVDAATGTGLFQYFIKLVPTIYRAAPDAAPVRTVRYSYTQRFRPLHNQPP 307

Query: 307 GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVD 356
               +PGIF  Y+ S  MV++T    SL H   ++   + G       VD
Sbjct: 308 PTAMLPGIFLVYDFSAFMVEVTRHRSSLAHFLVRVCAIVGGVSTVVAFVD 357


>gi|224000966|ref|XP_002290155.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973577|gb|EED91907.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 396

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 118/394 (29%), Positives = 189/394 (47%), Gaps = 52/394 (13%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           +++  + +D  +    +F  +T+ G A+++   LF  YLI  +    F  +  + + V  
Sbjct: 5   YTDYFRSIDTHSPISSEFRIRTLSGAAISLFTLLFTLYLISSEYSYNFSTTFLDHVHVMP 64

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGE-QHLHVE--HNIYKRRLDLDGKPIQEPQ 119
                L +  DI  P I C  LA DA D +G+ Q  H++  H I+K RL+ DGKPI    
Sbjct: 65  QSPDGLEVEFDITFPHIPCALLASDANDPTGQSQSFHIDKKHRIWKHRLNKDGKPI---- 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
                   +K      GT T+   D  +CGSCYGA  E  +CCNTC++VK AYR K+W +
Sbjct: 121 -------GRKSRFELGGTLTSSDHDEEECGSCYGAGGEG-ECCNTCDDVKRAYRTKQWHI 172

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP------------GLS 227
            ++  I QC +     ++K+   EGC I+GY+ ++   G+ H AP            GL 
Sbjct: 173 TDMTKITQCAH---LVRVKDEDGEGCNIHGYVALSTGGGNLHFAPDRQWEKEGDKQNGLM 229

Query: 228 YSINHVHVHDIQPYTSAA---FNTTHHIRHLSFGIKL----QDDDERRKPLDGTVAKAEE 280
                +++  I    + A   FN TH +  LSFG  +    ++       LDG      +
Sbjct: 230 IMGGFINLDSIVEMFNDAYEQFNVTHTVNKLSFGPYMPKHVKNSLNLTSQLDGATRTVTD 289

Query: 281 GASMFNYYIKIIPTIYERLDGSKL---------------GGGDGGMPGIFFSYELSPLMV 325
           G  MF +Y++I+PT+Y  L+G+ +                G + GMPG+FF YE+S L V
Sbjct: 290 GYGMFQFYLQIVPTVYRFLNGTTIETFQYSVTEHVRHVDPGSNRGMPGVFFFYEVSALHV 349

Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
           +  E  +   H +T +   + G +    ++D L+
Sbjct: 350 EFEEYRRGWTHFFTGVCAAVGGAFTVMGMLDRLV 383


>gi|67479077|ref|XP_654920.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56472012|gb|EAL49533.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449701866|gb|EMD42605.1| endoplasmic reticulumgolgi intermediate compartment protein,
           putative [Entamoeba histolytica KU27]
          Length = 354

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 114/371 (30%), Positives = 176/371 (47%), Gaps = 41/371 (11%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           + +K  DA+ K   +   K   GG ++IVC + + ++   ++ DYF +     L VD S+
Sbjct: 2   QNIKRFDAYPKINSNNRVKHWIGGLLSIVCIITMIWMFSSELNDYFTIRKKPVLRVDESK 61

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
             KLPI+ DI  P  +C + ++D +D++GE  + +  NI K RL+L    + E +     
Sbjct: 62  NKKLPINFDITFPHSACSFSSVDVLDTTGEVIIDISKNIKKERLNL----VNEDE----- 112

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             KKK   T  GT         +C  C   E++  KCC TC E+ E+Y+     +P+   
Sbjct: 113 ISKKKFAKTVYGT---------ECPPC-NNESDKDKCCFTCEELTESYQKLNKEVPKGSP 162

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
             + +N +      N   EGC+I G + VNR SG+FHIAPG S  +   H+H +  + S 
Sbjct: 163 QCEIRNIHKMTTFYN--GEGCRISGTVFVNRASGNFHIAPGSSQQLTQEHIHSVD-WISG 219

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--- 301
             N TH    LSFG           P+DG V       SM+ Y+++++P  Y  LD    
Sbjct: 220 GINLTHTWNFLSFGDSFPG---MINPMDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNKVI 276

Query: 302 -------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                          L   + G+PG+F  Y++S + V   E+  S GHL T I   I G 
Sbjct: 277 HTNGYSVTEHYRPGSLKSPEQGIPGVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGV 336

Query: 349 YITFMLVDALL 359
           +  F L+D  +
Sbjct: 337 FALFSLLDYFI 347


>gi|367007030|ref|XP_003688245.1| hypothetical protein TPHA_0N00300 [Tetrapisispora phaffii CBS 4417]
 gi|357526553|emb|CCE65811.1| hypothetical protein TPHA_0N00300 [Tetrapisispora phaffii CBS 4417]
          Length = 407

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 124/406 (30%), Positives = 188/406 (46%), Gaps = 60/406 (14%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L   DAF+K  ED   +T  GG +T+ C L   YL+  +   + +V++   L VD  R 
Sbjct: 7   KLAKFDAFSKTDEDVRIRTRLGGIITLGCILTAIYLLGGEWAAFNEVTSVPRLVVDKDRS 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
             L ++LDI  P I CD + LD +D +G   L +  + +K+ RLD +GK ++  + ++ +
Sbjct: 67  IDLNMNLDISFPFIPCDIINLDIMDDAGGLQLDILDSGFKKTRLDPNGKQLEFREFDLKD 126

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGA--------ETETRKCCNTCNEVKEAYRYKK 176
               K++ +E G        PN CGSCYGA        E   + CCNTC +V+ AY    
Sbjct: 127 --NSKRIVSEKG--------PNYCGSCYGAIDQSHNDEEGAKKVCCNTCEDVRLAYVTAN 176

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   I QC++E   +++     EGC++ G  ++NRV G+ H APG     +  H+H
Sbjct: 177 WAFFDGKNIEQCEDEGYVKRINEHLNEGCRVTGKAKINRVKGNIHFAPGKPMQNSKGHLH 236

Query: 237 DIQPY-TSAAFNTTHHIRHLSFG------IKLQDDDERRKPLD--GTVAKAEEGASMFNY 287
           D   Y  S   N  H I H SFG       K +  D    PLD        +     F+Y
Sbjct: 237 DTSLYEKSPNMNFKHIIHHFSFGEPIDRKAKSKGADVLTNPLDDYDVQPNIDTHYHQFSY 296

Query: 288 YIKIIPTIYERL---------------DGSKLGGGD----------GGMPGIFFSYELSP 322
           Y+K++PT YE L               D    GG D           G+PG+FF +++S 
Sbjct: 297 YMKVVPTRYEYLNRMVVETAQFSVTFHDRPLRGGKDEDHPNTIHARNGIPGVFFFFDISS 356

Query: 323 LMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVK 364
           + V   E+   +   W+  + N    I G      +VD L +   K
Sbjct: 357 IKVINNEQ---ITQTWSGFILNCIITIGGVLAVGSMVDRLSYKAQK 399


>gi|3860008|gb|AAC72954.1| unknown [Homo sapiens]
          Length = 198

 Score =  174 bits (441), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 84/187 (44%), Positives = 115/187 (61%), Gaps = 19/187 (10%)

Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGY 210
           CYGAE E  KCCNTC +V+EAYR + WA    DTI QC+ E  ++K++    EGCQ+YG+
Sbjct: 8   CYGAEAEDIKCCNTCEDVREAYRRRGWAFKNPDTIEQCRREGFSQKMQEQKNEGCQVYGF 67

Query: 211 LEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKP 270
           LEVN+V+G+FH APG S+  +HVHVHD+Q +     N TH+I+HLSFG   +D      P
Sbjct: 68  LEVNKVAGNFHFAPGKSFQQSHVHVHDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNP 124

Query: 271 LDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------------GDGGMPGI 314
           LD T   A + + MF Y++K++PT+Y ++DG  L                  GD G+PG+
Sbjct: 125 LDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGV 184

Query: 315 FFSYELS 321
           F    LS
Sbjct: 185 FAHLPLS 191


>gi|190347075|gb|EDK39286.2| hypothetical protein PGUG_03384 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 404

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 120/404 (29%), Positives = 190/404 (47%), Gaps = 51/404 (12%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L   DAF K  ED   +T  GG +T++C + + YLI  +  +Y  +    EL VD    
Sbjct: 5   KLLSFDAFAKTVEDARVRTPAGGIITLICVIVVLYLIRNEYSEYTSIINRPELVVDRDIN 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
            KL I+LDI  P I CD L +D +D SG+  + +  + +++ RL  DG  I++    + +
Sbjct: 65  KKLEINLDISFPDIPCDVLTMDILDVSGDLQVDLLSSGFEKFRLLKDGSEIRDESPVMSS 124

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGA---ETETRKCCNTCNEVKEAYRYKKWALPE 181
           A + ++         +       CGSCYGA   +  +  CCN C  V+ AY  K W   +
Sbjct: 125 AGELEERARGRAPDGS-------CGSCYGALPQDENSDYCCNDCETVRLAYAQKAWGFFD 177

Query: 182 LDTIVQCKNEYSTEKLK---NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
            + I QC+ E    +L    N F EGC+I G  ++NR+SG+ H APG S++    H HD+
Sbjct: 178 GENIEQCEREGYVARLNEKINNF-EGCRIKGTGKINRISGNLHFAPGASFTAPGSHFHDL 236

Query: 239 QPYT--SAAFNTTHHIRHLSFGIKLQD----DDERRKPLDGTVAKAEEGASMFNYYIKII 292
             +      F   H I HLSFG    +    + +   PLD +    +    +++YY+K++
Sbjct: 237 SLFNKYDDKFTFDHVINHLSFGSDPHNIQFFEKQSTHPLDKSSMILKSKDRLYSYYLKVV 296

Query: 293 PTIYERLDGS----------------KLGGGD-----------GGMPGIFFSYELSPLMV 325
            T +E L  +                 L GG            GG+PG+FF +E+SP+  
Sbjct: 297 ATRFEFLTPNTPALETNQFSVISHHRPLAGGKDDDHQHTLHARGGLPGVFFHFEISPM-- 354

Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
           KI  K +     W+  +  +  +    ++V ALL   V    +V
Sbjct: 355 KIINKEQ-YAKTWSGFVLGVISSIAGVLMVGALLDRSVWAAERV 397


>gi|440299607|gb|ELP92159.1| endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Entamoeba invadens IP1]
          Length = 361

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 118/375 (31%), Positives = 173/375 (46%), Gaps = 42/375 (11%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           + +K  DA+ K   D   +   GG ++I+C L + ++   +V DY+ V     L VD S+
Sbjct: 2   DTIKRFDAYPKLNYDVRVRYWLGGLLSILCLLTMGWMFYSEVQDYYTVQMRPTLRVDESK 61

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
             KLPI+ DI  P ISC  + +D +D++GE  + +E N+ K+RL+           E  N
Sbjct: 62  SEKLPINFDITFPRISCSLMTIDVLDTTGEVSIDIESNVNKKRLN------PHSMTESSN 115

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
                KV         E  D N             KCC TC+E+KE+Y+     +P    
Sbjct: 116 KATAHKVYGIECPACEESVDKN-------------KCCFTCDELKESYKKAGKEVPP--N 160

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
            VQC+ +   +       EGC +YG + VNRVSG+FHIAPG+S      H H  +   S 
Sbjct: 161 AVQCQLKNIQKMALALDGEGCHMYGSVFVNRVSGNFHIAPGMSEQQGEGHRHSAEWIGS- 219

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD---- 300
             N TH    LSFG          KP+D          SM+ Y+++++P  Y  LD    
Sbjct: 220 -LNLTHTWNSLSFGDNFPG---MIKPMDSIQKVDVTNNSMYQYFVQVVPMTYFGLDKKVV 275

Query: 301 ------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                          L   + G+PG+F  YE+S + V  TE++ S GHL T I   + G 
Sbjct: 276 KTNGYSVTEHYRSGNLKTMEQGVPGVFVLYEISSMEVLYTEETGSFGHLLTGICGIVGGI 335

Query: 349 YITFMLVDALLHSCV 363
           +  F L+DA +   V
Sbjct: 336 FTIFSLLDAFIFHTV 350


>gi|365982867|ref|XP_003668267.1| hypothetical protein NDAI_0A08710 [Naumovozyma dairenensis CBS 421]
 gi|343767033|emb|CCD23024.1| hypothetical protein NDAI_0A08710 [Naumovozyma dairenensis CBS 421]
          Length = 410

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 119/413 (28%), Positives = 182/413 (44%), Gaps = 58/413 (14%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           +V    L  LDAF+K  ED   +T  G  ++I C L    L+  +   Y Q+ T   L V
Sbjct: 2   LVNKSTLLSLDAFSKTQEDVRIRTKTGAIISISCILVTVLLLLNEWIQYSQIVTRPTLVV 61

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV--EHNIYKRRLDLDGKPIQEP 118
           D  R  KL ++LDI  P++ CD L LD +D +G+  L +  +    K RLD  G      
Sbjct: 62  DRERNLKLDLNLDISFPSMPCDILNLDILDDAGDLQLDILNQGQFTKTRLDRMG------ 115

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----------ETETRKCCNTCNEV 168
                N ++  K   ++        D N CG CYG+            + + CC TC +V
Sbjct: 116 -----NVIEVSKFKIDDDVAEFPPNDENYCGPCYGSIDQSGNDKIESVKDKICCQTCEQV 170

Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
           +EAY    WA  +   I QC+ E    K+     EGC++ G + +NR+ G+ H APG ++
Sbjct: 171 REAYLKAGWAFFDGKNIEQCEREGYVTKINKHLNEGCRVKGNVLLNRIQGNIHFAPGKAF 230

Query: 229 SINHVHVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEG 281
                H HD   Y TS   N  H I HLSFG  ++   + R       PLDG        
Sbjct: 231 QNVKGHFHDSSLYETSPDLNFNHIIHHLSFGKTIEQLAQLRGATVATSPLDGQQISPSFD 290

Query: 282 ASM--FNYYIKIIPTIYERLD-------------------------GSKLGGGDGGMPGI 314
           + +  ++Y++KI+PT YE LD                            +     G+PG+
Sbjct: 291 SHLYRYSYFVKIVPTRYEYLDKMISETAQFSATFHQSLVTGERDPENPNIKYSRTGLPGL 350

Query: 315 FFSYELSPLMVKITEKS-KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           F  +E+SPL +  TE+  KS   ++   + +I G      ++D   +   + +
Sbjct: 351 FIYFEMSPLKIINTEQHFKSWSGVFLHCITSIGGILAVGTILDKFFYKAQRTV 403


>gi|50305633|ref|XP_452777.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49641910|emb|CAH01628.1| KLLA0C12947p [Kluyveromyces lactis]
          Length = 405

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 119/408 (29%), Positives = 185/408 (45%), Gaps = 62/408 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  +DAF K  ED   +T  GG +T+ C +    L+  +   +  + T  +L VD  R  
Sbjct: 6   LLSIDAFGKTEEDVRVRTRTGGLITVSCIIITMLLLVSEWKQFSTIVTRPDLVVDRDRHL 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL ++LD+  P++ C+ L LD +D SGE  +++ +    K R+  +GK + + + +V + 
Sbjct: 66  KLDLNLDVTFPSMPCNVLNLDILDDSGEFQINLLDSGFTKIRISPEGKELSKEKFQVGDK 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYKK 176
             K+    E             CG CYGA  +++          CC TC++V+ AY  K 
Sbjct: 126 SSKQSFNEEG-----------YCGPCYGALDQSKNDELPQDQKVCCQTCDDVRAAYGQKG 174

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA  +   + QC+ E   E +     EGC++ G  ++NR+ G+ H  PG S      H H
Sbjct: 175 WAFKDGKGVEQCEREGYVESINARIHEGCRVQGRAQLNRIQGTIHFGPGSSMRNIRGHFH 234

Query: 237 DIQPYTS-AAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEGASM--FNY 287
           D   Y +    N  H I  L+FG K +D D          PLD      +       F+Y
Sbjct: 235 DTSLYDAYPHLNFNHIINTLTFGEKPKDGDSELIGSASISPLDSRQVFPDRDTHFHEFSY 294

Query: 288 YIKIIPTIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSP 322
           + KIIPT +E LDG K+               GG D          GG+PG+FF++E+SP
Sbjct: 295 FCKIIPTRFEFLDGKKVETTQFSATYHDRPLRGGRDEDHPNTVHSKGGVPGVFFNFEMSP 354

Query: 323 LMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
           L V   E+  +    W+  + N    I G      ++D + +   K I
Sbjct: 355 LKVINKEQHAT---SWSGFLLNCITSIGGVLAVGTVIDKITYRAQKSI 399


>gi|323306137|gb|EGA59869.1| Erv46p [Saccharomyces cerevisiae FostersB]
          Length = 349

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 106/321 (33%), Positives = 156/321 (48%), Gaps = 36/321 (11%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LDAF K  ED   +T  GG +T+ C L   +L+  +   +  V T  +L VD  R +
Sbjct: 6   LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWXQFNSVVTRPQLVVDRDRHA 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL +++D+  P++ CD + LD +D SGE  L + +      RL+ +G+P+ +  +  V  
Sbjct: 66  KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125

Query: 126 VKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
                    NG  T  +  DPN CG CYGA+ +++          CC  C+ V+ AY   
Sbjct: 126 ---------NGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            WA  +   I QC+ E    K+     EGC+I G  ++NR+ G+ H APG  Y   + H 
Sbjct: 177 GWAFFDGKNIEQCEREGYVSKINEHLNEGCRIKGSAQINRIQGNLHFAPGKPYQNAYGHF 236

Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQD------DDERR-------KPLDGTVAKAEEG 281
           HD   Y  ++  N  H I HLSFG  +Q       +D+R         PLDG     +  
Sbjct: 237 HDTSLYDKTSNLNFNHIINHLSFGKPIQSHSKLLGNDKRHGGAVVATSPLDGRQVFPDRN 296

Query: 282 ASM--FNYYIKIIPTIYERLD 300
                F+Y+ KI+PT YE LD
Sbjct: 297 THFHQFSYFAKIVPTRYEYLD 317


>gi|225680824|gb|EEH19108.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Paracoccidioides brasiliensis Pb03]
          Length = 413

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 128/440 (29%), Positives = 180/440 (40%), Gaps = 101/440 (22%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ED   +T  GG VTIV    IS+LI  +  +Y ++    EL V
Sbjct: 1   MAPKSRFARLDAFTKTVEDARIRTRSGGLVTIVALFVISFLIWGEWYEYRRIVVLPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  R                      D +D SGE    V H I K RL         P+ 
Sbjct: 61  DKGR----------------------DVMDVSGEMQSGVIHGISKVRL--------APES 90

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----CCNTCNEVKEAYRYKK 176
           E  + +    +     T   +  DP+ CG CYGA   +      CC+TC EV+EAY  + 
Sbjct: 91  EGGHVIDTTALVLHTQTDAAKHLDPDYCGPCYGAPPPSHATKPGCCSTCEEVREAYASQS 150

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E  ++ L     EGC+I G L VN+V G+FHIAPG S+S  ++H H
Sbjct: 151 WAFGRGENVEQCEREGYSKNLDAQRNEGCRIEGVLRVNKVVGNFHIAPGRSFSSGNIHAH 210

Query: 237 DIQPY--TSAAFNTTHHIRHLSFGIKLQD---------DDERRKPLDGTVAKAEEGASMF 285
           D+  Y  T    + +H I  L FG +L D         D     PLD T     +    F
Sbjct: 211 DLDTYYHTPVPHHMSHKIHQLRFGPQLSDEISSRWKWTDHHHTNPLDNTSQHTTDPRYNF 270

Query: 286 NYYIKIIPTIYERLDGS------------------------------------------K 303
            Y++K++ T Y  L  S                                           
Sbjct: 271 MYFVKVVSTSYLPLGWSPEFSSSVHETTLGNTPLGKQGVHFGSSGSIETHQYSVTSHKRS 330

Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITE-KSKSLGHLWTKIMCNISGTY 349
           + GGD             GG+PG+F +Y++SP+ V   E ++K+     T +   I GT 
Sbjct: 331 IDGGDDAAEGHKERLHSHGGIPGVFVNYDISPMKVINREARTKTFSGFLTGVCAVIGGTL 390

Query: 350 ITFMLVDALLHSCVKKISKV 369
                VD  L+    ++ K+
Sbjct: 391 TVAAAVDRALYEGAARVKKL 410


>gi|150866674|ref|XP_001386342.2| hypothetical protein PICST_85013 [Scheffersomyces stipitis CBS
           6054]
 gi|149387930|gb|ABN68313.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 407

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 117/402 (29%), Positives = 194/402 (48%), Gaps = 48/402 (11%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L   DAF K  ED   +T  GG +T+ C   + +LI  +  DY  V T  EL VD    
Sbjct: 7   KLLTFDAFAKTVEDARIRTTSGGIITLFCIFVVMFLIRNEYSDYTSVITRPELVVDRDIN 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN- 124
             L I+LD+    + CD L+LD +D +G+  L +  + +++      + +++ ++E+++ 
Sbjct: 67  KPLDIYLDVSFHNLPCDLLSLDIMDEAGDLQLDILKSGFEKF-----RIVKDSEEEIIDR 121

Query: 125 ---AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKWAL 179
               +       E      E ED  +CGSCYGA  + +K  CCN C  VK AY  K W  
Sbjct: 122 ESTPINADLSIEEMAKGLKEGED-GECGSCYGALPQDKKQYCCNDCETVKLAYAEKLWGF 180

Query: 180 PELDTIVQCKNEYSTEKLKNTFT--EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
            + + I QC+NE   +++++     EGC+I G   +NR+SG+   APG S++ +  HVHD
Sbjct: 181 YDGENIEQCENEGYVQRVQSRINGKEGCRIKGNARINRISGTMDFAPGASFTSSGHHVHD 240

Query: 238 IQPYTS-AAFNTTHHIRHLSFGIKLQDDD----ERRKPLDGTVAKAEEGASMFNYYIKII 292
           +  Y      N  H +  L+FG  + D+     E   PLD       +   +F YY+K++
Sbjct: 241 LSLYDKHPHLNFDHIVNKLTFG-PIPDESVPTAESTHPLDNYGVALNDKNHVFTYYLKVV 299

Query: 293 PTIYERLDGSK-----------------LGGGD----------GGMPGIFFSYELSPLMV 325
            T +E L+G+                   GG D          GG+PG+ F +++SPL +
Sbjct: 300 ATRFEFLNGASKALDANQFSVITHDRPISGGKDNDHQHTLHAKGGIPGVVFHFDISPLKI 359

Query: 326 KITEK-SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
              E+ +KS       ++ +++G  I   L+D  +++    I
Sbjct: 360 INREQYAKSWSGFVLGVVSSVAGVLIVGSLLDRSVYAAESAI 401


>gi|448081831|ref|XP_004194985.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
 gi|359376407|emb|CCE86989.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
          Length = 405

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 127/395 (32%), Positives = 187/395 (47%), Gaps = 46/395 (11%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  LDAF K  ED   KT  GG +T+VC L +  LI  +  +Y  V    EL VD    
Sbjct: 7   KLLSLDAFAKTVEDAKVKTASGGIITLVCVLVVLLLIRNEYSEYTSVVNRPELVVDRDVN 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
            KL I++DI  P + CD + LD +D SG+    V +    K RL      I    +EV++
Sbjct: 67  RKLDINIDITFPNLPCDLVTLDILDVSGDTQADVLKSGFEKYRL------IPSSNEEVLD 120

Query: 125 --AVKKKKVTTENGTTTTELEDPNKCGSCYGA--ETETRKCCNTCNEVKEAYRYKKWALP 180
              V +  ++ E+       E    CGSCYGA  + +   CCN C  V+ AY  + WA  
Sbjct: 121 NAPVLRNDLSLEDIARNPNKEGGGFCGSCYGALPQGDNEYCCNDCETVRLAYAERMWAFY 180

Query: 181 ELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
           +   I QC+NE    +L       EGC+I G  ++NRVSG+ H APG + +    H+HD+
Sbjct: 181 DGANIEQCENEGYVTRLNQRIEQKEGCRIKGTAQINRVSGNMHFAPGYAKTSPGRHIHDL 240

Query: 239 QPYTS--AAFNTTHHIRHLSFGIKLQDDDERRK---PLDGTVAKAEEGASMFNYYIKIIP 293
             Y      FN  H I HLSFG+    +D   +   PLDG      + + + +YY+K++ 
Sbjct: 241 SLYEKHFDKFNFDHVINHLSFGLDPVKEDPNHQSTHPLDGYRLILNDKSRVISYYLKVVA 300

Query: 294 TIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKIT 328
           T +E L G  +               GG D          GG+PG+FF +++SP+  KI 
Sbjct: 301 TRFEFLSGLAMETNQFSAIPHHRPYRGGKDEDHRHTMHAKGGIPGVFFHFDISPM--KII 358

Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
            K +     W+  +  +  +    + V A+L   V
Sbjct: 359 NKEQ-YAKTWSGFVLGVVSSIAGVLTVGAVLDRSV 392


>gi|353242343|emb|CCA73995.1| related to ERV46-component of copii vesicles [Piriformospora indica
           DSM 11827]
          Length = 420

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 121/416 (29%), Positives = 180/416 (43%), Gaps = 69/416 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
            K +DAF K  ED   +T  G  +T +    I  L  ++  DY  V     + +  +R  
Sbjct: 11  FKAIDAFGKTLEDVKIRTRTGAFLTFLSIGIICLLTLIEFIDYRTVYLDTNIEIMKARDE 70

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           +L ++++I  P + C  L+LDA D SGE    V HNI K RLD +GKP   P ++ ++ +
Sbjct: 71  RLTVNMNITFPRVPCFLLSLDATDVSGEHMREVSHNIVKVRLDSEGKPY--PNQDHISDL 128

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +       +   ++  P  CGSCYG       CCNTC +V+++Y  + WA    + I 
Sbjct: 129 RNEI------SRVKDIGKPGYCGSCYGGLEPEGGCCNTCEDVRKSYLDRGWAFSAPEHIE 182

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           QC  E  TEK+K    +GCQI G + + +V+ S   + G S+  N  H  ++ PY     
Sbjct: 183 QCVREGWTEKIKVQANDGCQISGRVRIKKVASSLIFSFGRSFQANSFHAQELVPYLKDGL 242

Query: 247 --NTTHHIRHLSFGIKLQDDDE---RRK---------------PLDGTVA-----KAEEG 281
             +  HHI  L F    Q DDE   RR                PL+G  +         G
Sbjct: 243 IHDFGHHIETLQF----QSDDEYDPRRANEAARLKKHLGVPKDPLNGFNSHYAKYSGRRG 298

Query: 282 AS----MFNYYIKIIPTIYERLD----------------------------GSKLGGGDG 309
                 MF Y+IK++   +E LD                            G +   G  
Sbjct: 299 PDITTYMFQYFIKVVSADFETLDHEHVSSHLYSYSSHTRNVGEAYHLKNTEGIETTHGYD 358

Query: 310 GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             PG+F + ++SP+ V  TEK K   H  T     I G      LVD+ L + + K
Sbjct: 359 AAPGLFINIDVSPMQVIHTEKRKPFAHFLTTFCAIIGGVLTVASLVDSALFNTINK 414


>gi|50294900|ref|XP_449861.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49529175|emb|CAG62841.1| unnamed protein product [Candida glabrata]
          Length = 415

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 121/413 (29%), Positives = 183/413 (44%), Gaps = 62/413 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K  ED   +T  GG +T+ C +    L+  +  D+  V T  EL +D  R  
Sbjct: 6   LLSFDAFAKTEEDVRIRTRSGGFITLGCLVVTLMLLLSEWRDFNSVVTRPELVIDRDRSL 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIY-KRRLDLDGKPIQEPQKEVVNA 125
           +L ++LDI  P++ C+ L LD +D SGE  L + +  + K RL  +GK +     ++  A
Sbjct: 66  RLDLNLDITFPSMPCELLTLDIMDDSGEVQLDIMNAGFEKTRLSKEGKVLGTADMKIGEA 125

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----------CCNTCNEVKEAYRYK 175
            KK K               N CG+CYGA  + +           CC TC++V++AY  K
Sbjct: 126 AKKDK------EAQLAKLGANYCGNCYGARDQGKNNDDTPRDQWVCCQTCDDVRQAYFEK 179

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            WA  +   I QC+ E   +K+ +   EGC++ G  ++NR+ G+ H A G  +     H 
Sbjct: 180 NWAFFDGKDIEQCEREGYVQKIADQLQEGCRVSGSAQLNRIDGNLHFAAGPGFQNIRGHF 239

Query: 236 HDIQPYTS-AAFNTTHHIRHLSFGIKLQDDDERR---------KPLDGTVAKAEEGASM- 284
           HD   Y      N  H I HLSFG  ++   + +          PLDG        A   
Sbjct: 240 HDDSLYIQHPNLNFNHIINHLSFGKAVEPTKKGKVMGIEKVTVNPLDGHSMFPPRDAHFL 299

Query: 285 -FNYYIKIIPTIYERLDGSKL----------------GGGD----------GGMPGIFFS 317
            ++YY KI+PT YE L+   +                GG D          GG P ++ +
Sbjct: 300 QYSYYAKIVPTRYEGLNKKNMVETAQFSSTFHIRPVGGGSDDDHPNTVHQRGGSPSMWIN 359

Query: 318 YELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKI 366
           +E+SPL V   E+    G  W+  + N    I G      ++D  L+   + I
Sbjct: 360 FEMSPLKVINREEH---GQSWSGFVLNCITSIGGVLAVGTVLDKALYKAQRTI 409


>gi|448531492|ref|XP_003870264.1| Erv46 protein [Candida orthopsilosis Co 90-125]
 gi|380354618|emb|CCG24134.1| Erv46 protein [Candida orthopsilosis]
          Length = 411

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 124/404 (30%), Positives = 190/404 (47%), Gaps = 48/404 (11%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  LDAF K  ED   KT  GG +T++C L   +LI  +  DY  V    EL VD    
Sbjct: 7   KLISLDAFAKTVEDARIKTASGGIITLICILVALFLIRNEYIDYTTVIARPELVVDRDIN 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDG---KP--IQEPQ 119
            +L I+LDI    + CD +++D  D SG+  L  +   + K R+   G   KP  I++ Q
Sbjct: 67  KQLDINLDISFLNLPCDLVSIDLFDESGDLKLDIINSQLEKFRIIKSGHSSKPTEIKDDQ 126

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK--CCNTCNEVKEAYRYKKW 177
             +   +  +++        TE E    CGSCYGA  + +K  CCN+C  V+ AY    W
Sbjct: 127 PPLQREMPLEQIAPGLPDGQTEGE----CGSCYGAVPQDKKQYCCNSCAAVRRAYAEANW 182

Query: 178 ALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
              + + I QC+ E   ++L+      EGC++ G  ++NRV+G+   APG S +    HV
Sbjct: 183 QFYDGENIAQCEEEGYVQRLRQRINDNEGCRVKGTTKINRVAGTMDFAPGASMT-KERHV 241

Query: 236 HDIQPYT--SAAFNTTHHIRHLSFGIKLQD----DDERRKPLDGTVAKAEEGASMFNYYI 289
           HD+  Y      FN  H I HLSFG    D    D     PLDG      +     NY++
Sbjct: 242 HDLSLYMKYKDKFNFDHVINHLSFGNNPPDSQLVDTGSISPLDGHKFLQHKKLHSINYFL 301

Query: 290 KIIPTIYERLDGSK---------------LGGGD-----------GGMPGIFFSYELSPL 323
           KI+ T +E L+G                 L GG             G+PG+ F++++SPL
Sbjct: 302 KIVATRFESLEGKDKFDTNQFSAITHDRPLAGGKDDDHQHTLHARAGVPGVAFNFDISPL 361

Query: 324 -MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            ++   E +K+       ++ +I+G  +   L+D  + +  + I
Sbjct: 362 KIINREEYAKTRSGFILGVVSSIAGVLMVGSLMDRSVFAAQQAI 405


>gi|325191973|emb|CCA26442.1| endoplasmic reticulumGolgi intermediate compartment protein
           putative [Albugo laibachii Nc14]
          Length = 401

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 171/369 (46%), Gaps = 36/369 (9%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  D + K + +F  +T  G  V+I+  +    L   ++ +Y  V   E + VDS+   
Sbjct: 33  LKRFDVYPKLHTEFKVQTETGAIVSIITAVIALILFLAELREYMSVRMHEHMVVDSTISE 92

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI    ++C    L A+D +GE  + +  +I   RLD  G PI       +++ 
Sbjct: 93  KLRINIDISYLALTCKESYLTAMDVTGELQMDLHRSIGMTRLDAKGNPIN-----TLDSA 147

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCY-GAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           K+            E+   N CGSCY       + CCNTC+EVKEA+      L + D  
Sbjct: 148 KE------------EVLPANYCGSCYETVHPLGKTCCNTCDEVKEAFVANDLRLFDADQK 195

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E + E+ +    EGC++ GY+ VNRV+G+FH+  G ++      +H   P   + 
Sbjct: 196 EQCVREMTEEQRQAQAGEGCRLKGYMMVNRVAGNFHVGLGRTFHRKGKLIHQFLPGQESV 255

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS--- 302
           FN +  +  LSFG    +    +  LDGT    ++   +  Y++KI+PTIY  +  S   
Sbjct: 256 FNASFLLHSLSFGTPYAN---VKNGLDGTQYITKKKGGVMKYFLKIVPTIYSDISSSVHS 312

Query: 303 ------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                          G   G+PG +F +E SP MVKI  +     H   +I   + G   
Sbjct: 313 YQYSHTKQEKYMNAMGQISGLPGAYFMFEFSPFMVKIDSEQIPFTHFVIRIFAILGGMIS 372

Query: 351 TFMLVDALL 359
               VD+++
Sbjct: 373 IAGFVDSVI 381


>gi|312376736|gb|EFR23738.1| hypothetical protein AND_12338 [Anopheles darlingi]
          Length = 265

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 89/198 (44%), Positives = 119/198 (60%), Gaps = 22/198 (11%)

Query: 83  YLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTEL 142
           +++LDA DS+GEQHLH+EH+IYKRRLDL+G  I+EP+KE +    K+  +TE   T++ +
Sbjct: 30  HVSLDAQDSTGEQHLHIEHSIYKRRLDLEGNQIEEPKKEDIQVSTKRVSSTETPVTSSTI 89

Query: 143 EDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFT 202
           +                     C  V +AYR +KW  P ++   QCKN          F 
Sbjct: 90  KP-------------------ACGNVIDAYRERKWN-PNVEDFEQCKNSNHGAIEGKAFN 129

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           EGC IYG +EVNRV G FHIAPG S+SI ++HVHD+QPY+S+ FNT+H I  LSFG +  
Sbjct: 130 EGCHIYGTMEVNRVEGRFHIAPGKSFSIQNIHVHDVQPYSSSRFNTSHRINTLSFGEQF- 188

Query: 263 DDDERRKPLDGTVAKAEE 280
            D    +PLDG    A E
Sbjct: 189 -DFGTTQPLDGLNVVATE 205


>gi|401888400|gb|EJT52358.1| ER to golgi family transport-related protein [Trichosporon asahii
           var. asahii CBS 2479]
 gi|406696432|gb|EKC99721.1| ER to transport-related protein [Trichosporon asahii var. asahii
           CBS 8904]
          Length = 378

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 106/375 (28%), Positives = 163/375 (43%), Gaps = 65/375 (17%)

Query: 44  VDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNI 103
           ++  DY +V+    + VD SRG KL I LDI  P + C  L+LD +D SGE+   + H++
Sbjct: 2   IEFIDYRRVTLEPTIIVDRSRGEKLEIDLDITFPRVPCFLLSLDVMDISGERQNDITHDM 61

Query: 104 YKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCN 163
            K RL   G+ ++         V +            +  DPN CGSCYGA+     CCN
Sbjct: 62  AKHRLSASGEELE---------VTRSGQLKGEAERAAQNRDPNYCGSCYGAQAPESGCCN 112

Query: 164 TCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA 223
           +C++V++AY    W  P   TI QC  E   E +    TEGC+I G ++VN+V G+    
Sbjct: 113 SCDDVRKAYSESGWQFPNPSTIEQCVEENWAENMAQQNTEGCRIVGQVKVNKVVGNLQFT 172

Query: 224 PGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL---------------QDDDERR 268
            G  ++  H    D+ PY     N  H   H+    +                + +DE R
Sbjct: 173 HGNVFTRGHT---DLLPYLRDG-NVHHDFGHIINKFRFTGEMPGQLYHRSQIQKKEDETR 228

Query: 269 K------PLDGTVAKAEEGAS--MFNYYIKIIPTIYERLDGSKLGGGD------------ 308
           K      PL G  + AE   S  M+ Y++K++ T +  L+G  +                
Sbjct: 229 KELGIHDPLQGVRSHAENDGSNIMYQYFVKVVSTAFVYLNGQNINTNQYSATEYERDLKH 288

Query: 309 -----------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                              +PG+F +YE+SP+ V  TE  +S  H  T     + G    
Sbjct: 289 GNLPTKDQHGHVTTHYTNAIPGVFINYEISPMKVVHTETRQSFAHFVTSTCAIVGGVLTV 348

Query: 352 FMLVDALLHSCVKKI 366
             L+DA + +  K++
Sbjct: 349 ASLIDAAIFNSRKRL 363


>gi|146416067|ref|XP_001484003.1| hypothetical protein PGUG_03384 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 404

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 120/404 (29%), Positives = 189/404 (46%), Gaps = 51/404 (12%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L   DAF K  ED   +T  GG +T++C + + YLI  +  +Y  +    EL VD    
Sbjct: 5   KLLSFDAFAKTVEDARVRTPAGGIITLICVIVVLYLIRNEYLEYTSIINRPELVVDRDIN 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
            KL I+LDI  P I CD L +D +D SG+  + +  + +++ RL  DG  I++    + +
Sbjct: 65  KKLEINLDISFPDIPCDVLTMDILDVSGDLQVDLLLSGFEKFRLLKDGLEIRDESPVMSS 124

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---CCNTCNEVKEAYRYKKWALPE 181
           A + ++     G     L     CGSCYGA  +      CCN C  V+ AY  K W   +
Sbjct: 125 AGELEE--RARGRAPDGL-----CGSCYGALPQDENLDYCCNDCETVRLAYAQKAWGFFD 177

Query: 182 LDTIVQCKNEYSTEKLK---NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
            + I QC+ E    +L    N F EGC+I G  ++NR+SG+ H APG S++    H HD+
Sbjct: 178 GENIEQCEREGYVARLNEKINNF-EGCRIKGTGKINRISGNLHFAPGASFTAPGSHFHDL 236

Query: 239 QPYT--SAAFNTTHHIRHLSFGIKLQD----DDERRKPLDGTVAKAEEGASMFNYYIKII 292
             +      F   H I HL FG+   +    + +   PLD +    +    +++YY+K++
Sbjct: 237 SLFNKYDDKFTFDHVINHLLFGLDPHNIQFFEKQLTHPLDKSSMILKSKDRLYSYYLKVV 296

Query: 293 PTIYERLDGS----------------KLGGGD-----------GGMPGIFFSYELSPLMV 325
            T +E L  +                 L GG            GG+PG+FF +E+ P+  
Sbjct: 297 ATRFEFLTPNTPALETNQFLVISHHRPLAGGKDDDHQHTLHARGGLPGVFFHFEILPM-- 354

Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
           KI  K +     W+  +  +  +    ++V ALL   V    +V
Sbjct: 355 KIINKEQ-YAKTWSGFVLGVISSIAGVLMVGALLDRSVWAAERV 397


>gi|219111025|ref|XP_002177264.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411799|gb|EEC51727.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 404

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 116/406 (28%), Positives = 193/406 (47%), Gaps = 57/406 (14%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   +RLK  D  +   ++F   TV G  ++IV  +F+ YL+  D    FQV+  E++ V
Sbjct: 1   MDLKDRLKRFDTHSPVSKEFRVYTVQGAVLSIVTLVFVGYLVTADFFFNFQVTLQEKVHV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQ---HLHVEHNIYKRR---------- 107
           ++S  S + +  D+ +P + C  L++DA D +G++   HL  +H+++K R          
Sbjct: 61  NASSPSGIELEFDVSLPDVPCSKLSIDANDPNGQKQSLHLDTDHHVWKHRITLLPNGHRQ 120

Query: 108 -------LDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK 160
                  L+L    + E   EV    ++ +   +N  + TE+     CG CYGA  E  +
Sbjct: 121 LLGERSKLELGSTLLTEKDLEV--KAEELQNAKDNSESRTEM---TPCGDCYGAGEEG-E 174

Query: 161 CCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSF 220
           CC +C +VK AY+ + W+L +   + QC+ E    + +    EGC ++G + ++   G+ 
Sbjct: 175 CCKSCEDVKRAYKRRGWSLRDTSGVSQCRRESGIAEAEG---EGCNVHGVVALSSGGGNL 231

Query: 221 HIAPGLSYSINH---VHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAK 277
           HIAPG     N    +++ D    +   +N +H I  L FG   +D       LDG    
Sbjct: 232 HIAPGRDTEANFPGGMNIFDALLQSFHQWNVSHQIHKLRFG---KDYPAGVYQLDGETRT 288

Query: 278 AEEGASMFNYYIKIIPTIYERLDGSKLG---------------GGDGG------MPGIFF 316
             +G  M+ YY +++PT Y  L+G+ +                G + G      MPGIFF
Sbjct: 289 ITDGYGMYQYYFQVVPTRYTFLNGTTIQTHQYSVTEHLRHVSPGSNRGYSLNSRMPGIFF 348

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM-LVDALLHS 361
            YE+SPL V I E  +     +   +C I G  +T   L+D ++ S
Sbjct: 349 FYEVSPLHVDIMEVYQKGWIAFLTSVCAIVGGVVTIAGLIDHVIFS 394


>gi|156838396|ref|XP_001642904.1| hypothetical protein Kpol_367p1 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156113483|gb|EDO15046.1| hypothetical protein Kpol_367p1 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 404

 Score =  167 bits (423), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 121/411 (29%), Positives = 177/411 (43%), Gaps = 71/411 (17%)

Query: 11  DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPI 70
           D FTK  ED   +T  GG +T+ C  F + L+  +  ++  V T   L +D     KL +
Sbjct: 10  DVFTKTEEDVRIRTRVGGIITLCCLSFTAILLFSEWINFNHVITKPNLVIDREHHLKLEL 69

Query: 71  HLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           ++DI  P I C  L LD +D SG   L + E    K R+  DG+                
Sbjct: 70  NIDITFPFIPCQLLNLDIMDDSGNVQLDITESGFTKTRIGSDGQ---------------- 113

Query: 130 KVTTENGTTTTEL-----EDPNKCGSCYGAETETRK----------CCNTCNEVKEAYRY 174
           ++ T N   + +L     +D N CGSCYGA  +++           CC TC +VK AY  
Sbjct: 114 QLGTTNFKVSEDLLEYSPKDKNYCGSCYGARDQSKNDEAESVDKKVCCQTCEDVKNAYSD 173

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
             WA  +   I QC+ E   EK+ +   EGC+I G   +NR+ G+ H APG ++     H
Sbjct: 174 AGWAFFDGKNIEQCEREGYVEKMNDQLNEGCRISGEALLNRIHGNIHFAPGKAFQNRGGH 233

Query: 235 VHDIQPYTS-AAFNTTHHIRHLSFGIKL------QDDDERRKPLDG--TVAKAEEGASMF 285
            HD   Y      N  H I HLSFG  +      +D      PLDG   +   +     F
Sbjct: 234 FHDTSFYNDHKNLNFKHMIEHLSFGRPVAQFKSNKDLVAMTSPLDGHQELPSIDAHNHQF 293

Query: 286 NYYIKIIPTIYERL-----------------------DGSKLGGGDGGMPGIFFSYELSP 322
            Y+ KI+PT +E L                       D S       G+PG+F  YE+SP
Sbjct: 294 IYFAKIVPTRFEYLNKQAQETSQLVVTSHMKPIGDATDYSTTMNSRQGIPGLFIDYEISP 353

Query: 323 LMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKISKV 369
           L V   E+  +    W+  + N    I G      + D ++H+  + +S +
Sbjct: 354 LKVINREQHAT---TWSGFLLNCITSIGGILAVGTVADKIVHATQRVVSHI 401


>gi|413945824|gb|AFW78473.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein,
           partial [Zea mays]
          Length = 284

 Score =  167 bits (423), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 98/287 (34%), Positives = 151/287 (52%), Gaps = 22/287 (7%)

Query: 101 HNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK 160
           H+I K RLD  G  I E +K  +   K ++   ++G    + E    CG+CYGAE    +
Sbjct: 2   HDIEKIRLDAHGNVI-EARKVSIGGAKIERPLQKHGGRLDKGE--QYCGTCYGAEESDEQ 58

Query: 161 CCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSF 220
           CCN+C EV+EAY+ K WAL   D I QC  E   E++K    EGC ++G+L+V++V+G+F
Sbjct: 59  CCNSCEEVREAYKKKGWALTNPDLIDQCAREDFVERVKTQQDEGCNVHGFLDVSKVAGNF 118

Query: 221 HIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE 280
           H APG  +  +++ V ++       FN TH I  LSFG +         PLDG       
Sbjct: 119 HFAPGKGFYESNIDVPELS-LLEGGFNITHKINKLSFGTEFPG---VVNPLDGAQWTQPA 174

Query: 281 GASMFNYYIKIIPTIYERLDGSKLGGG---------DGGM-----PGIFFSYELSPLMVK 326
               + Y+IK++PTIY  + G  +            DG +     PG+FF Y+ SP+ V 
Sbjct: 175 SDGTYQYFIKVVPTIYTDIRGHNIHSNQFSVTEHFRDGNVRPKPQPGVFFFYDFSPIKVI 234

Query: 327 ITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
            TE+S+SL H  T +   + G +    ++D+ ++   K +  K+E+G
Sbjct: 235 FTEESRSLLHYLTNLCAIVGGVFTVSGIIDSFIYHGQKALKKKMELG 281


>gi|219110527|ref|XP_002177015.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411550|gb|EEC51478.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 500

 Score =  167 bits (422), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 113/413 (27%), Positives = 188/413 (45%), Gaps = 66/413 (15%)

Query: 7   LKGLD-AFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF--QVSTTEELFVDSS 63
           +K LD  F K   ++  +T  GG  ++V +L I+ L   +   +      T + + VD+S
Sbjct: 76  VKKLDFLFPKVDTEYTVQTDRGGLASLVAYLLIAVLALAETASWLSHNRDTVDHVRVDTS 135

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
            G ++ ++L+I  P+++CD L +D +D +G+  L++E  + KR++D  G+     Q E++
Sbjct: 136 LGQRMRVNLNITFPSLACDDLHVDVMDVAGDSQLNIEDTLTKRKMDRTGR---YGQAEIL 192

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP-EL 182
            + + ++  +       +      CG CYGA+ +   CCN C+ + +AY+ K W     L
Sbjct: 193 QSNQHEQEQSRKAKLRQDPLPDTYCGPCYGAQPDVDACCNNCDALLDAYKLKGWRTDLVL 252

Query: 183 DTIVQCKNEYSTEKLKNTFT--EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            T  QC  E   +K        EGC + G++ +NRV+G+FHIA G     +  H+H   P
Sbjct: 253 YTAEQCIREGRDQKKLRPLIQGEGCNLSGFMSLNRVAGNFHIAMGEGLQRDGRHIHVFDP 312

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDER----RKPLDGT--VAKAEEGAS-MFNYYIKIIP 293
             S  +N +H I HLSFG ++Q   +        L+G   +   E G + +F Y+IK++P
Sbjct: 313 EDSEHYNASHVIHHLSFGPEIQGKTKSGNLDSSSLNGVTKMVTPEHGTTGLFQYFIKVVP 372

Query: 294 TIYERLDGSK----------------------------------------LGGG------ 307
           T Y    G +                                         GGG      
Sbjct: 373 TTYLGPGGRRDESGTFETNRYFYTERFRPLMKEYLPEEAVAEDPKQAAVHAGGGHRTHDH 432

Query: 308 ----DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVD 356
               +  +PG+FF YE+ P  V+I   S  L HL  ++M  I G +     VD
Sbjct: 433 HHVRNSVLPGVFFLYEIYPFAVEIHPVSVPLTHLLIRLMATIGGVFTIVRWVD 485


>gi|322792513|gb|EFZ16471.1| hypothetical protein SINV_10123 [Solenopsis invicta]
          Length = 141

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 75/121 (61%), Positives = 93/121 (76%), Gaps = 3/121 (2%)

Query: 160 KCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGS 219
           +CCNTC +V EAYR KKWA P+   + QC+N+ S EKLK+ FT+GCQIYGY+EVNRV GS
Sbjct: 11  RCCNTCEDVWEAYRRKKWAPPDPADVKQCQNDKSMEKLKHAFTQGCQIYGYMEVNRVGGS 70

Query: 220 FHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAE 279
           FHIAPG+S+S+NHVHVHD+QPYTS+ FN TH IRHLSFG+ +     +  P+D T   A 
Sbjct: 71  FHIAPGVSFSVNHVHVHDVQPYTSSHFNMTHKIRHLSFGLNIPG---KTNPMDDTTVVAM 127

Query: 280 E 280
           E
Sbjct: 128 E 128


>gi|45188262|ref|NP_984485.1| ADR389Cp [Ashbya gossypii ATCC 10895]
 gi|44983106|gb|AAS52309.1| ADR389Cp [Ashbya gossypii ATCC 10895]
          Length = 392

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 122/408 (29%), Positives = 183/408 (44%), Gaps = 57/408 (13%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  LDAF K  ED   +T  GG +T+ C +    L+  +    ++V    ++ +D  R 
Sbjct: 5   KLLSLDAFAKTEEDVRVRTRAGGLITLGCVVVTLLLLVSEWRRLWEVEKRPQVVLDRDRQ 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
            KL + LDI    + C+ L LD +D +GE  L++ E    K RLD  G+ + + +  V  
Sbjct: 65  QKLELRLDITFSQMPCELLNLDIIDDTGEAQLNLLEEGFTKTRLDKHGRTLGKEEFRV-- 122

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETE---------TRKCCNTCNEVKEAYRYK 175
                      G T    +D + CG CYGA  +          R CC TC EV+ AY   
Sbjct: 123 -----------GETLPSTDDQDYCGPCYGARDQDQNENLPRSERVCCQTCGEVRAAYAEM 171

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            WA  +     QCK E  TE+L+    EGC++ G  ++NRV G+ H APG S  +   H 
Sbjct: 172 NWATFDGKGFEQCKREGYTERLQEQINEGCRVAGTAQLNRVHGNIHFAPG-SAHVGKGHA 230

Query: 236 HDIQPYTSAAFNTTHHIRH-LSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIP 293
           HD   Y      + +H+ H LSFG ++  +     PL+G   +   G S  F+Y+ K++P
Sbjct: 231 HDDSFYKEHPHLSFNHVIHSLSFGPEIAGNP---GPLNGRAMEVPNGHSHFFSYFAKVVP 287

Query: 294 TIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKIT 328
             YE L G+                 GG D          GGM G+  ++E+SPL V   
Sbjct: 288 IRYETLAGTITESAEFSVTAHDRPVHGGRDADHPNTVHFRGGMAGMTINFEMSPLKVIQR 347

Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
           E+  S    WT  + N   +    + V  +L        +  +G KT+
Sbjct: 348 EQYAS---TWTAFVLNAITSIGGVLAVGTVLDRVTYHTQRTLMGKKTL 392


>gi|374107698|gb|AEY96606.1| FADR389Cp [Ashbya gossypii FDAG1]
          Length = 392

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 122/408 (29%), Positives = 183/408 (44%), Gaps = 57/408 (13%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  LDAF K  ED   +T  GG +T+ C +    L+  +    ++V    ++ +D  R 
Sbjct: 5   KLLSLDAFAKTEEDVRVRTRAGGLITLGCVVVTLLLLVSEWRRLWEVEKRPQVVLDRDRQ 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
            KL + LDI    + C+ L LD +D +GE  L++ E    K RLD  G+ + + +  V  
Sbjct: 65  QKLELRLDITFSQMPCELLNLDIIDDTGEAQLNLLEEGFTKTRLDKHGRTLGKEEFRV-- 122

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETE---------TRKCCNTCNEVKEAYRYK 175
                      G T    +D + CG CYGA  +          R CC TC EV+ AY   
Sbjct: 123 -----------GETLPSTDDQDYCGPCYGARDQDQNENLPRSERVCCQTCGEVRAAYAEM 171

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            WA  +     QCK E  TE+L+    EGC++ G  ++NRV G+ H APG S  +   H 
Sbjct: 172 NWATFDGKGFEQCKREGYTERLQEQINEGCRVAGTAQLNRVHGNIHFAPG-SAHVGKGHA 230

Query: 236 HDIQPYTSAAFNTTHHIRH-LSFGIKLQDDDERRKPLDGTVAKAEEGAS-MFNYYIKIIP 293
           HD   Y      + +H+ H LSFG ++  +     PL+G   +   G S  F+Y+ K++P
Sbjct: 231 HDDSFYKEHPHLSFNHVIHSLSFGPEIAGNP---GPLNGRAMEVPNGHSHFFSYFAKVVP 287

Query: 294 TIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKIT 328
             YE L G+                 GG D          GGM G+  ++E+SPL V   
Sbjct: 288 IRYETLAGTITESAEFSATAHDRPVHGGRDADHPNTVHFRGGMAGMTINFEMSPLKVIQR 347

Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
           E+  S    WT  + N   +    + V  +L        +  +G KT+
Sbjct: 348 EQYAS---TWTAFVLNAITSIGGVLAVGTVLDRVTYHTQRTLMGKKTL 392


>gi|156030895|ref|XP_001584773.1| hypothetical protein SS1G_14228 [Sclerotinia sclerotiorum 1980]
 gi|154700619|gb|EDO00358.1| hypothetical protein SS1G_14228 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 381

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 120/390 (30%), Positives = 164/390 (42%), Gaps = 102/390 (26%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    R   LDAFTK  ++   +T  GG VTI   L + YL   +  DY +++   EL V
Sbjct: 1   MPAKSRFTRLDAFTKTVDEARVRTTSGGIVTIASLLIVLYLAFGEWADYRRITVHPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  R                      D +D SGEQ + V H + K RL          Q+
Sbjct: 61  DKGR----------------------DVMDVSGEQQVGVMHGVKKVRLSA--------QE 90

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKK 176
           E    +    +   N        DPN CG CYGA      + + CCNTC+EV+EAY    
Sbjct: 91  EGGKVIDTTALDLHNADEAATHLDPNYCGPCYGATPPPNAKKQGCCNTCDEVREAYASVS 150

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           WA    + + QC+ E+  E+L +   EGC+I G L VN+V G+FHIAPG S++  ++HVH
Sbjct: 151 WAFGRGENVEQCEREHYGERLDSQRKEGCRIEGGLRVNKVIGNFHIAPGRSFTNGNMHVH 210

Query: 237 DIQPY----TSAAFNTTHHIRHLSFGIKLQDDDERR--------------KPLDGTVAKA 278
           D+  Y           +HHI  L FG +L ++  ++               PLD T    
Sbjct: 211 DLNNYFDTPVPGGHVFSHHIHSLRFGPELPEEVTKKLGSDSIIPWTNHHLNPLDNTEQIT 270

Query: 279 EEGASMFNYYIKIIPTIYERL------------------------DGS------------ 302
            E A  F Y++K++ T Y  L                        DGS            
Sbjct: 271 HEAAYNFMYFVKVVSTSYLPLGWETTYNSPPHDASVDIGTYGHSEDGSIETHQYSVTSHR 330

Query: 303 -KLGGGD-------------GGMPGIFFSY 318
             L GGD             GG+PG+FFSY
Sbjct: 331 RSLNGGDDSAEGHKEKLHARGGIPGVFFSY 360


>gi|215704311|dbj|BAG93745.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 261

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 95/257 (36%), Positives = 140/257 (54%), Gaps = 21/257 (8%)

Query: 89  VDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKC 148
           +D SGEQH  + H+I KRRLD  G  I E +KE +   K +    ++G   ++ E+   C
Sbjct: 1   MDISGEQHHDIRHDIEKRRLDAHGNVI-EARKEGIGGAKIESPLQKHGGRLSKGEE--YC 57

Query: 149 GSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIY 208
           G+CYGAE    +CCN+C EV+EAY+ K WAL   D I QC  E   E++K    EGC ++
Sbjct: 58  GTCYGAEESDEQCCNSCEEVREAYKKKGWALTNPDLIDQCTREDFVERVKTQQGEGCNVH 117

Query: 209 GYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR 268
           G+L+V++V+G+ H APG  +  ++++V ++       FN TH I  LSFG +        
Sbjct: 118 GFLDVSKVAGNLHFAPGKGFYESNINVPELSA-LEHGFNITHKINKLSFGTEFPG---VV 173

Query: 269 KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGG---------DGGM-----PGI 314
            PLDG           + Y+IK++PTIY  L G K+            DG +     PG+
Sbjct: 174 NPLDGAQWTQPASDGTYQYFIKVVPTIYTDLRGRKIHSNQFSVTEHFRDGNIRPKPQPGV 233

Query: 315 FFSYELSPLMVKITEKS 331
           FF Y+ SP+ V   E++
Sbjct: 234 FFFYDFSPIKVVTMERN 250


>gi|407852879|gb|EKG06122.1| hypothetical protein TCSYLVIO_002790, partial [Trypanosoma cruzi]
          Length = 472

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 115/411 (27%), Positives = 190/411 (46%), Gaps = 53/411 (12%)

Query: 7   LKGLDAF----TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           L  LD F    TK  +D  ++T  GG  +++  L I+ L+  +V  +F      E++VD 
Sbjct: 70  LGQLDVFPKFDTKFEQDARQRTAVGGIFSLISLLIIAVLVIGEVRYFFSTVEQHEMYVDP 129

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
             G  + I ++I  P + CD +  DA+D+ G     VE +  K R+           + +
Sbjct: 130 DLGGTMEITVNITFPRVPCDLITADAIDAFGTFAEGVERDTVKSRVAASTLEKISEARPL 189

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           V+  +KKK+T     +  E E+   C SCYGAE E   CC+TC +V+ AY  ++W   E 
Sbjct: 190 VD--EKKKITKALDPSGAEKEN---CPSCYGAEPEPGACCHTCEDVRRAYSLRRWVFNED 244

Query: 183 D-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
           D ++ QC  E   +    +  EGC ++   +V RV+G+ H  PG  +++   H+HD +  
Sbjct: 245 DISVEQCAEERLRKAATLSSQEGCNLFVNYKVARVTGNIHFVPGRMFNLMGQHLHDFRGK 304

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLD------GTVAKAEEGASMFNYYIKIIPTI 295
           T    N +H +  L FG +      +  P+D      G V   EE    F+Y++K++PT 
Sbjct: 305 TVRQLNLSHIVHTLGFGERFPG---QVNPMDGLVNSRGAVDATEEVNGRFSYFVKVVPTQ 361

Query: 296 YERLDGSKLGGGD------------------------------GGMPGIFFSYELSPLMV 325
           Y+    S LG G                                 +PG+F +Y+LSP+ V
Sbjct: 362 YQ--SASVLGVGSVVESNQYSVTRHFTPSPSAELSAAAAESSPVVVPGVFITYDLSPIKV 419

Query: 326 KITEKS--KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
            + EK    S+ HL  ++     G +    LVD+++   V+++ +    GK
Sbjct: 420 FVIEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRRVQRKMQQGK 470


>gi|241955457|ref|XP_002420449.1| COPII-coated vesicle complex subunit, putative; ER-derived vesicle
           protein, putative [Candida dubliniensis CD36]
 gi|223643791|emb|CAX41527.1| COPII-coated vesicle complex subunit, putative [Candida
           dubliniensis CD36]
          Length = 414

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 128/414 (30%), Positives = 202/414 (48%), Gaps = 54/414 (13%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +L   DAF K  ED   KT  GG +T++C L    LI  +  DY  + T  EL V
Sbjct: 1   MSSRPKLLSFDAFAKTVEDARIKTTSGGIITLICILITLVLIRNEYVDYTTIITRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKPIQEPQ 119
           D     +L I+LDI    + CD +++D +D +G+  L+ ++  + K RL      ++  Q
Sbjct: 61  DRDINKQLDINLDISFINLPCDLISIDLLDVTGDLSLNIIDSGLKKIRL------LKNKQ 114

Query: 120 KEV-VNAVKKKKVTTENGTTTTEL-------EDPNK-CGSCYGAETETRK--CCNTCNEV 168
            +V VN ++  +    N    T+L        D N  CGSCYGA  + +K  CCN CN V
Sbjct: 115 GDVIVNEIEDDEPAFNNDIELTDLAKGLPEGSDENAYCGSCYGALPQDKKQFCCNDCNTV 174

Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGL 226
           + AY  K W+  + + I QC+ E    +L+      EGC+I G  ++NRVSG+   APG 
Sbjct: 175 RRAYAEKHWSFYDGENIEQCEKEGYVARLRERINNNEGCRIKGTTKINRVSGTMDFAPGA 234

Query: 227 SYSINHVHVHDIQPYT--SAAFNTTHHIRHLSFG---IKLQDDD--ERRKPLDGTVAKAE 279
           S++    H HD+  YT     FN  H I HLSFG   +  Q D   +   PLD       
Sbjct: 235 SFTREGRHFHDLSLYTKYEDKFNFDHIINHLSFGEMPVDGQADQLFDSIHPLDDHQFMLH 294

Query: 280 EGASMFNYYIKIIPTIYE------RLDGSKL----------GGGD----------GGMPG 313
           + A + +YY+K++ T +E      R+D ++           GG D          GG+PG
Sbjct: 295 KKAHLVSYYLKVVATRFESLDYKNRIDTNQFSVITHDRPLRGGKDEDHQHTLHARGGIPG 354

Query: 314 IFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           + F++++SPL ++   + +K+       ++ +I+G  +   L+D  + +  + I
Sbjct: 355 VNFNFDISPLKIINRQQYAKTWSGFVLGVISSIAGVLMVGTLLDRSVFAAQQAI 408


>gi|68483709|ref|XP_714213.1| hypothetical protein CaO19.586 [Candida albicans SC5314]
 gi|68483794|ref|XP_714172.1| hypothetical protein CaO19.8218 [Candida albicans SC5314]
 gi|46435713|gb|EAK95089.1| hypothetical protein CaO19.8218 [Candida albicans SC5314]
 gi|46435761|gb|EAK95136.1| hypothetical protein CaO19.586 [Candida albicans SC5314]
 gi|238882494|gb|EEQ46132.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 414

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 127/414 (30%), Positives = 201/414 (48%), Gaps = 54/414 (13%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M    +L   DAF K  ED   KT  GG +T++C L    LI  +  DY  + T  EL V
Sbjct: 1   MSSRPKLLSFDAFAKTVEDARIKTTSGGIITLICILITLVLIRNEYVDYTTIITRPELVV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLH-VEHNIYKRRLDLDGKPIQEPQ 119
           D     +L I+LDI    + CD +++D +D +G+  L+ ++  + K RL      ++  Q
Sbjct: 61  DRDINKQLDINLDISFINLPCDLISIDLLDVTGDLSLNIIDSGLKKIRL------LKNKQ 114

Query: 120 KEV-VNAVKKKKVTTENGTTTTEL-------EDPNK-CGSCYGAETETRK--CCNTCNEV 168
            +V VN ++  +    N    ++L        D N  CGSCYGA  + +K  CCN CN V
Sbjct: 115 GDVIVNEIEDDEPAFNNDIELSDLAKGLPEGSDENAYCGSCYGALPQDKKQFCCNDCNTV 174

Query: 169 KEAYRYKKWALPELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGL 226
           + AY  K W+  + + I QC+ E    +L+      EGC+I G  ++NRVSG+   APG 
Sbjct: 175 RRAYAEKHWSFYDGENIEQCEKEGYVGRLRERINNNEGCRIKGTTKINRVSGTMDFAPGA 234

Query: 227 SYSINHVHVHDIQPYTS--AAFNTTHHIRHLSFG---IKLQDDD--ERRKPLDGTVAKAE 279
           S++    H HD+  YT     FN  H I HLSFG   +  Q D+  +   PLD       
Sbjct: 235 SFTREGRHFHDLSLYTKYPDKFNFDHIINHLSFGEMPVDGQADELFDSIHPLDDHQFMLH 294

Query: 280 EGASMFNYYIKIIPTIYERLDGSK----------------LGGGD----------GGMPG 313
           + A + +YY+K++ T +E LD                   +GG D          GG+PG
Sbjct: 295 KKAHLVSYYLKVVATRFESLDYKNRIDTNQFSVITHDRPLVGGKDEDHQHTLHARGGIPG 354

Query: 314 IFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           + F++++SPL ++   + +K+       ++ +I+G  +   L+D  + +  + I
Sbjct: 355 VNFNFDISPLKIINRQQYAKTWSGFVLGVISSIAGVLMVGTLLDRSVFAAQQAI 408


>gi|71407913|ref|XP_806393.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70870127|gb|EAN84542.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 406

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 114/418 (27%), Positives = 188/418 (44%), Gaps = 67/418 (16%)

Query: 7   LKGLDAF----TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           L  LD F    TK  +D  ++T  GG  +++  L I+ L+  +V  +F      E++VD 
Sbjct: 4   LGQLDVFPKFDTKFEQDARQRTAIGGIFSLLSLLIIAVLVIGEVRYFFSTVEQHEMYVDP 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDL-------DGKPI 115
             G  + I ++I  P + CD +  DA+D+ G     VE +  K R+         + +P+
Sbjct: 64  DIGGTMEITVNITFPRVPCDLITADAIDAFGTFAEGVERDTVKSRVAASTLEKISEARPL 123

Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
            + +K++  A+       EN            C SCYGAE E   CC+TC +V+ AY  +
Sbjct: 124 VDEKKKITKALDPSGAEKEN------------CPSCYGAEPEPGACCHTCEDVRRAYSLR 171

Query: 176 KWALPELDTIV-QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
           +W   E D  V QC  E   +    +  EGC ++   +V RV+G+ H  PG  +++   H
Sbjct: 172 RWVFNEDDVSVEQCAEERLRKAAILSSQEGCNLFVNYKVARVTGNIHFVPGRMFNLMGQH 231

Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLD------GTVAKAEEGASMFNYY 288
           +HD +  T    N +H +  L FG +      +  P+D      G V   EE    F+Y+
Sbjct: 232 LHDFRGKTVRQLNLSHIVHTLGFGERFPG---QVNPMDGLVNLRGAVDATEEVNGRFSYF 288

Query: 289 IKIIPTIYERLDGSKLGGGD------------------------------GGMPGIFFSY 318
           +K++PT Y+    S LG G                                 +PG+F +Y
Sbjct: 289 VKVVPTQYQ--SASILGVGSVVESNQYSVTHHFTPSPSAELSAAAAESSPVMVPGVFITY 346

Query: 319 ELSPLMVKITEKS--KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
           +LSP+ V + EK    S+ HL  ++     G +    LVD+++   V+++ +    GK
Sbjct: 347 DLSPIKVFVFEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRRVQRKMQQGK 404


>gi|255578837|ref|XP_002530273.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
 gi|223530205|gb|EEF32113.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
          Length = 265

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 94/252 (37%), Positives = 133/252 (52%), Gaps = 21/252 (8%)

Query: 89  VDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKC 148
           +D  GEQH  ++HNI K+R++  G  I E +KE + A K +K    +G      E    C
Sbjct: 1   MDIMGEQHFDIKHNITKKRINAHGDVI-EVRKEGIGAPKIEKPLQRHGGRLEHNE--TYC 57

Query: 149 GSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIY 208
           GSCYGAE     CCN+C+EV+EAYR K WAL  +D I QCK E   +K+K+   EGC IY
Sbjct: 58  GSCYGAEMSDDDCCNSCDEVREAYRKKGWALTGVDLIDQCKREGFIQKVKDEEGEGCNIY 117

Query: 209 GYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR 268
           G LEVN+V+G+FH +PG     +   + D+  +   ++N +H I  L+FG          
Sbjct: 118 GSLEVNKVAGNFHFSPGKGLHQSSFFIQDLLVFQGDSYNISHTINRLAFGDYFPG---VV 174

Query: 269 KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGDGGM---------------PG 313
            PLDG     E    M  Y++K++PTIY  + G  +      +               PG
Sbjct: 175 NPLDGVPWVHETPNGMHQYFLKVVPTIYTDIRGRTVRSNQYSVTEHFKKSEFARLDSPPG 234

Query: 314 IFFSYELSPLMV 325
           +FF Y+ SP+ V
Sbjct: 235 VFFFYDFSPIKV 246


>gi|322693278|gb|EFY85144.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Metarhizium acridum CQMa 102]
          Length = 356

 Score =  164 bits (415), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 116/361 (32%), Positives = 166/361 (45%), Gaps = 75/361 (20%)

Query: 74  IVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTT 133
           +  P + C+ L LD +D SGEQ   V H +   RL    +P  E Q   V  +K  KV  
Sbjct: 1   MTFPRMPCELLTLDVMDVSGEQQHGVSHGVKNVRL----RP--ESQGGGVIDIKSMKVHD 54

Query: 134 ENGTTTTELEDPNKCGSCYGAET--ETRK--CCNTCNEVKEAYRYKKWALPELDTIVQCK 189
           +      E  DP+ CG CYGA      RK  CCNTC+EV+EAY  + WA    + + QC 
Sbjct: 55  D----PAEHLDPSYCGECYGATAPPNARKAGCCNTCDEVREAYASQGWAFGRGENVEQCT 110

Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY----TSAA 245
            E+  E+L     EGC++ G+LEVN+V G+FH+APG S+S  ++HVHD++ Y        
Sbjct: 111 REHYAERLDEQREEGCRVEGHLEVNKVVGNFHLAPGRSFSNGNMHVHDLKNYWETPNGKQ 170

Query: 246 FNTTHHIRHLSFGIKLQDDDERR-------------KPLDGTVAKAEEGASMFNYYIKII 292
            + TH I  L FG +L      R              PLDGT  +  + A  + Y++KI+
Sbjct: 171 HDFTHTIHQLRFGPQLPAAVSDRLGKGSMPWTNHHINPLDGTRQETGDPAFNYMYFVKIV 230

Query: 293 PTIYERL-----------------DGS-------------KLGGGD-------------G 309
           PT Y  L                 DGS              L GG+             G
Sbjct: 231 PTSYLPLGWEKRFKNAAGSTYGNADGSLETHQYSVTSHKRSLEGGNDAAEGHAERQHSQG 290

Query: 310 GMPGIFFSYELSPL-MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           G+PG+FFSY++SP+ ++   E +K+       +   + GT      VD  L     ++ K
Sbjct: 291 GIPGVFFSYDISPMKVINREEPAKTFTGFLAGLCAIVGGTLTVAAAVDRGLFEGAARLKK 350

Query: 369 V 369
           +
Sbjct: 351 M 351


>gi|340053482|emb|CCC47775.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 404

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 114/404 (28%), Positives = 182/404 (45%), Gaps = 53/404 (13%)

Query: 7   LKGLDAFTKP----YEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           L+ LD F K      +D  ++TV GG ++  C   I+ L+  +V  +       E++VD 
Sbjct: 4   LRCLDVFPKFDVRFEQDARQRTVVGGLLSFACMTAIAVLVVGEVRYFLSTVDQHEMYVDP 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQKE 121
             G ++ I L++  P + CD +  DA+DS GE    V  +  K R+  D  +PI E +  
Sbjct: 64  HIGGEMHITLNVTFPRVPCDLMTADAIDSFGEYAKDVIRSTRKMRVHADTLQPISEARGL 123

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
           V   V+K++ +T   +   E      C SCYGAE     CCNTC++V+ A++ K W+  E
Sbjct: 124 V---VEKRQSSTNADSGGAE-----GCPSCYGAEKNPGDCCNTCDDVRNAFKDKGWSFNE 175

Query: 182 LDT-IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            D  I QC  E       ++  EGC IY     +RV G+ H  PG  +     H+H ++ 
Sbjct: 176 DDIGIAQCAEERLRHAESSSSREGCNIYAKFSASRVKGNIHFVPGSMFDYYGQHMHVLKG 235

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLD------GTVAKAEEGASMFNYYIKIIPT 294
                 N +H I  L FG +      ++ PLD      G V K+E     F+Y+++++PT
Sbjct: 236 EIIRKMNLSHIIHQLDFGERFPG---QKNPLDGMVNSRGVVDKSESTNGRFSYFVQVVPT 292

Query: 295 IYERLD--------------------------GSKLGGGDGG--MPGIFFSYELSPL--M 324
            Y+ +                           G      D    +PGIF  Y++SP+   
Sbjct: 293 QYQHVSIFGTGRLLETNQYSVTHYFTESWNATGRDKSANDAPSVVPGIFILYDISPIKTS 352

Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           VK T    S+ HL  ++     G +    L+D+ L    +++ K
Sbjct: 353 VKATHPYPSVVHLVLQLCAVGGGVFNVASLIDSFLFHGTRQVQK 396


>gi|407418919|gb|EKF38246.1| hypothetical protein MOQ_001547 [Trypanosoma cruzi marinkellei]
          Length = 406

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 112/418 (26%), Positives = 188/418 (44%), Gaps = 67/418 (16%)

Query: 7   LKGLDAF----TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           L  LD F    TK  +D  ++T  GG  +++    I+ L+  +V  +F      E++VD 
Sbjct: 4   LGQLDVFPKFDTKFEQDARQRTAVGGVFSLLSLFIIAVLVIGEVRYFFSTVEQHEMYVDP 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDL-------DGKPI 115
             G  + I ++I  P + CD +  DA+D+ G     VE +  K R+         + +P+
Sbjct: 64  DLGGTMEITVNITFPHVPCDLITADAIDAFGTFAEGVERDTVKSRVAASTLEKISEARPL 123

Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
            + +K++  A+       EN            C SCYGAE E   CC+TC++V+ AY  +
Sbjct: 124 VDEKKKITKALDPNGAEKEN------------CPSCYGAEPEPGACCHTCDDVRRAYSLR 171

Query: 176 KWALPELD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
           +W   E D ++ QC  E   +       EGC ++   +V RV+G+ H  PG  +++   H
Sbjct: 172 RWVFNEDDISVEQCAGERLRKAAILISQEGCNLFVKYKVARVTGNIHFVPGRMFNLMGQH 231

Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLD------GTVAKAEEGASMFNYY 288
           +HD +  T    N +H +  L FG +      +  P+D      G V   EE    F+Y+
Sbjct: 232 LHDFRGKTVRQLNLSHIVHTLCFGERFPG---QVNPMDGLVNSRGAVDATEEVNGRFSYF 288

Query: 289 IKIIPTIYERLDGSKLGGGDGG------------------------------MPGIFFSY 318
           +K++PT Y+    S LG G                                 +PG+F +Y
Sbjct: 289 VKVVPTQYQA--ASILGVGSVVESNQYSVTHHFTASPSAELSTTTPESTPVIVPGVFITY 346

Query: 319 ELSPLMVKITEKS--KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
           +LSP+ V + EK    S+ HL  ++     G +    LVD+++   V+++ +    GK
Sbjct: 347 DLSPIKVFVMEKHPYSSVLHLVLQLCAVGGGVFTVAGLVDSVIFHGVRRVQRKMQQGK 404


>gi|261327856|emb|CBH10834.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 405

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 113/406 (27%), Positives = 191/406 (47%), Gaps = 60/406 (14%)

Query: 7   LKGLDAFTKPYEDF----HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           L  LD F K  E F     ++T  GG +++   L I++L+  +V  +F      E++VD 
Sbjct: 4   LSRLDVFPKFDERFERDARQRTALGGVLSMASILIITFLVVGEVRYFFSSVEQHEMYVDP 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQKE 121
             G  + + ++I  P + CD +  DA+D+ GE   +V  +  + R++ D   P+ E +  
Sbjct: 64  HIGGIMHMKVNITFPRVPCDLMTADAIDAFGEHVENVLTDTARVRVNPDTLVPLGEARPL 123

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
           +     KK+    NG       +  KC SCYGAE+    CC+TC++V+ A+  ++W   E
Sbjct: 124 MD---MKKQPADGNGA------EHGKCPSCYGAESNPGDCCHTCDDVRRAFAERQWEFHE 174

Query: 182 LD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            D +IVQC +E       +  TEGC ++    V RV+G+ H  PG  ++    H+H  + 
Sbjct: 175 DDASIVQCVHERLKMAAASASTEGCNLHASFSVPRVTGNIHFIPGRMFNFFGQHLHSFKG 234

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA------KAEEGASMFNYYIKIIPT 294
            T    N +H +  L FG +      +  P+DG          +E     F+Y++K++PT
Sbjct: 235 ETIQKLNLSHIVHSLEFGERFPG---QSNPMDGMANVRGATDPSEPLIGRFSYFVKVVPT 291

Query: 295 IYERLDGSKLGGG---------------------DGG-----------MPGIFFSYELSP 322
           +Y R++ S +GGG                      GG           +PG+F SY+LSP
Sbjct: 292 VY-RIE-SLVGGGRVVESNQYSVTHHFTPSWETPKGGENNNAKHDPSVVPGVFISYDLSP 349

Query: 323 LMVKI--TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           + V +  T    S+ HL  ++     G Y    L+D+L    ++++
Sbjct: 350 IRVSVKRTHPYPSIVHLVLQLCAVGGGVYTVTGLIDSLFFHSIRRM 395


>gi|66363024|ref|XP_628478.1| ER vesicle protein; Erv41p, transmembrane region near C terminus
           and possible N region transmembrane [Cryptosporidium
           parvum Iowa II]
 gi|46229502|gb|EAK90320.1| ER vesicle protein; Erv41p, transmembrane region near C terminus
           and possible N region transmembrane [Cryptosporidium
           parvum Iowa II]
          Length = 397

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 109/377 (28%), Positives = 180/377 (47%), Gaps = 57/377 (15%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           ++K +D + K +ED+  K+     ++++ ++ + +L   ++  YF+    + + VD++  
Sbjct: 33  KVKKIDIYGKIHEDYCVKSTSRSIISLLVYIIVFFLTLNEIFKYFKGEMIDNIGVDNTIN 92

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
           +KL I LDI  P + C+ +++D+VD  GE  +  +  + K  +DL+G+ ++  +    N 
Sbjct: 93  NKLDIMLDITFPRLRCEEISVDSVDYVGENQVDSKEYMVKIPIDLNGQEVRNIKYNQQND 152

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA-LPELDT 184
           +K                   +C SCYGAET    CCN C+ +K AYR K W+ L  +  
Sbjct: 153 LKI------------------ECMSCYGAETNEFLCCNDCDSLKTAYRSKGWSYLDIVSK 194

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY-TS 243
             QC      EK+      GC+I G ++VN+VSG+ H+A G +   N  HVH+      S
Sbjct: 195 APQC-----IEKV------GCRINGRMQVNKVSGNIHVALGTATIKNGKHVHEFNMNDVS 243

Query: 244 AAFNTTHHIRHLSFGIKLQDDDE---RRKPLDGTVAKAEEGASMFNYYIKIIPTIY---- 296
             FNT+H I  L FG      D+      PL+       +G  MF+YY+K+IPT Y    
Sbjct: 244 RGFNTSHIIHELRFG-----SDKIPFLFSPLENIQKFVHKGTKMFHYYVKLIPTQYFSGN 298

Query: 297 -------------ERLDGSKLGGGD-GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
                        ER     +  G+  G+PGIF  Y+  P +++   K   + HL T   
Sbjct: 299 GEVNLYGNQYAFTERERDVHVQNGELSGLPGIFIVYDFQPFLLQKIYKRVPISHLITSFC 358

Query: 343 CNISGTYITFMLVDALL 359
             + G Y    L+D  +
Sbjct: 359 AIVGGIYSIMSLLDTFV 375


>gi|298706631|emb|CBJ29569.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 453

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 117/413 (28%), Positives = 187/413 (45%), Gaps = 65/413 (15%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK LD +++P  +F   TV+G  VTIV    +  L   ++    +  T E LFV+S+  
Sbjct: 28  KLKRLDIYSRPKREFQRATVHGAMVTIVLVGAVLVLTWRELVFSMKRETVENLFVNSTIN 87

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVN 124
             + +  D+V   I C +L+LDA D+ G     + H++ + RLD  G+ + + +K E+ N
Sbjct: 88  PTVNVTFDVVFARIPCGFLSLDAEDALGIPQEDLRHDVTRTRLDSIGRALDDGEKHEMGN 147

Query: 125 AVK-------KKKVTTENGTTTTELEDPNKCG------------------------SCYG 153
            +K       +K+   +      +L+  ++ G                        +CYG
Sbjct: 148 TLKAVIAKEEEKQAEADASPGDEDLDSKSRAGDGGDGDVEQRALEDTATTGQEDECNCYG 207

Query: 154 AETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTF------TEGCQI 207
           A  E  +CC TC +V++AYR K W L   + I  C  E  +    NT        EGC++
Sbjct: 208 AGAEG-ECCRTCEDVRKAYRRKGWRLNPAE-IPACAGEALSANSANTMESPPVENEGCRL 265

Query: 208 YGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDD 265
            G+LEV+R  G+FH APG  L    N +   D       +FNTTH I  L+FG +     
Sbjct: 266 AGHLEVSRTEGNFHFAPGHRLHRHANELSFVDRIQVALESFNTTHTINTLTFGDQPPPGH 325

Query: 266 ERRKP------LDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL--------------- 304
              K       L+G     ++  +M  Y+++++PT+Y RLD  +                
Sbjct: 326 ASPKHAVASTVLEGHQKTVQDTHAMHQYFLQLVPTVY-RLDNGETVHSNQYSATEHLKHV 384

Query: 305 -GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVD 356
             G   G+PG++F YE+SP+   + EK K      T     + G Y    LV+
Sbjct: 385 HDGTSRGLPGVYFYYEVSPVQALVEEKRKGFLAFLTGACGVVGGVYTILGLVN 437


>gi|403215743|emb|CCK70242.1| hypothetical protein KNAG_0D05030 [Kazachstania naganishii CBS
           8797]
          Length = 422

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 123/421 (29%), Positives = 187/421 (44%), Gaps = 66/421 (15%)

Query: 9   GLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKL 68
            LDAF+K  E+   +T  GG ++++C +    L+  +   +  V+T   L +D      L
Sbjct: 10  ALDAFSKTEEEARVRTSGGGLISLLCVVSAVVLLWREWAQFRAVTTDPMLVIDRDHELPL 69

Query: 69  PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIY-KRRLDLDGKPI----QEPQKEVV 123
            + LDI  P + C  L LD +D SG   L V  + + K R+D++G  +     EP K   
Sbjct: 70  KLTLDITFPAMPCALLGLDIMDESGNVQLDVLFDQFTKTRVDVNGNMVGGSASEPYKP-- 127

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAET---------ETRKCCNTCNEVKEAYRY 174
           N++  K+     G    ++ D + CGSCYG++          E R CC TC++V +AY  
Sbjct: 128 NSLSGKRA----GAKDLQM-DADYCGSCYGSKNQENNAELPPEQRICCQTCDDVHDAYLE 182

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV- 233
             WA  +   I QC++E   ++++    EGC + G   +NR+ G+ H APG  Y      
Sbjct: 183 AGWAFFDGANIEQCESEGYVKRIQEQLHEGCNVKGTALLNRIQGNLHFAPGKPYQQLAAG 242

Query: 234 -------HVHDIQPY-TSAAFNTTHHIRHLSFGIKLQDD-----DERRKPLDGTVAKAEE 280
                  H HD+  Y  +   N  H I    FG   Q +      +R  PL+ TVA  E 
Sbjct: 243 MPGQGLGHYHDVSLYERNRHMNLNHVINEFRFGEDPQSEIVAQKIQRSAPLEDTVASLEN 302

Query: 281 GA-SMFNYYIKIIPTIYERLDGSK----------------LGG----------GDGGMPG 313
               +FNYY  ++PT YE L  SK                +GG          G GG PG
Sbjct: 303 PHYYIFNYYTNVVPTRYEFLGASKPLDTAQYSATYHDRPIMGGRDADHPTTLHGRGGTPG 362

Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGG 373
           ++F+ E SPL +   E+       W+ ++ N   T    + V  +    V K  +  IG 
Sbjct: 363 VYFNLEFSPLKIINRERRP---QQWSTLLLNWITTIGGILAVGTVTDKVVYKAQR-SIGA 418

Query: 374 K 374
           K
Sbjct: 419 K 419


>gi|67623967|ref|XP_668266.1| serologically defined breast cancer antigen 84 [Cryptosporidium
           hominis TU502]
 gi|54659454|gb|EAL38030.1| serologically defined breast cancer antigen 84 [Cryptosporidium
           hominis]
          Length = 397

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 108/378 (28%), Positives = 180/378 (47%), Gaps = 59/378 (15%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           ++K +D + K +ED+  K+     ++++ ++ + +L   ++  YF+    + + VD++  
Sbjct: 33  KVKKIDIYGKIHEDYCVKSTSRSIISLLVYIIVFFLTLNEIFKYFKGEMIDNIGVDNTIN 92

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
           +KL I LDI  P + C+ +++D+VD  GE  +  +  + K  +DL+G+ ++  +    N 
Sbjct: 93  NKLDIMLDITFPRLRCEEISVDSVDYVGENQVDSKEYMAKIPIDLNGQEVRNIKYNQQND 152

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA-LPELDT 184
           +K                   +C SCYGAET    CCN C+ +K AYR K W+ L  +  
Sbjct: 153 LKI------------------ECMSCYGAETNEFLCCNDCDSLKTAYRSKGWSYLDIVSK 194

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY-TS 243
             QC      EK+      GC+I G ++VN+VSG+ H+A G +   N  HVH+      S
Sbjct: 195 APQC-----IEKV------GCRINGRMQVNKVSGNIHVALGTATIKNGKHVHEFNMNDVS 243

Query: 244 AAFNTTHHIRHLSFGIKLQDDDER----RKPLDGTVAKAEEGASMFNYYIKIIPTIY--- 296
             FNT+H I  L FG       +R      PL+       +G  MF+YY+K+IPT Y   
Sbjct: 244 RGFNTSHIIHELRFG------SDRIPFLFSPLENIQKFVHKGTKMFHYYVKLIPTQYFSG 297

Query: 297 --------------ERLDGSKLGGGD-GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
                         ER     +  G+  G+PG+F  Y+  P +++   K   + HL T  
Sbjct: 298 NGEVNLYGNQYAFTERERDVHVQNGELSGLPGVFIVYDFQPFLLQKIYKRVPISHLITSF 357

Query: 342 MCNISGTYITFMLVDALL 359
              + G Y    L+D  +
Sbjct: 358 CAIVGGIYSIMSLLDTFV 375


>gi|72388468|ref|XP_844658.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62360135|gb|AAX80555.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70801191|gb|AAZ11099.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 405

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 112/406 (27%), Positives = 190/406 (46%), Gaps = 60/406 (14%)

Query: 7   LKGLDAFTKPYEDF----HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           L  LD F K  E F     ++T  GG +++     I++L+  +V  +F      E++VD 
Sbjct: 4   LSRLDVFPKFDERFLRDARQRTALGGVLSMASIFIITFLVVGEVRYFFSSVEQHEMYVDP 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQKE 121
             G  + + ++I  P + CD +  DA+D+ GE   +V  +  + R++ D   P+ E +  
Sbjct: 64  HIGGIMHMKVNITFPRVPCDLMTADAIDAFGEHVENVLTDTARVRVNPDTLVPLGEARPL 123

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
           +     KK+    NG       +  KC SCYGAE+    CC+TC++V+ A+  ++W   E
Sbjct: 124 MD---MKKQPADGNGA------EHGKCPSCYGAESNPGDCCHTCDDVRRAFAERQWEFHE 174

Query: 182 LD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            D +IVQC +E       +  TEGC ++    V RV+G+ H  PG  ++    H+H  + 
Sbjct: 175 DDASIVQCVHERLKMAAASASTEGCNLHASFSVPRVTGNIHFIPGRMFNFFGQHLHSFKG 234

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA------KAEEGASMFNYYIKIIPT 294
            T    N +H +  L FG +      +  P+DG          +E     F+Y++K++PT
Sbjct: 235 ETIQKLNLSHIVHSLEFGERFPG---QSNPMDGMANVRGATDPSEPLIGRFSYFVKVVPT 291

Query: 295 IYERLDGSKLGGG---------------------DGG-----------MPGIFFSYELSP 322
           +Y R++ S +GGG                      GG           +PG+F SY+LSP
Sbjct: 292 VY-RIE-SLVGGGRVVESNQYSVTHHFTPSWETPKGGENNNAKHDPSVVPGVFISYDLSP 349

Query: 323 LMVKI--TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           + V +  T    S+ HL  ++     G Y    L+D+L    ++++
Sbjct: 350 IRVSVKRTHPYPSIVHLVLQLCAVGGGVYTVTGLIDSLFFHSIRRM 395


>gi|449528843|ref|XP_004171412.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like, partial [Cucumis sativus]
          Length = 355

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 89/241 (36%), Positives = 128/241 (53%), Gaps = 19/241 (7%)

Query: 148 CGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQI 207
           CGSC+GAE     CCN+C EV+EAYR K WA+   D I QC+ E   +K+K+   EGC I
Sbjct: 115 CGSCFGAEASDDDCCNSCEEVREAYRKKGWAITNQDLIDQCQREDFIQKVKDEEGEGCNI 174

Query: 208 YGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDER 267
            G LEVN+V+GSFH  PG S+  +  +   +    ++ +N +H I  L+FG      D  
Sbjct: 175 EGSLEVNKVAGSFHFVPGKSFYQSSFNFLGLLALQTSDYNVSHRINRLAFG---NHYDGL 231

Query: 268 RKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG---------------GGDGGMP 312
             PLDG   +  E   M  Y++K++PTIY+ + G  +                G    +P
Sbjct: 232 VNPLDGVHWEYNEQNVMHQYFVKVVPTIYKNIRGRTVHSNQYSVTEHFKSVEFGSSQSIP 291

Query: 313 GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEI 371
           G+FF Y+LSP+ V  TE+     H  T I   I G +    ++DA ++   +K+  KVEI
Sbjct: 292 GVFFYYDLSPVKVTYTEEHVPFLHFMTHICAIIGGVFSVAGIIDAFIYHGQRKMKKKVEI 351

Query: 372 G 372
           G
Sbjct: 352 G 352


>gi|330803630|ref|XP_003289807.1| hypothetical protein DICPUDRAFT_80570 [Dictyostelium purpureum]
 gi|325080118|gb|EGC33688.1| hypothetical protein DICPUDRAFT_80570 [Dictyostelium purpureum]
          Length = 388

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 121/369 (32%), Positives = 178/369 (48%), Gaps = 47/369 (12%)

Query: 5   ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEE--LFVD 61
           E++K  D + K  +D   +K+ +GG VT+VC L  +YL+  ++  YF      E  L VD
Sbjct: 34  EKVKLFDFYPKVDDDVPRQKSTFGGVVTVVCLLITAYLLISEI--YFFTFPVREHSLKVD 91

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDS-SGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
            +RG++LPI++DI  P + C  + +D VD   G+      + I K RLD  G P  +   
Sbjct: 92  VTRGNRLPINIDIHFPRLVCTDITIDVVDGIDGKPIKDAAYQIVKERLDSKGVPFAKGV- 150

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
               A+  KK    +  T  E     K  S +  +    KCCN+C++++E YR  +    
Sbjct: 151 ----ALAGKKGIFSSRCTECEFPKQKKGSSVFFRQ----KCCNSCDDLREYYRLNRIPQN 202

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH-VHVHDIQ 239
             D   QC  E   +       EGC+IYG L+V ++ G FHI  GLS   +H  H H + 
Sbjct: 203 FADDAPQCLIERPIQD-----DEGCRIYGSLQVQKMKGDFHILAGLSADESHDGHAHHVH 257

Query: 240 PYTS------AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
             T         FN THHI   SFG    D D    PL+G    A+  A   NYYI+++P
Sbjct: 258 RITKENIGRVTQFNITHHIHKFSFG---DDIDGLINPLEGFGIVAQSLAVQ-NYYIQVVP 313

Query: 294 TIYER-------------LDGSKLGGGDGG--MPGIFFSYELSPLMVKITEKSKSLGHLW 338
            IY++              D   +   + G   PGI+F Y++SPLM+++ + SK +  L 
Sbjct: 314 AIYKKNDYVLETNQYSYTYDYRNVNVFNLGRIFPGIYFKYDMSPLMIEVDQTSKPIVELI 373

Query: 339 TKIMCNISG 347
           T I C I G
Sbjct: 374 TSI-CAIGG 381


>gi|298708525|emb|CBJ49158.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 467

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 108/316 (34%), Positives = 164/316 (51%), Gaps = 32/316 (10%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQV-STTEELFVDSSRG 65
           +K LD + +  ED   +T  G AVTI  W+ +  L   +V  Y +V + TE + VDSS G
Sbjct: 44  IKQLDVYARVDEDLQVRTEAGAAVTIGFWVLMVVLCVGEVQAYRKVQAPTERVVVDSSMG 103

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++D+   +I C  + +DA+D +G+  + ++H ++K+RLD DG  I E   EV   
Sbjct: 104 QKLRINIDMTFHSIPCLDVHVDAMDVAGDNQIDIDHGMWKQRLDPDGSAIGEAFMEVPGE 163

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DT 184
           V       ++    +  ED   CGSC+GA+   + CCN C +V +AY  K W++ ++  T
Sbjct: 164 V-------DDDPAQSLPED--YCGSCFGAK---KGCCNMCRDVVDAYTAKGWSVQDIRRT 211

Query: 185 IVQC-KNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
             QC ++ +    + N   EGC + G++ VN+VSG+FH+A G        HVH      +
Sbjct: 212 AEQCIRDNHIETPIVN--GEGCNLSGFMSVNKVSGNFHVATGEGVMREGRHVHLYTLEQA 269

Query: 244 AAFNTTHHIRHLSF-----GIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-- 296
             FNT+H I  LSF     G+K    D   + +D  V     G   F YYIK++PT++  
Sbjct: 270 VGFNTSHSINLLSFWEPYPGMKPNPLDRTSRIIDEDV-----GTGAFQYYIKLVPTMHSL 324

Query: 297 ---ERLDGSKLGGGDG 309
                  GS L  G G
Sbjct: 325 SPQSEASGSPLPKGKG 340


>gi|157873507|ref|XP_001685262.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68128333|emb|CAJ08503.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 467

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 114/412 (27%), Positives = 189/412 (45%), Gaps = 54/412 (13%)

Query: 5   ERLKGLDAFTKPYEDFHE----KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
            +LK LD F K    F +    +TV GG +++V  + I +L+  +V  +  V   +E+FV
Sbjct: 63  RQLKRLDVFPKFDRKFEQDARHRTVSGGVLSVVAIVIIIWLLVGEVRYFLSVEEHQEMFV 122

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+  G  + + ++I    + CD + LDAVD  G     VE N  K+R+D     +    +
Sbjct: 123 DTKVGGDMQVTVNITFNHVPCDLITLDAVDIFGVFANDVEGNTVKQRIDAATGQVISAAR 182

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
            +V+   +KKV T+      +  +   C SCYGAE     CC+TC +V++AY  + W L 
Sbjct: 183 AMVD---EKKVMTK--AIDADGAEKENCPSCYGAERNPGDCCHTCEDVRQAYARRGWKL- 236

Query: 181 ELD--TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
           ++D  ++ QC  +        +  EGC +Y     +R +GS    PG  Y      +HD+
Sbjct: 237 DIDEISVEQCAEDRINMAAAASGKEGCNLYATFAASRATGSLQFIPGRIYETLGRRMHDL 296

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGAS------MFNYYIKI 291
              T+   + +H +  L FG        ++ PLDGT    A  G +       F+Y++K+
Sbjct: 297 MGSTTRKLDLSHTVHTLEFGDPFPG---QQNPLDGTAQGSALSGDAKDAMNGRFSYFVKL 353

Query: 292 IPTIYERLD----------------------------GSKLGGGDGGMPGIFFSYELSPL 323
           +PT Y+R                               S+       +PG+F +Y+LSP+
Sbjct: 354 VPTTYQRYSLITGLQDVVESNQYSATHHFTPSEAAKAASQAPKKQEIVPGVFMTYDLSPV 413

Query: 324 MVKITEKS--KSLGHLWTKIMCNISGTYITFM-LVDALLHSCVKKISKVEIG 372
            + + E+    SL H   + +C + G  +T   LVD+L     +KI K+  G
Sbjct: 414 RILVQERHPYPSLAHFVLQ-LCAVCGGVLTVAGLVDSLCFHSARKIRKMCTG 464


>gi|72393511|ref|XP_847556.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|62175086|gb|AAX69235.1| hypothetical protein, conserved [Trypanosoma brucei]
 gi|70803586|gb|AAZ13490.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|261330829|emb|CBH13814.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 405

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 104/405 (25%), Positives = 182/405 (44%), Gaps = 54/405 (13%)

Query: 7   LKGLDAF----TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           L  LD F    T+  +D  ++T  GG +++   L I++L+  ++  +       E++VD 
Sbjct: 4   LSRLDVFPKFDTRFEQDARQRTALGGVLSMASILIITFLVVGEIRYFLSTVEQHEMYVDP 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
             G  + + ++I  P + CD +  DA+D+ GE   +V  +  K R+D       +P  + 
Sbjct: 64  HIGGIMHMKVNITFPRVPCDLMTADAIDAFGEYVENVVTDTAKVRVD---SSTLKPLGKA 120

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
              V  KK  T    T  E      C +CYGAE    +CC+TC++V+ A+  ++W   E 
Sbjct: 121 RQLVDLKKQPTNGNETGNE-----NCPTCYGAEKNPGECCHTCDDVRRAFAERQWEFHED 175

Query: 183 D-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
           D +I QC +E       +   EGC ++    V RV+G+ H  PG  ++    H+H  +  
Sbjct: 176 DVSIAQCAHERLKVAADSASAEGCNLHASFSVPRVTGNIHFVPGRMFNFFGQHLHSFKGE 235

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAK------AEEGASMFNYYIKIIPTI 295
           T    N +H +  L FG +    +    P+DG V        +E     F Y++K++PT+
Sbjct: 236 TIRKLNLSHIVHALEFGERFPGQN---NPMDGMVNARGVKDPSEPLIGRFTYFVKVVPTL 292

Query: 296 YERLDGSKLGG----------------------GDGG--------MPGIFFSYELSPLMV 325
           Y+ +  +  G                       G+          +PG+F SY++SP+ V
Sbjct: 293 YQVVSMANTGNLVESNQYSVTHHFTPSWAAPKEGETDNPNSDPLVVPGVFISYDISPIRV 352

Query: 326 KITEKS--KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
            +T      S+ HL  ++     G Y    L+D+L    +K++ +
Sbjct: 353 SVTRTHPYPSIVHLVLQLCAVGGGVYTVTGLIDSLFFHGIKRVQE 397


>gi|302823246|ref|XP_002993277.1| hypothetical protein SELMODRAFT_431378 [Selaginella moellendorffii]
 gi|302825185|ref|XP_002994225.1| hypothetical protein SELMODRAFT_236933 [Selaginella moellendorffii]
 gi|300137936|gb|EFJ04730.1| hypothetical protein SELMODRAFT_236933 [Selaginella moellendorffii]
 gi|300138947|gb|EFJ05698.1| hypothetical protein SELMODRAFT_431378 [Selaginella moellendorffii]
          Length = 333

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 109/380 (28%), Positives = 176/380 (46%), Gaps = 69/380 (18%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           ++K ++AF    E   +KTV G  +TIV    I  L   +   Y   +   ++ VD++RG
Sbjct: 4   KMKNINAFAHADEHLTQKTVSGAILTIVGVSIILVLFAYEFKFYLSTNVVHQMSVDTTRG 63

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             LPIH++I  P++ C  L++DA+D SG+  + ++ NI+K RL  DG  +     E ++ 
Sbjct: 64  QNLPIHINITFPSLPCQILSVDAIDMSGKHEVDLDTNIWKLRLHKDGHILGS---EYLSD 120

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + +K+   +N T              + +  E R      NE+ +A +            
Sbjct: 121 LVEKEHAHDNLT------------GIFHSHEELRSAVKVVNEINKALQDG---------- 158

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSA 244
                            EGC+++G L+V RV+G+FHI+  G+S  I H         +  
Sbjct: 159 -----------------EGCRVFGVLDVERVAGNFHISMHGMSLQIFH---------SVK 192

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N +H I  LSFG K         PLD TV    + A  F Y+IKI+PT Y  L+G KL
Sbjct: 193 EVNVSHIINDLSFGPKYPG---IHNPLDRTVRILRDTAGTFKYFIKIVPTEYRYLNGGKL 249

Query: 305 GGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                            D   P ++F Y+LSP+ V I E+ +S GHL T+    + GT+ 
Sbjct: 250 PTNQFSVGEYYLAARDDDISWPAVYFLYDLSPITVLIKEERRSFGHLLTRFCAIVGGTFS 309

Query: 351 TFMLVDALLHSCVKKISKVE 370
              ++D  ++  V+ I++ +
Sbjct: 310 LTGMLDRWIYRLVESITRAK 329


>gi|448086324|ref|XP_004196073.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
 gi|359377495|emb|CCE85878.1| Piso0_005514 [Millerozyma farinosa CBS 7064]
          Length = 405

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 129/401 (32%), Positives = 191/401 (47%), Gaps = 46/401 (11%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  LDAF K  ED   KT  GG +T+VC L +  LI  +  +Y  V    EL VD    
Sbjct: 7   KLLSLDAFAKTVEDAKVKTASGGIITLVCVLVVLLLIRNEYSEYTSVVNRPELVVDRDVN 66

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
            KL I++DI  P + CD + LD +D SG+    V +    K RL      I    +EV++
Sbjct: 67  RKLDINIDITFPYLPCDLVTLDILDVSGDTQADVLKSGFEKYRL------IPSSNEEVLD 120

Query: 125 --AVKKKKVTTENGTTTTELEDPNKCGSCYGA--ETETRKCCNTCNEVKEAYRYKKWALP 180
              V +  ++ E+       E    CGSCYGA  + +   CCN C  V+ AY  + WA  
Sbjct: 121 NAPVLRNDLSLEDIARNPNKEGGGYCGSCYGALPQGDNEFCCNDCETVRVAYAERMWAFY 180

Query: 181 ELDTIVQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
           +   I QC+NE    +L       EGC+I G  ++NRVSG+ H APG + +    H+HD+
Sbjct: 181 DGANIEQCENEGYVTRLNQRIEQKEGCRIKGTAQINRVSGNMHFAPGYAKTSPGRHIHDL 240

Query: 239 QPYTS--AAFNTTHHIRHLSFGIKLQDDDERRK---PLDGTVAKAEEGASMFNYYIKIIP 293
             Y      F+  H I HLSFG+    +D   +   PLDG      + + + +YY+K++ 
Sbjct: 241 SLYEKHFDKFSFDHVINHLSFGLDPAKEDPNHQSTHPLDGYRLILNDKSRVISYYLKVVA 300

Query: 294 TIYERLDGSKL---------------GGGD----------GGMPGIFFSYELSPLMVKIT 328
           T +E L+GS +               GG D          GG+PG+FF +++SP+  KI 
Sbjct: 301 TRFEFLNGSSMETNQFSAIPHHRPYRGGKDEDHRHTMHAKGGIPGVFFHFDISPM--KII 358

Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
            K +     W+  +  +  +    + V A+L   V    KV
Sbjct: 359 NKEQ-YAKTWSGFVLGVISSIAGVLTVGAVLDRSVWAAEKV 398


>gi|146095510|ref|XP_001467598.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398020411|ref|XP_003863369.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134071963|emb|CAM70660.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322501601|emb|CBZ36681.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 467

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 114/412 (27%), Positives = 193/412 (46%), Gaps = 54/412 (13%)

Query: 5   ERLKGLDAFTKPYEDFHE----KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
            +LK LD F K    F +    +TV GG +++V  + I +L+  +V  +  V   +E+FV
Sbjct: 63  RQLKRLDVFPKFDRKFEQDARHRTVSGGVLSVVAIVVIIWLLVGEVRYFLSVEEHQEMFV 122

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+  G  + + +++    + CD + LDAVD  G     VE N  K+R+D     +    +
Sbjct: 123 DTKVGGDMQVTVNVTFNHVPCDLITLDAVDIFGVFANDVEGNTVKQRIDAATGQVISAAR 182

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
            +V+   +KKV T+      +  +   C SCYGAE     CC+TC +V++AY  + W L 
Sbjct: 183 AMVD---EKKVMTK--AIDADGAEKENCPSCYGAERNPGDCCHTCEDVRQAYARRGWKL- 236

Query: 181 ELD--TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
           ++D  ++ QC  +        +  EGC +Y     +R +GS    PG  Y      +HD+
Sbjct: 237 DIDEISVEQCAEDRIKMAAAASGKEGCNLYATFAASRATGSLQFIPGRIYETLGRRMHDL 296

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGAS------MFNYYIKI 291
              T+   + +H +  L FG        ++ PLDGT    A  G +       F+Y++K+
Sbjct: 297 MGSTTRKLDLSHTVHTLEFGDPFPG---QQNPLDGTAQGSALSGDAKDAMNGRFSYFVKL 353

Query: 292 IPTIYER---LDG-------------------------SKLGGGDGGMPGIFFSYELSPL 323
           +PT Y+R   + G                         S+       +PG+F +Y+LSP+
Sbjct: 354 VPTTYQRYSLITGLQDAVESNQYSATHHFTPSEAAKAVSQTPKKQEIVPGVFMTYDLSPV 413

Query: 324 MVKITEKS--KSLGHLWTKIMCNISGTYITFM-LVDALLHSCVKKISKVEIG 372
            + + E+    SL H   + +C + G  +T + LVD++    V+KI K+  G
Sbjct: 414 RILVQERHPYPSLVHFVLQ-LCAVCGGVLTVVGLVDSMCFHSVRKIRKMCTG 464


>gi|291001965|ref|XP_002683549.1| predicted protein [Naegleria gruberi]
 gi|284097178|gb|EFC50805.1| predicted protein [Naegleria gruberi]
          Length = 391

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 114/404 (28%), Positives = 184/404 (45%), Gaps = 65/404 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           ++  D ++K      +KT  GG V+I+  + I +L+   +  Y  ++  + L VD     
Sbjct: 9   IRSFDLYSKTDSIATKKTSLGGVVSILALIIIIFLVGSALIRYLSINRRDTLSVDIQVED 68

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           ++ I  +I  P + C  L +D+VD+SG+  + V H+I+K  +D  G+         +  +
Sbjct: 69  RVVIFFNISFPDLKCYDLHVDSVDASGDAAIDVAHHIHKVPVDSSGR---------ITHL 119

Query: 127 KKKKVTTENGTTTTELE-DPNK-------CGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
           +  K  T+ GT   + + DP K       CG+CY  E    +CCNTC +V E Y+     
Sbjct: 120 ESPKHKTKLGTEMPQDKYDPTKDPHSIMYCGTCY-VEQRRGECCNTCQDVMEVYKRNGLP 178

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV----H 234
            P ++ + QC  + S          GC IYG L+V +V+G+FH  PG S+S  +     H
Sbjct: 179 APRVEDVEQCLFDASKNH------PGCNIYGTLDVQKVNGNFHFLPGRSFSQEYETRVHH 232

Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLD---GTVAKAEEG------ASMF 285
           +H+  P     +N+TH I  LSFG+++        PLD   G + K EE        ++F
Sbjct: 233 IHEFNPILVDRYNSTHIIHSLSFGLRIP---HVTYPLDETVGIIPKIEESDAQAPKTALF 289

Query: 286 NYYIKIIPTIY---------------------ERLDGSKLGGGDGGMPGIFFSYELSPLM 324
            Y+IK +PT Y                        D SK+      +PG+FF Y   P+ 
Sbjct: 290 KYFIKAVPTTYIGSSYFSSTINTYQFSFTKHVMPFDSSKM----MMLPGVFFVYNFEPIR 345

Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           +   E      H    +M   +G ++    +DALL   V K+ K
Sbjct: 346 ITYEENGMPFTHFIVDLMAVCAGIFVVLNYIDALLEGVVHKLRK 389


>gi|449705731|gb|EMD45722.1| endoplasmic reticulumgolgi intermediate compartment protein,
           putative [Entamoeba histolytica KU27]
          Length = 272

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 93/286 (32%), Positives = 146/286 (51%), Gaps = 32/286 (11%)

Query: 99  VEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETET 158
           +E N+ K R+  DG  + E + + + +             + E  DP +C SCYGAET  
Sbjct: 4   IEQNVTKIRIHHDGSLVTENEMKAIQS-----------KLSIETPDPKECRSCYGAETPE 52

Query: 159 RKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSG 218
           +KCC TC++VKEAY+ + W L +L+ + QC+N    +  K T  EGC++ G   +N++ G
Sbjct: 53  KKCCFTCDDVKEAYKKRGWRL-DLNIVSQCQNHEKIQMAKLTKDEGCRLIGDFLLNKIGG 111

Query: 219 SFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA 278
           +FHIAPG S  +   H H+++       + +H    LSFG       E  K    T  K 
Sbjct: 112 NFHIAPGSSEQLWGRHSHNLEWTGKTQIDLSHKWNELSFG-------ENSKKFT-TEKKD 163

Query: 279 EEGASMFNYYIKIIP----------TIYERLDGSKLGGGDG-GMPGIFFSYELSPLMVKI 327
            +  SMF YY+ IIP          T Y+      +  G+G G PG+F  Y++SP+++++
Sbjct: 164 TQMNSMFQYYLTIIPIKNNFINGTSTFYDYSIQENIRSGEGEGQPGVFIYYDVSPMVLEV 223

Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
           TE +    H    I   + G + TF L DA++   +  +  KVE+G
Sbjct: 224 TESNHGFLHFLIGICSIVGGIFTTFQLFDAIVFESIHTLKKKVELG 269


>gi|385302035|gb|EIF46185.1| erv46p [Dekkera bruxellensis AWRI1499]
          Length = 266

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 93/262 (35%), Positives = 133/262 (50%), Gaps = 20/262 (7%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
            DAF K  ++   KT  GG +T++C   I  L+  +  DY  +    EL VD      L 
Sbjct: 10  FDAFAKTLDEAKVKTTSGGILTLICSFTIFILLINEYRDYRTLIMRPELVVDRDHDKTLG 69

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNAVKK 128
           ++LDI  P + CD L++D +D +G+    + E N  + RLD DGK I   +   VN  K+
Sbjct: 70  LNLDITFPNMPCDLLSMDIMDLTGDVQADILEGNFLRTRLDRDGKEIATDEPFKVN--KE 127

Query: 129 KKVTTENGTTTTELEDPNKCGSCYGA--------ETETRK--CCNTCNEVKEAYRYKKWA 178
             V +E  T     ED   CGSCYGA        E++  K  CCN+C  VK AY    W 
Sbjct: 128 DXVKSELST-----EDSQYCGSCYGAIDQSGNEKESDPTKWVCCNSCEAVKLAYSKAAWK 182

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
             + + I QC+ E   +++     EGC++ G  ++NR+ G+ H APG S ++N  HVHD+
Sbjct: 183 FYDGEGIEQCEKEGYVDRINKRLDEGCRVKGTAQLNRIGGNLHFAPGSSITMNDRHVHDL 242

Query: 239 QPYT--SAAFNTTHHIRHLSFG 258
             +      FN  H I H SFG
Sbjct: 243 SLFDKHQDKFNFDHVINHFSFG 264


>gi|389602486|ref|XP_001567299.2| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|322505471|emb|CAM42729.2| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 541

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 114/413 (27%), Positives = 187/413 (45%), Gaps = 52/413 (12%)

Query: 5   ERLKGLDAFTKPYEDFHE----KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
            +LK LD F K    F +    +TV GG  ++V  + I +L+  +V  +  +    E+FV
Sbjct: 137 RQLKRLDVFPKFDRKFEQDARHRTVSGGIFSVVAIVVILWLLVGEVRYFLSIEEHHEMFV 196

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+  G  + + +++    + CD + LDAVD  G     VE N  K+R+D     +    +
Sbjct: 197 DTEVGGDMRVTVNVTFNHVPCDLITLDAVDVFGVFANDVEDNTVKQRIDAATGQVISAAR 256

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
            VV+  +KK +T        E E+   C SCYGAE     CC+TC +V++AY  K W L 
Sbjct: 257 AVVD--EKKVITKAIDADGVEKEN---CPSCYGAERSPGDCCHTCEDVRQAYAQKGWRLN 311

Query: 181 ELD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
             D ++ QC  +           EGC +Y     +R +GS    PG  Y +    +HD+ 
Sbjct: 312 VDDISVEQCAEDRIKMATAAFGKEGCNLYATFAASRATGSLQFIPGRMYQMLGRRMHDLM 371

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGAS------MFNYYIKII 292
              +   + +H +  L FG +      ++ PLDGT    A  G +       F+Y++K+I
Sbjct: 372 GSAARKLDLSHTVHTLEFGERFPG---QQNPLDGTAQGSALSGDAKDAMNGRFSYFVKVI 428

Query: 293 PTIYERLD------------------------GSKLGGGDGGM----PGIFFSYELSPLM 324
           PT Y+R                           +K       M    PG+F +Y+LSP+ 
Sbjct: 429 PTTYQRYSLITGLQDTVESNQYTATHHFTPSAATKAASQTPTMQEIVPGVFMTYDLSPVR 488

Query: 325 VKITEKS--KSLGHLWTKIMCNISGTYITFM-LVDALLHSCVKKISKVEIGGK 374
           +   E+    S+ H   + +C + G  +T + LVD++    V+K+ K+  G +
Sbjct: 489 ILAQERHPYPSVIHFVLQ-LCAVCGGVLTVVGLVDSMCFHSVRKVRKMCTGKQ 540


>gi|342183032|emb|CCC92512.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 401

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 107/408 (26%), Positives = 181/408 (44%), Gaps = 60/408 (14%)

Query: 5   ERLKGLDAFTKP----YEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           +R   LD F K      +D  ++T  GG ++I   + I+ LI  +V  +       E++V
Sbjct: 2   KRFSRLDVFPKFDARFEQDARQRTALGGVLSIASMVTIALLIIGEVRYFLTTVEQHEMYV 61

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQ 119
           D   G  + + ++I  P + CD +  DA+D+ GE    +  +  K R+D D   P+ E  
Sbjct: 62  DPRIGGTMHVVINITFPRVPCDLMTADAIDAFGEYVEDMGRDTVKMRVDSDTLAPLGE-A 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           + +VN  KK               D + C SCYGAE     CC+TC++V+ A+  ++W  
Sbjct: 121 RPLVNMNKKAT------------SDTHDCPSCYGAEKNPGDCCHTCDDVRRAFAERQWEF 168

Query: 180 PELD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
            E D +I+QC  E           EGC ++    V RV+G+ H  PG  ++    H+H  
Sbjct: 169 HEDDVSIMQCAKERLQMAASTASREGCNLHSSFSVPRVTGNIHFVPGRMFNFFGQHLHSF 228

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV------AKAEEGASMFNYYIKII 292
           +  T    N +H I  L FG +      ++ PLDG V        +E+    F Y++K++
Sbjct: 229 KGETIQRLNLSHIIHTLEFGERFPG---QKNPLDGMVNTRGVENPSEDLIGRFAYFVKVV 285

Query: 293 PTIYE---------------------------RLDGSKLGGGDGG---MPGIFFSYELSP 322
           PT+Y+                             D +     D     +PG+F SY++SP
Sbjct: 286 PTLYQVRTLMSSGRVVESNQYSVTHHFTASWDAADQNNQTNRDANPRVVPGVFVSYDISP 345

Query: 323 LMVKI--TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           + V +  T    S+ HL  ++     G Y    L+D++    ++++ +
Sbjct: 346 IRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVVGLIDSMFFHSIRRVQE 393


>gi|342183042|emb|CCC92522.1| unnamed protein product [Trypanosoma congolense IL3000]
 gi|343474271|emb|CCD14057.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 401

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 107/408 (26%), Positives = 181/408 (44%), Gaps = 60/408 (14%)

Query: 5   ERLKGLDAFTKP----YEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           +R   LD F K      +D  ++T  GG ++I   + I+ LI  +V  +       E++V
Sbjct: 2   KRFSRLDVFPKFDARFEQDARQRTALGGVLSIASMVTIALLIIGEVRYFLTTVEQHEMYV 61

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQ 119
           D   G  + + ++I  P + CD +  DA+D+ GE    +  +  K R+D D   P+ E  
Sbjct: 62  DPRIGGTMHVVINITFPRVPCDLMTADAIDAFGEYVEDMGRDTVKMRVDSDTLAPLGE-A 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           + +VN  KK               D + C SCYGAE     CC+TC++V+ A+  ++W  
Sbjct: 121 RPLVNMNKKAT------------SDTHDCPSCYGAEKNPGDCCHTCDDVRRAFAERQWEF 168

Query: 180 PELD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
            E D +I+QC  E           EGC ++    V RV+G+ H  PG  ++    H+H  
Sbjct: 169 HEDDVSIMQCAKERLQMAASTASREGCNLHSSFSVPRVTGNIHFVPGRMFNFFGQHLHSF 228

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV------AKAEEGASMFNYYIKII 292
           +  T    N +H I  L FG +      ++ PLDG V        +E+    F Y++K++
Sbjct: 229 KGETIQRLNLSHIIHTLEFGERFPG---QKNPLDGMVNTRGVENPSEDLIGRFAYFVKVV 285

Query: 293 PTIYE---------------------------RLDGSKLGGGDGG---MPGIFFSYELSP 322
           PT+Y+                             D +     D     +PG+F SY++SP
Sbjct: 286 PTLYQVKTLMSSGRVVESNQYSVTHHFTASWDAADQNNQTNRDANPRVVPGVFVSYDISP 345

Query: 323 LMVKI--TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           + V +  T    S+ HL  ++     G Y    L+D++    ++++ +
Sbjct: 346 IRVSVKRTHPYPSVVHLVLQLCAVGGGVYTVVGLIDSMFFHSIRRVQE 393


>gi|384501765|gb|EIE92256.1| hypothetical protein RO3G_17063 [Rhizopus delemar RA 99-880]
          Length = 291

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 89/266 (33%), Positives = 132/266 (49%), Gaps = 26/266 (9%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  D + K  ++F  KT  G +V       +S L+  +   +        L VD SR  
Sbjct: 11  LRQFDGYAKTLDEFRIKTTSGASV-------LSELMTYNTSVW-----KPSLVVDKSRKE 58

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+PI  +I  P + C  L++D +D SGEQ      ++ K RLD  G         ++ + 
Sbjct: 59  KMPIDFNITFPNMPCHMLSIDIMDESGEQSSGYSQDVTKIRLDTLGN--------IIESG 110

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAET-ETRKCCNTCNEVKEAYRYKKWALPELDTI 185
              K+          LE+  +CGSCYGA+      CC++C +V+EAY  + W L     I
Sbjct: 111 HTVKLGDHTNDAKKALEEAPECGSCYGAKPLREDGCCHSCQDVREAYVKQGWGLVNTKEI 170

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC  E    KL+N   EGC ++G+L VN+V G+FH APG ++    +HVHD+Q YT  A
Sbjct: 171 EQCIREGWLAKLENQSNEGCNVHGHLLVNKVRGNFHFAPGGAFQAGSMHVHDLQEYTQGA 230

Query: 246 -----FNTTHHIRHLSFGIKLQDDDE 266
                F+ +H I  L FG   +D +E
Sbjct: 231 PNGHSFDMSHRIHKLKFGPDTKDQNE 256


>gi|444314203|ref|XP_004177759.1| hypothetical protein TBLA_0A04460 [Tetrapisispora blattae CBS 6284]
 gi|387510798|emb|CCH58240.1| hypothetical protein TBLA_0A04460 [Tetrapisispora blattae CBS 6284]
          Length = 406

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 115/418 (27%), Positives = 170/418 (40%), Gaps = 67/418 (16%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           S  L  LDAF++  ED   +T  G  +T+ C      L+  +   +  + T  EL +D  
Sbjct: 3   SSTLLSLDAFSRTEEDVRVRTKTGALITLGCMGITFLLLLNEWLRFGIIETRPELVIDRE 62

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEV 122
           R  KL + LD+  P + CD + LD +D +GE  L +      K RLD  G          
Sbjct: 63  RHLKLDLDLDVTFPNMPCDLINLDLMDDAGEIQLDILSSGFTKTRLDSRG---------- 112

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----------CCNTCNEVKEAY 172
            N +           +    +D   CG CYGA  ++            CC TC +V++AY
Sbjct: 113 -NELGTFDFDLSKDISEYPPDDDKYCGPCYGALDQSNNKDDMPMDEKVCCQTCADVRQAY 171

Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH 232
               WA  +   I QC+ E   +++ +   EGC+I G   +NR+ G+ H APGL++    
Sbjct: 172 LNAGWAFFDGKDIEQCEREGYVQRINDHLNEGCRIQGNARLNRIHGNVHFAPGLAFQNRR 231

Query: 233 VHVHDIQPY---TSAAFNTTHHIRHLSF------GIKLQDDDERRKPLDG--TVAKAEEG 281
            H HD   Y   T   FN  H I HLSF      GI  +       PLDG   +   +  
Sbjct: 232 GHYHDTSLYDKKTELTFN--HIINHLSFGKHVKPGIGSKFSAASVSPLDGHQMILNDDPH 289

Query: 282 ASMFNYYIKIIPTIYERLDGSKLGGGD-------------------------GGMPGIFF 316
              F Y+ KI+PT YE LD   +                              G PG++ 
Sbjct: 290 NVQFIYFAKIVPTRYEYLDKDVIETAQFSTTTHSKALNNLADDKTTPKPSRRSGTPGLYI 349

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLHSCVKKISKVE 370
           +YE+SPL V   E+       W   + N    I G      ++D + +   + I   +
Sbjct: 350 NYEMSPLKVINREQHV---QTWVSFILNCLTSIGGVLAVGTVIDKIFYRAQRTIQSTK 404


>gi|366996541|ref|XP_003678033.1| hypothetical protein NCAS_0I00190 [Naumovozyma castellii CBS 4309]
 gi|342303904|emb|CCC71687.1| hypothetical protein NCAS_0I00190 [Naumovozyma castellii CBS 4309]
          Length = 409

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 120/408 (29%), Positives = 185/408 (45%), Gaps = 57/408 (13%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  +DAF+K  ED   +T  G  +TI C +    L+  +   Y  + +   L +D  R  
Sbjct: 8   LLSIDAFSKTQEDVRIRTKSGAIITICCIVITLILLLNEYIQYTHIVSRPTLVIDRERNL 67

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV--EHNIYKRRLDLDGKPIQEPQKEVVN 124
           KL ++LDI  P+I CD L LD +D SGE  L +  E +  K R+D +G           N
Sbjct: 68  KLELNLDITFPSIPCDLLNLDILDDSGELQLDLLQEGSFTKTRVDSNG-----------N 116

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
           A+   K   ++       +D N CGSCYGA  ++           CC  C +V+ AY   
Sbjct: 117 ALDSMKFKLDDEVGEYPPQDDNYCGSCYGALDQSNNDNLPKDEKVCCQDCEQVRNAYLTA 176

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            WA  +   I QC+ E    ++ +   EGC++ G + +NR+ G+ H APG ++     H 
Sbjct: 177 GWAFFDGKKIEQCEREGYVARINSHLNEGCRVKGDVLLNRIHGNIHFAPGRAFQNTKGHF 236

Query: 236 HDIQPY-TSAAFNTTHHIRHLSFGIKLQDDDERR------KPLDGTVAKAEEGASM--FN 286
           HD   Y  + + N  H I HLSFG  ++   E R       PLDG        + +  ++
Sbjct: 237 HDTSLYEQTLSLNFNHIINHLSFGKSVEQLAEVRGASVSTSPLDGQQVSPSFDSHLYRYS 296

Query: 287 YYIKIIPTIYERLDG--------------SKLGGG-----------DGGMPGIFFSYELS 321
           Y+ KI+PT YE LDG              S + G              G+PG+F  +E+S
Sbjct: 297 YFTKIVPTRYEWLDGVVAETAQFSATFHESPVNGAMDPEHPHIRHSRTGLPGVFIYFEMS 356

Query: 322 PLMVKITEKS-KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           PL V   E+  KS   ++   + ++ G      ++D + +   + I K
Sbjct: 357 PLKVINQEQHFKSWSGVFLHGITSMGGILAVGTVLDKIFYRAQRTIQK 404


>gi|401426616|ref|XP_003877792.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322494038|emb|CBZ29334.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 406

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 115/412 (27%), Positives = 189/412 (45%), Gaps = 54/412 (13%)

Query: 5   ERLKGLDAFTKPYEDFHE----KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
            +LK LD F K    F +    +TV GG  ++V  + I +L+  +V  +  V   +E+FV
Sbjct: 2   RQLKHLDVFPKFDRKFEQDARHRTVSGGVFSVVAVVVIIWLLVGEVRYFLSVEEHQEMFV 61

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D+  G  + + +++    + CD + LDAVD  G     VE N  K+R+D     +    +
Sbjct: 62  DTKVGGDMQVTVNVTFNHVPCDLITLDAVDIFGVFANDVEGNTVKQRIDTATGQVISAAR 121

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
            +V+  +KK VT        E E+   C SCYGAE     CC+TC +V++AY  + W L 
Sbjct: 122 AIVD--EKKVVTKAIDADGAEKEN---CPSCYGAERHPGDCCHTCEDVRQAYVRRGWKL- 175

Query: 181 ELD--TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
           ++D  ++ QC  +           EGC +Y     +R +GS    PG  Y      +HD+
Sbjct: 176 DIDEISVEQCAEDRIKMATAAFGKEGCNLYATFAASRATGSLQFIPGRIYETLGRRMHDL 235

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGAS------MFNYYIKI 291
               +   + +H +  L FG        ++ PLDGT    A  G +       F+Y++K+
Sbjct: 236 MGSATRKLDLSHTVHTLEFGDPFPG---QQNPLDGTAQGSALSGDAKDAMNGRFSYFVKL 292

Query: 292 IPTIYER---------------------LDGSKLGGGDGG-------MPGIFFSYELSPL 323
           +PT Y+R                        S+    +         +PG+F +Y+LSP+
Sbjct: 293 VPTTYQRYSLITGLQDTVESNQYSATHHFTPSEAAKAESQAPKKQEIVPGVFMTYDLSPV 352

Query: 324 MVKITEKS--KSLGHLWTKIMCNISGTYITFM-LVDALLHSCVKKISKVEIG 372
            + + E+    SL H   ++ C + G  +T + LVD+L    V+KI K+  G
Sbjct: 353 RILVQERHPYPSLAHFVLQV-CAVCGGVLTVVGLVDSLCFHSVRKIRKMCTG 403


>gi|328875761|gb|EGG24125.1| DUF1692 family protein [Dictyostelium fasciculatum]
          Length = 1172

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 113/394 (28%), Positives = 177/394 (44%), Gaps = 61/394 (15%)

Query: 5    ERLKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
            E+LK  D + K  E  H+ K++YGG  T++C +   +L+  ++  Y        L VD S
Sbjct: 808  EKLKLFDFYPKLDESVHQTKSIYGGIATVICIIVTVFLLTSELYYYTFPIRDHSLRVDVS 867

Query: 64   RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
            RG+++ I+ D+  P++ C  + +++VD                   +DGKPI++   ++V
Sbjct: 868  RGNRMNINFDVHFPSLICSDIIVESVDG------------------VDGKPIKDAAHQIV 909

Query: 124  NAVKKKKVTTENGTTTTELEDPNKCGSCYGAE-------TETRKCCNTCNEVKEAYRYKK 176
                 K+     G+    L       SC   E        E RKCCN+C +++  YR  K
Sbjct: 910  -----KERLNRRGSPLERLHARAGLFSCTKCELPPKYQLLEKRKCCNSCEDLRTFYRTNK 964

Query: 177  WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV--- 233
                  D   QC     T     T  EGC+++G L V ++ G  HI  G  +  +H    
Sbjct: 965  VPQHLADESPQC-----TIGKPVTEDEGCRVFGILSVQKMKGDIHIIAGRPHEESHDGHS 1019

Query: 234  -HVHDIQPYTSA---AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
             HVH + P  +     FN +HHI   SFG   QD +    PL+G       G  +  YY+
Sbjct: 1020 HHVHKLTPEIAQRIHKFNISHHIHKFSFG---QDVEGLINPLEGFGIVVPMGLGLQTYYL 1076

Query: 290  KIIPTIYER-------------LDGSKLGGGDGG--MPGIFFSYELSPLMVKITEKSKSL 334
            +++PTIY++              +   +   + G   PGI+F Y+LSPLM+++ + SK  
Sbjct: 1077 QVVPTIYKQNNYILETNQYSYTREYKSINYNNLGYLFPGIYFKYDLSPLMIEVDQSSKPF 1136

Query: 335  GHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
              L T I     G Y+ F L   +    V KI K
Sbjct: 1137 SELITSICAIGGGMYVAFGLFYHVTARIVGKIKK 1170


>gi|367004394|ref|XP_003686930.1| hypothetical protein TPHA_0H02930 [Tetrapisispora phaffii CBS 4417]
 gi|357525232|emb|CCE64496.1| hypothetical protein TPHA_0H02930 [Tetrapisispora phaffii CBS 4417]
          Length = 439

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 125/439 (28%), Positives = 198/439 (45%), Gaps = 84/439 (19%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQ---VSTTEELFVD 61
           + L   D FTK  ED   +T  GG +T++C + +++L+ +   ++FQ   V +  EL +D
Sbjct: 8   DNLLAYDVFTKVEEDIRIRTRTGGLITLIC-IGVTFLLLI--SEWFQFKKVISKPELVID 64

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE-----HNIYKRRLDLDGKPIQ 116
               SKL +++D+  P I CD L LD +D SG   L ++      N  K RL+  G    
Sbjct: 65  RDYQSKLELNIDVTFPYIPCDLLNLDILDDSGNVQLDIDLEEASSNFVKTRLNNRG---- 120

Query: 117 EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK----------CCNTCN 166
               EV+   KK K+T + G    E +  N CGSCYG++ +T+           CCN+C 
Sbjct: 121 ----EVIGKAKKFKITDDLGEYAPE-DKENYCGSCYGSKDQTKNEDIEKITDKVCCNSCE 175

Query: 167 EVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGL 226
           +V++AY    WA  +   I QC+ E   + +    +EGC++ G   +N++ G+ H APG 
Sbjct: 176 DVRQAYSEAGWAFFDGKNIEQCEREGYVKTINERLSEGCRVKGEALLNKIHGNLHFAPGK 235

Query: 227 SYSINHVHVHDIQPYTS-AAFNTTHHIRHLSFG--------IKLQD---DDERRK--PLD 272
           ++     H HD   +      N  H I HLSFG           QD   D  R +  P+D
Sbjct: 236 AFQNRRGHFHDTSLFNQHKNLNFQHVINHLSFGKPIRQLVTSNFQDTMSDSLRAQTAPID 295

Query: 273 GTVAKAEEGAS--------------MFNYYIKIIPTIYERLDGS--------------KL 304
           G  A  ++                  F YY +II T +E L G               K+
Sbjct: 296 GHQAFIQDNTGDSDSASTTIAAHDYQFIYYAEIISTRFEYLKGDLEETSQLTVTSHYKKI 355

Query: 305 GGGDG-----------GMPGIFFSYELSPLMVKITEK-SKSLGHLWTKIMCNISGTYITF 352
           G  +G           G+PG++  +E+SPL V   E+ S S      K + +I G     
Sbjct: 356 GYQNGQDYMQGMQSRSGIPGLYIDFEVSPLKVINKEQYSTSWSGYLLKTITSIGGILAVG 415

Query: 353 MLVDALLHSCVKKISKVEI 371
            ++D ++++    + +  I
Sbjct: 416 TVIDKVVYATQTALKQASI 434


>gi|430811512|emb|CCJ31046.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 264

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 99/277 (35%), Positives = 139/277 (50%), Gaps = 23/277 (8%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
            +  DAF+K  ED   KT  GG +TI+  + I  L+  +  DY +V    EL +D +R  
Sbjct: 7   FRRFDAFSKTIEDAQIKTTNGGLITIISIIIIFILVSFEWHDYRRVVVLPELTIDRTRSE 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I+L++  P I C  L+LD +D SGE    V HN+ K RLD +G  I       +N  
Sbjct: 67  KLQINLNLTFPKIPCSILSLDIMDVSGELQTDVSHNVVKNRLDKNGIFINSTSINTLNFQ 126

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +  KV              + CGSCYGA+     CCNTC +V  AY    W +P   T  
Sbjct: 127 QPIKVLPS-----------DYCGSCYGAK---EGCCNTCEDVINAYIANNWPIPNKRTFE 172

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG-LSYSINHVHVHDIQPYTSAA 245
           QCK+  + +       EGC   G +EVN+V G+FH APG  S +I   HVHDI  Y + +
Sbjct: 173 QCKDSNNMDGPD----EGCNFVGRIEVNKVIGNFHFAPGHSSQTITGGHVHDIYDYLTDS 228

Query: 246 F--NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE 280
              + +H I  LSFG +++     + PLD      ++
Sbjct: 229 LPHDFSHMINKLSFGPEIE--GSLQNPLDNVKKDTDD 263


>gi|410078101|ref|XP_003956632.1| hypothetical protein KAFR_0C05060 [Kazachstania africana CBS 2517]
 gi|372463216|emb|CCF57497.1| hypothetical protein KAFR_0C05060 [Kazachstania africana CBS 2517]
          Length = 414

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 122/413 (29%), Positives = 184/413 (44%), Gaps = 71/413 (17%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQ--VSTTEELFVD 61
           S  L  +DAF++  +D   +T  G  +TI C + ++ ++ ++    FQ  +ST   L VD
Sbjct: 3   SSTLLSIDAFSRAQDDIRIRTKSGAIITISC-IAVTVILLINQWLQFQYSISTITNLVVD 61

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE---HNIYKRRLDLD-GKPIQE 117
             R  KL +  DI    + C+ + +D +D +      ++    +  K R+D   GKPI  
Sbjct: 62  RERNLKLNLDFDITFTNLPCNLINIDILDDASFLQSIIDPDSSSFTKIRIDRSSGKPISS 121

Query: 118 PQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAET-----------ETRKCCNTCN 166
            +    N  +K        T     +D N CG CYGA+            E R CC TC+
Sbjct: 122 SE---FNLNEK--------TYEYPPDDENYCGPCYGAKDQSINDKEGIKKEDRVCCQTCS 170

Query: 167 EVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGY-LEVNRVSGSFHIAPG 225
           +VK +Y    WA  +   I QC+ E   EK+ +   EGCQI G  + +NRV+G+ H APG
Sbjct: 171 DVKNSYLDAGWAFFDGKNIEQCEREGYIEKINSQLNEGCQIKGSNVLINRVNGNLHFAPG 230

Query: 226 LSYSINHVHVHDIQPYT-SAAFNTTHHIRHLSFGIKLQDDDE------RRKPLDGT--VA 276
            +Y   + H HD   Y      N  H I H SFG    D D          PLDGT  + 
Sbjct: 231 EAYHNPNGHYHDTSFYDLKPQLNFNHIINHFSFGNGAVDRDATHDTTLMNSPLDGTQVLP 290

Query: 277 KAEEGASMFNYYIKIIPTIYERLDGSKL---------------GGGD----------GGM 311
           + +  A  F Y+ KI+ T YE L+   L               GG D          GG+
Sbjct: 291 EYDSHAYAFTYFNKIVSTRYEYLERDPLETVQFTSMFHDRQINGGNDIHDEKIKHARGGI 350

Query: 312 PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN----ISGTYITFMLVDALLH 360
           PG+F  +++SP+  KI  K +   + W+  + N    I G      ++D + +
Sbjct: 351 PGLFIYFDISPM--KIINKEQHTVN-WSTFVLNCITSIGGILAVGTVIDKIFY 400


>gi|413949705|gb|AFW82354.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein,
           partial [Zea mays]
          Length = 202

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 79/186 (42%), Positives = 111/186 (59%), Gaps = 3/186 (1%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
            F  RLK LDA+ K  EDF+++T+ GG VT+V  + +  L   +   YF  ST  +L VD
Sbjct: 3   AFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLVVD 62

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +SRG +L ++ DI  P+I C  L++D  D SGEQH  + H+I KRRL+  G  I E +KE
Sbjct: 63  TSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVI-EARKE 121

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
            +   K ++   ++G    + E    CG+CYGAE    +CCN+C EV+EAY+ K WAL  
Sbjct: 122 GIGGAKVERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEEVREAYKKKGWALTN 179

Query: 182 LDTIVQ 187
            D I Q
Sbjct: 180 PDLIDQ 185


>gi|123389547|ref|XP_001299739.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121880652|gb|EAX86809.1| hypothetical protein TVAG_100310 [Trichomonas vaginalis G3]
          Length = 351

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 107/367 (29%), Positives = 167/367 (45%), Gaps = 53/367 (14%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICV-DVCDYFQVSTTEELF-VDSSRGS- 66
           LD F K  +         GA   +  +  +  +C+ ++  Y + +  E+L  V   RG+ 
Sbjct: 3   LDFFPKFIDSAMTHKTACGAFNSILMIACALALCISEIYAYAKPALHEQLVSVSDLRGAL 62

Query: 67  -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            +L I  +  V ++ C  L LD  D  G  +   +  +YK R+D +G PI  PQ ++   
Sbjct: 63  DQLSISFNFTV-SVPCVLLHLDVFDMMGSGNRPDQKTLYKVRVDQNGNPI--PQTQIA-- 117

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                            ED   CG CYGAE+  RKCC TC +V  AY+ K W +  L + 
Sbjct: 118 -----------------ED---CGPCYGAESSQRKCCQTCEDVVAAYQEKGWGIGNLSSW 157

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            QC+ E      K    E CQ YG L VN + G FH+APG++      HVHD  P     
Sbjct: 158 AQCRAEGVMFDGK----ERCQAYGNLHVNAIEGGFHLAPGINVFSRFGHVHDFSPLVD-T 212

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGT-VAKAEEGASMFNYYIKIIPTIYE------- 297
            N TH I H+SFG  +      + PLD T V + + G   + Y +K +PT+ E       
Sbjct: 213 LNLTHEIEHISFGAPID-----KSPLDNTRVVQKKPGQIHYRYNLKAVPTVKEVNGKVHR 267

Query: 298 ----RLDGSKLGGGDGGM--PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                ++ +++     G   PGIFF Y  +P+ +  T    ++  L  +++    G+++ 
Sbjct: 268 FFRFTVNYAEIPVTARGRYGPGIFFVYSFAPVAITSTYDRPNITVLLARLISIFGGSFML 327

Query: 352 FMLVDAL 358
             L+D+ 
Sbjct: 328 ARLIDSF 334


>gi|397564627|gb|EJK44287.1| hypothetical protein THAOC_37187 [Thalassiosira oceanica]
          Length = 506

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 152/315 (48%), Gaps = 26/315 (8%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVS--TTEELFVDSSR 64
           ++ LD F K   D   +T  GG +T   ++ +  LI  +   +  ++  + E + VD+S 
Sbjct: 58  VRKLDFFNKIEVDHIVRTERGGQLTAAGYVIMLILILAEYLTWSGMNGESIEHVVVDTSL 117

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           G ++ ++L+I  P++ C+ L L+ +D +G+  L V   ++K+RLDLDG P   P  ++  
Sbjct: 118 GKRMKVNLNITFPSLHCEDLHLNIIDVAGDSQLEVSDKMFKQRLDLDGTP--RPLAKISA 175

Query: 125 AVKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
               K +  +      E    P+ CG CYGA+   + CCNTC++V E Y+ K+W    + 
Sbjct: 176 EANAKALEDKKRREVVEKSVGPDYCGPCYGAQENAQDCCNTCDDVIERYKKKRWNDNAVQ 235

Query: 184 TIV-QCKNEYS---TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            +  QC  E     +E  +    EGC + G+  VNRV+G+FHIA G     +  H+H   
Sbjct: 236 PLAEQCIREGRAGVSEPKRMAGGEGCNLSGHFTVNRVAGNFHIAMGEGVERDGRHIHQFL 295

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDE--------------RRKPLDGTV---AKAEEGA 282
           P     F   H I  LSF      D E                + ++G+V    +     
Sbjct: 296 PEDRVNFIANHVIHELSFLDDEYGDIEGEGFLNLMSKAGVNGERSMNGSVKTVTEETGTT 355

Query: 283 SMFNYYIKIIPTIYE 297
            +F Y+IK++PT Y+
Sbjct: 356 GLFQYFIKVVPTKYK 370



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 19/55 (34%), Positives = 31/55 (56%)

Query: 311 MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
           +PG+FF YE+ P MV+++       HLW +IM  + G +     +D  LH+  K+
Sbjct: 448 LPGVFFVYEIYPFMVEVSRNRVPFMHLWIRIMATVGGVFTMMSWIDGALHARDKR 502


>gi|297830940|ref|XP_002883352.1| hypothetical protein ARALYDRAFT_479742 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329192|gb|EFH59611.1| hypothetical protein ARALYDRAFT_479742 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 354

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 176/391 (45%), Gaps = 66/391 (16%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + L+ +DAF +  +   +KT  G  V+IV  L ++ L   ++  Y    T  ++ V
Sbjct: 1   MGVKQALRSIDAFPRAEDHLLQKTQSGAVVSIVGLLIMATLFLHELSYYLNTLTVHQMSV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG  LPIH+++  P++ CD L++DA+D SG+  + ++ NI+K RL+  G  I    +
Sbjct: 61  DLKRGETLPIHVNMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGHIIG--TE 118

Query: 121 EVVNAVKK---------KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEA 171
            + + V+K         K    E     TE E  N  G    AET  +K           
Sbjct: 119 YISDLVEKGHEHGHSPHKHDGKEEHKNETETEALNILGFDQAAETMIKKV---------- 168

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
               K AL +                     EGC++YG L+V RV+G+FHI+    + +N
Sbjct: 169 ----KQALAD--------------------GEGCRVYGVLDVQRVAGNFHIS---VHGLN 201

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
            ++V  +    S   N +H I  LSFG K         PLD T     + +  F YYIKI
Sbjct: 202 -IYVAQMIFGGSKNVNVSHMIHDLSFGPKYPG---IHNPLDDTNRILHDTSGTFKYYIKI 257

Query: 292 IPTIYERLDGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
           +PT Y  L    L                 D   P ++F Y+LSP+ V I E+ +S  HL
Sbjct: 258 VPTEYRYLSKDVLSTNQYSVTEYYTPMTEFDRTWPAVYFLYDLSPITVTIKEERRSFLHL 317

Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISK 368
            T++   + GT+    ++D  +   ++  +K
Sbjct: 318 ITRLCAVLGGTFALTGMLDRWMFRLIESFNK 348


>gi|30686584|ref|NP_188868.2| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
 gi|13877821|gb|AAK43988.1|AF370173_1 unknown protein [Arabidopsis thaliana]
 gi|51969000|dbj|BAD43192.1| unknown protein [Arabidopsis thaliana]
 gi|51970108|dbj|BAD43746.1| unknown protein [Arabidopsis thaliana]
 gi|51970556|dbj|BAD43970.1| unknown protein [Arabidopsis thaliana]
 gi|51970734|dbj|BAD44059.1| unknown protein [Arabidopsis thaliana]
 gi|62319967|dbj|BAD94071.1| hypothetical protein [Arabidopsis thaliana]
 gi|332643097|gb|AEE76618.1| Endoplasmic reticulum vesicle transporter protein [Arabidopsis
           thaliana]
          Length = 354

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 112/391 (28%), Positives = 176/391 (45%), Gaps = 66/391 (16%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + L+ +DAF +  +   +KT  G  V+IV  L ++ L   ++  Y    T  ++ V
Sbjct: 1   MGVKQALRSIDAFPRAEDHLLQKTQSGAVVSIVGLLIMATLFLHELSYYLNTLTVHQMSV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG  LPIH+++  P++ CD L++DA+D SG+  + ++ NI+K RL+  G  I    +
Sbjct: 61  DLKRGETLPIHVNMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGHIIG--TE 118

Query: 121 EVVNAVKK---------KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEA 171
            + + V+K         K    E     TE E  N  G    AET  +K           
Sbjct: 119 YISDLVEKGHEHGHSPHKHDGKEEHKNETETEALNILGFDQAAETMIKKV---------- 168

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
               K AL +                     EGC++YG L+V RV+G+FHI+    + +N
Sbjct: 169 ----KQALAD--------------------GEGCRVYGVLDVQRVAGNFHIS---VHGLN 201

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
            ++V  +    S   N +H I  LSFG K         PLD T     + +  F YYIKI
Sbjct: 202 -IYVAQMIFGGSKNVNVSHMIHDLSFGPKYPG---IHNPLDDTNRILHDTSGTFKYYIKI 257

Query: 292 IPTIYERLDGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
           +PT Y  L    L                 D   P ++F Y+LSP+ V I E+ +S  HL
Sbjct: 258 VPTEYRYLSKDVLSTNQYSVTEYFTPMTEFDRTWPAVYFLYDLSPITVTIKEERRSFLHL 317

Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISK 368
            T++   + GT+    ++D  +   ++  +K
Sbjct: 318 ITRLCAVLGGTFALTGMLDRWMFRFIESFNK 348


>gi|168004249|ref|XP_001754824.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693928|gb|EDQ80278.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 347

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 104/378 (27%), Positives = 170/378 (44%), Gaps = 72/378 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF +  E   +KT  G AV+ +    +  L   ++  Y +  T  E+ VD  RG 
Sbjct: 9   IKNLDAFPRAEEHLLQKTSSGAAVSAIGLFIMGVLFFHELRFYLETVTVHEMSVDVKRGE 68

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KLPIH+++  P + C+ L+LDA+D SG+  + ++ NI+K R+  DG  +     E VN +
Sbjct: 69  KLPIHINMTFPALPCEVLSLDAIDMSGKHEVDLDTNIWKLRIHRDGYVLG---SEFVNDL 125

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            + +   E      + +D +K G     +   +      NEVK+A          +D   
Sbjct: 126 VEGEHRKEE--PKADKKDEHKDG-----DHRKKDPQKVINEVKKA----------IDD-- 166

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                           EGCQI+G L+V RV+G+FHI+           +H +  Y ++  
Sbjct: 167 ---------------GEGCQIFGVLDVERVAGNFHIS-----------MHGLSLYVASKI 200

Query: 247 -------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
                  N +H I  LSFG           PLDG+     + +  F Y++KI+PT Y  L
Sbjct: 201 FEAGYEVNVSHVIHDLSFGPTYPG---HHNPLDGSERILHDTSGTFKYFLKIVPTEYHYL 257

Query: 300 DG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
            G               +    D   P ++F Y+LSP++V I E  ++ GH  T++   +
Sbjct: 258 HGEVMPTNQFSVTEYYQRTKPSDRSYPAVYFVYDLSPIVVTIREHRRNFGHFITRLCAVL 317

Query: 346 SGTYITFMLVDALLHSCV 363
            GT+    ++D  +   +
Sbjct: 318 GGTFAVTGMLDRWMSRII 335


>gi|225446891|ref|XP_002284045.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3 [Vitis vinifera]
 gi|296086333|emb|CBI31774.3| unnamed protein product [Vitis vinifera]
          Length = 351

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/384 (27%), Positives = 179/384 (46%), Gaps = 57/384 (14%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + +K L AF +  E   +KT  G  V+I+  + ++ L   ++  Y    T  ++ V
Sbjct: 1   MGVKQFIKSLHAFPRAEEHLLQKTQSGAVVSIIGLVIMATLFLHELRYYLTTYTVHQMSV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG  LPIH+++  P++ CD L++DA+D SG+  + ++ NI+K RL+ DG  I    +
Sbjct: 61  DLKRGETLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNRDGFIIG--TE 118

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
            + + V+K+    ++        D ++    +  + +     N   +VK+A         
Sbjct: 119 YLSDLVEKEHADHKHDHNKDHHGDSDQKLHAHSFDQDAE---NMVKKVKQA--------- 166

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
                           L N   EGC++YG L+V RV+G+FHI      S++ +++   Q 
Sbjct: 167 ----------------LANG--EGCRVYGVLDVQRVAGNFHI------SVHGLNIFVAQM 202

Query: 241 YTSAAF--NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
               A   N +H I  LSFG K         PLDGTV      +  F YYIKI+PT Y  
Sbjct: 203 IFDGAIHVNVSHIIHDLSFGPKYPG---LHNPLDGTVRILRGASGTFKYYIKIVPTEYRY 259

Query: 299 LDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           +                S +   D   P ++F Y+LSP+ V I E+ +S  H  T++   
Sbjct: 260 ISKEVLPTNQFSVMEYFSPMNEFDRTWPAVYFLYDLSPVTVTIKEERRSFLHFITRLCAV 319

Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
           + GT+    ++D  ++  ++ ++K
Sbjct: 320 LGGTFALTGMLDRWMYRFLEMLTK 343


>gi|255637400|gb|ACU19028.1| unknown [Glycine max]
          Length = 347

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 106/384 (27%), Positives = 179/384 (46%), Gaps = 58/384 (15%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + +K LDAF +  +   +KT  G  V+++  + ++ L   ++  Y    T  ++ V
Sbjct: 1   MGMKQVIKNLDAFPRAEDHLLQKTQSGALVSVIGLIIMATLFVHELGYYLTTYTVHQMSV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG  LPIH+++  P++ CD L++DA+D SG+  + ++ NI+K RL+  G  I     
Sbjct: 61  DLKRGETLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSYGHIIG---T 117

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           E V+ + +K+ T           D NK                        +  +K  L 
Sbjct: 118 EYVSDLVEKEHTHHK-------HDDNK---------------------NHEHSEQKIHLQ 149

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            LD   +   +   E LKN   EGC++YG L+V RV+G+FHI      S++ ++++  Q 
Sbjct: 150 NLDESTENIIKKVKEALKNG--EGCRVYGVLDVQRVAGNFHI------SVHGLNIYVAQM 201

Query: 241 YTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
               A   N +H I  LSFG K         PLD T     + +  F YYIK++PT Y  
Sbjct: 202 IFDGAKNVNVSHFIHDLSFGPKYPG---LHNPLDDTTRILHDTSGTFKYYIKVVPTEYRY 258

Query: 299 LDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           +                S +   D   P ++F Y+LSP+ V I E+ +S  H  T++   
Sbjct: 259 ISKEVLPTNQFSVSEYYSPINQFDRTWPAVYFLYDLSPITVTIKEERRSFFHFITRLCAV 318

Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
           + GT+    ++D  ++  ++ ++K
Sbjct: 319 LGGTFAVTGMLDRWMYRLLETLTK 342


>gi|123449396|ref|XP_001313417.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121895300|gb|EAY00488.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 361

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 170/370 (45%), Gaps = 43/370 (11%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICV-DVCDYFQVSTTEELFVDSSRG 65
           ++  D F K   ++   T+ GG ++++  +F + ++C  +V  Y    T + LFVD+ R 
Sbjct: 3   IRKFDVFPKLANEYRIGTISGGILSLIS-VFAAIVLCFYEVAAYLNAPTRQFLFVDTRRP 61

Query: 66  S-------------KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEH-NIYKRRLDLD 111
           +             +L + + +  P   C  + LD +DS  +  + +E+ N    RLD  
Sbjct: 62  TGPDGVTIDQNSQPRLDVKVSVTFPKAPCFLIHLDVIDSVTQLAMPLENINSKFMRLDSQ 121

Query: 112 GKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEA 171
           GKPI+              ++T   TT  E     KCGSCY A+   R CC +C EV +A
Sbjct: 122 GKPIE-----------ALDLSTLVNTTVQE-----KCGSCYNAKDPKRICCRSCQEVFDA 165

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
           YR   +  P L  I QCK     EK+     EGC++    +  RV+   HIAPG S++  
Sbjct: 166 YRDAAFKPPVLTEIEQCKP--VAEKVAKMEGEGCKVDASFKALRVASEMHIAPGYSWNSE 223

Query: 232 HVHVHDIQPYTS--AAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYY 288
             HVHD+  +T   A+ N TH I +LSF  K  D      PL+     + E GA    Y 
Sbjct: 224 GWHVHDLSLFTKEFASLNLTHTIHYLSFSEKEGD-----YPLNNLNNVQTENGAWRVVYT 278

Query: 289 IKIIPTIYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
             I+   Y      ++        G+FF Y++SP+       S+ + HL T+I+  + G 
Sbjct: 279 ADILEGNYSA-SKYQMYNPKSFASGLFFKYDVSPISAVTYTDSEPVFHLLTRILTVLGGV 337

Query: 349 YITFMLVDAL 358
                L+DA+
Sbjct: 338 LGLCRLIDAI 347


>gi|356547537|ref|XP_003542168.1| PREDICTED: probable endoplasmic reticulum-Golgi intermediate
           compartment protein 3-like [Glycine max]
          Length = 351

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 102/384 (26%), Positives = 175/384 (45%), Gaps = 54/384 (14%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + +K LDAF +  +   +KT  G  V+++  + ++ L   ++  Y    T  ++ V
Sbjct: 1   MGMKQVIKNLDAFPRAEDHLLQKTQSGALVSVIGLIIMATLFVHELGYYLTTYTVHKMSV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG  LPIH+++  P++ CD L++DA+D SG+  + ++ NI+K RL+  G  I    +
Sbjct: 61  DLKRGETLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSYGHIIG--TE 118

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
            + + V+K+    E+                   +       N   +VKEA         
Sbjct: 119 YISDLVEKEHTNQEHDDNKDHDHHHEHSEQKIHLQNLDESTENIIKKVKEA--------- 169

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
                           LKN   EGC++YG L+V RV+G+FHI      S++ ++++  Q 
Sbjct: 170 ----------------LKN--GEGCRVYGVLDVQRVAGNFHI------SVHGLNIYVAQM 205

Query: 241 YTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
               A   N +H I  LSFG K         PLD T     + +  F YYIK++PT Y  
Sbjct: 206 IFDGAKNVNVSHFIHDLSFGPKYPG---LHNPLDDTTRILHDTSGTFKYYIKVVPTEYRY 262

Query: 299 LDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           +                S +   D   P ++F Y+LSP+ V I E+ +S  H  T++   
Sbjct: 263 ISKEVLPTNQFSVSEYYSPINQFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAV 322

Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
           + GT+    ++D  ++  ++ ++K
Sbjct: 323 LGGTFAVTGMLDRWMYRLLEALTK 346


>gi|356575088|ref|XP_003555674.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Glycine max]
          Length = 347

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 105/389 (26%), Positives = 176/389 (45%), Gaps = 68/389 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + +K LDAF +  +   +KT  G  V+++  + ++ L   ++  Y    T  ++ V
Sbjct: 1   MGMKQVIKNLDAFPRAEDHLLQKTQSGALVSVIGLIIMATLFVHELGYYLTTYTVHQMSV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG  LPIH+++  P++ CD L++DA+D SG+  + ++ NI+K RL+  G  I     
Sbjct: 61  DLKRGETLPIHINMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSYGHIIG---T 117

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           E ++ + +K+ T           D NK                        +  +K  L 
Sbjct: 118 EYISDLVEKEHTHHK-------HDDNK---------------------NHEHSEQKIHLQ 149

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            LD   +   +   E LKN   EGC++YG L+V RV+G+FHI+           VH +  
Sbjct: 150 NLDESTENIIKKVKEALKNG--EGCRVYGVLDVQRVAGNFHIS-----------VHGLNI 196

Query: 241 YTSAAF-------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
           Y +          N +H I  LSFG K         PLD T     + +  F YYIK++P
Sbjct: 197 YVAQMIFDGAKNVNVSHFIHDLSFGPKYPG---LHNPLDDTTRILHDTSGTFKYYIKVVP 253

Query: 294 TIYERLDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
           T Y  +                S +   D   P ++F Y+LSP+ V I E+ +S  H  T
Sbjct: 254 TEYRYISKEVLPTNQFSVSEYYSPINQFDRTWPAVYFLYDLSPITVTIKEERRSFLHFIT 313

Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISK 368
           ++   + GT+    ++D  ++  ++ ++K
Sbjct: 314 RLCAVLGGTFAVTGMLDRWMYRLLETLTK 342


>gi|11036454|dbj|BAB17274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 333

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 110/372 (29%), Positives = 168/372 (45%), Gaps = 66/372 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + L+ +DAF +  +   +KT  G  V+IV  L ++ L   ++  Y    T  ++ V
Sbjct: 1   MGVKQALRSIDAFPRAEDHLLQKTQSGAVVSIVGLLIMATLFLHELSYYLNTLTVHQMSV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG  LPIH+++  P++ CD L++DA+D SG+  + ++ NI+K RL+  G  I    +
Sbjct: 61  DLKRGETLPIHVNMTFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGHIIG--TE 118

Query: 121 EVVNAVKK---------KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEA 171
            + + V+K         K    E     TE E  N  G    AET  +K           
Sbjct: 119 YISDLVEKGHEHGHSPHKHDGKEEHKNETETEALNILGFDQAAETMIKKV---------- 168

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
               K AL +                     EGC++YG L+V RV+G+FHI+    + +N
Sbjct: 169 ----KQALAD--------------------GEGCRVYGVLDVQRVAGNFHIS---VHGLN 201

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
            ++V  +    S   N +H I  LSFG K         PLD T     + +  F YYIKI
Sbjct: 202 -IYVAQMIFGGSKNVNVSHMIHDLSFGPKYPG---IHNPLDDTNRILHDTSGTFKYYIKI 257

Query: 292 IPTIYERLDGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
           +PT Y  L    L                 D   P ++F Y+LSP+ V I E+ +S  HL
Sbjct: 258 VPTEYRYLSKDVLSTNQYSVTEYFTPMTEFDRTWPAVYFLYDLSPITVTIKEERRSFLHL 317

Query: 338 WTKIMCNISGTY 349
            T++   + GT+
Sbjct: 318 ITRLCAVLGGTF 329


>gi|224137484|ref|XP_002322569.1| predicted protein [Populus trichocarpa]
 gi|222867199|gb|EEF04330.1| predicted protein [Populus trichocarpa]
          Length = 351

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 110/393 (27%), Positives = 181/393 (46%), Gaps = 67/393 (17%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + +K LDAF +  E   +KT  G  V+++  + ++ L   ++  Y    T  ++ V
Sbjct: 1   MGVKQAIKSLDAFPRAEEHLLQKTQSGALVSVIGLVIMATLFYHELAYYLTTYTVHQMSV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG  LPIH++I  P++ CD L++DA+D SG+  + ++ NI+K RL+  G        
Sbjct: 61  DLQRGEILPIHVNITFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHG---HITGT 117

Query: 121 EVVNAVKKKKVTTENGTTTTEL-----EDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
           E ++ + +K+    N     +      E+ +  G    AET  +K       VK+A    
Sbjct: 118 EYLSDLVEKEHEAHNHDHDKDHHKDSHEEQHTHGFDDAAETMIKK-------VKQA---- 166

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
                                L N   EGC++YG L+V RV+G+FHI      S++ +++
Sbjct: 167 ---------------------LANG--EGCRVYGVLDVQRVAGNFHI------SVHGLNI 197

Query: 236 HDIQPYTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
              Q     A   N +H I  LSFG K         PLDGT     E + +F YYIKI+P
Sbjct: 198 FVAQMIFDGAKHVNVSHIIHDLSFGPKYPG---IHNPLDGTARILRETSGIFKYYIKIVP 254

Query: 294 TIYERLDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
           T Y  +                S +   D   P ++F Y+LSP+ V I E+ +S  H  T
Sbjct: 255 TEYRYISKDVLPTNQFSVTEYFSPITDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFIT 314

Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           ++   + GT+    ++D  ++  ++ ++K   G
Sbjct: 315 RLCAILGGTFALTGMLDRWMYRLLEALTKPNRG 347


>gi|66813156|ref|XP_640757.1| DUF1692 family protein [Dictyostelium discoideum AX4]
 gi|60468793|gb|EAL66793.1| DUF1692 family protein [Dictyostelium discoideum AX4]
          Length = 421

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 111/380 (29%), Positives = 172/380 (45%), Gaps = 46/380 (12%)

Query: 5   ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           E++K  D + K  +D    K+ +GG  T++C L  +YL+  ++  Y        L VD +
Sbjct: 52  EKVKLFDFYPKVNDDVPRHKSTFGGVATMICILITTYLLVSEIYFYTFPIREHSLKVDIT 111

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDS-SGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           RG++LPI++DI  P + C  + +D VD   G       + I K+RLD  G+P  +     
Sbjct: 112 RGNRLPINIDIHFPRLVCTDITIDVVDGIDGNPIKDAAYQIVKQRLDSYGEPFAQGV--- 168

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
             A+  KK       T  E     +  S +  +    KCCN+C ++++ YR  +      
Sbjct: 169 --ALAGKKGIFSRSCTECEFPKSKRVSSVFYKQ----KCCNSCEDLRQYYRLNRIPQNLA 222

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ--- 239
           D   QC  E   +       EGC+IYG L V ++ G FHI  G     +H          
Sbjct: 223 DDSPQCLIERPVQD-----DEGCRIYGSLSVQKMKGDFHILAGTGIDQSHDGHVHHAHHI 277

Query: 240 PYTSAA----FNTTHHIRHLSFGIKLQDDDERRKPLD--GTVAKAEEGASMFNYYIKIIP 293
           P  +      FN THHI   SFG   +D +    PL+  G VA++    ++  YY++++P
Sbjct: 278 PRENIGRIKHFNITHHIHKFSFG---EDIEGLINPLEDFGIVAQS---LAVQTYYLQVVP 331

Query: 294 TIYERLDGS---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
            IY++ D                  +       PGI+F Y+LSPLM+++ + SK L  L 
Sbjct: 332 AIYKKNDFVLETNQYSYTYDYRIVNMFNLGQLFPGIYFKYDLSPLMIEVDQTSKPLVELI 391

Query: 339 TKIMCNISGTYITFMLVDAL 358
           T I     G Y+   LV  L
Sbjct: 392 TSICAIGGGMYVVLGLVVRL 411


>gi|123438593|ref|XP_001310077.1| MGC83277 protein [Trichomonas vaginalis G3]
 gi|121891831|gb|EAX97147.1| MGC83277 protein, putative [Trichomonas vaginalis G3]
          Length = 355

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 97/322 (30%), Positives = 153/322 (47%), Gaps = 39/322 (12%)

Query: 51  QVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDL 110
           Q+       +D+    K+ I+ DI++  I C YL +D +D+  E     E ++   R D 
Sbjct: 45  QLPFINNRIIDTEHLPKMDINFDIMMKHIPCSYLHVDVIDNIKESDESYEGHVRMERFDE 104

Query: 111 DGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKE 170
            G PI             KK   +N + T   +DP  CG+CYG ++    CCNTC EV++
Sbjct: 105 KGNPIL------------KKSYPKNSSVT---KDPGYCGNCYGQKS---GCCNTCKEVRK 146

Query: 171 AYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSI 230
           A++      P +  I QC +E   E+L     E C+++G L V+R  G+FH+APG SY+I
Sbjct: 147 AFKANNRPPPPIIHIQQCVDEGYKEELIAMKGEACRVHGTLTVHRAPGTFHVAPGESYNI 206

Query: 231 N--HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNY 287
           N  H H ++         N +H I H S G+   +      PLDG T  + + G     Y
Sbjct: 207 NGEHDHYYEDLGINIDEMNFSHTINHFSIGMPTANS---YYPLDGHTEIQQKTGRMKMIY 263

Query: 288 YIKIIPTIYERLDGSKL-----------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
           +++ +P     LDG              G      PG+FFSY++S L+  ++ ++ SL  
Sbjct: 264 FLRAVPI---NLDGRVFSFGASSYQNYRGSNSTKYPGVFFSYDVS-LIGIVSSQNSSLMD 319

Query: 337 LWTKIMCNISGTYITFMLVDAL 358
           L T++M  + G +     +D L
Sbjct: 320 LVTELMSILGGVFAIATFLDML 341


>gi|169603005|ref|XP_001794924.1| hypothetical protein SNOG_04508 [Phaeosphaeria nodorum SN15]
 gi|111067148|gb|EAT88268.1| hypothetical protein SNOG_04508 [Phaeosphaeria nodorum SN15]
          Length = 351

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 103/343 (30%), Positives = 156/343 (45%), Gaps = 88/343 (25%)

Query: 115 IQEPQKEVVNAVKKKKVTTE-NGTT---TTELE---------DPNKCGSCYGAETETRK- 160
           +++ Q  V + + K +++ E  G+T   TT L+          P+ CG CYGA + T   
Sbjct: 6   LEQLQMGVTHGINKVRLSPEIEGSTVLSTTALDLHKDEAQHLAPDYCGECYGAPSPTNAI 65

Query: 161 ---CCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVS 217
              CCNTC+EV++AY    W+    + + QC+ E+  E L     EGC++ G + VN+V 
Sbjct: 66  KAGCCNTCDEVRDAYASISWSFGRGEGVEQCEREHYAEHLDQQRQEGCRLEGSIRVNKVV 125

Query: 218 GSFHIAPGLSYSINHVHVHDIQPYTSAAFNT--THHIRHLSFGIKLQD---DDERRK--- 269
           G+FHIAPG S+S  ++HVHD++ Y    ++   TH I HL FG +L +    D ++K   
Sbjct: 126 GNFHIAPGKSFSTGNMHVHDLENYFKDEYSHTFTHKIHHLRFGPQLSNAVIADMQKKHQN 185

Query: 270 ------------PLDGTVAKAEEGASMFNYYIKIIPTIY---------------ERLDGS 302
                       PLD T  +  E A  F Y++K++ T Y               + L GS
Sbjct: 186 TGPGGWTSHHINPLDNTEQQTSEKAYNFMYFVKVVSTAYLPLGWEKEAPRLTKHDELLGS 245

Query: 303 KL-----------------------GGGD------------GGMPGIFFSYELSPLMVKI 327
            +                       GG D            GG+PG+FFSY++SP+ V  
Sbjct: 246 TIEGNYKGSIETHQYSVTSHKRSLAGGNDEKEGHKERIHAKGGIPGVFFSYDISPMKVIN 305

Query: 328 TE-KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
            E + K+       +   I GT      VD  L+  V KI K+
Sbjct: 306 REVRDKTFSGFLVGLCAVIGGTLTVAAAVDRALYEGVNKIKKI 348


>gi|322792514|gb|EFZ16472.1| hypothetical protein SINV_10246 [Solenopsis invicta]
          Length = 153

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 76/161 (47%), Positives = 101/161 (62%), Gaps = 14/161 (8%)

Query: 5   ERLKGLDAFTKPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           + L+ LD   K  E  D   +T  G  VTI+  + +  L   +V  Y   S +EELFVD+
Sbjct: 2   QMLRQLDVHPKVREEADILVRTFSGAIVTIISTIIMGILFLSEVNYYLTPSMSEELFVDT 61

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           SRGSKL I+LDI+VP +SCD+    A+D++GEQHLH+EHNI+KRRLDL+GKPI++PQ+  
Sbjct: 62  SRGSKLRINLDIIVPAVSCDH----AMDTTGEQHLHIEHNIFKRRLDLNGKPIEDPQRTN 117

Query: 123 VNAVKKKKVTTENGT---TTTELEDPNKCGSCYGAETETRK 160
           +   K    TTE      +TTE      CG CYGA T+T K
Sbjct: 118 ITDAKAVSKTTEKAVEIGSTTE-----TCGDCYGAATDTMK 153


>gi|449016424|dbj|BAM79826.1| hypothetical protein, conserved [Cyanidioschyzon merolae strain
           10D]
          Length = 499

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 116/448 (25%), Positives = 186/448 (41%), Gaps = 95/448 (21%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           +++ L+ LD + K  ED   ++V GG + +  ++ I  L+  +   + Q      + VD+
Sbjct: 38  WNDWLRKLDVYPKTVEDVRLRSVTGGIIALFSYICIGILVVSEFLRWLQPQLHSNVLVDA 97

Query: 63  SR---GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
                   + + L I +  + CD  +LDA+ ++G Q  +    + KR LD  G+P+  P+
Sbjct: 98  RSILDTEPITVDLGIDLLAVGCDEFSLDALTANGAQLPNSVVELRKRPLDASGQPVIFPR 157

Query: 120 ---------KEVVNAVKKKKVTTENGTTTTELEDP------------------------- 145
                     E        +  TE+   T +LE                           
Sbjct: 158 GAFGRSRLRNERGGVAPAPQALTEDPPNTQQLEGRVSQEVRAQLKQYREEAIAFRDRLAA 217

Query: 146 -NK-----CGSCYGAETETRK-----------CCNTCNEVKEAYRYKKWALPE-LDTIVQ 187
            NK     CGSCYGA  +T +           CCNTC+E++  Y  + WA  + L T  Q
Sbjct: 218 LNKTGVAYCGSCYGAVPQTDQVGEANQITSGVCCNTCDEIRVLYEERNWAFDQVLRTAEQ 277

Query: 188 C-KNEYST--EKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYS---INHVHVHDIQPY 241
           C +  Y T   +     + GC++   L++ RV+G+FH APG  ++    +HVH  D Q  
Sbjct: 278 CAEKRYLTLLHEAGRVQSGGCRVSARLQLPRVAGNFHFAPGKGHTHRMGHHVHSVDDQ-L 336

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEG------ASMFNYYIKIIPTI 295
               +N +H IRHL FG        ++ PLDG +   E+        +M  YY K+IPT 
Sbjct: 337 LHRTYNFSHRIRHLRFGPLF---PHQQNPLDGAMRILEQPPPGSPFGNMVLYYCKLIPTT 393

Query: 296 YER-------LDGSKLGGGD----------------GGMPGIFFSYELSPLMVKITE-KS 331
           Y R       L   +    D                G +PGIFF YE  PL +   E + 
Sbjct: 394 YRRDRQRGDALRSMEYAAADLTQSSEQDRVGITHSTGALPGIFFFYEPQPLQIAYFEGRM 453

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALL 359
             L H   ++   + G +    ++D  +
Sbjct: 454 YGLLHFIVQLCAIVGGVFTVSSMIDRFV 481


>gi|224086657|ref|XP_002307923.1| predicted protein [Populus trichocarpa]
 gi|222853899|gb|EEE91446.1| predicted protein [Populus trichocarpa]
          Length = 351

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 109/388 (28%), Positives = 179/388 (46%), Gaps = 65/388 (16%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + +K LDAF +  E   +KT  G  V+I+  + ++ L   ++  Y    T  ++ V
Sbjct: 1   MGMKQAIKKLDAFPRAEEHLLQKTQSGALVSIIGLVTMATLFYHELAYYLTTYTVHQMSV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D +RG  LPIH++I  P++ CD L++DA+D SG+  + ++ +I+K RL+  G       +
Sbjct: 61  DLTRGETLPIHINITFPSLPCDVLSVDAIDMSGKHEVDLDTSIWKLRLNSYGHITG--TE 118

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYG----AETETRKCCNTCNEVKEAYRYKK 176
            + + V+K+     +       ED +     +G    AET  +K       VK+A     
Sbjct: 119 YLSDLVEKEHEAHNHDHNKDHHEDSHAKQHTHGFDDAAETMVKK-------VKQA----- 166

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
                               L N   EGC++YG L+V RV+G+FHI      S++ +++ 
Sbjct: 167 --------------------LANG--EGCRVYGVLDVQRVAGNFHI------SVHGLNIF 198

Query: 237 DIQPYTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
             Q     A   N +H I  LSFG K         PLDGT     E +  F YYIKI+PT
Sbjct: 199 VAQMIFDGAKHVNVSHIIHDLSFGPKYPG---IHNPLDGTTRILHETSGTFKYYIKIVPT 255

Query: 295 IYERLDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
            Y  +                S +   D   P ++F Y+LSP+ V I E+ +S  H  T+
Sbjct: 256 EYRYISKEVLPTNQFSVTEYFSPMTDFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITR 315

Query: 341 IMCNISGTYITFMLVDALLHSCVKKISK 368
           +   + GT+    ++D  +   ++ ++K
Sbjct: 316 LCAVLGGTFALTGMLDRWMCRLLEALTK 343


>gi|118482697|gb|ABK93267.1| unknown [Populus trichocarpa]
          Length = 366

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 105/399 (26%), Positives = 178/399 (44%), Gaps = 64/399 (16%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M   + +K LDAF +  E   +KT  G  V+++  + ++ L   ++  Y    T  ++ V
Sbjct: 1   MGVKQAIKSLDAFPRAEEHLLQKTQSGALVSVIGLVIMATLFYHELAYYLTTYTVHQMSV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D  RG  LPIH++I  P++ CD L++DA+D SG+  + ++ NI+K+ L            
Sbjct: 61  DLQRGEILPIHVNITFPSLPCDVLSVDAIDMSGKHEVDLDTNIWKKLL------------ 108

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYR------- 173
                          G   T +E      + +G  T T    +   +  EA+        
Sbjct: 109 --------------FGMLLTRIEFLQLRLNSHGHITGTEYLSDLVEKEHEAHNHDHDKDH 154

Query: 174 ----YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYS 229
               +++      D   +   +   + L N   EGC++YG L+V RV+G+FHI      S
Sbjct: 155 HKDSHEEQHTHGFDDAAETMIKKVKQALANG--EGCRVYGVLDVQRVAGNFHI------S 206

Query: 230 INHVHVHDIQPYTSAA--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
           ++ +++   Q     A   N +H I  LSFG K         PLDGT     E + +F Y
Sbjct: 207 VHGLNIFVAQMIFDGAKHVNVSHIIHDLSFGPKYPG---IHNPLDGTARILRETSGIFKY 263

Query: 288 YIKIIPTIYERLDG--------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKS 333
           YIKI+PT Y  +                S +   D   P ++F Y+LSP+ V I E+ +S
Sbjct: 264 YIKIVPTEYRYISKDVLPTNQFSVTEYFSPITDFDRTWPAVYFLYDLSPITVTIKEERRS 323

Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
             H  T++   + GT+    ++D  ++  ++ ++K   G
Sbjct: 324 FLHFITRLCAILGGTFALTGMLDRWMYRLLEALTKPNRG 362


>gi|410083920|ref|XP_003959537.1| hypothetical protein KAFR_0K00470 [Kazachstania africana CBS 2517]
 gi|372466129|emb|CCF60402.1| hypothetical protein KAFR_0K00470 [Kazachstania africana CBS 2517]
          Length = 417

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 112/397 (28%), Positives = 172/397 (43%), Gaps = 74/397 (18%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L   DAF K  ED   +T  GG ++I C +   +L+  +   + ++ T  +L VD    
Sbjct: 5   KLLVFDAFNKTEEDVRVRTNTGGLISIGCVVLTCFLLLREWYQFNEIITRPKLVVDRDHD 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVN 124
            +L ++ DI  P+ISCD L LD +D +G+  L + E  + K R+D +G  +      + N
Sbjct: 65  LELDLNFDITFPSISCDLLTLDILDDAGDLQLDLLESGLTKTRVDSNGVSLTTESFNIGN 124

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
               K+   ++            CGSCYGA  + +          CC TC +V +AY   
Sbjct: 125 EALIKRDFPQD-----------YCGSCYGALDQGKNDELNANEKVCCQTCEDVHDAYLNI 173

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY------S 229
            WA  +   I QC+ E   +++     EGC++ G   +NRV G+ H APG SY      +
Sbjct: 174 GWAFYDGKNIEQCETEGYVDRINEHLNEGCRVQGSARLNRVQGNIHFAPGKSYQDYSRRN 233

Query: 230 INHVHVHDIQPYT---SAAFNTTHHIRHLSFGIKLQDDDERR----------KPLDGTVA 276
               H HD   Y    S +FN  H I H SFG  +++                PLDG   
Sbjct: 234 SFATHFHDTSLYDKTHSLSFN--HIIHHFSFGKPIENSYVNNHNEGLSKISTNPLDGRKV 291

Query: 277 KAEEGASM--FNYYIKIIPTIYERLDGSK-----------------LGGGD--------- 308
             +  +    ++Y+ +I+PT YE L+                     GG D         
Sbjct: 292 FPDRDSHFIQYSYFAEIVPTRYEYLNNKSDPVETTQFSATFHSRPLRGGRDEDHPTTLHQ 351

Query: 309 -GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
            GG+PG+F  +E SPL V   E+       W+  + N
Sbjct: 352 RGGIPGLFIYFETSPLKVINKEQ---YSQAWSTFLLN 385


>gi|242059085|ref|XP_002458688.1| hypothetical protein SORBIDRAFT_03g038260 [Sorghum bicolor]
 gi|241930663|gb|EES03808.1| hypothetical protein SORBIDRAFT_03g038260 [Sorghum bicolor]
          Length = 350

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 105/382 (27%), Positives = 171/382 (44%), Gaps = 68/382 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK L+AF    E   +KT  G  VTI+  L +  L   ++  Y    T  ++ VD  RG 
Sbjct: 7   LKSLNAFPHAEEHLLKKTYSGAVVTILGLLVMITLFVHELQFYLTTYTVHQMSVDLKRGE 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            LPIH+++  P++ C+ L++DA+D SG+  + +  NI+K RLD  G  I    + + + V
Sbjct: 67  TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIG--TEYLSDLV 124

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +K      +     E  D            E +K   T NE             E + ++
Sbjct: 125 EKGHGAHHDHDHGQEHHD------------EQKKPEQTFNE-------------EAEKMI 159

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT---- 242
           +   +     L N   EGC++YG L+V RV+G+FHI+           VH +  +     
Sbjct: 160 KSVKQ----ALGNG--EGCRVYGMLDVQRVAGNFHIS-----------VHGLNIFVAEKI 202

Query: 243 ---SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
              S+  N +H I  LSFG K         PLD T     + +  F YYIK++PT Y+ L
Sbjct: 203 FEGSSHVNVSHVIHELSFGPKYPGI---HNPLDETSRILHDTSGTFKYYIKVVPTEYKYL 259

Query: 300 DGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
               L                 D   P ++F Y+LSP+ V I E+ ++  H  T++   +
Sbjct: 260 SKKVLPTNQFSVTEYFLPIRPSDRAWPAVYFLYDLSPITVTIKEERRNFLHFITRLCAVL 319

Query: 346 SGTYITFMLVDALLHSCVKKIS 367
            GT+    ++D  ++  ++ ++
Sbjct: 320 GGTFAMTGMLDRWMYRLIESVT 341


>gi|357112836|ref|XP_003558212.1| PREDICTED: probable endoplasmic reticulum-Golgi intermediate
           compartment protein 3-like [Brachypodium distachyon]
          Length = 349

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 105/382 (27%), Positives = 171/382 (44%), Gaps = 69/382 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  +AF    +   +KT  G  VTI   + +  L   ++  Y    T  ++ VD  RG 
Sbjct: 7   LKNFNAFPHAEDHLLKKTYSGAIVTIFGLIIMFTLFVHELKFYLTTYTMHQMSVDLKRGE 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            LPIH+++  P++ C+ L++DA+D SG+  + +  NI+K RLD  G  I     E ++ +
Sbjct: 67  TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGTIIG---TEYLSDL 123

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            +K+    +     E  D            E +K  +T NE             + D +V
Sbjct: 124 VEKEHGAHHHDNGHEHHD------------EEKKPEHTFNE-------------DADKMV 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT---- 242
           +   +     L+N   EGC++YG L+V RV+G+FHI+           VH +  Y     
Sbjct: 159 KSVRQ----ALENG--EGCRVYGMLDVQRVAGNFHIS-----------VHGLNIYVAEKI 201

Query: 243 ---SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
              S+  N +H I  LSFG K         PLD T     + +  F YYIK++PT Y  L
Sbjct: 202 FEGSSHVNVSHVIHELSFGPKYPGI---HNPLDDTTRILHDASGTFKYYIKVVPTEYRYL 258

Query: 300 DGSKLG--------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
               L                 D   P ++F Y+LSP+ V I E+ ++  H  T++   +
Sbjct: 259 SKQVLPTNQFSVTEYFVPIRPADRSWPAVYFLYDLSPITVTIKEERRNFLHFITRLCAVL 318

Query: 346 SGTYITFMLVDALLHSCVKKIS 367
            GT+    ++D  ++  ++ +S
Sbjct: 319 GGTFAMTGMLDRWMYRIIESVS 340


>gi|115455745|ref|NP_001051473.1| Os03g0784400 [Oryza sativa Japonica Group]
 gi|14718311|gb|AAK72889.1|AC091123_8 unknown protein [Oryza sativa Japonica Group]
 gi|108711422|gb|ABF99217.1| Serologically defined breast cancer antigen NY-BR-84, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113549944|dbj|BAF13387.1| Os03g0784400 [Oryza sativa Japonica Group]
 gi|215737170|dbj|BAG96099.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625918|gb|EEE60050.1| hypothetical protein OsJ_12848 [Oryza sativa Japonica Group]
          Length = 350

 Score =  131 bits (329), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 106/384 (27%), Positives = 168/384 (43%), Gaps = 70/384 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  +AF    +   +KT  G  VTI   + +  L   ++  Y    T  ++ VD  RG 
Sbjct: 7   LKNFNAFPHAEDHLLKKTYSGAIVTIFGLIIMVTLFAHELKFYLTTYTVHQMSVDLKRGE 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            LPIH+++  P++ C+ L++DA+D SG+  + +  NI+K RLD  G  I     E +N +
Sbjct: 67  TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIG---TEYLNDL 123

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            +K+  T N     E ED  K            K  +T NE  E           + ++ 
Sbjct: 124 VEKEHGTHNHDHDHEHEDEQK------------KQEHTFNEDAEKM---------VKSVK 162

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT---- 242
           Q               EGC++YG L+V RV+G+FHI+           VH +  +     
Sbjct: 163 QAMEN----------GEGCRVYGVLDVQRVAGNFHIS-----------VHGLNIFVAEKI 201

Query: 243 ---SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
              S+  N +H I  LSFG K         PLD T     + +  F YYIKI+PT Y  L
Sbjct: 202 FDGSSHVNVSHIIHDLSFGPKYPGI---HNPLDETTRILHDTSGTFKYYIKIVPTEYRYL 258

Query: 300 DGS---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
                             K        P ++F Y+LSP+ V I E+ ++  H  T++   
Sbjct: 259 SKQVLPTNQFSVTEYFVPKRATDRSAWPAVYFLYDLSPITVTIKEERRNFLHFLTRLCAV 318

Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
           + GT+    ++D  ++  ++ ++K
Sbjct: 319 LGGTFAMTGMLDRWMYRLIESVTK 342


>gi|326490247|dbj|BAJ84787.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326493774|dbj|BAJ85349.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 348

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 169/376 (44%), Gaps = 58/376 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  +AF    +   +KT  G  VTI+  + +  L   ++  Y    T  ++ VD  RG 
Sbjct: 7   LKNFNAFPHAEDHLLKKTYSGAIVTILGLIVMVTLFAHELTFYLTTYTMHQMSVDLKRGE 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            LPIH+++  P++ C+ L++DA+D SG+  + +  NI+K RLD  G+ I           
Sbjct: 67  TLPIHINVSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGQIIG---------- 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                 TE  +   E E               ++  +T NE             + D +V
Sbjct: 117 ------TEYLSDLVEKEHGTHDHDHGHGHDVQKQPEHTFNE-------------DADKMV 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
           +      + KL     EGC++YG L+V RV+G+FHI+  GL+  + +  + D     S+ 
Sbjct: 158 K------SVKLAMENGEGCRVYGALDVQRVAGNFHISVHGLNIFVAN-QIFD----GSSH 206

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N +H I  LSFG +         PLD T     + +  F YYIK++PT Y  L    L 
Sbjct: 207 VNVSHVIHRLSFGPEYPGI---HNPLDDTSRILHDTSGTFKYYIKVVPTEYRYLSKGVLP 263

Query: 306 GG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                           D   P ++F Y+LSP+ V I E+ ++  H  T++   + GT+  
Sbjct: 264 TNQFSVTEYFVPIRPTDRSWPAVYFLYDLSPITVTIREERRNFLHFITRLCAVLGGTFAM 323

Query: 352 FMLVDALLHSCVKKIS 367
             ++D  ++  ++ IS
Sbjct: 324 TGMLDRWMYRIIESIS 339


>gi|194708090|gb|ACF88129.1| unknown [Zea mays]
 gi|195607866|gb|ACG25763.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
 gi|195619788|gb|ACG31724.1| serologically defined breast cancer antigen NY-BR-84 [Zea mays]
 gi|413952088|gb|AFW84737.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 350

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 105/382 (27%), Positives = 171/382 (44%), Gaps = 68/382 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK L+AF    E   +KT  G  VTI   L +  L   ++  Y    T  ++ VD  RG 
Sbjct: 7   LKSLNAFPHAEEHLLKKTYSGAVVTIFGLLIMITLFVHELQFYLTTYTVHQMSVDLKRGE 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            LPIH+++  P++ C+ L++DA+D SG+  + +  NI+K RLD  G  I           
Sbjct: 67  TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIG---------- 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                       T  L D  + G  +GA  +     +  +E K   ++++    E + ++
Sbjct: 117 ------------TEYLSDLVEKG--HGAHHDHDHDHDHHDEQK---KHEQTFNEEAEKMI 159

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT---- 242
           +   +     L N   EGC++YG L+V RV+G+FHI+           VH +  +     
Sbjct: 160 KSVKQ----ALGNG--EGCRVYGMLDVQRVAGNFHIS-----------VHGLNIFVAEKI 202

Query: 243 ---SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
              S   N +H I  LSFG K         PLD T     + +  F YYIK++PT Y+ L
Sbjct: 203 FEGSNHVNVSHVIHELSFGPKYPGI---HNPLDETSRILHDTSGTFKYYIKVVPTEYKYL 259

Query: 300 DGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
               L                 D   P ++F Y+LSP+ V I E+ ++  H  T++   +
Sbjct: 260 SKKVLPTNQFSVTEYFLPIRPTDRAWPAVYFLYDLSPITVTIKEERRNFLHFVTRLCAVL 319

Query: 346 SGTYITFMLVDALLHSCVKKIS 367
            GT+    ++D  ++  +K ++
Sbjct: 320 GGTFAMTGMLDRWMYQLIKTVT 341


>gi|218193856|gb|EEC76283.1| hypothetical protein OsI_13786 [Oryza sativa Indica Group]
          Length = 350

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 106/384 (27%), Positives = 167/384 (43%), Gaps = 70/384 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  +AF    +    KT  G  VTI   + +  L   ++  Y    T  ++ VD  RG 
Sbjct: 7   LKNFNAFPHAEDHLLPKTYSGAIVTIFGLIIMVTLFAHELKFYLTTYTVHQMSVDLKRGE 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            LPIH+++  P++ C+ L++DA+D SG+  + +  NI+K RLD  G  I     E +N +
Sbjct: 67  TLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIG---TEYLNDL 123

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            +K+  T N     E ED  K            K  +T NE  E           + ++ 
Sbjct: 124 VEKEHGTHNHDHDHEHEDEQK------------KQEHTFNEDAEKM---------VKSVK 162

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT---- 242
           Q               EGC++YG L+V RV+G+FHI+           VH +  +     
Sbjct: 163 QAMEN----------GEGCRVYGVLDVQRVAGNFHIS-----------VHGLNIFVAEKI 201

Query: 243 ---SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
              S+  N +H I  LSFG K         PLD T     + +  F YYIKI+PT Y  L
Sbjct: 202 FDGSSHVNVSHIIHDLSFGPKYPGI---HNPLDETTRILHDTSGTFKYYIKIVPTEYRYL 258

Query: 300 DGS---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
                             K        P ++F Y+LSP+ V I E+ ++  H  T++   
Sbjct: 259 SKQVLPTNQFSVTEYFVPKRATDRSAWPAVYFLYDLSPITVTIKEERRNFLHFLTRLCAV 318

Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
           + GT+    ++D  ++  ++ ++K
Sbjct: 319 LGGTFAMTGMLDRWMYRLIESVTK 342


>gi|412992535|emb|CCO18515.1| predicted protein [Bathycoccus prasinos]
          Length = 428

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 113/398 (28%), Positives = 178/398 (44%), Gaps = 74/398 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---- 62
           +  LDA+ K  ED+   +  G A+T++C+L    L   +   +       EL VD+    
Sbjct: 32  IASLDAYPKVKEDYARGSTLGAAITLICFLACLCLFFSEYRTHLVSKIESELDVDTMGVN 91

Query: 63  ---SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPI--- 115
              S   +L +++D+   +++C+ + LD++D++GE H  V + +I KRRLD DGKPI   
Sbjct: 92  KFESNAERLHVYVDVTFHSLACELITLDSLDAAGEVHHDVHDGHITKRRLDRDGKPIPRR 151

Query: 116 ------------QEPQKE--VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKC 161
                       ++P K   +   V++K+   E      E E   +  +    + + RK 
Sbjct: 152 DSSAKDDVAVTREKPNKHKHIEKLVREKEKEEEGKKNEGEQEQEQQEQNHEQHDEKRRKL 211

Query: 162 CNTCNEVKEAYRYKKWALPELDTIV--QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGS 219
            NT      A         +++ ++  Q  N    E  KN   EGC++ GYLEVNRV GS
Sbjct: 212 QNT------ALAGFGGGFFDINALIHEQFPNGLE-EAFKNKNKEGCEVMGYLEVNRVPGS 264

Query: 220 FHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG------IKLQDDDERRKPLDG 273
           F I+PG S  I   H   IQ    +  N +H I  L+FG      + L D + R  P + 
Sbjct: 265 FSISPGKSLQIGMSH---IQLNVVSHLNMSHTINRLAFGEAFPGALNLLDKNTRYLPPN- 320

Query: 274 TVAKAEEGASMFNYYIKIIPTIYERLDGSKL-------------------GGGDGGMP-G 313
                    ++  Y++K++PT + RL  + L                   G G  G P G
Sbjct: 321 ---------AVHQYFLKVVPTSFARLKDTTLATNQYSVTESSSSAKQSFFGMGSSGKPSG 371

Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
           I+F YELSP+ +   E+  S G     + C+I G   T
Sbjct: 372 IYFHYELSPIRIDFKERRNSFGEFMLSV-CSIIGGVAT 408


>gi|194689880|gb|ACF79024.1| unknown [Zea mays]
 gi|413949702|gb|AFW82351.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 176

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 70/174 (40%), Positives = 101/174 (58%), Gaps = 3/174 (1%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
            F  RLK LDA+ K  EDF+++T+ GG VT+V  + +  L   +   YF  ST  +L VD
Sbjct: 3   AFLHRLKRLDAYPKVNEDFYKRTLSGGIVTLVAAVVMLLLFISETRSYFYSSTETKLVVD 62

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +SRG +L ++ DI  P+I C  L++D  D SGEQH  + H+I KRRL+  G  I E +KE
Sbjct: 63  TSRGERLRVNFDITFPSIPCTLLSVDTTDISGEQHHDIRHDIEKRRLNSHGNVI-EARKE 121

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
            +   K ++   ++G    + E    CG+CYGAE    +CCN+C E  +  R K
Sbjct: 122 GIGGAKVERPLQKHGGRLDKGE--QYCGTCYGAEESDEQCCNSCEESGKHIRRK 173


>gi|340055752|emb|CCC50073.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 404

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 105/399 (26%), Positives = 175/399 (43%), Gaps = 59/399 (14%)

Query: 6   RLKGLDAFTK--PY--EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
           R++  D F++  P   E   E+T  GG ++ +  L ++  I +++  Y  V    E++VD
Sbjct: 3   RIRRFDMFSRFDPALEEAGRERTTCGGLLSFLFILLVALFIKIELYRYLSVVELREMYVD 62

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
              G  + I ++I  P I CD +A+D +   GE       +I K R+     P Q+P   
Sbjct: 63  PHVGGDMHITINITFPHIHCDLMAVDVIGPFGEYMTGAVRSITKVRV-----PTQDPA-- 115

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNK---CGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
               V +    ++   +T  L   NK   C SCYGAE     CCN+C++V  A+R   W 
Sbjct: 116 ---PVSEALPQSDRSVSTAALPVSNKMGGCVSCYGAEESPGDCCNSCDDVHAAFRRNGWE 172

Query: 179 LPELDT-IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
           + E D  + QC           + +EGC I+    V ++ G+ H  PG   +     ++ 
Sbjct: 173 IDENDIKLSQCTEGQLHNVGPVSPSEGCNIHSKFSVRKIKGNIHFVPGRRLNHRGQPMYV 232

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGAS-----MFNYYIKI 291
           ++       N +H    L FG +      +  PL+G   A+    AS      F+YY+++
Sbjct: 233 VRREAIKKMNLSHVFHSLEFGERFPG---QVNPLNGIANARGVRNASEVVSGRFSYYVQV 289

Query: 292 IPTIYERLD--GSKL-------------------------GGGDGGM-PGIFFSYELSPL 323
           +PT Y+ +   GS++                         G  D  +  G+F  Y++SP+
Sbjct: 290 LPTEYQFVPALGSRVRLETNQYSVKQHFTESWYTTDRRYPGWSDPTLVAGVFIVYDVSPV 349

Query: 324 --MVKITEKSKSLGHLWTKIMCNISGTYITFM-LVDALL 359
             +V  T    SL HL  + MC + G   T   ++D+LL
Sbjct: 350 KTLVMRTSPYPSLIHLLLR-MCAVGGGAFTVASMIDSLL 387


>gi|407859749|gb|EKG07137.1| hypothetical protein TCSYLVIO_001725 [Trypanosoma cruzi]
          Length = 393

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 98/387 (25%), Positives = 166/387 (42%), Gaps = 54/387 (13%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGA-VTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
           +++  +D F KP ED+     Y GA V++V  + I  L+  +VC Y   + + T EL VD
Sbjct: 22  KKVAAVDLFPKPKEDYSRSQTYHGALVSLVTVVVIGLLVFWEVCSYIFGRDAYTTELSVD 81

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +S  +++  +LDI  P + C  ++LD +D +G  +L+V  NI+K  +D  G         
Sbjct: 82  TSLSTEVEFNLDITFPRVPCHEVSLDVLDVTGTVNLNVTRNIFKTPVDAQGN-------- 133

Query: 122 VVNAVKKKKVTTENGTTTTELED----PNKCGSCYGAETET------RKCCNTCNEVKEA 171
               +  ++   E G+   + +D    P  CG C+ +E +        +CCNTCN+V  A
Sbjct: 134 -FAFIGTRQGVGEYGSFREQSKDDPNSPQFCGRCFISEHQLSMMDNKNRCCNTCNDVLNA 192

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
           Y  +    P+ + + QC  E S          GC   G L V +  G    AP       
Sbjct: 193 YDQQGLPRPQKNEVEQCIYELS------LINPGCNYKGTLIVKKFGGRLVFAP--KRVPG 244

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR---KPLDGTVAKAEEGASMFNYY 288
              + D+       F+++H I  LS G +      RR    PL+G    A+   +   Y+
Sbjct: 245 GFLIKDVM-----QFDSSHIINKLSIGDERVTRFSRRGVQHPLNGHEFVAQRRFTEIRYF 299

Query: 289 IKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSK 332
           +K++PT+Y     S                    G G  P +   ++  P+ V    +  
Sbjct: 300 LKVVPTMYFSGKNSASFNATYEYSVQWSHRLTPIGFGHFPSVSLGFDFHPMQVNNYFRRS 359

Query: 333 SLGHLWTKIMCNISGTYITFMLVDALL 359
           S  H   ++   + G ++   L+D L+
Sbjct: 360 SFPHFIVQLCGIVGGLFVVLGLIDGLV 386


>gi|308808274|ref|XP_003081447.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
 gi|116059910|emb|CAL55969.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
          Length = 406

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 109/378 (28%), Positives = 157/378 (41%), Gaps = 85/378 (22%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD- 61
            S  LK LDA  K  ED+  ++  G  +T+VC      L   +   Y       EL V+ 
Sbjct: 30  MSALLKSLDANPKLKEDYARQSTSGVIITLVCGALCLLLFLGEFFAYRTTKVVSELRVNP 89

Query: 62  ------SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKP 114
                 +    +L I +DI   +++C+ + LD  D +GEQH  V + +I KRR+D DGKP
Sbjct: 90  MGVHSVTPNAERLKIDIDITFHSMACNLITLDTSDKAGEQHYDVHDGHIEKRRVDKDGKP 149

Query: 115 I------QEPQK--EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCN 166
           I      ++P K  E+V A                LE  N+  S  G ET  +K      
Sbjct: 150 IDATFTSEKPNKHKEMVQA----------------LEKMNQTDSVVGNETALQKQ----- 188

Query: 167 EVKEAYRYK---------KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVS 217
               A+R+          K A PE       +N +     +N   EGC++ GYLEVNRV 
Sbjct: 189 --DRAHRFAGVFGFESMLKEAFPE-----GIENAF-----RNEAREGCEVKGYLEVNRVP 236

Query: 218 GSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAK 277
           G   I+PG    +  + +   +       N TH I  LSFG +         PLDGT   
Sbjct: 237 GRISISPG---RVVMMGMQQFKLNVHTDLNLTHTIHRLSFGERFPG---LVSPLDGTHRS 290

Query: 278 AEEGASMFNYYIKIIPTIYERLDGS-------------------KLGGGDGGM-PGIFFS 317
               A    Y++ ++ T ++ L G                     LGG   G  PG+FF+
Sbjct: 291 LPPNAVQ-QYFLNVVATTFQPLRGDARISTHQYSVTETFTTSQRSLGGSSNGRDPGVFFT 349

Query: 318 YELSPLMVKITEKSKSLG 335
           YE+ P+ V   E   + G
Sbjct: 350 YEIEPIRVDFKETRTTFG 367


>gi|449445069|ref|XP_004140296.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 388

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 114/425 (26%), Positives = 186/425 (43%), Gaps = 100/425 (23%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKT---VYGGAVTIVCW---------------------- 35
           M   + +K LDAF +  E   +KT    +G    I CW                      
Sbjct: 1   MGLKQTIKSLDAFPRAEEHLLQKTQTGAFGNMRGICCWISHNGHTISARTEILSLHIYCS 60

Query: 36  ---------LFISYL--ICVD----VCDYFQVSTTE--ELFVDSSRGSKLPIHLDIVVPT 78
                    LF  YL  I +D    + D+      +   + VD  RG  LPIH+++  P+
Sbjct: 61  SVGKQQMWPLFFLYLRIIPLDWGEGMSDFGDPVLWKGFHMSVDLKRGETLPIHINMTFPS 120

Query: 79  ISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTT 138
           + CD L++DA+D SG+  + ++ NI+K RL+  G+ I    + + + V+K+ V  ++   
Sbjct: 121 LPCDVLSVDAIDMSGKHEVDLDTNIWKLRLNSHGQIIG--TEYLSDLVEKEHVDHKHDHD 178

Query: 139 TTELED-PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
             + +D P+  G    AE       N   +VK+A       L E                
Sbjct: 179 HDKEKDHPHIHGFDQAAE-------NLVKKVKQA-------LEE---------------- 208

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
                +GC++YG L+V RV+G+FHI+    + +N + V  +    S   N +H I  LSF
Sbjct: 209 ----AQGCRVYGVLDVQRVAGNFHIS---VHGLN-IFVAQMIFGGSKHVNVSHMIHDLSF 260

Query: 258 GIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--------------SK 303
           G K         PLDGTV    + +  F YYIKI+PT Y+ +                S 
Sbjct: 261 GPKYPGI---HNPLDGTVRILRDTSGTFKYYIKIVPTEYKYISKAVLPTNQFSVTEYFSP 317

Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
           +   D   P ++F Y+LSP+ V I E+ +S  H  T++   + GT+    ++D  +   +
Sbjct: 318 MTDSDRSWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLDRWMFRFL 377

Query: 364 KKISK 368
           + ++K
Sbjct: 378 EALTK 382


>gi|145476255|ref|XP_001424150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391213|emb|CAK56752.1| unnamed protein product [Paramecium tetraurelia]
          Length = 339

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 103/389 (26%), Positives = 164/389 (42%), Gaps = 82/389 (21%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           RL+ LD + K   D  E T  G  ++++  + I  L   ++  Y +V  + E+FVD +RG
Sbjct: 8   RLRKLDIYRKLPADLTEPTTAGALISVISTIVIVILFITELQAYIEVDNSSEMFVDINRG 67

Query: 66  S-KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
             ++ ++LDI      CD L+LD  D  G   ++VE  + K+R                 
Sbjct: 68  GEQIRVNLDIEFHKFPCDILSLDVQDIMGSHVVNVEGRLIKKR----------------- 110

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            +K  KV +E   +  E           G E   +   +    +++A++ K         
Sbjct: 111 -IKNGKVISEEVHSNHE-----------GHEHHNQPSIDFA-RIEQAFKEK--------- 148

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
                             EGCQI GY+ VN+V G+FH++      I H      Q     
Sbjct: 149 ------------------EGCQIAGYIIVNKVPGNFHVSAHAFGGILH---QVFQRSQIQ 187

Query: 245 AFNTTHHIRHLSFG-------IKLQDDDERRKPLDGT--VAKAEEGASM-FNYYIKIIPT 294
             + +H I H+SFG       IK Q       PLD T  VA+ + G  M F YYI ++PT
Sbjct: 188 TLDLSHTINHISFGEEDDLMKIKKQFQKGVLNPLDNTKKVAQPQGGTGMMFQYYISVVPT 247

Query: 295 IYERLDGS-----KLGGGDG-----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
            Y  + G+     +            +P  +F Y+LSP+ VK  +  +S  H   +I   
Sbjct: 248 TYVDVSGNEYYVHQFTANSNEVLTDHLPAAYFRYDLSPVTVKFLQYRESFLHFLVQICAI 307

Query: 345 ISGTYITFMLVDALLH-SCVKKISKVEIG 372
           + G +    +VD ++H S V  + K E+G
Sbjct: 308 LGGVFTIASIVDGMIHKSVVALLKKYEMG 336


>gi|71409973|ref|XP_807304.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70871276|gb|EAN85453.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 393

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 96/389 (24%), Positives = 164/389 (42%), Gaps = 58/389 (14%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGA-VTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
           +++  +D F KP ED+     Y GA V++V  + I  L+  +V  Y   + + T EL VD
Sbjct: 22  KKVAAVDLFPKPKEDYSRSQTYRGALVSLVTVVVIGLLVFWEVYSYIFGRDAYTTELSVD 81

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +S   ++  +LDI  P + C  ++LD +D +G  +L+V  NI+K  +D  G         
Sbjct: 82  TSLSKEVEFNLDITFPRVPCHEVSLDVLDVTGTVNLNVTRNIFKTPVDAQGN-------- 133

Query: 122 VVNAVKKKKVTTENGTTTTELED----PNKCGSCYGAETE------TRKCCNTCNEVKEA 171
               +  ++   E G+   + +D    P  CG C+ +E +        +CCNTCN+V  A
Sbjct: 134 -FAFIGTRQGVGEYGSFREQSKDDPNSPQFCGRCFISEHQLSMSENKNRCCNTCNDVLNA 192

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
           Y  +    P+ + + QC  + S          GC   G L V +  G    AP       
Sbjct: 193 YDQQGLPRPQKNEVEQCIYDLS------RINPGCNYKGTLIVKKFGGRLVFAP--KRVPG 244

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR---KPLDGTVAKAEEGASMFNYY 288
              + D+       F+++H I  LS G +      RR    PL+G     +   +   Y+
Sbjct: 245 GFLIRDVM-----QFDSSHIINKLSIGDERVTRFSRRGVQHPLNGHEFDTQRRFTEIRYF 299

Query: 289 IKIIPTIYERLDGSKLGG------------------GDGGMPGIFFSYELSPLMVKITEK 330
           +K++PT+Y  L G                       G G  P +   ++  P+ V    +
Sbjct: 300 LKVVPTMY--LSGKNSASFNATYEYSVQWSHRLTPIGFGHFPSVSLGFDFHPMQVNNYFR 357

Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALL 359
             S  H   ++   + G ++   L+D L+
Sbjct: 358 RSSFPHFLVQLCGIVGGLFVVLGLIDGLV 386


>gi|123472317|ref|XP_001319353.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121902134|gb|EAY07130.1| hypothetical protein TVAG_342940 [Trichomonas vaginalis G3]
          Length = 358

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 100/386 (25%), Positives = 165/386 (42%), Gaps = 63/386 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR-- 64
           LK  D F K +ED   KT + G VT+VC   +SYL+      +      ++L VD ++  
Sbjct: 3   LKDFDFFPKVFEDHSRKTDFSGTVTVVCLAIMSYLLVFQTLGFIASPPKQKLVVDQAKLP 62

Query: 65  -----------GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
                        KL I++DI  P++ C  +    +D   E        +  +R+  DGK
Sbjct: 63  VNEDNVLDWPFVPKLQIYIDIEFPSLPCPVIDFQVLDRFEEIQSDSFSKVKLKRIGPDGK 122

Query: 114 PIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYR 173
                       +K KK            E P  CGSCYGA +    CCNTC +VK A++
Sbjct: 123 -----------IIKNKKT-----------EKPEVCGSCYGAAS---GCCNTCKDVKNAFK 157

Query: 174 YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV 233
            K    P L TI QC++  +     +   E C +YG + V    G+  +  G SY     
Sbjct: 158 KKGRVPPSLSTIRQCRD--AVIDYNHIRNESCHVYGTVIVPPTHGTIVMNSGDSYGAQMN 215

Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFN--YYIKI 291
                   +   FN TH I  +  G    ++D    PL G + K ++    +   Y+I+ 
Sbjct: 216 TTTSSLGISIDDFNFTHKINDIYIG----ENDLGDHPLKG-IKKVQKEVGRYKGLYFIR- 269

Query: 292 IPTIYERLDGSKL------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
             T+ E+    ++             G  G  PG++F+Y++SP++V + ++  ++ +   
Sbjct: 270 --TLREQKGSLQVYRATSSHYDRYREGTTGKFPGLYFNYDVSPIIV-MYKRDTTVLNFVI 326

Query: 340 KIMCNISGTYITFMLVDALLHSCVKK 365
           ++M  + G Y    L+D L    +K+
Sbjct: 327 ELMAILGGIYSLGSLLDHLSLITIKR 352


>gi|123451578|ref|XP_001313964.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121895945|gb|EAY01112.1| hypothetical protein TVAG_442240 [Trichomonas vaginalis G3]
          Length = 375

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 105/387 (27%), Positives = 174/387 (44%), Gaps = 39/387 (10%)

Query: 7   LKGLDAFTKPYE-DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           LK  D F K  + D   KT  G  ++++    +S L   ++  +      E++ VDSSR 
Sbjct: 4   LKKFDIFPKYTDPDVKVKTNGGAILSLIAMTLMSILFLHELYRFIFPRIYEDIAVDSSRV 63

Query: 66  S---KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           S    + I+ +I +  + C  L + A D+ G       ++I ++R+D +G  I     + 
Sbjct: 64  SLARTMNINFNISI-QVPCGKLFISAYDAEGNAQSTDVNDIKQQRIDENGFAI-----DS 117

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           VN ++ K+          + +    CG CYGA  +  KCCN+C +V  A++ K W +  +
Sbjct: 118 VNWIRLKRAAKSKKQKKEQPQ--QYCGKCYGALPQG-KCCNSCEDVINAFKAKGWGIDGI 174

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
           D   QC +E   +  K    E C +YG + V  +SG  + A    Y +   H  DI    
Sbjct: 175 DRWQQCIDEGYADLGK----ESCNVYGDINVAHISGFLYFALE-DYKVGDKHPKDIS-RL 228

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTIYERLDG 301
           S  +N TH I +L FG ++  +     PLDG TV + E G   +NY ++++PT +    G
Sbjct: 229 SHKYNLTHTINYLEFGPRVSHEP---GPLDGLTVLQEEPGLMQYNYDLEVVPTKWFSSRG 285

Query: 302 SKLG---------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
             +                  + G+PGIF +Y L+P+ +   E   S   L T +   + 
Sbjct: 286 FPVSTYKFHPMITQKNFTEKVNRGVPGIFLNYNLAPISLVQYEVISSPWKLITSVCAIVG 345

Query: 347 GTYITFMLVDALLHSCVKKI-SKVEIG 372
           G +    L D +    +  I  K +IG
Sbjct: 346 GCFTCVSLADQIFFRTLSSIEGKRQIG 372


>gi|428171090|gb|EKX40010.1| hypothetical protein GUITHDRAFT_154283 [Guillardia theta CCMP2712]
          Length = 331

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 105/385 (27%), Positives = 166/385 (43%), Gaps = 83/385 (21%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           E LK  D F K  +D  E +V GG V++V   F+  L+  +   + + +T  E+ VD+ R
Sbjct: 13  EWLKNFDVFPKTVDDAKEASVSGGTVSVVVLFFMFLLLFTETSIFLKTNTKFEMEVDTMR 72

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           G  L I+ DI  P + C  L+LD++D SGE  L                       ++V+
Sbjct: 73  GGMLQINFDISFPGLPCSVLSLDSMDVSGEHEL-----------------------DIVH 109

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            V K+ + ++           N  G     + +  +   + + +KE              
Sbjct: 110 DVYKRAMDSKG----------NALGPVISEKVKLARDALSISHIKEQLERH--------- 150

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
                             EGC IYG L   +VSG+FH    LS      HV        A
Sbjct: 151 ------------------EGCNIYGTLNAQKVSGNFH----LSLHAQDFHVLAQVFPDRA 188

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             NT+H + HLSFG   +D    + PLDG +   ++G+  F YYIKI+PT +  LDG+ +
Sbjct: 189 TVNTSHIVNHLSFG---RDYPGLKNPLDGEMKVLDQGSGTFEYYIKIVPTKFHHLDGTII 245

Query: 305 GGG-----------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
                           G P ++F Y++SP+MV++ +  +S  H  T++     G Y+   
Sbjct: 246 DTNQYSVTDHFRKLQDGFPAVYFIYDISPIMVRVKQWKQSFSHYATQLCAITGGMYV--- 302

Query: 354 LVDALLHSCVKKI-SKVEIGGKTVT 377
            V   LH+  K + +K  IG K+ +
Sbjct: 303 -VTGQLHALSKFLWTKYYIGRKSFS 326


>gi|221114903|ref|XP_002155889.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Hydra magnipapillata]
          Length = 399

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 98/386 (25%), Positives = 165/386 (42%), Gaps = 69/386 (17%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           S+  K LDAF K  E + E +  GG V+I+ +LFIS L+  +   Y     T +  VD  
Sbjct: 15  SKGFKDLDAFPKIPESYQETSASGGTVSILVFLFISMLVISEFIYYSGSILTYKYEVDKE 74

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGE-----QHLHVEHNIYKRRLDLDGKPIQEP 118
             +K  I++DI V  + CD +  D +D SG      ++LH+    +          +   
Sbjct: 75  ADNKFRINIDITV-AMECDDIGADVLDLSGGNVDTGENLHLTPAHFS---------MSSN 124

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
           QK+  +A +  + + E                 Y +  +  +      +V   Y      
Sbjct: 125 QKQWWDAFRSARKSDEG----------------YRSINKVTQIDMIFGDVMPTY------ 162

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
           +P+     + ++E+  ++      +GC+IYG +EVN+V+G+FHI  G S      H H  
Sbjct: 163 MPD-----EIESEFEGKEF-----DGCRIYGNIEVNKVAGNFHITAGKSIPHPRGHAHLS 212

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
              +   +N +H I  LSFG   +       PLDG +        M+ YYI I+PT  + 
Sbjct: 213 ALVSELNYNFSHRIDMLSFG---EPHPGIINPLDGDLMITTTPYHMYQYYIAIVPTTIQT 269

Query: 299 LDGS---------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           L  +                L  G  G+PGIFF Y+ + + V + E+ +S      ++  
Sbjct: 270 LKNTIKTNQYSVTQRSRQLNLNSGSQGVPGIFFKYDFNAISVSVNEERRSFNEFLIRLCG 329

Query: 344 NISGTYITFMLVDALLHSCVKKISKV 369
            I G + T      +LHS +  ++ +
Sbjct: 330 IIGGVFAT----SGMLHSAIGALADI 351


>gi|229594330|ref|XP_001024169.3| hypothetical protein TTHERM_00455630 [Tetrahymena thermophila]
 gi|225566928|gb|EAS03924.3| hypothetical protein TTHERM_00455630 [Tetrahymena thermophila
           SB210]
          Length = 348

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 102/397 (25%), Positives = 166/397 (41%), Gaps = 90/397 (22%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  D + K   D  E T+ G  V+IV  L +  L   +   Y  V    E+FVD ++G
Sbjct: 9   KLKSFDMYRKLPSDLTEPTLSGAIVSIVSTLIMLILFISEFNGYLSVEENSEMFVDVAQG 68

Query: 66  -SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
             K+ ++LDI  P   CD  +LD  D  G   ++VE ++ K RL   G  +++ ++    
Sbjct: 69  GQKIRVNLDIDFPQFPCDIFSLDVQDIMGSHSVNVEGDLVKTRLSSTGTYLEKIKQNTGG 128

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
                     +G  + +LE                        VK+A+  +         
Sbjct: 129 DHGHGGHGHGHGDVSLDLE-----------------------RVKKAFNDR--------- 156

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP-YTS 243
                             EGC+I G++ VN+V G+FHI+       +H + + +Q  +  
Sbjct: 157 ------------------EGCKISGFMLVNKVPGNFHIS-------SHAYGNYLQRIFQD 191

Query: 244 AAFNT---THHIRHLSFGIKLQDDDERR----------KPLDGTVAKAEEGASMF----N 286
           A  NT   +H I HLSFG   +++D  R          +PLD T     E          
Sbjct: 192 ARINTLDLSHVINHLSFG---EENDLNRIKKTFQQGILQPLDHTKKIKPENLRTVGVTHQ 248

Query: 287 YYIKIIPTIYERLDGSK-----LGGGDGGM-----PGIFFSYELSPLMVKITEKSKSLGH 336
           YYI ++PT Y+ L   K            M     P +FF Y+LSP+ V+ ++  +S  H
Sbjct: 249 YYINVVPTTYKDLSNRKYHVYQFVANSNEMTTQHLPAVFFRYDLSPVTVQFSQTRESFLH 308

Query: 337 LWTKIMCNISGTYITFMLVDALLH-SCVKKISKVEIG 372
              ++   I G +    ++D+++H S V  + K E+G
Sbjct: 309 FLVQVCAIIGGVFTVAGIIDSIVHRSVVHILKKAEMG 345


>gi|340058906|emb|CCC53277.1| conserved hypothetical protein [Trypanosoma vivax Y486]
          Length = 394

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 97/398 (24%), Positives = 162/398 (40%), Gaps = 62/398 (15%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVT----------IVCWLFISYLICVDVCDYFQV 52
           F ++ +  D F KP ED+       GA+           +V W  ++Y+   D       
Sbjct: 21  FLKKFEAFDFFPKPKEDYRRSQTTVGALVSVVTLALILLLVLWEGVAYIYGRD------- 73

Query: 53  STTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
           +   EL VD+S   ++  ++DI  P   C+ L LD  D++G    +V  N++K  LD  G
Sbjct: 74  AYRTELAVDTSLTKEVVFNIDISFPQERCNELFLDVFDATGSTRFNVTMNVHKTPLDASG 133

Query: 113 KPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSC-------YGAETETRKCCNTC 165
           K +   ++           T        +   P  CG C       Y  + ET  C NTC
Sbjct: 134 KSVFVGERHF-----HTDYTVPQYNAKFDPTSPKFCGKCFVGRKYSYLQQPET-PCRNTC 187

Query: 166 NEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG 225
            +V E +  +K A P   T+ QC  E S E        GC   G L++ + SG+   AP 
Sbjct: 188 EQVMEEFERRKLAKPSKSTVEQCIGELSEE------NPGCNYRGSLKLKKASGTLIFAPK 241

Query: 226 LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRK---PLDGTVAKAEEGA 282
           +  ++    ++D+       FN +H I  LS G  L     +R    PL+       +  
Sbjct: 242 MFENV--FRINDLM-----QFNASHVINKLSIGDDLVRRFSKRGVYFPLNNQRFVTTKQF 294

Query: 283 SMFNYYIKIIPTIY----------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVK 326
           +   Y++KI+PT Y                 + D  ++  G G +P + FS++ S + V 
Sbjct: 295 AQVRYFMKIVPTTYISDNTANPVASTYEYSVQWDHRQVPLGSGEIPSVVFSFDFSSMQVN 354

Query: 327 ITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
              +  S  H    +   + G ++   +VD L+   ++
Sbjct: 355 NYFQRPSFCHFIVSLCGIVGGLFVVLGMVDGLVARVLR 392


>gi|407424942|gb|EKF39210.1| hypothetical protein MOQ_000571 [Trypanosoma cruzi marinkellei]
          Length = 393

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 97/389 (24%), Positives = 162/389 (41%), Gaps = 58/389 (14%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGA-VTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
           +++  +D F KP ED+     Y GA V++V  + I  L+  +V  Y   + + T EL VD
Sbjct: 22  KKVAAVDFFPKPKEDYSRSQTYRGALVSLVTVVVIGLLVFWEVYSYIVGRDAYTTELSVD 81

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +S  +++  +LDI  P I C  ++LD +D +G  +L+V  NI+K  +D  G         
Sbjct: 82  TSLSTEVEFNLDITFPRIRCHDVSLDILDVTGTVNLNVTRNIFKTPVDAQGN-------- 133

Query: 122 VVNAVKKKKVTTENGTTTTELED----PNKCGSCYGAETET------RKCCNTCNEVKEA 171
               +  ++   E G+   + +D    P  CG C+  E +        +CCNTC++V  A
Sbjct: 134 -FAFIGTRQGVGEYGSFREQSKDDPNSPQFCGRCFINEHQVSVKENKNRCCNTCDDVLNA 192

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
           Y  +    P    + QC  + S          GC   G L V +  G    AP       
Sbjct: 193 YDQQGLPRPRKSEVEQCIYDLS------RINPGCNYKGTLIVKKFGGRLVFAP--KRVSG 244

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR---KPLDGTVAKAEEGASMFNYY 288
              + D+       F+++H I  LS G +      RR    PL+G     +   +   Y+
Sbjct: 245 GFLIKDVM-----QFDSSHVINKLSIGDERVTRFSRRGVQHPLNGHKFDTQRRITEIRYF 299

Query: 289 IKIIPTIYERLDGSKLGG------------------GDGGMPGIFFSYELSPLMVKITEK 330
           +KI+PT+Y  L G                       G G  P +   ++  P+ V    +
Sbjct: 300 LKIVPTMY--LSGKNSAPFNATYEYSVQWSQRLTPIGFGHFPSVSLGFDFHPMQVNNYFR 357

Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALL 359
             S  H   ++   + G ++   L+D L+
Sbjct: 358 RSSFPHFIVQLCGIVGGLFVVLGLIDGLV 386


>gi|145351005|ref|XP_001419879.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580112|gb|ABO98172.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 373

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 154/374 (41%), Gaps = 54/374 (14%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---- 62
           LK LDA  K  ED+  ++  G   T+VC      L   +   Y       EL V+     
Sbjct: 5   LKALDANPKLKEDYVSESTSGVITTLVCAALCLILFFGEFFSYKTTKIVSELRVNPLGVH 64

Query: 63  ---SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEP 118
                  +L I +DI   +++C+ + LD  D +GE+H  V + +I KRR+D  GK     
Sbjct: 65  QTVPNAERLKIDVDITFHSLACNLITLDTSDKAGEEHYDVHDGHIEKRRIDKHGK----- 119

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
              V++A    +   ++      L+  N+  S + A++   +       +       +  
Sbjct: 120 ---VIDAAFTSEKPNKHKEIEQALQKMNETDSAHAADSHAMEHVQPFGGMFGLQSLLQEV 176

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
            PE                +N   EGC++ GYLEVNRV G F I+PG S  +    +  +
Sbjct: 177 FPE----------GVEHAFRNENQEGCEVKGYLEVNRVPGRFSISPGRSLMMG---MQMV 223

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
           +     A N TH I  LSFG   +       PLDGT       A    Y++ ++ T +E 
Sbjct: 224 KLNVQTALNLTHTIHRLSFG---ESFPGLVSPLDGTHRSLPPNAVQ-QYFLNVVSTTFEP 279

Query: 299 LDGSK--------------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           L  +K                    +G  +G  PG+ F+YE+SP+ V   E   S G   
Sbjct: 280 LGENKIISTHQYSVTETFTSSQRSIMGTSNGRDPGVIFTYEISPIRVDFKETRTSFGAFV 339

Query: 339 TKIMCNISGTYITF 352
             I C++ G  +T 
Sbjct: 340 LGI-CSVIGGVVTM 352


>gi|291232448|ref|XP_002736170.1| PREDICTED: MGC81917 protein-like [Saccoglossus kowalevskii]
          Length = 395

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 97/383 (25%), Positives = 155/383 (40%), Gaps = 84/383 (21%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E++ E T  GG V+I+ +  I+ L+  ++  Y + +   E  VD+   S
Sbjct: 13  VKELDAFPKIPENYQETTATGGTVSILTFSLIAILVISEIQYYSETTMKYEYEVDTDLTS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSG-----------EQHLHVEHNIYKRRLDLDGKPI 115
           KL +++DI V  + CDY+  D +D +G           EQ +H E     RR     K +
Sbjct: 73  KLRLNIDITV-AMKCDYIGADVLDMTGDTVSASFGSLKEQAVHFE---LSRRQKQWQKKL 128

Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
           Q  +  + N    + +  + G   +    P +     GA                     
Sbjct: 129 QAVRSALANEHAIQDLLFKVGFDGSPTSMPEREDKPAGAPN------------------- 169

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
                                        C+I+G + +N+V+G+FHI  G S      H 
Sbjct: 170 ----------------------------SCRIHGSMSLNKVAGNFHITLGKSIPHPRGHA 201

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT- 294
           H     + + +N +H I H SFG+          PLDG     +E A M+ Y+I+I+PT 
Sbjct: 202 HLAAFISQSQYNFSHRIDHFSFGVPTPGI---VNPLDGDQRVTQENARMYQYFIQIVPTR 258

Query: 295 --------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
                         + ER        G  G+ GIFF Y+LS + VK+TE+ +       +
Sbjct: 259 VNTRRASADTHQYAVTERDRVISHSSGSHGVAGIFFKYDLSSVSVKVTEEYQPYWQFLVR 318

Query: 341 IMCNISGTYITFMLVDALLHSCV 363
           +   I G + T      +LHS +
Sbjct: 319 LCGIIGGVFAT----SGMLHSLI 337


>gi|71755761|ref|XP_828795.1| hypothetical protein [Trypanosoma brucei brucei strain 927/4
           GUTat10.1]
 gi|70834181|gb|EAN79683.1| hypothetical protein, conserved [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 391

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 97/390 (24%), Positives = 156/390 (40%), Gaps = 49/390 (12%)

Query: 5   ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
            ++  +D FTKP ED+   +T  G  ++I+    +  L   +V  Y     +   EL VD
Sbjct: 23  RKVAAVDLFTKPKEDYCRSQTRAGAIISIITVFAVGLLASWEVMSYTLGWNAYKTELSVD 82

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +S    +  ++DI      C  L LD  D SG   ++V  N+ K  +D+ G         
Sbjct: 83  TSPEKNITFNIDITFMQEPCHDLFLDVSDVSGTFSINVTENLLKTPVDVGGN-------- 134

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNK---CGSCY---GAETETRKCCNTCNEVKEAYRYK 175
           +     ++  T       T   DPN    CG C+    A    + CCNTC EV   +  K
Sbjct: 135 LAYLGTRRFFTDPRSPLYTRRNDPNSPDFCGRCFTGNKAIAGGKNCCNTCEEVMAEHDRK 194

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
               P  + + QC  E S E        GC   G L V +VSG     P +    N + +
Sbjct: 195 GLPRPNKNVVEQCIGELSLEN------PGCNYRGALNVRKVSGVIFFTPKVIK--NTIKM 246

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMF---NYYIKII 292
            D+       F+ +H I   S G +      RR  L+    +   G+  F    YY+ I+
Sbjct: 247 EDL-----LKFDASHVINKFSIGDESVRRHSRRGVLNPLEKQRFNGSGRFMKVRYYLNIV 301

Query: 293 PTIY----------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
           PT Y                   +  ++  G GG P + FS++  P+ V    K + + H
Sbjct: 302 PTTYGSGASSGLHPPTYEYSANWNSREVAIGYGGFPSVEFSFDFFPMQVNNNFKREPIYH 361

Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKI 366
              ++   I G ++   LVD+++    + +
Sbjct: 362 FLVQLCGIIGGLFVVLGLVDSVVARLTRLV 391


>gi|198422133|ref|XP_002131157.1| PREDICTED: similar to ptx1 [Ciona intestinalis]
          Length = 391

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 97/395 (24%), Positives = 162/395 (41%), Gaps = 73/395 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E   E +  GG +T++    I++L+  ++  YF V+   +  VD    S
Sbjct: 15  VKSLDAFPKVPELCIETSTRGGTITLITTAVITFLVLSEIIYYFNVTFRYDYQVDVDFDS 74

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+ ++ DI V T  C  +  D +D +G+  +  E+ +Y+             Q++ +  +
Sbjct: 75  KVWLNFDITVAT-PCTLIGADVLDVTGQATV-FENEVYEELTFFRQSNTAAAQRKALLRM 132

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           K++ +T ENG   +E+                                       L +  
Sbjct: 133 KEELLTPENGKKMSEIT--------------------------------------LQSNF 154

Query: 187 QCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
                +   KL N     + C+ YG L +N+V+G+FHI  G    +   H H    ++  
Sbjct: 155 NPNLMFKNRKLDNVGIKMDACRFYGNLPLNKVAGNFHIVAGKPIQMFGGHAHLSMMFSPI 214

Query: 245 AFNTTHHIRHLSFG------IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---- 294
            +N +H I H SFG      I   D DER       V  +E  + +F YY+ ++ T    
Sbjct: 215 PYNFSHRIDHFSFGNMKTGFINALDGDER-------VTSSE--SYIFQYYLDVVSTKINS 265

Query: 295 -----------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                      + E+        G  G PG+FF Y  SPL V ITE+      L  ++  
Sbjct: 266 RRITTDTFQFSVSEQSRALDHASGSHGQPGVFFKYNFSPLSVMITEQKMPFYRLLVRLCS 325

Query: 344 NISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
            + G + T  +++ALL  C+   +K     K +T 
Sbjct: 326 IVGGIFATSHVLNALL-GCLPGFTKQSESSKLITN 359


>gi|261334705|emb|CBH17699.1| hypothetical protein, conserved [Trypanosoma brucei gambiense
           DAL972]
          Length = 391

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 95/390 (24%), Positives = 155/390 (39%), Gaps = 49/390 (12%)

Query: 5   ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
            ++  +D FTKP ED+   +T  G  ++I+    +  L   +V  Y     +   EL VD
Sbjct: 23  RKVAAVDLFTKPKEDYCRSQTRAGAIISIITVFAVGLLASWEVMSYTLGWNAYKTELSVD 82

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +S    +  ++DI      C  L LD  D SG   ++V  N+ K  +D+ G         
Sbjct: 83  TSPEKNITFNIDITFMQEPCHDLFLDVSDVSGTFSINVTENLLKTPVDVGGN-------- 134

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNK---CGSCYGAETET---RKCCNTCNEVKEAYRYK 175
           +     ++  T       T   DPN    CG C+         + CCNTC EV   +  K
Sbjct: 135 LAYLGTRRFFTDPRSPLYTRRNDPNSPDFCGRCFTGNKAIAGGKNCCNTCEEVMAEHDRK 194

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
               P  + + QC  E S E        GC   G L V +VSG     P +    N + +
Sbjct: 195 GLPRPNKNVVEQCIGELSLEN------PGCNYRGALNVRKVSGVIFFTPKVIK--NTIKM 246

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMF---NYYIKII 292
            D+       F+ +H I   S G +      RR  L+    +   G+  F    YY+ I+
Sbjct: 247 EDL-----LKFDASHVINKFSIGDESVRRHSRRGVLNPLEKQRFNGSGRFMKVRYYLNIV 301

Query: 293 PTIY----------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
           PT Y                   +  ++  G GG P + FS++  P+ V    K + + H
Sbjct: 302 PTTYGSGASSGLHPPTYEYSANWNSREVAIGYGGFPSVEFSFDFFPMQVNNNFKREPIYH 361

Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKI 366
              ++   + G ++   LVD+++    + +
Sbjct: 362 FLVQLCGIVGGLFVVLGLVDSVVARLTRLV 391


>gi|449704125|gb|EMD44426.1| endoplasmic reticulumgolgi intermediate compartment protein,
           putative [Entamoeba histolytica KU27]
          Length = 185

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 61/173 (35%), Positives = 91/173 (52%), Gaps = 11/173 (6%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  D + K  ED   +  +GG +TI+C + I  L   +   Y Q     +L VD  R S
Sbjct: 1   MKRFDTYGKVPEDLRTRHCFGGFLTIICVVIIIVLSIAEFAFYLQREVVPQLLVDRERSS 60

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+P+H DI  P  SC   ++D +  SGE  + +E N+ K R+  DG  + E + + + + 
Sbjct: 61  KIPVHFDITFPYSSCPITSVDILTKSGESMIGIEQNVTKIRIHHDGSLVTENEMKAIQS- 119

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
                       + E  DP +C SCYGAET  +KCC TC++VKEAY+ + W L
Sbjct: 120 ----------KLSIETPDPKECRSCYGAETPEKKCCFTCDDVKEAYKKRGWRL 162


>gi|212275606|ref|NP_001131002.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
 gi|194690678|gb|ACF79423.1| unknown [Zea mays]
 gi|413952089|gb|AFW84738.1| DUF1692 domain, endoplasmic reticulum vescicle transporter protein
           [Zea mays]
          Length = 293

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 89/329 (27%), Positives = 148/329 (44%), Gaps = 68/329 (20%)

Query: 60  VDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
           VD  RG  LPIH+++  P++ C+ L++DA+D SG+  + +  NI+K RLD  G  I    
Sbjct: 3   VDLKRGETLPIHINMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWKLRLDKYGHIIG--- 59

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
                              T  L D  + G  +GA  +     +  +E K   ++++   
Sbjct: 60  -------------------TEYLSDLVEKG--HGAHHDHDHDHDHHDEQK---KHEQTFN 95

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
            E + +++       + L N   EGC++YG L+V RV+G+FHI+           VH + 
Sbjct: 96  EEAEKMIKS----VKQALGNG--EGCRVYGMLDVQRVAGNFHIS-----------VHGLN 138

Query: 240 PYT-------SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
            +        S   N +H I  LSFG K         PLD T     + +  F YYIK++
Sbjct: 139 IFVAEKIFEGSNHVNVSHVIHELSFGPKYPGI---HNPLDETSRILHDTSGTFKYYIKVV 195

Query: 293 PTIYERLDGSKLGGG--------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           PT Y+ L    L                 D   P ++F Y+LSP+ V I E+ ++  H  
Sbjct: 196 PTEYKYLSKKVLPTNQFSVTEYFLPIRPTDRAWPAVYFLYDLSPITVTIKEERRNFLHFV 255

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKIS 367
           T++   + GT+    ++D  ++  +K ++
Sbjct: 256 TRLCAVLGGTFAMTGMLDRWMYQLIKTVT 284


>gi|348505737|ref|XP_003440417.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Oreochromis niloticus]
          Length = 374

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 155/371 (41%), Gaps = 66/371 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E T  GG V+++ + F++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVESTASGGTVSLIAFTFMAVLAFLEFFVYRHTWMKYEYEVDRDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D            + +  +  DG      Q E VN  
Sbjct: 73  KLRINVDITV-AMRCQYIGADVLD------------LAETMVASDGL-----QYEPVN-- 112

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETR-KCCNTCNEV--KEAYRYKKWALPELD 183
                         EL    +         + R +  +   +V  K A +    ALP   
Sbjct: 113 -------------FELPPQQRIWHMTLLHIQERLRVEHALQDVIFKAAIKGAPPALPP-- 157

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
                ++E ST  L       C+I+G+L VN+V+G+FHI  G S      H H       
Sbjct: 158 -----RSEDSTASLS-----ACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVAH 207

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--------- 294
            ++N +H I HLSFG  L        PLDGT   A +   MF Y+I I+PT         
Sbjct: 208 DSYNFSHRIDHLSFGEPLPGII---SPLDGTEKIATDSNHMFQYFITIVPTKLNTYKVSA 264

Query: 295 ------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                 + ER        G  G+ GIF  Y++S LMVK+TE+   L     ++   I G 
Sbjct: 265 ETHQYSVTERERVINHAAGSHGVSGIFMKYDISSLMVKVTEQHMPLWQFLVRLCGIIGGI 324

Query: 349 YITFMLVDALL 359
           + T  ++  L+
Sbjct: 325 FSTTGMIHGLV 335


>gi|403371798|gb|EJY85783.1| hypothetical protein OXYTRI_16231 [Oxytricha trifallax]
          Length = 333

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 97/394 (24%), Positives = 157/394 (39%), Gaps = 98/394 (24%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LD F +  +D  E T  G  +T +C+  +  L   +V  Y  V T  ++ VD S   
Sbjct: 7   LARLDIFKRVPKDLTEPTFCGALLTSICFFVLVGLSLSEVARYLNVETKTDMLVDISHSD 66

Query: 67  -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            KL I++DI  P   C+ L+LD  D  G  H+++E  + K+R+                 
Sbjct: 67  DKLEINIDITFPRFPCEILSLDVQDVMGTHHVNIEGGLVKQRI----------------- 109

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                  T NG    E        S +  +  +     T +EVK                
Sbjct: 110 -------TANGEVILEY-------SAHTKQDRSHVASQTRDEVKAQ-------------- 141

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
                            EGC IYG + +NRV G+FHI+   +++ N + +  +Q      
Sbjct: 142 -----------------EGCHIYGNILINRVPGNFHIS---THAFNDILMGLMQ--EGHH 179

Query: 246 FNTTHHIRHLSFGIKLQDDDERRK--------PLDGTVAKAEEGASMF------NYYIKI 291
           F+ ++ I H+SFG +   D  RRK        PLDG    A      F      N+Y+  
Sbjct: 180 FDFSYKIDHISFGKRNNFDMIRRKFRDHQLISPLDGKSETAPRDNKNFPKSLEGNFYLIA 239

Query: 292 IPTIYERLDG-------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           +P+ ++ + G             +  G G+  +    F+YELSP+ V  ++  +S+    
Sbjct: 240 VPSYFKDVSGGVYQVYQLTANDHTNFGTGNNILK---FNYELSPITVGFSQDRESIALFL 296

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
             I   I G +    ++DA++H     + K  IG
Sbjct: 297 VHICAIIGGVFTAVSIIDAIIHKSFSLLFKKRIG 330


>gi|390337315|ref|XP_792272.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 2 [Strongylocentrotus purpuratus]
 gi|390337317|ref|XP_003724529.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 1 [Strongylocentrotus purpuratus]
          Length = 388

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 92/368 (25%), Positives = 156/368 (42%), Gaps = 58/368 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  ED+ + T  GG V+IV ++ I+ L+  +   Y          VD+   +
Sbjct: 13  VKELDAFPKIPEDYVKTTSTGGTVSIVTFIVIAGLVISEFMYYLDSRMKYGYDVDTDFNT 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + CDY+  D +DS+G+  +              GK  +EP        
Sbjct: 73  KLQINIDITV-AMKCDYIGADVLDSAGDSAM----------FKFSGKLKEEP-------- 113

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                        T  E   +  S +      RK  +  + +++      ++    +   
Sbjct: 114 -------------TSFEMTPQQRSWHKTLQTVRKALSEEHAIQDLLFQTGFSSKPTN--- 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           Q +   S +KL     + C+++G L  N+V+G+FH+  G S      H H         +
Sbjct: 158 QPQRVDSGKKL-----DACRLHGSLTTNKVAGNFHVTIGKSIPHPRGHAHLALMIDPNNY 212

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I H S+G  +        PLDG +    E   ++ Y+I+I+PT            
Sbjct: 213 NFSHRIDHFSYGTPVPG---IVNPLDGDLKVTNESLQIYQYFIQIVPTKVKTRAAKAHTH 269

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER      G G  G+ GIFF YELS L++ + E       L  ++   + G + T
Sbjct: 270 QYAVTERERVINHGAGSHGVTGIFFKYELSSLVISVEEVYDPFWKLLVRLCGIVGGVFAT 329

Query: 352 FMLVDALL 359
             ++++L+
Sbjct: 330 SGIINSLM 337


>gi|340372649|ref|XP_003384856.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Amphimedon queenslandica]
          Length = 347

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 95/371 (25%), Positives = 147/371 (39%), Gaps = 66/371 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K  ED+ + T  GG  +IV    I +LI  ++  +       E  VD+   S
Sbjct: 10  VKEFDAFPKVSEDYIKPTTRGGLFSIVSITIILFLIVSELSYFKDSEILYEYMVDTDMTS 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L +  DI V  + C++L  D VD++G            + L       QE  KE     
Sbjct: 70  TLKLRFDITV-AMPCEFLGADVVDAAGS----------SKSLQ------QEVHKE----- 107

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                      T  EL    K       E   R          E  R  +  +   D+  
Sbjct: 108 ----------PTIFELNKEQKAWLAAKQEVIRRH---------EGLRLLRDVM--FDSHP 146

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTSA 244
           Q    +      +     C+++G+++VN+VSG+FHI  G  + +   H H+    P  + 
Sbjct: 147 QQYIPFPEHPQHSAPLTSCRVHGHIQVNKVSGNFHITAGQAVPHPQGHAHLSAFVP--TN 204

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             N +H I    FG+      +   PL+GT   A E   +F YYI+I+PT  +   GS L
Sbjct: 205 MINFSHRIDSFGFGVSTPGMVD---PLEGTYVIARESNRLFQYYIQIVPTTLQMRGGSDL 261

Query: 305 ----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                             G  G+PG+FF YE+  LMV + E  + L     ++   + G 
Sbjct: 262 HTNQYSVTERNRAISHKAGSHGLPGLFFKYEIYSLMVLMKEVDRPLSLFLVRLCAIVGGV 321

Query: 349 YITFMLVDALL 359
           + T  ++   L
Sbjct: 322 FATLGMISQFL 332


>gi|270003406|gb|EEZ99853.1| hypothetical protein TcasGA2_TC002635 [Tribolium castaneum]
          Length = 380

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 161/378 (42%), Gaps = 70/378 (18%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           ++K +D F K  E F EK+  GG  ++  ++ I++L+ +++  Y       +   D+   
Sbjct: 18  KIKKIDIFPKIEETFKEKSSVGGTFSVFSFILITWLVFLEINYYLDSKFIFKFSPDTDFD 77

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
           +KL I++DI V  + C  L  D +DS+ +       N YK   LD +    +    + ++
Sbjct: 78  AKLKINVDITV-AMPCSNLGADILDSTNQ-------NAYKFGSLDEEDTWFEMAPNQQIH 129

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL--PEL 182
              KK+  +                                  V+E Y   K  L     
Sbjct: 130 FHNKKQFNSY---------------------------------VREEYHALKDVLWKSRF 156

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
            T+ + + E ST    N   + C+I+G L +N+VSG+FHI  G S ++   H+H     +
Sbjct: 157 STMFRHRPERST--YPNRPHDACRIHGSLILNKVSGNFHITAGKSLNLPRGHIHISAFMS 214

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-------- 294
              +N +H I   SFG           PL+G       G ++FNY+I+++PT        
Sbjct: 215 ERDYNFSHRIDTFSFG---DSSPGIIHPLEGDELITHNGMTLFNYFIEVVPTNVKTFLAN 271

Query: 295 ----------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
                     +   +D  K   G  GMPGIFF Y++S L V ++++   LG    ++   
Sbjct: 272 VNTYQYSVKELNRPIDHDK---GSHGMPGIFFKYDMSALKVTVSQERDHLGMFLARLCSI 328

Query: 345 ISGTYITFMLVDALLHSC 362
           I G ++    V++ +  C
Sbjct: 329 IGGIFVCSGFVNSFVQFC 346


>gi|189235693|ref|XP_966630.2| PREDICTED: similar to AGAP005044-PA [Tribolium castaneum]
          Length = 373

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 161/378 (42%), Gaps = 70/378 (18%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           ++K +D F K  E F EK+  GG  ++  ++ I++L+ +++  Y       +   D+   
Sbjct: 11  KIKKIDIFPKIEETFKEKSSVGGTFSVFSFILITWLVFLEINYYLDSKFIFKFSPDTDFD 70

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVN 124
           +KL I++DI V  + C  L  D +DS+ +       N YK   LD +    +    + ++
Sbjct: 71  AKLKINVDITV-AMPCSNLGADILDSTNQ-------NAYKFGSLDEEDTWFEMAPNQQIH 122

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL--PEL 182
              KK+  +                                  V+E Y   K  L     
Sbjct: 123 FHNKKQFNSY---------------------------------VREEYHALKDVLWKSRF 149

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
            T+ + + E ST    N   + C+I+G L +N+VSG+FHI  G S ++   H+H     +
Sbjct: 150 STMFRHRPERST--YPNRPHDACRIHGSLILNKVSGNFHITAGKSLNLPRGHIHISAFMS 207

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-------- 294
              +N +H I   SFG           PL+G       G ++FNY+I+++PT        
Sbjct: 208 ERDYNFSHRIDTFSFG---DSSPGIIHPLEGDELITHNGMTLFNYFIEVVPTNVKTFLAN 264

Query: 295 ----------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
                     +   +D  K   G  GMPGIFF Y++S L V ++++   LG    ++   
Sbjct: 265 VNTYQYSVKELNRPIDHDK---GSHGMPGIFFKYDMSALKVTVSQERDHLGMFLARLCSI 321

Query: 345 ISGTYITFMLVDALLHSC 362
           I G ++    V++ +  C
Sbjct: 322 IGGIFVCSGFVNSFVQFC 339


>gi|343476464|emb|CCD12449.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 224

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/230 (29%), Positives = 108/230 (46%), Gaps = 19/230 (8%)

Query: 5   ERLKGLDAFTKP----YEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           +R   LD F K      +D  ++T  GG ++I   + I+ LI  +V  +       E++V
Sbjct: 2   KRFSRLDVFPKFDARFEQDARQRTALGGVLSIASMVAIALLIIGEVRYFLTTVEQHEMYV 61

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG-KPIQEPQ 119
           D   G  + + ++I  P + CD +  DA+D+ GE    +  +  K R+D D   P+ E  
Sbjct: 62  DPRIGGTMHVVINITFPRVPCDLMTADAIDAFGEYVEDMGRDTVKMRVDSDTLAPLGE-A 120

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           + +VN  KK               D + C SCYGAE     CC+TC++V+ A+  ++W  
Sbjct: 121 RPLVNMNKKAT------------SDTHDCPSCYGAEKNPGDCCHTCDDVRRAFAERQWEF 168

Query: 180 PELD-TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
            E D +I+QC  E           EGC ++    V RV+ + H  PG  +
Sbjct: 169 HEDDVSIMQCAKERLQMAASTASREGCNLHSSFRVPRVTENIHFVPGRMF 218


>gi|303290895|ref|XP_003064734.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226453760|gb|EEH51068.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 363

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 107/386 (27%), Positives = 164/386 (42%), Gaps = 64/386 (16%)

Query: 7   LKGLDAFT--KPYEDFHEKT-VYGGAVTIVCWLFISYLICVDVCDYFQVSTT---EELFV 60
           L+ +D ++  K  EDF + + + GG +T  C L    L    V +YF   T      L V
Sbjct: 5   LRRMDVYSSSKVIEDFRQSSSMSGGIITCACALLCFVLF---VNEYFYHRTPVVKSSLTV 61

Query: 61  D--------SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGE--QHLHVEHNIYKRRLDL 110
           D        S+  ++L + +DI    + CD + +D +D +GE    +H  H + KRRLD 
Sbjct: 62  DATGLDAKTSANSNRLHVEIDITFHQLPCDIINMDTMDQAGEAFHDVHSGH-LKKRRLDS 120

Query: 111 DGKPIQEPQK-EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVK 169
           DGKP++   K E  NA K+ +   E+       ++  K                     +
Sbjct: 121 DGKPLEGVFKHEKANAHKEIREDIESHALALSGDEEYKTSE------------------E 162

Query: 170 EAYRYKKWALPELDTIVQCKNEYSTEK-LKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
           +    +   +  L  ++  +     EK  KN   EGC++ GYLEVNRV GSF ++PG S 
Sbjct: 163 DLMPEEGLTMFNLKQLLDKQFPGGIEKAFKNEAREGCEVIGYLEVNRVPGSFSVSPGKSI 222

Query: 229 SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
            +   HV   Q    +  N +H I   +FG           PLDG  A+  +   +  Y+
Sbjct: 223 RLGMEHV---QLNVQSRLNMSHTINRFAFGKSFPG---FVSPLDGN-ARDLDPNYVHQYF 275

Query: 289 IKIIPTIYERLDGSKLGGGD----------------GGMP-GIFFSYELSPLMVKITEKS 331
           +KI+PT +  L G  L                    G  P G++F+Y+LSPL V   E  
Sbjct: 276 LKIVPTSFTPLRGEYLQSNQYSVTEASAPAKALNVVGSKPSGVYFNYDLSPLRVDYVESR 335

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDA 357
            S+    T +   + G      LV A
Sbjct: 336 NSMTEFITSVCAIVGGVASMSGLVQA 361


>gi|443716796|gb|ELU08142.1| hypothetical protein CAPTEDRAFT_19918 [Capitella teleta]
          Length = 403

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 95/370 (25%), Positives = 154/370 (41%), Gaps = 63/370 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           ++ LDAF K  E + E +  GG+++I+  +  + LI  ++  Y       +  VD     
Sbjct: 12  VRELDAFPKVPEGYQECSASGGSISILVLVLSAILIISEIRYYTATEFKYDYEVDKHFEG 71

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C  +  D +D +G Q++     + +  +  +  P Q    + ++A+
Sbjct: 72  KLSINIDITV-AMKCHQVGADVLDITG-QNVASFGKLTEEEVHFELSPNQRKHLKSMSAI 129

Query: 127 KKKKVTTENGTTTTELEDPNKC--GSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
                   N     E    +K    S +G                    Y     P  D 
Sbjct: 130 --------NEYIRNEYHSIHKFLWRSGFGG-------------------YLAQMPPREDH 162

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN-HVHVHDIQPYTS 243
               KN             GC+ YG L+VN+V+G+FHI  G S  +N   H H       
Sbjct: 163 PQTPKN-------------GCRFYGTLDVNKVAGNFHITAGKSVPLNIGGHAHMAMMVKE 209

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--------- 294
           + +N TH I H SFG K+     R  PLDG      +   M+ Y+I+++PT         
Sbjct: 210 SDYNFTHRIEHFSFGDKVSG---RINPLDGEEKNTNDNYHMYQYFIQVVPTHVKTLFTDI 266

Query: 295 ------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                 + E+      G G  G+PGIF  Y+L+P+MVK+ E  K    L  ++   I G 
Sbjct: 267 NTYQFSVTEQNRTISHGKGSHGIPGIFVKYDLAPMMVKVIESHKPFSQLLIRLCGIIGGL 326

Query: 349 YITFMLVDAL 358
           + T  ++  +
Sbjct: 327 FATSGMLHGM 336


>gi|307188057|gb|EFN72889.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Camponotus floridanus]
          Length = 386

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 95/370 (25%), Positives = 153/370 (41%), Gaps = 58/370 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + +KT  GG  +I   L I+YLI  +   +       +   D+   +
Sbjct: 12  VKELDAFPKVPELYVDKTAVGGTFSIFTMLIIAYLIIAETSYFLDSRLQFKFEPDTEIDA 71

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C  +  D +DS+ +  +  +                          
Sbjct: 72  KLQINIDITV-AMPCGRIGADVLDSTNQNMISYD-------------------------- 104

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                T E   T  EL    +      A  E  K  N+   ++E Y      L + + I 
Sbjct: 105 -----TLEEEDTWWELTQEQR------AHFEALKHMNSY--LREEYHAIHELLWKSNQIT 151

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                       +  T  C+I+G L VN+V+G+FHI  G S S+   H+H     T   +
Sbjct: 152 LYSEMPMRSHKPDYATNACRIHGSLVVNKVAGNFHITAGKSLSLPRGHIHISAYMTDQDY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDGSKL- 304
           N TH I   SFG           PL+G    A+    ++ Y+++++PT I   L  SK  
Sbjct: 212 NFTHRINRFSFG---GPSPGIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRTLLSTSKTY 268

Query: 305 -------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                          G  G+PGIFF Y++S L +K+T++  ++     K+   + G ++T
Sbjct: 269 QYSVKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQFLVKLCATVGGIFVT 328

Query: 352 FMLVDALLHS 361
             LV  ++ S
Sbjct: 329 SGLVKNIVQS 338


>gi|119596606|gb|EAW76200.1| ERGIC and golgi 3, isoform CRA_b [Homo sapiens]
          Length = 239

 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 57/158 (36%), Positives = 87/158 (55%), Gaps = 20/158 (12%)

Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           +HD+Q +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT
Sbjct: 85  IHDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPT 141

Query: 295 IYERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           +Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  
Sbjct: 142 VYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFL 201

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
           T +   I G +    L+D+L++   + I  K+++G  T
Sbjct: 202 TGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKTT 239


>gi|326911226|ref|XP_003201962.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Meleagris gallopavo]
          Length = 377

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 160/378 (42%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  I++L  ++   Y       E  VD    S
Sbjct: 13  MKELDAFPKVPESYVETSASGGTVSLIAFTTIAFLTIMEFTVYRDTWMKYEYEVDKDFTS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  IY+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMRCQYVGADVLDLAETMVASADGLIYEPVV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S E       + C+I+G+L VN+V+G+FHI  G +      H H     +  ++
Sbjct: 159 --REDNSLES-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELIPGII---NPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAETH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y++S LMV +TE+         ++   I G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   + +++V
Sbjct: 329 ----TGILHGFGRFVAEV 342


>gi|169731514|gb|ACA64886.1| endoplasmic reticulum-Golgi intermediate compartment protein 3
           (predicted) [Callicebus moloch]
          Length = 237

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 58/156 (37%), Positives = 84/156 (53%), Gaps = 19/156 (12%)

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
           HD+Q +     N TH+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+
Sbjct: 84  HDLQSFGLDNINMTHYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTV 140

Query: 296 YERLDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
           Y ++DG  L                  GD G+PG+F  YELSP+MVK+TEK +S  H  T
Sbjct: 141 YMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMMVKLTEKHRSFTHFLT 200

Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            +   I G +    L+D+L++   + I K    GKT
Sbjct: 201 GVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKT 236



 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 39/113 (34%), Positives = 55/113 (48%), Gaps = 3/113 (2%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEP 118
            KL I++D++ P + C   A   + S G  ++++ H I       D   I  P
Sbjct: 66  DKLKINIDVLFPHMPC---AFHDLQSFGLDNINMTHYIQHLSFGEDYPGIVNP 115


>gi|313661438|ref|NP_001186332.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Gallus gallus]
          Length = 377

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 160/378 (42%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  I++L  ++   Y       E  VD    S
Sbjct: 13  MKELDAFPKVPESYVETSASGGTVSLIAFTTIAFLTIMEFTVYRDTWMKYEYEVDKDFTS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  IY+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMRCQYVGADVLDLAETMVASADGLIYEPVV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S E       + C+I+G+L VN+V+G+FHI  G +      H H     +  ++
Sbjct: 159 --REDNSLES-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHESY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELIPG---IINPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAETH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y++S LMV +TE+         ++   I G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   + +++V
Sbjct: 329 ----TGILHGFGRFVAEV 342


>gi|384253563|gb|EIE27037.1| DUF1692-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 327

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 86/380 (22%), Positives = 154/380 (40%), Gaps = 89/380 (23%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
             A+ +      ++T +G  VT++  +    L   ++ +Y    + + + VD+SR   + 
Sbjct: 9   FSAYARAESHLVQRTYFGAIVTVLGVILAIVLFANELREYTTPFSIQTMSVDTSRAHYIR 68

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEH----NIYKRRLDLDGKPIQEPQKEVVNA 125
           ++ +   P++ C  L+LDA D SGE+     H     I+K RL+  G+ I          
Sbjct: 69  MNFNFTYPSMPCQVLSLDATDMSGEKSGDSGHAANGEIHKVRLNEAGEKI---------- 118

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                          E   P + G   G   +  +     N+  +A+             
Sbjct: 119 ------------GLGEYIPPRRWGFMMGKPRQ--QEVMEVNQAMDAH------------- 151

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT--- 242
                            EGC I+G+L++ RV+G+F ++         VHV D    T   
Sbjct: 152 -----------------EGCNIFGWLDLQRVAGNFRVS---------VHVEDFFALTRLQ 185

Query: 243 --SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
             +   N++H I  +SFG        +  PLDG     ++ +  F Y++K++PT Y+   
Sbjct: 186 ADTTGINSSHIIHRVSFGPTFPG---QVNPLDGAERILDKESGTFKYFLKVVPTEYQWSA 242

Query: 301 GSK--------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
           G++              +  G+  MP ++FSY++SP+ V I+E  KS  HL  +    + 
Sbjct: 243 GTRTTTNQYSVTEYDTVVHKGEMQMPSVWFSYDISPISVTISEIRKSFAHLLVRFCAVVG 302

Query: 347 GTYITFMLVDALLHSCVKKI 366
           G +    + D  +H  V  I
Sbjct: 303 GVFAVTGMFDRWVHRIVTAI 322


>gi|156553212|ref|XP_001600226.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Nasonia vitripennis]
          Length = 391

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 92/377 (24%), Positives = 162/377 (42%), Gaps = 72/377 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAFTK  ED+ +++  GG  ++  +  I YLI  +   +       +   D    S
Sbjct: 11  VKELDAFTKIPEDYRKQSAVGGTFSLASFCIIVYLIYAETSYFLDSRLQFKFEPDVEYDS 70

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           +L +++DI V T  CD +  D +DS+ +  +                             
Sbjct: 71  QLQMNIDITVAT-PCDRIGADILDSTNQNLM----------------------------- 100

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                T+EN      LED     + +    + R        +   +R +  AL EL   +
Sbjct: 101 -----TSEN----FHLED-----TWWDLTPDQRAHFEALKHMNYYFREEYHALHEL---L 143

Query: 187 QCKNE--YSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
              N+  +S E  K  +     +  C+IYG L+VN+V+G+FH+  G S  +   H H   
Sbjct: 144 WKSNQLTFSNEMPKRDYIPSYPSNACRIYGSLDVNKVAGNFHVTSGKSVILPRGHFHFTS 203

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYER 298
            ++S A+N TH I   SFG   +       PL+G      +   +F Y+I+++ T I   
Sbjct: 204 FHSSTAYNFTHRINRFSFG---KPSPGIIHPLEGDEKITTDNMMLFQYFIEVVSTDINML 260

Query: 299 LDGSKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           +  SK                 G  G+PGIFF Y+ S L +K++++  S+G    K+   
Sbjct: 261 MHKSKTYQYSVKDHQRPINHAKGSHGIPGIFFKYDTSALKIKVSQERDSIGQFLVKLCAT 320

Query: 345 ISGTYITFMLVDALLHS 361
           +   ++T  ++++++ +
Sbjct: 321 VGCIFVTNGILNSIVQN 337


>gi|332020071|gb|EGI60517.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Acromyrmex echinatior]
          Length = 390

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 155/371 (41%), Gaps = 60/371 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + +KT  GG  +I   L I YL+  +   Y           D+   +
Sbjct: 12  VKELDAFPKVPEVYVDKTAVGGTFSIFTVLIIMYLVIAETSYYLDSRLQFTFEPDTDIDA 71

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++D+ V  + C  +  D +DS+  QH+          +D D    ++   E+    
Sbjct: 72  KLQINIDVTV-AMPCGRIGADVLDSTN-QHM----------IDFDSLTEEDTWWEL---- 115

Query: 127 KKKKVTTENGTTTTELEDPNK-CGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                T E  T    L+  N      Y A  E     N      E        +P     
Sbjct: 116 -----TQEQRTHFEALKHMNSYLREEYHAIHELLWKSNQVTLYSE--------MP----- 157

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
              K  Y  +   N     C+++G L +N+V+G+FHI  G S S+ H H+H     T   
Sbjct: 158 ---KRSYVPDYAPN----ACRVHGSLNINKVAGNFHITAGKSLSVPHGHIHISAFMTDRD 210

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDGSKL 304
           +N TH I   SFG           PL+G    A+    ++ Y+++++PT I   L  SK 
Sbjct: 211 YNFTHRINKFSFG---GPSPGIVHPLEGDEKIADNNMMLYQYFVEVVPTDIRTLLTTSKT 267

Query: 305 --------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                           G  G+PGIFF Y++S L +K+T++  ++     K+   + G ++
Sbjct: 268 YQYSVKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQFLVKLCATVGGIFV 327

Query: 351 TFMLVDALLHS 361
           T  LV  ++ S
Sbjct: 328 TSGLVKNVVQS 338


>gi|320170541|gb|EFW47440.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864]
          Length = 408

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 94/381 (24%), Positives = 164/381 (43%), Gaps = 32/381 (8%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           E +K LD F K    + E +  GG VT+VC + I +L+  ++ +YF         VD   
Sbjct: 17  EFVKQLDIFPKVASTYKETSSSGGTVTLVCLVLIVFLVGAELGEYFNQQAAFSYGVDPVV 76

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSG-EQHLHV-EHNIYKRRLDLDGKPIQEPQKEV 122
              L +  DIVV  + CD L  D + ++G  +H H   H+             QEP+  +
Sbjct: 77  DGSLKLTYDIVV-AMPCDLLGADVLQATGTSKHGHDHSHDDAAPVKPAPPPSPQEPRNRL 135

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
            N +++ + T ++G      ++  K    +      R+      E ++    +  +L   
Sbjct: 136 FNVMRQSRDTGDDGRDDHGHDEMRKEPVVFALSAAQREWLA---ENRKPLTREHLSLS-- 190

Query: 183 DTIVQCKNEYSTEKLKNTFTEG----CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
            T  + K  +     + +  EG    C+++G +  ++++G+FHI  G +  +   H H  
Sbjct: 191 GTTRKAKKNFQAMPRELSSQEGTPDACRLHGSVSADKIAGNFHIIAGAAVEVPGGHAHMG 250

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
           Q     A N TH I HLSFG ++   +    PLDG           + Y+I+++PT+Y R
Sbjct: 251 QMIPQHALNFTHRINHLSFGEEMPGME---FPLDGDEWITTSHTMAYQYFIQVVPTVYTR 307

Query: 299 L--DGSKLGGGD-----------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
              D  +L  G              +PG+FF Y+  P++V +     S  HL  ++   I
Sbjct: 308 HANDPEQLRSGQFSVTRHESPNSNRLPGLFFKYDTFPILVTVQYSPYSFWHLLIRLSGII 367

Query: 346 SGTYITFMLVDALLHSCVKKI 366
            G + T       +H  V+ +
Sbjct: 368 GGVFAT----SGFIHQVVRFV 384


>gi|123499008|ref|XP_001327531.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121910461|gb|EAY15308.1| hypothetical protein TVAG_394520 [Trichomonas vaginalis G3]
          Length = 357

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 98/374 (26%), Positives = 157/374 (41%), Gaps = 52/374 (13%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD----- 61
           L+  D F K    +   T +GG ++I        L   ++  Y      +   VD     
Sbjct: 3   LRKFDVFPKLDRQYRVSTSFGGILSIASITVTIILFFSEIHTYLNPPIRQRFIVDNTKPM 62

Query: 62  -----SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEH--NIYKRRLDLDGKP 114
                SS   KL ++LDI  P + C  L +D VD   +  L +E   N + R LD  GK 
Sbjct: 63  GISGKSSNQRKLSVNLDIEFPNVPCYLLHIDVVDPISQLDLPMESISNNFAR-LDKTGKN 121

Query: 115 IQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRY 174
           I +   E       K +  +N  T+          SCY A     K C TC +V +A++ 
Sbjct: 122 IGDFHPE-------KFLEPDNAKTS-------DSTSCYAANNT--KVCKTCKDVVQAHKN 165

Query: 175 KKWALPELDTIVQCKNEYST-EKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV 233
           ++   P L TI QC +  +  +++K+   EGC++    +  R++  FH+APG +Y     
Sbjct: 166 QELLPPPLSTIAQCASTAAIIQEMKD---EGCKLTSAFQTVRLASEFHVAPGYNYLYKGW 222

Query: 234 HVHD--IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIK 290
           H H+  I    S   N TH IR   F     +  + + PLD  T  +  +G+    Y   
Sbjct: 223 HSHNTTILGSESKDLNLTHIIRSFRF-----NRVDGKFPLDNVTSIQTGKGSWRVVYSAD 277

Query: 291 IIPTI-----YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
           I+        YE +D  K         G++F Y ++P+       ++   HL T+++  I
Sbjct: 278 IMDNTYTANKYELMDPPKFSS------GVYFRYAINPVSAIDYYDTEPFLHLCTRLLTVI 331

Query: 346 SGTYITFMLVDALL 359
                 F L+D+ L
Sbjct: 332 GAVLAAFRLLDSFL 345


>gi|449278843|gb|EMC86582.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Columba livia]
          Length = 377

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 94/379 (24%), Positives = 159/379 (41%), Gaps = 65/379 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  I++L  ++   Y       E  VD    S
Sbjct: 13  MKELDAFPKVPESYVETSATGGTVSLIAFTTIAFLTIMEFTVYRDTWMKYEYEVDKDFTS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  IY+  +  +  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMRCQYVGADVLDLAETMVASADALIYEPVV-FELSPQQKEWQRMLQVI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
           + +            L++ +                      K A++    ALP   D  
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPPREDNS 163

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
           +Q               + C+I+G+L VN+V+G+FHI  G +      H H     +  +
Sbjct: 164 LQSP-------------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHES 210

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
           +N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT           
Sbjct: 211 YNFSHRIDHLSFGELIPGII---NPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET 267

Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
               + ER        G  G+ GIF  Y++S LMV +TE+         ++   I G + 
Sbjct: 268 HQFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFS 327

Query: 351 TFMLVDALLHSCVKKISKV 369
           T      +LH   + +++V
Sbjct: 328 T----TGILHGFGRFVAEV 342


>gi|302853436|ref|XP_002958233.1| hypothetical protein VOLCADRAFT_69193 [Volvox carteri f.
           nagariensis]
 gi|300256421|gb|EFJ40687.1| hypothetical protein VOLCADRAFT_69193 [Volvox carteri f.
           nagariensis]
          Length = 337

 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 91/396 (22%), Positives = 162/396 (40%), Gaps = 93/396 (23%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L  L A+ KP     ++TV+G  VT+   L  + L   ++  +++     ++ VD +R 
Sbjct: 4   KLSSLSAYVKPEAHLVQQTVHGALVTLCGILLAAMLFVHELGSFYRQHRVTQMSVDLARR 63

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSG--------EQHLHVEHNIYKRRLDLDGKPI-- 115
           + L I++D+  P I C  L++D +D +G          H+H    I+K RLD  GKPI  
Sbjct: 64  NALTINIDLTFPAIPCAVLSIDVLDIAGTAENDASYAHHMH----IHKLRLDGAGKPIGK 119

Query: 116 ---QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY 172
                PQ + +     +++ + N                                ++EA 
Sbjct: 120 AEYHTPQSQQIMDTGAEQLVSVN--------------------------------IQEAM 147

Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH 232
           ++          +V  + E           EGC +YG ++V RV+G  H      +S++ 
Sbjct: 148 QH----------LVDMEEEAEHH-------EGCHVYGTMDVKRVAGRLH------FSVHQ 184

Query: 233 VHVHDIQPYTSAAF------NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFN 286
             V  + P    A       N +H I+HL FG        +  PLDG V   +     F 
Sbjct: 185 NMVFQMLPQLLGAHRIPKVANISHTIKHLGFGPHYPG---QLNPLDGYVRMVKGPPQSFK 241

Query: 287 YYIKIIPTIYERLDGSKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSL 334
           Y++K++PT Y    G                   G +P +   Y+LSP+++ I E+  SL
Sbjct: 242 YFLKVVPTEYYNRLGRVTETHQYSVTEYTQPLEPGYVPTLDVHYDLSPIVMTINERPPSL 301

Query: 335 GHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVE 370
            H   ++   + G +    + D  +   V+ ++K++
Sbjct: 302 LHFVVRLCAVVGGAFAITRMTDRWVDWFVRLVTKLK 337


>gi|123425245|ref|XP_001306773.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121888365|gb|EAX93843.1| hypothetical protein TVAG_177510 [Trichomonas vaginalis G3]
          Length = 353

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 87/360 (24%), Positives = 155/360 (43%), Gaps = 59/360 (16%)

Query: 7   LKGLDAFTKPYED-FHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-- 63
           ++  D + K  +D F+ +TV GG VTI+ +LF+  +   +   + +V   +   V S   
Sbjct: 1   MRKFDIYPKVQDDSFNIRTVSGGVVTIITFLFMIIVAIKEGSSFHRVEIKQHAVVQSQYI 60

Query: 64  -RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              +++ I +DI V    C  L L+ +D+SG    +   +I ++RLD+  KP+++    +
Sbjct: 61  KESNEIEIFMDITV-AYPCHMLQLNVIDASGNPQPNARQDISRQRLDVHFKPLEQ----L 115

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           ++    K V                CG+C GA     KCC TC ++  ++R  +  +P L
Sbjct: 116 ISDSDPKSVF-------------QTCGNCLGANVS--KCCLTCTDIANSFRQMEEFIPNL 160

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG------LSYSINHVHVH 236
             + QC  +    + K    E C+I   L  +   G   I  G      ++Y  +  H  
Sbjct: 161 QNVEQCNRDKKAIEDK----ETCRIVAKLNTHFTKGKLTIMAGGIVPTPVNYKFDLSHFG 216

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGASMFNYYIKIIPTI 295
           D         N TH I  L FG   +D +  + PLD  T  + ++   M+NY I ++PTI
Sbjct: 217 D-------NVNLTHTIHTLRFG---RDFEGLKNPLDNYTNNQLKKSQFMYNYKIDLVPTI 266

Query: 296 Y----ERLDGSKLGGGDGGM----------PGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
                 ++   +                  PGI F ++ +P+  +   + +SL    T++
Sbjct: 267 TNDVENQIPAHQYSASSSSKEITKMITKKHPGITFDFDTAPVAARFIVEKQSLSSFLTQL 326


>gi|148674216|gb|EDL06163.1| ERGIC and golgi 3, isoform CRA_c [Mus musculus]
          Length = 261

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 60/172 (34%), Positives = 91/172 (52%), Gaps = 26/172 (15%)

Query: 227 SYSINHVHVHDIQ------PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEE 280
           S S+  + +HD+Q      P      N TH+I+HLSFG   +D      PLD T   A +
Sbjct: 93  SNSLMCMVIHDLQSFGLDNPSDCLQINMTHYIKHLSFG---EDYPGIVNPLDHTNVTAPQ 149

Query: 281 GASMFNYYIKIIPTIYERLDGSKLGG----------------GDGGMPGIFFSYELSPLM 324
            + MF Y++K++PT+Y ++DG  L                  GD G+PG+F  YELSP+M
Sbjct: 150 ASMMFQYFVKVVPTVYMKVDGEVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELSPMM 209

Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
           VK+TEK +S  H  T +   I G +    L+D+L++   + I  K+++G  T
Sbjct: 210 VKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQKKIDLGKTT 261


>gi|340709072|ref|XP_003393139.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Bombus terrestris]
          Length = 392

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 92/374 (24%), Positives = 154/374 (41%), Gaps = 66/374 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LD F K  E + +KT  GG  +I     I+YLI  +   Y       +   D+   +
Sbjct: 12  VKELDGFPKVPELYVDKTAVGGTFSIFTICTIAYLIIAETSYYLDSRLQFKFETDTDIDA 71

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  ++C  ++ D +DS+ +  +                             
Sbjct: 72  KLKINIDITV-AMTCSRISADVLDSTNQNMI----------------------------- 101

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL---D 183
                    G  + E ED     + +    E R       +V    R +  A+ EL    
Sbjct: 102 ---------GHESLEQED-----TWWELTQEQRSHFEALKDVNSYLREEYHAIHELLWKS 147

Query: 184 TIVQCKNEYSTEKLKNTFT-EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
             V   +E      + ++    C+I+G L VN+V+G+FHI  G S S    H+H +   T
Sbjct: 148 NQVTLYSEMPKRTHQPSYPPNSCRIHGSLNVNKVAGNFHITAGKSLSFPMGHIHILTFMT 207

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDG 301
              +N TH I   SFG           PL+G    A+    ++ Y+++++PT I   L  
Sbjct: 208 DKDYNFTHRINKFSFG---GPSPGIIHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLST 264

Query: 302 SKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
           SK                 G  G PGIFF Y++S L +K+T++  ++     K+   + G
Sbjct: 265 SKTYQYSVKDHQRPIDHQKGSHGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGG 324

Query: 348 TYITFMLVDALLHS 361
            ++T  +V +++ S
Sbjct: 325 IFVTSGMVKSIVQS 338


>gi|383865060|ref|XP_003707993.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Megachile rotundata]
          Length = 392

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 92/373 (24%), Positives = 150/373 (40%), Gaps = 66/373 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LD F K  E + +KT  GG  +I     I+YLI  +   Y       +  +D+   +
Sbjct: 12  VKELDGFPKVPEPYVDKTAVGGTFSIFTICIIAYLIIAETSYYLDSRLQFKFELDTDIDA 71

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C  +  D +DS+ +  +                             
Sbjct: 72  KLKINIDITV-AMPCGRIGADVLDSTNQNMV----------------------------- 101

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL---D 183
                    G  + E ED     + +    E R        +    R +  A+ EL    
Sbjct: 102 ---------GHESLEEED-----TWWELTQEQRSHFEALKHMNSYLREEYHAIHELLWKS 147

Query: 184 TIVQCKNEYSTEKLKNTFT-EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
             V   +E      + ++    C+I+G L VN+VSG+FHI  G S SI   H+H      
Sbjct: 148 NQVTLHSEMPKRSHQPSYPPNACRIHGSLNVNKVSGNFHITAGKSLSIPRGHIHISAFMI 207

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDG 301
              +N TH I   SFG           PL+G    A+    ++ Y+++++PT I   L  
Sbjct: 208 DRDYNFTHRINKFSFG---GPSPGVVHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLST 264

Query: 302 SKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
           SK                 G  G+PGIFF Y++S L +K+T++  ++     K+   + G
Sbjct: 265 SKTYQYSVKDYQRPIDHQKGSHGVPGIFFKYDMSALKIKVTQQRDTVSQFLVKLCATVGG 324

Query: 348 TYITFMLVDALLH 360
            ++T  LV  ++ 
Sbjct: 325 IFVTSGLVKNIVQ 337


>gi|431908425|gb|ELK12022.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Pteropus alecto]
          Length = 377

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPVI-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSSSTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + E S++       + C+I G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REEDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   AE+   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAEDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|224093106|ref|XP_002193654.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Taeniopygia guttata]
          Length = 377

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 91/361 (25%), Positives = 151/361 (41%), Gaps = 61/361 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  I++L  ++   Y       E  VD    S
Sbjct: 13  MKELDAFPKVPESYVETSASGGTVSLIAFTTIAFLTIMEFMVYRDTWMKYEYEVDKDFTS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  IY+  +  +  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMRCQYVGADVLDLAETMVASADGLIYEP-VPFELTPQQKELQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
           + +            L++ +                      K A++    ALP   D  
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPPREDNS 163

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
           +Q               + C+I+G+L VN+V+G+FHI  G +      H H     +  +
Sbjct: 164 LQSP-------------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHES 210

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
           +N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT           
Sbjct: 211 YNFSHRIDHLSFGELIPGII---NPLDGTEKIASDHNQMFQYFITVVPTKLHTYKISAET 267

Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
               + ER        G  G+ GIF  Y++S LMV +TE+         ++   I G + 
Sbjct: 268 HQFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIFS 327

Query: 351 T 351
           T
Sbjct: 328 T 328


>gi|350419069|ref|XP_003492060.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Bombus impatiens]
          Length = 392

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 91/374 (24%), Positives = 152/374 (40%), Gaps = 66/374 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LD F K  E + +KT  GG  +I     I+YLI  +   Y       +   D+   +
Sbjct: 12  VKELDGFPKVPEPYVDKTAVGGTFSIFTICTIAYLIIAETSYYLDSRLQFKFETDTDIDA 71

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  ++C  ++ D +DS+ +  +                             
Sbjct: 72  KLKINIDITV-AMTCSRISADVLDSTNQNMI----------------------------- 101

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL---D 183
                    G  + E ED     + +    E R        V    R +  A+ EL    
Sbjct: 102 ---------GHESLEQED-----TWWELTQEQRSHFEALKNVNSYLREEYHAIHELLWKS 147

Query: 184 TIVQCKNEYSTEKLKNTFT-EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
             V   +E      + ++    C+I+G L VN+V+G+FHI  G S S    H+H +   T
Sbjct: 148 NQVTLYSEMPKRTHQPSYPPNSCRIHGSLNVNKVAGNFHITAGKSLSFPMGHIHILTFMT 207

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDG 301
              +N TH I   SFG           PL+G    A+    ++ Y+++++PT I   L  
Sbjct: 208 DKDYNFTHRINKFSFG---GPSPGIIHPLEGDEKIADNNMILYQYFVEVVPTDIQTLLST 264

Query: 302 SKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
           SK                 G  G PGIFF Y++S L +K+T++  ++     K+   + G
Sbjct: 265 SKTYQYSVKDHQRPIDHQKGSHGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGG 324

Query: 348 TYITFMLVDALLHS 361
            ++T  ++  ++ S
Sbjct: 325 IFVTSGMIKNIVQS 338


>gi|156402826|ref|XP_001639791.1| predicted protein [Nematostella vectensis]
 gi|156226921|gb|EDO47728.1| predicted protein [Nematostella vectensis]
          Length = 413

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 93/385 (24%), Positives = 159/385 (41%), Gaps = 86/385 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K  E++ + T  GG+V++V +LFI  L+  +   Y    T     VD+   S
Sbjct: 13  IKEFDAFPKIPENYQQTTASGGSVSLVSFLFIFVLVISEFWYYRATETKFSYEVDTDADS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++D+ +  + C+ +  D +D SG                                 
Sbjct: 73  KLQINVDLTI-AMKCEDIDADVLDLSG--------------------------------- 98

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTC----NEVKEAYRYKKWALPEL 182
                      +T +L D  K    +   T  ++   T     +   E YR    +L E+
Sbjct: 99  -----------STMQLGDSIKLEPTFFKLTPEQEMWLTMFRDFHFFYEGYR----SLGEM 143

Query: 183 DTI------VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS--YSINHVH 234
           D           K E S +       + C++YG  +VN+V+G+FHI  G S  +   H H
Sbjct: 144 DEFNGDIPTYMPKREESKDAANTKEHDACRVYGSFKVNKVAGNFHITSGKSIHHPRGHAH 203

Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           +  + P  S   N +H I  LSFG ++        PLDG +   E+   M+ YYI+++PT
Sbjct: 204 LSSMVPVES--LNFSHRIDMLSFGKRVPG---IVHPLDGEMQITEKRRMMYQYYIQVVPT 258

Query: 295 IYERLDGSKL----------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
             + L+  ++                  G  G+ G+FF Y++S +MV++  +  S+    
Sbjct: 259 SIKSLNSEEIKTNQYSMTQRIREISHDSGSHGIAGLFFKYDMSSIMVRVKHQHHSMVGFL 318

Query: 339 TKIMCNISGTYITFMLVDALLHSCV 363
            ++   + G + T      +LH  +
Sbjct: 319 VRLCGIVGGIFAT----SGMLHDFI 339


>gi|307206941|gb|EFN84785.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Harpegnathos saltator]
          Length = 396

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 153/375 (40%), Gaps = 68/375 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + +KT  GG  +I    FI+YLI  +   +       +   D+   +
Sbjct: 12  VKELDAFPKVPELYVDKTAVGGTFSIFTVCFIAYLIIAETSYFLDSRLQFKFETDTDIDA 71

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C  +  D +DS       +E N++                      
Sbjct: 72  KLQINIDITV-AMPCGRIGADVLDS-------MEENVF---------------------- 101

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                    G  + E ED     + +    E R        +    R +  A+ EL    
Sbjct: 102 ---------GYDSLEQED-----TWWELTPEQRAHFEALKHMNSYLREEYHAIHELLWKS 147

Query: 187 QCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
                YS E  K ++        C+I+G L VN+V+G+FHI  G S S+   H+H     
Sbjct: 148 NQITLYS-EMPKRSYEPDYPPNACRIHGSLNVNKVAGNFHITTGKSLSVPRGHIHISAFM 206

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLD 300
           T   +N TH I   SFG           PL+G    A+    ++ Y+++++PT I   L 
Sbjct: 207 TDRDYNFTHRINRFSFG---GPSPGIVHPLEGDEKIADYNMMLYQYFVEVVPTDIRTLLS 263

Query: 301 GSKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
            SK                 G  G+PGIF  Y +S L +K+T++  ++     K+   + 
Sbjct: 264 TSKTYQYSVKDYQRPINHNEGSHGVPGIFIKYNMSALKIKVTQQRDTIFQFLVKLCATVG 323

Query: 347 GTYITFMLVDALLHS 361
           G ++T  L+  ++ S
Sbjct: 324 GIFVTSGLIKNIVQS 338


>gi|380787459|gb|AFE65605.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Macaca mulatta]
 gi|383418929|gb|AFH32678.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Macaca mulatta]
 gi|384941148|gb|AFI34179.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Macaca mulatta]
          Length = 377

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|62897157|dbj|BAD96519.1| CDA14 variant [Homo sapiens]
          Length = 377

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 95/379 (25%), Positives = 156/379 (41%), Gaps = 65/379 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
           + +            L++ +                      K A++    ALP   D  
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 163

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            Q  N              C+I+G+L VN+V+G+FHI  G +      H H        +
Sbjct: 164 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHES 210

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
           +N +H I HLSFG   +       PLDGT   A +   MF Y+I ++PT           
Sbjct: 211 YNFSHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT 267

Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
               + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + 
Sbjct: 268 HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFS 327

Query: 351 TFMLVDALLHSCVKKISKV 369
           T      +LH   K I ++
Sbjct: 328 T----TGMLHGIGKFIVEI 342


>gi|332233018|ref|XP_003265701.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 isoform 1 [Nomascus leucogenys]
          Length = 377

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|397517363|ref|XP_003828883.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Pan paniscus]
 gi|410259224|gb|JAA17578.1| ERGIC and golgi 2 [Pan troglodytes]
 gi|410298004|gb|JAA27602.1| ERGIC and golgi 2 [Pan troglodytes]
 gi|410334949|gb|JAA36421.1| ERGIC and golgi 2 [Pan troglodytes]
 gi|410334951|gb|JAA36422.1| ERGIC and golgi 2 [Pan troglodytes]
          Length = 377

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG   +       PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|50959176|ref|NP_057654.2| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Homo sapiens]
 gi|108935982|sp|Q96RQ1.2|ERGI2_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 2
 gi|22760017|dbj|BAC11037.1| unnamed protein product [Homo sapiens]
 gi|38173702|gb|AAH00887.2| ERGIC and golgi 2 [Homo sapiens]
 gi|78070782|gb|AAI07795.1| ERGIC and golgi 2 [Homo sapiens]
 gi|119616998|gb|EAW96592.1| ERGIC and golgi 2, isoform CRA_a [Homo sapiens]
 gi|119617000|gb|EAW96594.1| ERGIC and golgi 2, isoform CRA_a [Homo sapiens]
 gi|167773797|gb|ABZ92333.1| ERGIC and golgi 2 [synthetic construct]
          Length = 377

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 95/379 (25%), Positives = 156/379 (41%), Gaps = 65/379 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
           + +            L++ +                      K A++    ALP   D  
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 163

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            Q  N              C+I+G+L VN+V+G+FHI  G +      H H        +
Sbjct: 164 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHES 210

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
           +N +H I HLSFG   +       PLDGT   A +   MF Y+I ++PT           
Sbjct: 211 YNFSHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT 267

Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
               + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + 
Sbjct: 268 HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFS 327

Query: 351 TFMLVDALLHSCVKKISKV 369
           T      +LH   K I ++
Sbjct: 328 T----TGMLHGIGKFIVEI 342


>gi|75075986|sp|Q4R5C3.1|ERGI2_MACFA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 2
 gi|67970720|dbj|BAE01702.1| unnamed protein product [Macaca fascicularis]
          Length = 377

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++   
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAV-FDLSPQQKEWQRMLQ-- 128

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                      T + L++ +                      K A++    ALP      
Sbjct: 129 ----------LTQSRLQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|348562091|ref|XP_003466844.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Cavia porcellus]
          Length = 377

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 156/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPQSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDVAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 158 ------PREANSSQSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|323310251|gb|EGA63441.1| Erv46p [Saccharomyces cerevisiae FostersO]
          Length = 189

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 59/184 (32%), Positives = 91/184 (49%), Gaps = 20/184 (10%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LDAF K  ED   +T  GG +T+ C L   +L+  +   +  V T  +L VD  R +
Sbjct: 6   LLSLDAFAKTEEDVRVRTRAGGLITLSCILTTLFLLVNEWXQFNSVVTRPQLVVDRDRHA 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL +++D+  P++ CD + LD +D SGE  L + +      RL+ +G+P+ +  +  V  
Sbjct: 66  KLELNMDVTFPSMPCDLVNLDIMDDSGEMQLDILDAGFTMSRLNSEGRPVGDATELHVGG 125

Query: 126 VKKKKVTTENGTTTTEL-EDPNKCGSCYGAETETRK---------CCNTCNEVKEAYRYK 175
                    NG  T  +  DPN CG CYGA+ +++          CC  C+ V+ AY   
Sbjct: 126 ---------NGDGTAPVNNDPNYCGPCYGAKDQSQNENLAQEEKVCCQDCDAVRSAYLEA 176

Query: 176 KWAL 179
            WA 
Sbjct: 177 GWAF 180


>gi|71409118|ref|XP_806922.1| hypothetical protein [Trypanosoma cruzi strain CL Brener]
 gi|70870803|gb|EAN85071.1| hypothetical protein, conserved [Trypanosoma cruzi]
          Length = 310

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 81/302 (26%), Positives = 134/302 (44%), Gaps = 38/302 (12%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGA-VTIVCWLFISYLICVDVCDYF--QVSTTEELFVD 61
           +++  +D F KP ED+     Y GA V++V  + I  L+  +VC Y   + + T EL VD
Sbjct: 22  KKVAAVDLFPKPKEDYSRSQTYRGALVSLVTVVVIGLLVFWEVCSYIFGRDAYTTELSVD 81

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           +S  +++  +LDI  P + C  ++LD +D +G  +L+V  N++K  +D  G         
Sbjct: 82  TSLSTEVDFNLDITFPRVPCHEVSLDVLDVTGTVNLNVTRNLFKTPVDAQGN-------- 133

Query: 122 VVNAVKKKKVTTENGTTTTELED----PNKCGSCYGAETET------RKCCNTCNEVKEA 171
               +  ++   E G+   + +D    P  CG C+  E +        +CCNTCN+V  A
Sbjct: 134 -FAFIGTRQGVGEYGSFREQSKDDPSSPQFCGRCFINEHQVSMMENKNRCCNTCNDVLNA 192

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
           Y  +    P+ + + QC  E S          GC   G L V +  G    AP       
Sbjct: 193 YDQQGLPRPQKNEVEQCIYELS------RINPGCNYKGTLIVKKFGGRLVFAP--KRVPG 244

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR---KPLDGTVAKAEEGASMFNYY 288
              + D+       F+++H I  LS G +      RR    PL+G    A+   +   Y+
Sbjct: 245 GFLIRDV-----MRFDSSHIINKLSIGDEHVTRFSRRGVQHPLNGHEFDAQRRFTEIRYF 299

Query: 289 IK 290
            +
Sbjct: 300 FE 301


>gi|15010925|gb|AAK77355.1|AF302767_1 PTX1 protein [Homo sapiens]
          Length = 377

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 95/379 (25%), Positives = 155/379 (40%), Gaps = 65/379 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  +    Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMKFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
           + +            L++ +                      K A++    ALP   D  
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 163

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
            Q  N              C+I+G+L VN+V+G+FHI  G +      H H        +
Sbjct: 164 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHES 210

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
           +N +H I HLSFG   +       PLDGT   A +   MF Y+I ++PT           
Sbjct: 211 YNFSHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISAYT 267

Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
               + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + 
Sbjct: 268 HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFS 327

Query: 351 TFMLVDALLHSCVKKISKV 369
           T      +LH   K I ++
Sbjct: 328 T----TGMLHGIGKFIVEI 342


>gi|426225295|ref|XP_004006802.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Ovis aries]
          Length = 377

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 157/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQREWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFAVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|123430864|ref|XP_001307985.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121889642|gb|EAX95055.1| hypothetical protein TVAG_428580 [Trichomonas vaginalis G3]
          Length = 358

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 102/387 (26%), Positives = 162/387 (41%), Gaps = 54/387 (13%)

Query: 7   LKGLDAFTK-PYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           L+ LD F K  +++    T YG  V+I+  +  S LI  +V  Y       +L V  S  
Sbjct: 5   LEFLDLFDKNTHDELKMTTKYGSVVSILLTVVSSILIITNVALYINPRIYRDLSVKPSVT 64

Query: 66  SK---LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           S    + I L I +  + C +L +D +DS G Q  ++++ +  RRL+  G+ I       
Sbjct: 65  SASETINISLTIKI-AMPCYFLHIDYMDSLGFQRSYIKNTVTFRRLNNLGRVI------- 116

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
                        G T   L D   C  CY   T   +CCN+C +V+     +   + + 
Sbjct: 117 -------------GYTNDTLSD--VCEPCYNLSTNPDECCNSCLKVQLLSLMQNKPV-DF 160

Query: 183 DTIVQCKNEYSTEKLKN-TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
                C N    EK  N + +E C + G L VNR+ GSFHIAPG +      ++HD+   
Sbjct: 161 SKYRVCNNY---EKKPNVSLSEKCLVKGKLTVNRIPGSFHIAPGTNVP-QSAYLHDLSS- 215

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDG--TVAKAEEGASMFNYYIKIIPTIYERL 299
                + TH I+ L FG  +        PLD   +  +       + Y + I P I+ R 
Sbjct: 216 MQMFHDMTHSIQRLRFGPHIP---RTSNPLDNFKSFQQIPTHDRTYFYNLLITPVIFYRD 272

Query: 300 DGSKLGGGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
               L G +              G  PG+FF Y+ +P  + ++   ++     +     I
Sbjct: 273 GVEYLKGYEYTAFSEAIDTFQLFGISPGLFFQYQFTPYTIVVSANRQNFLQFISNTFGVI 332

Query: 346 SGTYITFMLVDALLHSCVKKISKVEIG 372
           SG Y    ++D L+   +   + VEIG
Sbjct: 333 SGIYACLSILDKLIGEDIGS-NVVEIG 358


>gi|301783747|ref|XP_002927289.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Ailuropoda melanoleuca]
          Length = 377

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMAILTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|355686514|gb|AER98081.1| ERGIC and golgi 2 [Mustela putorius furo]
          Length = 365

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 90/356 (25%), Positives = 150/356 (42%), Gaps = 60/356 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
              + ER        G  G+ GIF  Y+LS LMV +TE+       + + +C I G
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVR-LCGIVG 323


>gi|327273481|ref|XP_003221509.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Anolis carolinensis]
          Length = 377

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 89/362 (24%), Positives = 149/362 (41%), Gaps = 63/362 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  MKELDAFPKVPESYIETSASGGTVSLIAFTTMALLTIMEFTVYRDTWMKYEYEVDKDFTS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D + E  +     +    +  +  P+Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYIGADVLDLA-ETMVASADGLSYEPVIFELSPLQREWQRMLQII 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP  +   
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKTAFKSASTALPPRE--- 160

Query: 187 QCKNEYSTEKLKNTFT--EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
                       NT    + C+I+G+L VN+V+G+FHI  G +      H H     +  
Sbjct: 161 -----------DNTLQPPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHE 209

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------- 294
           ++N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT          
Sbjct: 210 SYNFSHRIDHLSFGELIPG---IINPLDGTEKVASDHNQMFQYFITVVPTKLHTHKISAE 266

Query: 295 -----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                + ER        G  G+ GIF  Y++S LMV +TE+         ++   I G +
Sbjct: 267 THQFSVTERERVINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIIGGIF 326

Query: 350 IT 351
            T
Sbjct: 327 ST 328


>gi|345441780|ref|NP_001230861.1| ERGIC and golgi 2 [Sus scrofa]
          Length = 377

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASTDGLVYEPAI-FDLSPQQKEWQRMLQRI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|21312962|ref|NP_080444.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           isoform 1 [Mus musculus]
 gi|81903633|sp|Q9CR89.1|ERGI2_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 2
 gi|12835992|dbj|BAB23451.1| unnamed protein product [Mus musculus]
 gi|12843481|dbj|BAB25998.1| unnamed protein product [Mus musculus]
 gi|12844310|dbj|BAB26318.1| unnamed protein product [Mus musculus]
 gi|13905198|gb|AAH06895.1| ERGIC and golgi 2 [Mus musculus]
 gi|17390417|gb|AAH18188.1| ERGIC and golgi 2 [Mus musculus]
 gi|20072972|gb|AAH26558.1| ERGIC and golgi 2 [Mus musculus]
 gi|26326029|dbj|BAC26758.1| unnamed protein product [Mus musculus]
 gi|40353061|gb|AAH64749.1| ERGIC and golgi 2 [Mus musculus]
 gi|74191314|dbj|BAE39481.1| unnamed protein product [Mus musculus]
 gi|148678796|gb|EDL10743.1| ERGIC and golgi 2, isoform CRA_c [Mus musculus]
          Length = 377

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 154/378 (40%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +   Y+  L  D  P Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   I G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|149048933|gb|EDM01387.1| rCG29652, isoform CRA_c [Rattus norvegicus]
          Length = 377

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 154/378 (40%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +   Y+  L  D  P Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSSSTALP------ 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   I G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|12841082|dbj|BAB25070.1| unnamed protein product [Mus musculus]
          Length = 377

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 154/378 (40%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +   Y+  L  D  P Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   I G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|410964074|ref|XP_003988581.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Felis catus]
          Length = 377

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 159/378 (42%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPVI-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSDSTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|291392459|ref|XP_002712727.1| PREDICTED: PTX1 protein [Oryctolagus cuniculus]
 gi|291416214|ref|XP_002724342.1| PREDICTED: PTX1 protein-like [Oryctolagus cuniculus]
          Length = 377

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPHQREWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP  +   
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSPSTALPPRED-- 161

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   + L++   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 162 --------DSLQSP--DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I I+PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAIDHNQMFQYFITIVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|7341109|gb|AAF61208.1|AF216751_1 CDA14 [Homo sapiens]
          Length = 378

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 97/382 (25%), Positives = 159/382 (41%), Gaps = 70/382 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
           + +            L++ +                      K A++    ALP   D  
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 163

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDI-QPYT 242
            Q  N              C+I+G+L VN+V+G+FHI  G  + +   H H+    QP+ 
Sbjct: 164 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLGSTCQPWN 210

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-------- 294
              F  +H I HLSFG   +       PLDGT   A +   MF Y+I ++PT        
Sbjct: 211 LTIF--SHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKIS 265

Query: 295 -------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
                  + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G
Sbjct: 266 ADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGG 325

Query: 348 TYITFMLVDALLHSCVKKISKV 369
            + T      +LH   K I ++
Sbjct: 326 IFST----TGMLHGIGKFIVEI 343


>gi|417399911|gb|JAA46936.1| Putative endoplasmic reticulum-golgi intermediate compartment
           protein 2 isoform 1 [Desmodus rotundus]
          Length = 376

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 154/378 (40%), Gaps = 64/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D + E  +   + +    +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLA-ETMVASANGLVYEPVIFDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +            T L++ +                      K A++      P  D   
Sbjct: 131 Q------------TRLQEEHSLQDVL---------------FKSAFKSSTALPPREDDSS 163

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           Q               + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 164 QPP-------------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 210

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 211 NFSHRIDHLSFGELVPG---IVNPLDGTEKIAVDHNRMFQYFITVVPTKLHTYKISADTH 267

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 268 QFSVTERERVVNHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 327

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 328 ----TGMLHGIGKFIVEI 341


>gi|403269250|ref|XP_003926667.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Saimiri boliviensis boliviensis]
          Length = 377

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LD F K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDVFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERIINHAAGSYGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|9963759|gb|AAG09679.1|AF183410_1 cd002 protein [Homo sapiens]
          Length = 387

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 96/381 (25%), Positives = 159/381 (41%), Gaps = 68/381 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 22  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 81

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 82  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 139

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
           + +            L++ +                      K A++    ALP   D  
Sbjct: 140 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 172

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTS 243
            Q  N              C+I+G+L VN+V+G+FHI  G  + +   H H+      T 
Sbjct: 173 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLGSTCS-TM 218

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--------- 294
            ++N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT         
Sbjct: 219 ESYNFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISA 275

Query: 295 ------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                 + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G 
Sbjct: 276 DTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGI 335

Query: 349 YITFMLVDALLHSCVKKISKV 369
           + T      +LH   K I ++
Sbjct: 336 FST----TGMLHGIGKFIVEI 352


>gi|57106442|ref|XP_534852.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 isoform 1 [Canis lupus familiaris]
          Length = 377

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGEVVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|149713890|ref|XP_001502984.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Equus caballus]
          Length = 377

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYDVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPVI-FDLSPQQKEWQRMLQVI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|12846043|dbj|BAB27008.1| unnamed protein product [Mus musculus]
          Length = 377

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 153/378 (40%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +   Y+  L  D  P Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   I G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I  +
Sbjct: 329 ----TGMLHGIGKFIVDI 342


>gi|417399168|gb|JAA46612.1| Putative endoplasmic reticulum-golgi intermediate compartment
           protein 2 isoform 1 [Desmodus rotundus]
          Length = 337

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 90/356 (25%), Positives = 146/356 (41%), Gaps = 61/356 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D + E  +   + +    +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLA-ETMVASANGLVYEPVIFDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +            T L++ +                      K A++      P  D   
Sbjct: 131 Q------------TRLQEEHSLQDVL---------------FKSAFKSSTALPPREDDSS 163

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           Q               + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 164 QPP-------------DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 210

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 211 NFSHRIDHLSFGELVPG---IVNPLDGTEKIAVDHNRMFQYFITVVPTKLHTYKISADTH 267

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
              + ER        G  G+ GIF  Y+LS LMV +TE+       + + +C I G
Sbjct: 268 QFSVTERERVVNHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVR-LCGIVG 322


>gi|395537817|ref|XP_003770886.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Sarcophilus harrisii]
          Length = 378

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 156/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K    + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPVSYVETSAIGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCHYVGADVLDLAETMVAPADGLVYEPVI-FDLSPQQREWQRMLQTI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S +       + C+I+G+L VN+V+G+FHI  G +      H H     +  ++
Sbjct: 159 --REDNSLQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAIDHNQMFQYFITVVPTKLNTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+         ++   I G + T
Sbjct: 269 QFSVTERERAINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFLVRLCGIIGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|344267803|ref|XP_003405755.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Loxodonta africana]
          Length = 377

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 157/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A +    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAIKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|432954843|ref|XP_004085560.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like, partial [Oryzias latipes]
          Length = 122

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 48/110 (43%), Positives = 70/110 (63%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DA+ K  EDF  KT  G  VTI+  + +  L   ++  +       EL+VD+SRG
Sbjct: 6   KLKQFDAYPKTLEDFRVKTWGGATVTIISGVIMLILFVSELQYFLTKEVHPELYVDTSRG 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
            KL I++D++ P + C YL++DA+D +GEQ L VEHN++KRRLD D K +
Sbjct: 66  DKLKINIDVIFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKRRLDKDLKAV 115


>gi|395839293|ref|XP_003792530.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Otolemur garnettii]
          Length = 377

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 155/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKTASTALP------ 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 158 ------PREDNPSQSPDACRISGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|300123494|emb|CBK24766.2| unnamed protein product [Blastocystis hominis]
          Length = 235

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 71/222 (31%), Positives = 106/222 (47%), Gaps = 14/222 (6%)

Query: 90  DSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK-EVVNAVKKKKVTTEN----GTTTTELED 144
           D+ G     +E+ I K  LD++G PI +  K +V   V  K+   EN          ++D
Sbjct: 8   DALGNDRADIENEILKTNLDVNGNPIGKTDKSQVTVTVPTKEEVLENTKHDDDEIVVIDD 67

Query: 145 PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV-QCKNEYSTEKLKNTFTE 203
             +CG C+GA+ E  +CCNTC E+  AYR K W +  +     QC      +K KN    
Sbjct: 68  KKECGDCFGAK-EKSECCNTCEELIAAYRKKNWDVDRIKAQAPQCAGFNYLQKWKNGVER 126

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC++ G L + +V G   I PG    IN +  +      + + N TH I H S G  +  
Sbjct: 127 GCRLEGKLSITKVQGHVFIIPG---RINDLLSNSEIRQIANSLNVTHTIHHFSLGEAIP- 182

Query: 264 DDERRKP-LDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
             E++ P +D     A + ASM+ Y++  IPT Y    G +L
Sbjct: 183 --EQKNPFVDHRGVMAVDHASMYQYFVNAIPTTYINKSGKEL 222


>gi|126339088|ref|XP_001363644.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Monodelphis domestica]
          Length = 378

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 88/360 (24%), Positives = 149/360 (41%), Gaps = 59/360 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K    + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPVSYVETSASGGTVSLIAFTTMALLTIMEFSVYRDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I+++I V  + C Y+  D +D +       +  +Y+  +  D  P Q   + ++  +
Sbjct: 73  KLRININITV-AMKCQYVGADVLDLAETMVAAADGLVYEPVI-FDLSPQQREWQRMLQTI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S +       + C+I+G+L VN+V+G+FHI  G +      H H     +  ++
Sbjct: 159 --REDNSLQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIANDHNQMFQYFITVVPTKLNTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+         ++   I G + T
Sbjct: 269 QFSVTERERAINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFLVRLCGIIGGIFST 328


>gi|387015774|gb|AFJ50006.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2-like
           [Crotalus adamanteus]
          Length = 377

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 89/379 (23%), Positives = 158/379 (41%), Gaps = 65/379 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKIPDSYIETSTSGGTVSLIAFTTMALLTIMEFMVYRDTWMKYEYEVDKDYTS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C ++  D +D +       +  +Y+  +  +  P+Q   + ++  +
Sbjct: 73  KLRINVDITV-AMKCQHIGADVLDLAETMVATADGLVYEPVI-FELSPLQREWQRILQNI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL-DTI 185
           + +            L++ +                      K A++    ALP   D  
Sbjct: 131 QSR------------LQEEHSLQDII---------------FKSAFKSASTALPPREDNP 163

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
           VQ               + C+I+G+L VN+V+G+FH+  G +      H H     +  +
Sbjct: 164 VQS-------------ADACRIHGHLYVNKVAGNFHVTVGKAIPHPRGHAHLAALVSHES 210

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
           +N +H I HLSFG  +        PLDGT   A +   MF Y++ ++PT           
Sbjct: 211 YNFSHRIDHLSFGELIPGII---NPLDGTEKIASDHNQMFQYFVTVVPTKLQTHKISAET 267

Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
               + ER        G  G+ GIF  Y++S LMV +TE+         ++   + G + 
Sbjct: 268 HQFAVTERERIINHAAGSHGVSGIFMKYDISSLMVTVTEEHMPFWQFLVRLCGIVGGIFS 327

Query: 351 TFMLVDALLHSCVKKISKV 369
           T      +LHS  + I ++
Sbjct: 328 T----TGILHSIGRFIVEI 342


>gi|82074366|sp|Q5EHU7.1|ERGI2_GECJA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 2
          Length = 377

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 91/378 (24%), Positives = 158/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASTDGLVYEPAI-FDLSPQQKEWQRMLQRI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K  ++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSTFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIALDHNQMFQYFITVVPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFSVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|115497448|ref|NP_001069031.1| endoplasmic reticulum-Golgi intermediate compartment protein 2 [Bos
           taurus]
 gi|113912114|gb|AAI22616.1| ERGIC and golgi 2 [Bos taurus]
 gi|296487341|tpg|DAA29454.1| TPA: PTX1 protein [Bos taurus]
          Length = 377

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 155/378 (41%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q   + ++   
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPAI-FDLSPQQREWQRMLQLF 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K  ++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVV---------------FKSVFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S++       + C+I G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 159 --REDDSSQP-----PDACRIRGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I I+PT            
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIALDHNQMFQYFITIVPTKLQTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + T
Sbjct: 269 QFAVTERERVINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|148224086|ref|NP_001087666.1| ERGIC and golgi 2 [Xenopus laevis]
 gi|51950053|gb|AAH82468.1| MGC81917 protein [Xenopus laevis]
          Length = 377

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 90/367 (24%), Positives = 150/367 (40%), Gaps = 59/367 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +   G V+++ +  +  L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASRGTVSLMAFSIMGILTIMEFLVYRDTRMKYEYEVDKDFTS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+ +++DI V  + C Y+  D +D + E  +     +  + +  D  P Q   + ++  +
Sbjct: 73  KIRLNIDITV-AMKCQYVGADVLDLA-ETMVTSAQGLAYQPVIFDLSPQQRQWQRMLQQI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A R    +LP      
Sbjct: 131 QGR------------LQEEHSLQDLL---------------FKSAMRTSVLSLP------ 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
               E S  +  N     C+I+G+L++N+V+G+FHI  G +      H H     +  ++
Sbjct: 158 --PREDSPMEQPN----ACRIHGHLDINKVAGNFHITVGKAIPHPRGHAHLAALVSHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I H SFG  L        PLDGT   AE+   M+ Y+I I+PT            
Sbjct: 212 NFSHRIDHFSFGEPL---PAIINPLDGTEKIAEDSNQMYQYFITIVPTKLNTNKVYCDTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y++S LMV +TE    L     ++   I G + T
Sbjct: 269 QFSVTERERVINHATGSHGVSGIFMKYDISSLMVTVTEDHMPLWKFLVRLCGIIGGIFTT 328

Query: 352 FMLVDAL 358
             ++  L
Sbjct: 329 TGMIHGL 335


>gi|89272944|emb|CAJ82943.1| ptx1 [Xenopus (Silurana) tropicalis]
          Length = 377

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 150/367 (40%), Gaps = 59/367 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +   G V+++ +  +  L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASRGTVSLMAFSIMGILTIMEFLVYRNTRMKYEYEVDKDFTS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           K+ +++DI V  + C Y+  D +D +       +  +Y+  +  +  P Q   + ++  +
Sbjct: 73  KIRLNIDITV-AMKCQYVGADVLDLAETMVTSAQGLVYEPVI-FELSPQQRLWQRMLQQI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A R    +LP      
Sbjct: 131 QGR------------LQEEHSLQDLL---------------FKSAMRTSVMSLPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + +  TE         C+I+G+LE+N+V+G+FHI  G +      H H     +  ++
Sbjct: 159 --REDSPTEP-----PNACRIHGHLEINKVAGNFHITVGKAIPHPRGHAHLAALVSHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I H SFG  L        PLDGT   AE+   M+ Y+I I+PT            
Sbjct: 212 NFSHRIDHFSFGEPLPGI---VNPLDGTEKIAEDSNQMYQYFITIVPTKLHTNKVDCDTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y++S LMV +TE    L     ++   + G + T
Sbjct: 269 QFSVTERERVINHASGSHGVSGIFMKYDISSLMVMVTEDHMPLWKFLVRLCGIVGGIFTT 328

Query: 352 FMLVDAL 358
             ++  L
Sbjct: 329 TGMIHGL 335


>gi|408393109|gb|EKJ72376.1| hypothetical protein FPSE_07400 [Fusarium pseudograminearum CS3096]
          Length = 376

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 91/362 (25%), Positives = 159/362 (43%), Gaps = 61/362 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    + ++T  GG  T+   +    LI  ++  +++ + +    V++    
Sbjct: 23  VAAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLILIWGELGRWWRGAESHNFEVEAGVSR 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
           ++ I+LDIVV  +SCD + ++  D+SG++      LH +  ++ +  D   K + +  ++
Sbjct: 83  EMQINLDIVV-KMSCDDIHVNVQDASGDRIMAAKRLHTDKTLWGQWAD--NKGVHKLGRD 139

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                 + +V T  G    + ED       +G E       +  + V    +  KWA   
Sbjct: 140 -----DQGRVNTGQGYNDPKYED-----EGFGEE-------HVHDIVALGKKRAKWA--- 179

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
                       T + +    + C+IYG L++N+V G FHI A G  Y  +  H+     
Sbjct: 180 -----------KTPRFRGN-ADSCRIYGSLDLNKVQGDFHITARGHGYMGHGEHL----- 222

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
              + FN +H I  LS+G           PLDGTV  A+     F YY+ ++PT+Y    
Sbjct: 223 -DHSKFNFSHIISELSYGPFYP---SLENPLDGTVNTADGNFHKFQYYLSVVPTVYSVNS 278

Query: 301 GSKLGG-----------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
            S L              D  +PGIFF Y++ P+++ + E    +  L+ KI+  ISG  
Sbjct: 279 RSILTNQYAVTEQSKAVDDRYIPGIFFKYDIEPILLTVHESRDGIISLFVKIINIISGVL 338

Query: 350 IT 351
           + 
Sbjct: 339 VA 340


>gi|146079597|ref|XP_001463805.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|398011570|ref|XP_003858980.1| hypothetical protein, conserved [Leishmania donovani]
 gi|134067893|emb|CAM66174.1| conserved hypothetical protein [Leishmania infantum JPCM5]
 gi|322497192|emb|CBZ32265.1| hypothetical protein, conserved [Leishmania donovani]
          Length = 368

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 86/383 (22%), Positives = 155/383 (40%), Gaps = 63/383 (16%)

Query: 13  FTKPYEDFH-EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG--SKLP 69
           F KP ED+  E+T +G  +++    F+ +L+  +   Y +     +  V   +G    +P
Sbjct: 2   FPKPKEDYQREQTRWGALLSVFTVFFVIFLVLWEGAAYLRGRDAYDTDVSLDKGLSEDMP 61

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           +H D++ P + C+ L++D VD++G    +    ++K    LDG        EVV     K
Sbjct: 62  VHFDVLFPFMPCNRLSIDVVDTTGMAKFNYTGRLHKLPTALDG--------EVVYKGSLK 113

Query: 130 KVTTENGTTTTELEDPNKCGSCY-----GAETETR-----KCCNTCNEVKEAYRYKKWAL 179
            +  +N   T E     KC  C      G   E R     KCC+TC  V + Y+     +
Sbjct: 114 DL--DNEMETREGRAGKKCRPCPPSAFDGVPAEVRSAAELKCCDTCESVLDLYKELGKGI 171

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP---GLSYSINHVHVH 236
           P  + I QC  +            GC + G L++ +V  +    P   G  YS+  V   
Sbjct: 172 PGTEYIPQCLEQLYQR------ASGCTVMGSLDLKKVPVTVIFGPRRTGHFYSLKDV--- 222

Query: 237 DIQPYTSAAFNTTHHIRHLSFG---IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
                     +T+H IR L  G   ++    +   +PL G  + ++   S   Y +K++P
Sbjct: 223 -------IRLDTSHFIRKLRIGDETVERFSKNGVAEPLSGHKSSSKT-YSETRYLVKVVP 274

Query: 294 TIYERLDGSK-----------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
           T Y +                      + G  G +P + F +E +P+ V    + +   H
Sbjct: 275 TTYRKTKTKNAKASTYEYSAQWSRRTIVVGFAGAVPAVLFEFEPAPIQVNNVFERQPFSH 334

Query: 337 LWTKIMCNISGTYITFMLVDALL 359
              ++   + G ++    +D ++
Sbjct: 335 FLVQLCGIVGGLFVVLGFIDNVV 357


>gi|398412138|ref|XP_003857398.1| hypothetical protein MYCGRDRAFT_66006 [Zymoseptoria tritici IPO323]
 gi|339477283|gb|EGP92374.1| hypothetical protein MYCGRDRAFT_66006 [Zymoseptoria tritici IPO323]
          Length = 407

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 97/388 (25%), Positives = 160/388 (41%), Gaps = 86/388 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K    + ++T  GG  T+V     + L   +V  ++  +TT    V+   G 
Sbjct: 23  IKAFDAFPKTKPSYTQQTSSGGVWTLVLIALSTVLAYTEVTRWWSGTTTHSFSVEQGVGH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L I++D+VV  + C+ + ++  D++G++ L                        V  AV
Sbjct: 83  DLQINVDLVV-AMKCEDIHINVQDAAGDRVL------------------------VDKAV 117

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           K+         T   L   N      GA  + R   +  N + +A  Y++  + +  ++ 
Sbjct: 118 KEDP-------TLFRLWGENHGAHTLGASLKDRLEVD-GNRIVQA-EYEEEDVHDYLSLA 168

Query: 187 QC--KNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS 243
           +   + +Y+    +N   + C+IYG +  N+V G FHI A G  Y     H+        
Sbjct: 169 RGGKRYQYTPRTPRNEEADSCRIYGSMHSNKVQGDFHITARGHGYMAYSQHL------DH 222

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY------- 296
           +AFN +HHI  LSFG       +   PLD T A+ E     F YY+ ++PTIY       
Sbjct: 223 SAFNFSHHINELSFGPYYP---KLVNPLDSTYARTEAHFHKFQYYLSVVPTIYTVDVNAL 279

Query: 297 ERLDG--SKLGGGDGGM-------------------------------PGIFFSYELSPL 323
           +R+D        GD G+                               PGIFF Y++ PL
Sbjct: 280 KRMDSKYETPSSGDDGLNQHPRRVTQHSVFTNQYAVTEQSHSVPENHVPGIFFKYDIEPL 339

Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYIT 351
            + I E+  S+  L  +I+  +SG  + 
Sbjct: 340 QLTIAEEWTSVPALLLRIVNVVSGLLVA 367


>gi|321479391|gb|EFX90347.1| hypothetical protein DAPPUDRAFT_309719 [Daphnia pulex]
          Length = 369

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 81/378 (21%), Positives = 151/378 (39%), Gaps = 75/378 (19%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
           LDAF K  + + EKT  GG ++++C   I YL+  +V D+           D    +++ 
Sbjct: 15  LDAFPKVPDTYKEKTTSGGTISLICIFIILYLVFSEVNDFIHSGVKFHFVPDDDLDTRMD 74

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           +++D+ V  + C Y+  D +DS+G+  +   H                            
Sbjct: 75  LNVDMTV-AMPCRYIGADVLDSTGQSVVSFGH---------------------------- 105

Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
            +T EN              + +      R        +    R K   + +L      +
Sbjct: 106 -LTEEN--------------TWFELSPRQRNHFEAAQRLNSILRDKPHGIQQLLWKSGYQ 150

Query: 190 NEY----STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN-HVHVHDIQPYTSA 244
           N +    S E + +  ++ C+++G L++ +V+G+FHI  G    +    H H        
Sbjct: 151 NLFGEMPSREFVPSQPSDACRLHGTLQLTKVAGNFHITAGKVLPLPMRAHAHLSPMMDDE 210

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL----- 299
            FN +H I   SFG          +PL+G     ++GA +F Y++  +PT  E L     
Sbjct: 211 RFNYSHRIDKFSFG----HSSTLIQPLEGDEVITDKGAMLFQYFVTAVPTEIESLVSASS 266

Query: 300 --------------DGSKLGG---GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
                         + S++ G   G  G+PGI+F Y+++PL V++   +  L     ++ 
Sbjct: 267 GIHGSMKTWQYSVRNQSRIIGHQKGSHGIPGIYFKYDVAPLRVRVVPDAPPLLRFVLRLC 326

Query: 343 CNISGTYITFMLVDALLH 360
             + G Y +  +V  ++ 
Sbjct: 327 AIVGGVYTSAGIVHKVIQ 344


>gi|303278158|ref|XP_003058372.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459532|gb|EEH56827.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 399

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 84/311 (27%), Positives = 133/311 (42%), Gaps = 79/311 (25%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M  + R+K  DA+ KP     +KT  GGAVT+   L ++++   ++  Y          V
Sbjct: 1   MGLAARVKLFDAYHKPERHLTKKTAAGGAVTLSSLLLMAFVFVFELRSYLATERVTTTGV 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI----- 115
           D +R   L I++D+   ++ C  L+LDA+D+SG+    V   ++K R+D  G+ I     
Sbjct: 61  DVTRDEMLAINVDVTFTSLPCQTLSLDALDASGKHDQDVGGELHKTRVDRFGRAIATYES 120

Query: 116 -QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRY 174
            +E    VVN +             TEL         YG ETE  K     +E+K A   
Sbjct: 121 HRENDDGVVNLI-------------TEL--------FYGFETEGHKAH--VDEIKTAL-- 155

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
                                    +  EGC+++G L+V RV+G+FH++         VH
Sbjct: 156 -------------------------SAGEGCRVHGRLKVQRVAGNFHVS---------VH 181

Query: 235 VHDIQPYTSAAF------NTTHHIRHLSFGIKLQDDDERRKPLDG---TVAKAEEGASMF 285
             D +    A F      N +H +  LSFG      ++   PL G   T   A E  + +
Sbjct: 182 GEDARTL-RATFEHPRNVNMSHAVHRLSFGKSFPRKED---PLSGFTRTTRHANETGT-Y 236

Query: 286 NYYIKIIPTIY 296
            Y++K++P  Y
Sbjct: 237 KYFLKVVPVTY 247



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 18/55 (32%), Positives = 32/55 (58%)

Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
           G +P ++F Y+LSP+ V I++  KS GH   + +  + G Y    L+D ++H  +
Sbjct: 337 GSLPAVYFIYDLSPIAVTISDARKSFGHFLARTVAGVGGAYAIAGLIDRMIHHSL 391


>gi|12857352|dbj|BAB30984.1| unnamed protein product [Mus musculus]
          Length = 377

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 153/378 (40%), Gaps = 63/378 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +   Y+  L  D  P Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------ 294
           N +H I H SFG  +        PLDGT   A +   MF Y+I ++PT            
Sbjct: 212 NFSHRIDHCSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVMPTKLHTYKISADTH 268

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   I G + T
Sbjct: 269 QFSVTERESIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST 328

Query: 352 FMLVDALLHSCVKKISKV 369
                 +LH   K I ++
Sbjct: 329 ----TGMLHGIGKFIVEI 342


>gi|449303002|gb|EMC99010.1| hypothetical protein BAUCODRAFT_120300 [Baudoinia compniacensis
           UAMH 10762]
          Length = 387

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 94/369 (25%), Positives = 153/369 (41%), Gaps = 67/369 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           ++  DAF K    + +KT  GG  T+V      +L   +V  ++   TT E  V+   G 
Sbjct: 23  VRAFDAFPKTKPSYTQKTNNGGIWTVVLVCASLWLAWTEVMRWWWGHTTHEFSVEQGVGH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L I+LD+VV  + CD L ++  D+SG++              L G+ +Q          
Sbjct: 83  DLQINLDVVV-KMRCDDLHVNVQDASGDR-------------ILAGETLQRDATLWSQWG 128

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYG-----AETETRKCCNTCNEVKEAYRYKKWALPE 181
             +K+ T   T    LE      S YG     AE +        +  K   ++KK   P 
Sbjct: 129 ANRKLHTLGATRDERLEMTGY--SSYGDAREYAEDDVHDYLGAASSTK---KFKK--TPR 181

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
           +               K+   + C+IYG +  N+V G FHI      +  H ++   Q  
Sbjct: 182 VP--------------KSKEADSCRIYGSMHGNKVQGDFHIT-----ARGHGYMEFGQHL 222

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY----- 296
             ++FN +HHI  LSFG           PLD T+A  E     F YY+ ++PTIY     
Sbjct: 223 EHSSFNFSHHINELSFGPFYP---SLTNPLDNTLAATEFNFFKFQYYLSVVPTIYTTNAK 279

Query: 297 --ERLDGSKLGGG------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
              ++  S +               +  +PG+F  Y++ P+++ I E+  S   L+ +++
Sbjct: 280 ALRKITKSTVFTNQYAVTEQSRPVPENQVPGVFVKYDIEPILLMIAEERNSFPALFIRLV 339

Query: 343 CNISGTYIT 351
             ISG  + 
Sbjct: 340 NVISGVLVA 348


>gi|123435131|ref|XP_001308935.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121890639|gb|EAX96005.1| hypothetical protein TVAG_369150 [Trichomonas vaginalis G3]
          Length = 353

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 84/311 (27%), Positives = 137/311 (44%), Gaps = 44/311 (14%)

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           S  S + I L I+V  + C YL  D  DS G    +V + +   R D +   I       
Sbjct: 65  SGDSLVNISLGILV-DLPCYYLHFDLTDSLGFTQNYVNNTLRFYRYDFNYSLI------- 116

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
                        G T   + D  KC  C+  +     CCN C+ +KE Y+      PE 
Sbjct: 117 -------------GLTNQTMVD--KCYPCFKVQFHNYTCCNGCDRLKENYKLNNLT-PEP 160

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH-VHVHD-IQP 240
           +   QC+   +  +     +E C + G + VNRV GSFHIA G +  +N   H+H+ +  
Sbjct: 161 EKWPQCQ---TNARPDINSSEKCLVKGKVSVNRVRGSFHIAAGRNIYLNDGSHIHELLDD 217

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASM-FNYYIKIIPTIY--- 296
           + + AF  +H I H+ FG ++      ++PL   V +A+E  ++  +Y + + P I+   
Sbjct: 218 FPNLAF--SHAIEHIRFGPRII---TAKQPLQNLVMRAKENLTVTHDYSLLVTPVIFVAD 272

Query: 297 -ERLDGS-----KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
            + ++ S      L       PGI+F Y+ +P  ++IT  S+S            +G Y 
Sbjct: 273 NQFIEKSFEYTVYLHPVQDKDPGIYFDYQFTPYTIQITWISRSFRGFLISTAGFTAGLYA 332

Query: 351 TFMLVDALLHS 361
              ++D L HS
Sbjct: 333 IASIIDQLFHS 343


>gi|46137745|ref|XP_390564.1| hypothetical protein FG10388.1 [Gibberella zeae PH-1]
          Length = 376

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 159/362 (43%), Gaps = 61/362 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    + ++T  GG  T+   +    LI  ++  +++ + +    V++    
Sbjct: 23  VAAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLILIWGELGRWWRGAESHNFEVEAGVSR 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
           ++ I+LDIVV  ++CD + ++  D+SG++      LH +  ++ +  D   K + +  ++
Sbjct: 83  EMQINLDIVV-KMNCDDIHVNVQDASGDRIMAAKRLHTDKTLWGQWAD--NKGVHKLGRD 139

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                 + +V T  G    + ED       +G E       +  + V    +  KWA   
Sbjct: 140 -----DQGRVNTGQGYNDPKYED-----EGFGEE-------HVHDIVALGKKRAKWA--- 179

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
                       T + +    + C+IYG L++N+V G FHI A G  Y  +  H+     
Sbjct: 180 -----------KTPRFRGN-ADSCRIYGSLDLNKVQGDFHITARGHGYMGHGEHL----- 222

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
              + FN +H I  LS+G           PLDGTV  A+     F YY+ ++PT+Y    
Sbjct: 223 -DHSKFNFSHIISELSYGPFYP---SLENPLDGTVNTADGNFHKFQYYLSVVPTVYSVNS 278

Query: 301 GSKLGG-----------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
            S L              D  +PGIFF Y++ P+++ + E    +  L+ KI+  ISG  
Sbjct: 279 RSILTNQYAVTEQSKAVDDRYIPGIFFKYDIEPILLTVHESRDGIISLFVKIINIISGVL 338

Query: 350 IT 351
           + 
Sbjct: 339 VA 340


>gi|71013590|ref|XP_758634.1| hypothetical protein UM02487.1 [Ustilago maydis 521]
 gi|46098292|gb|EAK83525.1| hypothetical protein UM02487.1 [Ustilago maydis 521]
          Length = 415

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 85/360 (23%), Positives = 154/360 (42%), Gaps = 63/360 (17%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +++  DAF K    + +++  GG +TI+  + +  L+  ++  Y          VDS   
Sbjct: 13  KIRQFDAFPKTQSIYTQRSSKGGLLTIIATVTLLALLWTELSSYLYGERGYSFSVDSRLQ 72

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
           S + I++D+ V  + C YL +D  D+ G++ LHV  + + +    DG             
Sbjct: 73  STMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDSEFTK----DG------------- 113

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                       TT E+   ++  +    E   +K  N     K  YR K    P     
Sbjct: 114 ------------TTFEIGHADRLDALPMQEVSVQKTINQARR-KPVYRKK----PRNK-- 154

Query: 186 VQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
            +   + + +K  +   +G  C+IYG +EV RV+G+ HI      ++ H ++  ++    
Sbjct: 155 -KFSRQVAFQKTAHIVPDGPACRIYGSMEVKRVTGNLHIT-----TLGHGYL-SVEHTDH 207

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
              N +H I   SFG       E  +PLD +V   E+  ++F Y++  +PT++    G K
Sbjct: 208 KLMNLSHVIHEFSFGPYF---PEISQPLDSSVETTEKHFTVFQYFVSAVPTLFIDARGRK 264

Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           L                  G+PGIF  Y++ PL + I ++S SL     ++   + G ++
Sbjct: 265 LHTHQYSVTDYTRQIEHGKGVPGIFIKYDIEPLQMTIRQRSTSLFQFLVRLAGVLGGVWV 324


>gi|449479952|ref|XP_004155757.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 266

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 82/301 (27%), Positives = 139/301 (46%), Gaps = 58/301 (19%)

Query: 83  YLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTEL 142
           +L++DA+D SG+  + ++ NI+K RL+  G+ I    + + + V+K+ V  ++     + 
Sbjct: 3   FLSVDAIDMSGKHEVDLDTNIWKLRLNSHGQIIG--TEYLSDLVEKEHVDHKHDHDHDKE 60

Query: 143 ED-PNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTF 201
           +D P+  G    AE   +K               K AL E                    
Sbjct: 61  KDHPHIHGFDQAAENLVKKV--------------KQALEE-------------------- 86

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
            +GC++YG L+V RV+G+FHI+    + +N + V  +    S   N +H I  LSFG K 
Sbjct: 87  AQGCRVYGVLDVQRVAGNFHIS---VHGLN-IFVAQMIFGGSKHVNVSHMIHDLSFGPKY 142

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--------------SKLGGG 307
                   PLDGTV    + +  F YYIKI+PT Y+ +                S +   
Sbjct: 143 PGI---HNPLDGTVRILRDTSGTFKYYIKIVPTEYKYISKAVLPTNQFSVTEYFSPMTDS 199

Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
           D   P ++F Y+LSP+ V I E+ +S  H  T++   + GT+    ++D  +   ++ ++
Sbjct: 200 DRSWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAVTGMLDRWMFRFLEALT 259

Query: 368 K 368
           K
Sbjct: 260 K 260


>gi|242006215|ref|XP_002423949.1| Endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Pediculus humanus corporis]
 gi|212507219|gb|EEB11211.1| Endoplasmic reticulum-golgi intermediate compartment protein,
           putative [Pediculus humanus corporis]
          Length = 349

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 91/376 (24%), Positives = 164/376 (43%), Gaps = 70/376 (18%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           + +K LDAF K      E +  GG ++I+ ++ + +++  ++  Y     T +   D   
Sbjct: 14  KSVKVLDAFPKVDNSCRESSPVGGTLSIISYILMLWILYSEITYYTNSKITYKFLPDVDF 73

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLD-GKPIQEP 118
             K+ I+LD+ V  + C  ++ D +DS+ +       LH E+  +    DL+  + I   
Sbjct: 74  DQKVKIYLDMTV-AMPCSAVSADILDSTQQSVFNFGELHEENTWF----DLEPSQKINFD 128

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
           Q + VNA+ ++                                    +EV E Y +K  +
Sbjct: 129 QIKNVNALLRQDY----------------------------------HEVHE-YLWKSAS 153

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
              ++  V  KN      L N   + C+IYG L +N+V+G+FHI+ G S  +   H+H  
Sbjct: 154 PSFINVYVPRKN------LPNRPYDACRIYGELVLNKVAGNFHISAGKSLQLPRGHIHIA 207

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
              +   FN +H + + SFG           PL+G    A +    + Y+I+++PT  + 
Sbjct: 208 TFMSDKEFNFSHRLNYFSFG---DYSPGIVHPLEGDEKIATDAMMSYQYFIEVVPTEVKT 264

Query: 299 LDGSKL---------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
              ++L                 G  G+PGIFF Y++S L V + ++  S  +   K+  
Sbjct: 265 FLTNQLTYQYSVKDYQRPINHNTGSHGIPGIFFKYDMSALKVIVMQERDSPINFAVKLCA 324

Query: 344 NISGTYITFMLVDALL 359
           +I G +IT  LV+ ++
Sbjct: 325 SIGGIHITSGLVNNII 340


>gi|123408947|ref|XP_001303296.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121884664|gb|EAX90366.1| hypothetical protein TVAG_036780 [Trichomonas vaginalis G3]
          Length = 364

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 97/389 (24%), Positives = 157/389 (40%), Gaps = 67/389 (17%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           R   LD F K   +    T  GG ++++    I  L  +++  +      + L V + R 
Sbjct: 2   RFSKLDLFEKLDNNHRTGTTTGGILSLITIGLIISLFVIEIKSFLNPPLRQRLSVVNKRP 61

Query: 66  S-------------KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-EHNIYKRRLDLD 111
           +             K  ++ DI  P   C  L  D +D+  +  L     NI   R   D
Sbjct: 62  TEADGVTITKESQEKTKVNFDIFFPNAPCYLLHFDLIDAVSQLDLFTYNQNITYTRFSSD 121

Query: 112 GKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE--TETRKCCNTCNEVK 169
           GK I +            KVT              +CG C   +   +  KCCNTC +V 
Sbjct: 122 GKIIGDFDHSA--RFNTSKVT--------------ECGFCNATKGLKDKYKCCNTCQQVL 165

Query: 170 E-AYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY 228
           E A  ++   +P      QC ++   ++LK    EGC+I G  E  ++   FHI+PG  Y
Sbjct: 166 EVAQVFRVVDIP------QCSDK--VKELKKMQNEGCRIKGNFETIKIKAEFHISPG--Y 215

Query: 229 SI---NHVHVHDIQPYTS--AAFNTTHHIRHLSFGIKLQDDDERRKPLDG-TVAKAEEGA 282
           S+   + VH HD+  +    +  N ++ + H  FG      D+    LDG +  + + G 
Sbjct: 216 SVIDEDGVHAHDVSSFIDDVSELNLSYKLNHCRFG------DQNHSQLDGFSTIQKQIGY 269

Query: 283 SMFNYYIKI-----IPTIY-ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
               Y I +       T Y E++D   L      +PGI F Y+   +  K       L H
Sbjct: 270 FYAVYTIDVSENNDYSTAYMEQVDNGTL------VPGIVFKYDFGIITAKSFPDRPPLIH 323

Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKK 365
           L++ ++    G  + F ++D  L S +K+
Sbjct: 324 LFSNLVSMAGGVAMIFYILDYALFSSIKQ 352


>gi|401416963|ref|XP_003872975.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322489202|emb|CBZ24457.1| conserved hypothetical protein [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 368

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 85/391 (21%), Positives = 153/391 (39%), Gaps = 71/391 (18%)

Query: 13  FTKPYEDFH-EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG--SKLP 69
           F KP ED+  E+T +G  +++     +  L+  +   Y +     +  +   RG    +P
Sbjct: 2   FPKPKEDYQREQTRWGAVLSVATVSIVIILVLWEGAAYLRGRDAYDTDISLDRGLSEDMP 61

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           +H D+  P + C+ L++D VD++G    +    ++K    LDG+ +           K  
Sbjct: 62  VHFDVFFPFMPCNRLSIDVVDTTGMAKFNYTGTLHKLPTALDGRVL----------YKGS 111

Query: 130 KVTTENGTTTTELEDPNKCGSCY-----GAETETR-----KCCNTCNEVKEAYRYKKWAL 179
               +N   T E  +  KC  C      G   E R     KCC+TC  V + Y+     +
Sbjct: 112 LKDLDNAMETEEARNGTKCRPCPPSAFDGVAAEVRSAAVSKCCDTCESVLDLYKELGKGI 171

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP---GLSYSINHVHVH 236
           P  + + QC  +   +        GC + G L++ +V  +    P   G  YS+  V   
Sbjct: 172 PGTEYLPQCLEQLYQQ------ASGCNVVGSLDLKKVHVTVIFGPRRTGRFYSLKDV--- 222

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFN-------YYI 289
                     +T+H IR L  G    D+   R   +G VA+   G   F+       Y +
Sbjct: 223 -------IRLDTSHSIRKLRIG----DEAVERFSKNG-VAEPLSGHKSFSKTYSETRYLV 270

Query: 290 KIIPTIYERLDGSK-----------------LGGGDGGMPGIFFSYELSPLMVKITEKSK 332
           K++PT Y +                      + G  G +P + F +E +P+ V    + +
Sbjct: 271 KVVPTTYRKTKKRNAKASTYEYSAQWSKRTIVVGFAGAVPAVLFEFEPAPIQVNNVFERQ 330

Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
              H   ++   + G ++    +D ++   V
Sbjct: 331 PFSHFVVQLCGIVGGLFVVLGFIDNVVDWAV 361


>gi|322791472|gb|EFZ15869.1| hypothetical protein SINV_02690 [Solenopsis invicta]
          Length = 403

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 93/384 (24%), Positives = 158/384 (41%), Gaps = 72/384 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGA--VTIV------------CWLFISYLICVDVCDYFQV 52
           +K LDAF K  E + +KT  GG   +T++                I+YLI  +   Y   
Sbjct: 12  VKELDAFPKVPELYVDKTAVGGTCELTVINKIFSIIHISIFTIFIIAYLIIAETSYYLDS 71

Query: 53  STTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
               +   D+   +KL I++D+ V  + C  +  D +DS+  QH+          +D D 
Sbjct: 72  RLQFKFEPDTEIDAKLQINIDVTV-AMPCGRIGADVLDSTN-QHM----------IDFD- 118

Query: 113 KPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY 172
                              + +   T  EL    +      A  E  K  N+   ++E Y
Sbjct: 119 -------------------SLKEEDTWWELTAEQR------AHFEALKHMNSY--LREEY 151

Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINH 232
                 L + + ++            +     C+++G L VN+V+G+FHI  G S S+ H
Sbjct: 152 HAIHELLWKSNQVILYSEMPKRTSEPDYAPNACRVHGSLNVNKVAGNFHITAGKSLSVPH 211

Query: 233 VHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
            H+H     T   +N TH I   SFG           PL+G    A+    ++ Y+++++
Sbjct: 212 GHIHISAFMTDRDYNFTHRINRFSFG---GPSPGIVHPLEGDEKIADNNMMLYQYFVEVV 268

Query: 293 PT-IYERLDGSKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
           PT I   L  SK                 G  G+PGIFF Y++S L +K+T++  ++   
Sbjct: 269 PTDIRTLLSTSKTYQYSVKDHQRPIDHHKGSHGIPGIFFKYDMSALKIKVTQERDTIFQF 328

Query: 338 WTKIMCNISGTYITFMLVDALLHS 361
             K+   + G ++T  L+  ++ S
Sbjct: 329 LVKLCATVGGIFVTSGLIKNIVQS 352


>gi|395744111|ref|XP_003780425.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 2 [Pongo abelii]
          Length = 387

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 92/379 (24%), Positives = 155/379 (40%), Gaps = 64/379 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 22  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 81

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C  +  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 82  KLRINIDITV-AMKCQCIGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 139

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 140 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 166

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 167 ------PREDDSSQSPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESY 220

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA-EEGASMFNYYIKIIPT----------- 294
           N +H I HLSFG  +        PLDGT   A +    MF Y+I ++PT           
Sbjct: 221 NFSHRIDHLSFGELVP---AIINPLDGTEKIAIDRKHQMFQYFITVVPTKLHTYKISADT 277

Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
               + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + 
Sbjct: 278 HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFS 337

Query: 351 TFMLVDALLHSCVKKISKV 369
           T      +LH   K I ++
Sbjct: 338 T----TGMLHGIGKFIVEI 352


>gi|162852511|emb|CAO03348.2| ERGIC and golgi 3 [Homo sapiens]
          Length = 118

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 51/109 (46%), Positives = 69/109 (63%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DA+ K  EDF  KT  G  VTIV  L +  L   ++  Y       EL+VD SRG 
Sbjct: 1   LKQFDAYPKTLEDFRVKTCGGATVTIVSGLLMLLLFLSELQYYLTTEVHPELYVDKSRGD 60

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
           KL I++D++ P + C YL++DA+D +GEQ L VEHN++K+RLD DG P+
Sbjct: 61  KLKINIDVLFPHMPCAYLSIDAMDVAGEQQLDVEHNLFKQRLDKDGIPV 109


>gi|310800159|gb|EFQ35052.1| hypothetical protein GLRG_10196 [Glomerella graminicola M1.001]
          Length = 377

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 100/369 (27%), Positives = 149/369 (40%), Gaps = 74/369 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  +T  GG  T+   +   +L C +V  +++ S T    V+   G 
Sbjct: 23  VSAFDAFPKAKPQYVTRTEGGGKWTVAMAVISFFLFCTEVGRWWRGSETHTFAVEKGVGH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD-----LDGKPIQEPQKE 121
           ++ I+LDIVV  + CD L ++  D++G++ L    ++ KR        +D K I    K+
Sbjct: 83  EMQINLDIVV-RMHCDDLHINVQDAAGDRIL--AGSMLKRDKTNWSQWVDSKGIHRLGKD 139

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                 K KV T  G    E          +G E       +  + V    +  KW    
Sbjct: 140 -----SKGKVVTGAGWQEEE---------GFGEE-------HVHDIVSLGKKKAKWG--- 175

Query: 182 LDTIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
                         K    + EG  C+IYG L+VNRV G FHI A G  Y     H+   
Sbjct: 176 --------------KTPRLWGEGDSCRIYGNLDVNRVQGDFHITARGHGYMEFGAHL--- 218

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
                AAFN +H I  LSFG           PLD TV  A      F YY+ ++PT+Y  
Sbjct: 219 ---DHAAFNFSHIISELSFGPFYP---SLVNPLDRTVNLARINFHKFQYYLSVVPTVYTV 272

Query: 299 LDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
              +                     D  +PGIFF Y++ P+++ + E       L  KI+
Sbjct: 273 GKSASSSNTIFTNQYAVTEQSKETDDHNIPGIFFKYDIEPILLSVEESRDGFLQLLMKIV 332

Query: 343 CNISGTYIT 351
             +SG  + 
Sbjct: 333 NIVSGVLVA 341


>gi|388858415|emb|CCF48009.1| uncharacterized protein [Ustilago hordei]
          Length = 415

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 77/360 (21%), Positives = 157/360 (43%), Gaps = 63/360 (17%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +++  DAF K    + +++  GG +TI+  + + +L+  ++  Y          VDS   
Sbjct: 13  KIRQFDAFPKTQSIYTQRSSKGGLLTIISTVTLLFLLWTELSSYLYGERAYSFAVDSQLS 72

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
           S + I++D+ V  + C YL +D  D+ G++ LHV  + + +    DG        + ++A
Sbjct: 73  STMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDSEFTK----DGTTFDIGHADRLDA 126

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + +++++ +                             T N+ ++   Y+K    +    
Sbjct: 127 MPREELSVQK----------------------------TINQARKKPLYRKKPKNK---- 154

Query: 186 VQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
            +   + +  K  +   +G  C+IYG +EV RV+G+ HI      ++ H ++  ++    
Sbjct: 155 -KFSRQVAFHKTAHIVPDGPACRIYGSMEVKRVTGNLHIT-----TLGHGYL-SLEHTDH 207

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
              N +H I   SFG       E  +PLD +V   ++  ++F Y+I  +PT++    G K
Sbjct: 208 KLMNLSHVIHEFSFGPYFP---EISQPLDSSVETTDKHFTVFQYFISAVPTLFVDARGRK 264

Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           L                  G+PGIF  Y++ P+ + I E+S +      ++   + G ++
Sbjct: 265 LHTHQYSVTDYTRQIEHGKGVPGIFIKYDIEPIQMTIRERSSTFVQFLVRLAGVLGGVWV 324


>gi|258573091|ref|XP_002540727.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237900993|gb|EEP75394.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 398

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 99/413 (23%), Positives = 167/413 (40%), Gaps = 94/413 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K    +   +  GG  T+  +LF   L+  ++  +++ +      V+     
Sbjct: 24  LRTFDAFPKTKPTYTTASRRGGQWTVFIFLFCGSLVFSELVSWYRGTENHHFSVEKGVSQ 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE--------HNIYKRRLDLDGKPIQEP 118
           ++ I+LD+VV  + C+ L ++  D+ G+  L  E         + + R L+   K    P
Sbjct: 84  EIQINLDMVV-HMPCEALRMNMQDAVGDFILAAELLHKDDTSWDAWNRELNYASKG-GSP 141

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
           Q + +NA        E+ T   E E+    G   G             EV+ +++ K   
Sbjct: 142 QYQTLNA--------EDDTRLAEQEEDQHVGHVLG-------------EVRRSWKRKFPK 180

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHD 237
            P+L             K K+   + C+IYG LE N+V G+FHI A GL Y        D
Sbjct: 181 GPKL-------------KSKDAM-DSCRIYGSLEGNKVQGNFHITARGLGY-------WD 219

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
              +     N TH I  LSFG +         PLD TVA  ++    + YY+ ++PTIY 
Sbjct: 220 PSGFHLEGLNFTHLITELSFGPRYS---TLLNPLDKTVAGTKDAFYKYQYYLSVVPTIYT 276

Query: 298 RL-----------DGSKLGGGD-----------------------GGMPGIFFSYELSPL 323
           R            D S +                             +PGIFF +++ P+
Sbjct: 277 RAGTVDPYNQELPDPSTITSRQRKNTIFTNQYAVTSQSHAIPQNVRAVPGIFFKFDIEPI 336

Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYIT----FMLVDALLHSCVKKISKVEIG 372
           ++ ++E+  SL  L  +++  +SG  +     F L    L    ++   + +G
Sbjct: 337 LLVVSEERGSLLALLVRLVNVVSGVLVAGGWVFQLATWALEVWGRRRKGMSLG 389


>gi|123361353|ref|XP_001295947.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121875215|gb|EAX83017.1| hypothetical protein TVAG_111750 [Trichomonas vaginalis G3]
          Length = 338

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 92/313 (29%), Positives = 130/313 (41%), Gaps = 53/313 (16%)

Query: 81  CDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTT 140
           C+ L LD +DS G + L V   +  RR++         +K  +    KKK          
Sbjct: 57  CEVLHLDILDSIGHKQLLVNDTLKWRRVN--------QEKGFMELYNKKK---------- 98

Query: 141 ELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK-KWALPELDTIVQCKNEYSTEKLKN 199
                 +C SCY    + R CCN C ++KE Y    K A PE     QCK E    K K 
Sbjct: 99  ------QCHSCYDF-YDNRFCCNGCEKLKEIYHSNNKTATPE--NWTQCKPE---NKQKF 146

Query: 200 TFTEGCQIYGYLEVNRVSGSFHIAPGLS---YSINHVHVHDIQPYTSAAFNTTHHIRHLS 256
              E C + G + VNRV GSFH+A G S   Y   H+ + D   Y +  F+  H I  L 
Sbjct: 147 DPNEKCHVKGKISVNRVPGSFHLAIGQSIEDYGHQHILLDD---YQTITFD--HDIIDLR 201

Query: 257 FGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGDGG------ 310
           FG  +        PL GT  K+        Y + I P ++   DG  +  G         
Sbjct: 202 FGANI---PMTSHPLRGTHIKSTGEPLATEYNLIITPIVF-YADGQYIEKGFEYVYFYSM 257

Query: 311 ----MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
               +PGI+F Y  +P  + +T +S+S           +SG Y  F +V   L    +K 
Sbjct: 258 TYHLVPGIYFYYSFTPYTIAVTWQSRSFRSFLISTGGLLSGIYAIFSMVSTFLEKSDQKK 317

Query: 367 SKVEIGGKTVTKR 379
            KVE   + V ++
Sbjct: 318 KKVETKAEAVAEK 330


>gi|322792517|gb|EFZ16475.1| hypothetical protein SINV_13267 [Solenopsis invicta]
          Length = 110

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 53/109 (48%), Positives = 64/109 (58%), Gaps = 17/109 (15%)

Query: 281 GASMFNYYIKIIPTIYERLDGS----------------KLGGGDGGMPGIFFSYELSPLM 324
           GA MF +YIKI+PT Y R DGS                 L  G+ GMPGIFFSYELSPLM
Sbjct: 1   GAMMFYHYIKIVPTTYVRADGSTLLTNQFSVTRHAKQVSLLTGESGMPGIFFSYELSPLM 60

Query: 325 VKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIG 372
           VK TEK+KS GH  T     I G +    L+D+LL+  V+ I  K+E+G
Sbjct: 61  VKYTEKAKSFGHFATNTCAIIGGVFTVAGLIDSLLYHSVRAIQRKIELG 109


>gi|343427702|emb|CBQ71229.1| conserved hypothetical protein [Sporisorium reilianum SRZ2]
          Length = 412

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 79/360 (21%), Positives = 157/360 (43%), Gaps = 63/360 (17%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +++  DAF K    + +++  GG +TIV  + +  L+  ++  Y          VD    
Sbjct: 13  KIRQFDAFPKTQSIYTQRSSKGGILTIVSTVTLLALLWTELSSYLYGERGYSFAVDQQLQ 72

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
           S + I++D+ V  + C YL +D  D+ G++ LHV  + + +    DG   +    + ++A
Sbjct: 73  STMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDSEFTK----DGTTFEIGHADRLDA 126

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + +++V+ +                             T N+ ++   Y+K    +    
Sbjct: 127 MPREEVSVQK----------------------------TINQARKKPLYRKKPKNK---- 154

Query: 186 VQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
            +   + +  K  +   +G  C+IYG +EV RV+G+ HI      ++ H ++  ++    
Sbjct: 155 -KFSRQVAFHKTAHVVPDGPACRIYGSMEVKRVTGNLHIT-----TLGHGYL-SMEHTDH 207

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
              N +H I   SFG       E  +PLD +V   ++  ++F Y++  +PT++    G K
Sbjct: 208 KLMNLSHVIHEFSFGPYFP---EISQPLDSSVETTDKHFTVFQYFVSAVPTLFVDARGRK 264

Query: 304 LGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           L                  G+PGIF  Y++ PL + I E+S +L     ++   + G ++
Sbjct: 265 LHTHQYSVTDYTRQIEHGKGVPGIFIKYDIEPLQMTIRERSTTLLQFLVRLAGVLGGVWV 324


>gi|157865526|ref|XP_001681470.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68124767|emb|CAJ02321.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 365

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 85/390 (21%), Positives = 154/390 (39%), Gaps = 69/390 (17%)

Query: 13  FTKPYEDFH-EKTVYGGAVTIVCWLFISYLICVDVCDYFQV--STTEELFVDSSRGSKLP 69
           F KP ED+  E+T +G  +++     +  L+  +   Y +   + + ++ +D      +P
Sbjct: 2   FPKPKEDYQREQTRWGAVLSVSTVSIVILLVLWEGAAYLRGRDAYSTDVSLDKGLSEDMP 61

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           +H D++ P + C+ L++D VD++G    +    ++K    LDG        EV+     K
Sbjct: 62  VHFDVLFPFMPCNRLSIDVVDTTGMAKFNYTGRLHKLPTALDG--------EVLYKGSLK 113

Query: 130 KVTTENGTTTTELEDPNKCGSCY-----GAETETR-----KCCNTCNEVKEAYRYKKWAL 179
            +  +N   T E+    KC  C      G   E R     KCC+TC  V   Y+     +
Sbjct: 114 DL--DNEMETEEVRTGKKCRQCPPSAFDGVAAEVRSAAASKCCDTCESVLGLYKELGRGV 171

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP---GLSYSINHVHVH 236
           P  + I QC  +            GC + G L++ +V  +    P   G  YS+  V   
Sbjct: 172 PGTEYIPQCLEQLYQR------ASGCAVMGSLDLKKVPVTVIFGPRRTGQFYSLKDV--- 222

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAK------AEEGASMFNYYIK 290
                     +T+H IR L  G    D+   R   +G   +      + +  S   Y +K
Sbjct: 223 -------IRLDTSHFIRKLRIG----DETVERFSKNGVAERLSGHKSSSKTYSETRYLVK 271

Query: 291 IIPTIYERLDGSK-----------------LGGGDGGMPGIFFSYELSPLMVKITEKSKS 333
           ++PT Y +                      L G  G +P + F +E +P+ V    + + 
Sbjct: 272 VVPTTYRKTKTKNAKASTYEYSAQWSRRTILVGFAGAVPAVLFEFEPAPIQVNNVFERQP 331

Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCV 363
             H   ++   + G ++    +D ++   V
Sbjct: 332 FSHFLVQLCGIVGGLFVVLGFIDNVVDWVV 361


>gi|340502903|gb|EGR29544.1| hypothetical protein IMG5_153610 [Ichthyophthirius multifiliis]
          Length = 342

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 90/393 (22%), Positives = 170/393 (43%), Gaps = 88/393 (22%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK +D + K   D  E TV G  ++I   L +  L   +   Y  ++ T E+++D  R 
Sbjct: 9   KLKSIDMYRKLPTDLTESTVSGAMISIASSLIMLILFISEFNGYLSITETSEMYIDEKRY 68

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            K+ I++DI  P + CD ++           L VE        DL G             
Sbjct: 69  DKIRINIDIDYPRLPCDVIS-----------LDVE--------DLKGT------------ 97

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                   E     T + + N+         +T+K  ++ +E  + +             
Sbjct: 98  ---HSYQLEGNIQITRISNTNQY-------FDTQKYDDSHSENNQEF------------- 134

Query: 186 VQCKNEYSTEKLKNTF--TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
               +E    +LK+ F   EGC+I G++ VN+  G+FH++   ++S + + +H I  + +
Sbjct: 135 ----SEARLNRLKSAFLDQEGCKIQGHIFVNKAPGNFHVS---AHSFDRI-LHQIASHVN 186

Query: 244 -AAFNTTHHIRHLSFG-----IKLQDDDERR---KPLDGTVA-KAEEGASM---FNYYIK 290
            +  + +H I H+SFG     I+++   + +    PLD T   K E+  ++   + YYI 
Sbjct: 187 ISTIDVSHIINHISFGDETDIIRIKRQFKSQGILDPLDRTRKIKTEDQKNISISYQYYIN 246

Query: 291 IIPTIYERLDGS-----KLGGGDG-----GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           ++ T Y  +        +    +       +P  FF Y+LSP++V+ ++   S  H   +
Sbjct: 247 VVHTTYVNIQKKEYSVYQFTANNNELLSDRLPACFFRYDLSPVIVRFSQSRMSFLHFIVQ 306

Query: 341 IMCNISGTYITFMLVDALLH-SCVKKISKVEIG 372
           +   I G +    ++D+++H S V  + K E+G
Sbjct: 307 VCAIIGGVFTVAGIIDSIIHKSVVHILKKAEMG 339


>gi|209877186|ref|XP_002140035.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209555641|gb|EEA05686.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 384

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 85/401 (21%), Positives = 171/401 (42%), Gaps = 68/401 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-RG 65
           ++  DAF+KP  +F  KT +GG +TI+  L + +L   ++  Y +V+  +E+ VD +  G
Sbjct: 1   MQRFDAFSKPIAEFRIKTAFGGYLTILSILTMLFLFYSELRYYLKVNRNDEITVDKTLAG 60

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             + I + +  P + C+ + L  +++                   D      P+  ++  
Sbjct: 61  GNVNIKMLVEFPKLPCEVVGLRILNTQ------------------DNTEFSHPKDSII-Y 101

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +    +  E+   ++       CGSCY   ++   CCNTC+EV  +Y+     LP+    
Sbjct: 102 IPINPLNEESNIGSS-------CGSCYNP-SKKNHCCNTCSEVIRSYQEDNIKLPQKINF 153

Query: 186 VQCKNEYSTEKLKNTFT-----EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
            QCK +   E+L+   +      GC+I   + + +V G   I+     + N +   DI  
Sbjct: 154 EQCKFD-PRERLEKAISAPLNISGCKIKVDINIPKVKGRIEISHKRWMNYNEMTNLDIS- 211

Query: 241 YTSAAFNTTHHIRHLSFGIKLQ------DDDERRKPLDGTVAKAEEGASMFNYYIKI--- 291
             +  +N ++ +++L +G  L       ++ E  +    T  K  +   + + ++ I   
Sbjct: 212 -EAHLYNFSYIVKYLHYGDDLPGINNIWNNQEYIQTAKFTHNKESDNLFLEDAHLDIDMH 270

Query: 292 -IPTIYERLDGSK------------------LGGG----DGGMPGIFFSYELSPLMVKIT 328
            IPT +  ++  K                  L  G    +  +PGI+ +Y+ +P +VKIT
Sbjct: 271 CIPTQFNSINSKKTKIGHQFSVRKQSKQVNVLNNGRFVPETSLPGIYINYDFTPFIVKIT 330

Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
           E  +S     T+    I G +    ++D  +      ++++
Sbjct: 331 ESRRSFLSFLTECCAIIGGIFAFSSMIDIFMFKLSSFLNRI 371


>gi|342878666|gb|EGU79974.1| hypothetical protein FOXB_09504 [Fusarium oxysporum Fo5176]
          Length = 376

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 154/361 (42%), Gaps = 59/361 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    + ++T  GG  T+   +    LI  ++  +++ + +    V++    
Sbjct: 23  VSAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLVLIWGELGRWWRGAESHNFEVEAGVSR 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLD----GKPIQEPQKEV 122
           +L I++DIVV  ++CD + ++  D+SG+      H +  +RL  D     + +       
Sbjct: 83  ELQINMDIVV-KMNCDDIHVNVQDASGD------HILAAKRLKADRTLWSQWVDNKGMHK 135

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +    + +V T +G      ED       +G E       +  + V    +  KWA    
Sbjct: 136 LGRDSQGRVNTGSGYNELGYED-----EGFGEE-------HVHDIVALGKKRAKWA---- 179

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
                      T K +    + C+IYG L++N+V G FHI A G  Y  N  H+      
Sbjct: 180 ----------KTPKFRGN-ADSCRIYGSLDLNKVQGDFHITARGHGYRGNGEHL------ 222

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
             + FN +H I  LS+G           PLDGTV  A +    F YY+ ++PT+Y     
Sbjct: 223 DHSKFNFSHIISELSYGPFYP---SLVNPLDGTVNTAPDNFHKFQYYLSVVPTVYSVNSK 279

Query: 302 SKLGG-----------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           S L              +  +PGIFF Y++ P+++ + E    +  L  K++  +SG  +
Sbjct: 280 SILTNQYAVTEQSKAVDERYIPGIFFKYDIEPILLTVHESRDGIISLLVKVINIMSGVLV 339

Query: 351 T 351
            
Sbjct: 340 A 340


>gi|403330686|gb|EJY64240.1| hypothetical protein OXYTRI_24846 [Oxytricha trifallax]
          Length = 345

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 101/398 (25%), Positives = 160/398 (40%), Gaps = 94/398 (23%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
            SER+K  D +    +D  E +  G  V++     +  LI      + Q   T E+ +D 
Sbjct: 9   LSERIKFFDFYKDLPQDLAEPSWSGATVSMFVMGLMVALIISQTYSFMQFQRTSEILIDV 68

Query: 63  SRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           + G SKL I+++I +    C  L+LD VD +G   + V   ++K  LD DG  +      
Sbjct: 69  NSGNSKLNININITMHKAPCHVLSLDIVDVTGVHVMDVGGKLHKHSLDKDGFYL------ 122

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                         G   T  E P           E ++  +  N++   YR        
Sbjct: 123 --------------GHHDTMDEGP-----------EFKQASSDVNDI---YR-------- 146

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
            DTI    ++           EGC + G + +N+V G+FH++   ++S   V V  I   
Sbjct: 147 -DTIKAMDDQ-----------EGCMVEGTVIINKVPGNFHLS---THSFGEV-VQKIY-M 189

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKP------------LDGTVAKAEE----GASMF 285
                + TH + HLSFG     DD++ K             +DGT     +    G  + 
Sbjct: 190 NGKKLDFTHTVNHLSFG-----DDKQMKSIQSKYNEKYTFDMDGTYVDQNQHLYQGQLLA 244

Query: 286 NYYIKI--------IPTIYERLDG-----SKLGGGDGGMPGIFFSYELSPLMVKITEKSK 332
           NYY+ I            Y+ L G     SK      G+P IFF YELSP+ ++ T   K
Sbjct: 245 NYYLDINQVDYLDATGIFYKLLQGFKYKSSKSIMAQMGLPAIFFRYELSPVKLQYTMTYK 304

Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVE 370
           S    + +I   I G Y+   ++++ L + +   S  E
Sbjct: 305 SWSEFFIEISAIIGGMYVVAGIIESFLRNSLSIFSSDE 342


>gi|429862433|gb|ELA37083.1| copii-coated vesicle protein [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 375

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 147/364 (40%), Gaps = 64/364 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  +T  GG  T+   +   +L   +V  +++ S T    V+   G 
Sbjct: 23  VSAFDAFPKAKPQYVTRTSGGGKWTVAMAVISLFLFWTEVGRWWRGSETHTFAVEKGVGH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           ++ I+LDIVV  + CD L ++  D++G++                          ++ A 
Sbjct: 83  EMQINLDIVV-RMHCDDLHINVQDAAGDR--------------------------ILAAS 115

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           K K+    + T  ++  D NK     G +T+ R       + +E +  +       D + 
Sbjct: 116 KLKR----DKTNWSQWVD-NKGIHRLGRDTKGRIVTGEGWQEEEGFGEEH----VHDIVA 166

Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS 243
             K      K    + EG  C+IYG L+VNRV G FHI A G  Y     H+        
Sbjct: 167 IGKKRAKWAKTPKLWGEGDSCRIYGNLDVNRVQGDFHITARGHGYMEFGEHL------DH 220

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
           AAFN +H I  +SFG           PLD TV  A      F YY+ ++PT+Y     + 
Sbjct: 221 AAFNFSHIISEMSFGPFYP---SLVNPLDRTVNAARINFHKFQYYLSVVPTVYTVGKSAS 277

Query: 304 LGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
                               D  +PGIFF Y++ P+++ + E          KI+  +SG
Sbjct: 278 TSNTIFTNQYAVTEQSKEVDDHNVPGIFFKYDIEPILLSVEESRDGFLQFLMKIVNVVSG 337

Query: 348 TYIT 351
             + 
Sbjct: 338 VLVA 341


>gi|169778245|ref|XP_001823588.1| COPII-coated vesicle protein (Erv41) [Aspergillus oryzae RIB40]
 gi|83772325|dbj|BAE62455.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 390

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 152/376 (40%), Gaps = 74/376 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K   D+   +  GG  T++  L  S     +   +F+ S      V+     
Sbjct: 24  LKTFDAFPKTKPDYTAPSRRGGQWTVLILLICSVFSISEFKTWFKGSENHHFSVEKGVSH 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++LDIVV  + CD L ++  D+SG++              L G+ +++         
Sbjct: 84  DLQLNLDIVV-QMPCDALHVNIQDASGDR-------------ILAGELLKKDPTSWKLWT 129

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            K+    E  T + E  +P++      A+ E     +   EV+   R K    P+L    
Sbjct: 130 DKRNYDHEYQTLSRE--EPSRLE----AQEEDAHVRHVLGEVRHNPRRKFPKGPKLR--- 180

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                      +    + C+IYG LE N+V G FHI A G  Y     H+        + 
Sbjct: 181 -----------RGDAVDSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHST 223

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER-----LD 300
           FN +H I  LSFG           PLD T+A  E     + Y++ ++PTIY +     LD
Sbjct: 224 FNFSHMITELSFGTHYP---TLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALD 280

Query: 301 -------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
                                    G++L      +PGIFF Y + P+++ I+E+  S  
Sbjct: 281 STLYTSKPSHSKNVIFTNQYAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFL 340

Query: 336 HLWTKIMCNISGTYIT 351
            L  +++  +SG  +T
Sbjct: 341 SLLIRLVNTVSGVMVT 356


>gi|328725267|ref|XP_003248406.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like, partial [Acyrthosiphon pisum]
          Length = 129

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 53/115 (46%), Positives = 72/115 (62%), Gaps = 10/115 (8%)

Query: 7   LKGLDAFTKPYEDFHEKTV-YGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           LK  DAF KP E    KTV +    +  C+L +S     +  +Y  +  TEELF D+S+ 
Sbjct: 13  LKQFDAFAKPLEGVQMKTVCFFALFSNHCFLMVS-----NSVEY--LDNTEELFADTSQN 65

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
            KL I+ DIVV  ISCD+L  +AV++SG  +L V+HNIYK RL+L G+PI  P+K
Sbjct: 66  KKLQINFDIVVLKISCDFL--NAVENSGVTNLQVDHNIYKWRLNLGGQPISNPEK 118


>gi|303313533|ref|XP_003066778.1| hypothetical protein CPC735_060030 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240106440|gb|EER24633.1| hypothetical protein CPC735_060030 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320036232|gb|EFW18171.1| COPII-coated vesicle protein [Coccidioides posadasii str. Silveira]
          Length = 399

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 88/390 (22%), Positives = 157/390 (40%), Gaps = 93/390 (23%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K    +   +  GG  T+  +LF   L+  ++  +   +      V+     
Sbjct: 24  LRTFDAFPKTKPTYTTASRRGGQWTVFTFLFCGILVLSELISWHGGTENHHFSVEKGVSE 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE--------HNIYKRRLDLDGKPIQEP 118
           ++ ++LD+VV  + CD L ++  D++G+  L  E         + + R ++  GK     
Sbjct: 84  EIQLNLDLVV-RMPCDSLRVNMQDAAGDFILAAELLHKTPTSWDAWNREMNFAGKG---- 138

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
                 + + + ++ E+     E E+    G   G             EV+ +++ +   
Sbjct: 139 -----GSRQYQTLSAEDNVRLAEQEEDQHVGHVLG-------------EVRRSWKRQFPP 180

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSY--SINHVHV 235
            P+L               +    + C+IYG LE N+V G+FHI A GL Y      V+V
Sbjct: 181 GPKLK--------------RKDVVDSCRIYGSLEGNKVQGNFHITAKGLGYYDPTGMVNV 226

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
           +D+        N TH I  LSFG           PLD TVA  ++    + YY+ ++PTI
Sbjct: 227 NDM--------NFTHLITELSFGPHYP---TLLNPLDKTVAATKDKFYKYQYYLSVVPTI 275

Query: 296 YERL----------------------------------DGSKLGGGDGGMPGIFFSYELS 321
           Y R                                       +  G   +PGIFF +++ 
Sbjct: 276 YTRAGTVDPYSQRLPDPSTITPSQRKNTIFTNQYAVTSQSRTISQGPYSVPGIFFKFDIE 335

Query: 322 PLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
           P+++ ++E+  SL  L  +++  +SG  + 
Sbjct: 336 PILLVVSEERGSLLALLVRLVNVVSGVLVA 365


>gi|412991249|emb|CCO16094.1| predicted protein [Bathycoccus prasinos]
          Length = 409

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 99/438 (22%), Positives = 157/438 (35%), Gaps = 126/438 (28%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LDA+ K  +    +T  G  V+++    +  L   ++ +Y      +++ VD ++  
Sbjct: 17  LSSLDAYKKIEDHLMVRTTSGAIVSLLGIALMCILGASEILNYITPPVVKQMAVDGTQNE 76

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + + +DI  P + C  L++DA D SG+    V   ++K RL+ DGK +    K      
Sbjct: 77  LMTVRMDITFPRVPCSVLSVDAYDQSGKNDQDVRGELHKERLNKDGKSLGSYDK------ 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
               VT E      +L+         G +   +K      EVK A   K           
Sbjct: 131 AGGGVTDEEDALIQDLQQFFGG----GMKVVFQKRAEHSREVKHAVEKK----------- 175

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA- 245
                           EGC++YG + V RV G+FHI+        H   ++   +   A 
Sbjct: 176 ----------------EGCRLYGRMHVQRVGGNFHISA-------HAEEYETLQHAFGAV 212

Query: 246 --FNTTHHIRHLSFG-------------IKLQDDDE------------------------ 266
              N +H I HLSFG              +   DDE                        
Sbjct: 213 NKINISHTITHLSFGAGYPGLVNPLDGVARSGSDDEFHYDESSKDSRSSDRKNIEKEKEE 272

Query: 267 -----------RRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG---------- 305
                      R + +D T    E G+ ++ Y++K++PT Y       LG          
Sbjct: 273 EEKRKKKEQVRRSRLMDLTW--DENGSGVYKYFLKLVPTFYRTHRSVFLGLFSWTKSVST 330

Query: 306 -------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
                           G +P ++F Y+ SP+ V I  K     +  T+ +C + G    F
Sbjct: 331 NQYSVTEYFRKTDAWSGSLPAVYFLYDFSPIAVTIDTKRPHFVYFLTR-LCAVCGGVFAF 389

Query: 353 M-----LVDALLHSCVKK 365
                 LVDALL    KK
Sbjct: 390 AHMISNLVDALLTIITKK 407


>gi|405119686|gb|AFR94458.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Cryptococcus neoformans var. grubii H99]
          Length = 431

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 90/384 (23%), Positives = 159/384 (41%), Gaps = 49/384 (12%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K    +  K+  GG +T V  L I  L+  D+ +Y   +      VDS    
Sbjct: 32  IKSFDAFPKVESTYTIKSRRGGVLTAVVGLIIFLLVLNDLGEYLYGAPDYAFQVDSDIQK 91

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L +++D+ V  + C YL +D  D+ G++ LH+ ++  K     DG      +   +   
Sbjct: 92  DLQLNVDLTV-AMPCRYLTIDLRDAVGDR-LHLSNSFAK-----DGTHFNVGKATCIKNS 144

Query: 127 KKKKVTTENGT-TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +   + + +   +++    PN+  S  G              +K  + +   +     T 
Sbjct: 145 RSTAIPSASEIISSSRRRTPNQQSSFSG--------------IKRLFGFSSSSSSNRRT- 189

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSY-SINHVHVHDIQPYTS 243
            Q    Y     K      C+IYG +EV +V+ + HI   G  Y S  H   H       
Sbjct: 190 GQGHTAYRPTYDKVEDGPACRIYGSVEVKKVTANLHITTLGHGYMSFQHTDHH------- 242

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-----ER 298
              N +H +   SFG          +PLD +    E+  ++F Y+++++PT Y      +
Sbjct: 243 -LMNLSHVVHEFSFGPFFP---AIAQPLDQSYEITEQPFTIFQYFLRVVPTTYIDASRRK 298

Query: 299 LDGSKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           L  S+    D         G+PG+FF Y+L P+ V I E++ SL     ++   + G + 
Sbjct: 299 LITSQYAVTDYSRSFEHGKGVPGLFFKYDLEPMSVVIRERTTSLYQFLIRLAGVVGGVWT 358

Query: 351 TFMLVDALLHSCVKKISKVEIGGK 374
                  + +   +++SK  +G K
Sbjct: 359 VAAFALRVFNRAQREVSKAVVGEK 382


>gi|321258600|ref|XP_003194021.1| ER to Golgi transport-related protein [Cryptococcus gattii WM276]
 gi|317460491|gb|ADV22234.1| ER to Golgi transport-related protein, putative [Cryptococcus
           gattii WM276]
          Length = 444

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 88/383 (22%), Positives = 154/383 (40%), Gaps = 46/383 (12%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K    +  K+  GG +T V  L I  L+  D+ +Y   +      VDS    
Sbjct: 33  IKSFDAFPKVESTYMIKSKRGGVLTAVVGLIIFLLVLNDLGEYLYGAPDYAFQVDSDVQK 92

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L +++D+ V  + C YL +D  D+ G++ LH+ ++  K     D              +
Sbjct: 93  DLQLNVDLTV-AMPCRYLTIDLRDAVGDR-LHLSNSFVKDGTHFD--------------I 136

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            K      N ++TT    P+       +   T    ++ + +K  +     +        
Sbjct: 137 GKATSIKNNPSSTT----PSASEIISSSRRRTPNQQSSFSGIKRLFSSSPSSSSSNRRTA 192

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSY-SINHVHVHDIQPYTSA 244
           Q    Y     K      C+IYG ++V +V+ + HI   G  Y S  H   H        
Sbjct: 193 QDHTAYRPTYDKVQDGPACRIYGSVQVKKVTANLHITTLGHGYMSFQHTDHH-------- 244

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-----ERL 299
             N +H +   SFG          +PLD +     +  ++F Y+++++PT Y      +L
Sbjct: 245 LMNLSHVVHEFSFGPFFP---AIAQPLDQSYEITLQPFTIFQYFLRVVPTTYIDASRRKL 301

Query: 300 DGSKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
             S+    D         G+PG+FF Y+L P+ V I E++ SL     ++   + G +  
Sbjct: 302 ITSQYAVTDYSRSFEHGKGVPGLFFKYDLEPMSVVIRERTTSLFQFLIRLAGVVGGVWTV 361

Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
                 + +    ++SK  +G K
Sbjct: 362 AAFALRVFNRATMEVSKAVVGEK 384


>gi|391872305|gb|EIT81439.1| COPII vesicle protein [Aspergillus oryzae 3.042]
          Length = 390

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 152/376 (40%), Gaps = 74/376 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K   D+   +  GG  T++  L  S     +   +F+ S      V+     
Sbjct: 24  LKTFDAFPKTKPDYTAPSRRGGQWTVLILLICSVFSISEFKTWFKGSENHHFSVEKGVSH 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++LDIVV  + CD L ++  D+SG++              L G+ +++         
Sbjct: 84  DLQLNLDIVV-QMPCDALHVNIQDASGDR-------------ILAGELLKKDPTSWKLWT 129

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            K+    E  T + E  +P++      A+ E     +   EV+   R K    P+L    
Sbjct: 130 DKRNYDHEYQTLSRE--EPSRLE----AQEEDAHVRHVLGEVRHNPRRKFPKGPKLR--- 180

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                      +    + C+IYG LE N+V G FHI A G  Y     H+        + 
Sbjct: 181 -----------RGDAVDSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHST 223

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER-----LD 300
           FN +H I  LSFG           PLD T+A  E     + Y++ ++PTIY +     LD
Sbjct: 224 FNFSHMITELSFGPHYP---TLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALD 280

Query: 301 -------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
                                    G++L      +PGIFF Y + P+++ I+E+  S  
Sbjct: 281 STLYTSKPSHSKNVIFTNQYAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFL 340

Query: 336 HLWTKIMCNISGTYIT 351
            L  +++  +SG  +T
Sbjct: 341 SLLIRLVNTVSGVMVT 356


>gi|238495520|ref|XP_002378996.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus flavus
           NRRL3357]
 gi|220695646|gb|EED51989.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus flavus
           NRRL3357]
          Length = 390

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 95/376 (25%), Positives = 152/376 (40%), Gaps = 74/376 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K   D+   +  GG  T++  L  S     +   +F+ S      V+     
Sbjct: 24  LKTFDAFPKTKPDYTAPSRRGGQWTVLILLICSVFSISEFKTWFKGSENHHFSVEKGVSH 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++LDIVV  + CD L ++  D+SG++              L G+ +++         
Sbjct: 84  DLQLNLDIVV-QMPCDALHVNIQDASGDR-------------ILAGELLKKDPTSWKLWT 129

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            K+    E  T + E  +P++      A+ E     +   EV+   R K    P+L    
Sbjct: 130 DKRNYDHEYQTLSRE--EPSRLE----AQEEDAHVRHVLGEVRHNPRRKFPKGPKLR--- 180

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                      +    + C+IYG LE N+V G FHI A G  Y     H+        + 
Sbjct: 181 -----------RGDAVDSCRIYGSLEGNKVQGDFHITARGHGYRDMGGHL------DHST 223

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER-----LD 300
           FN +H I  LSFG           PLD T+A  E     + Y++ ++PTIY +     LD
Sbjct: 224 FNFSHMITELSFGPHYP---TLLNPLDKTIAATESHYYKYQYFLSVVPTIYSKGHQAALD 280

Query: 301 -------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
                                    G++L      +PGIFF Y + P+++ I+E+  S  
Sbjct: 281 STLYTSKPSHSKNVIFTNQYAATSQGAELPENPYYIPGIFFKYNIEPILLMISEERSSFL 340

Query: 336 HLWTKIMCNISGTYIT 351
            L  +++  +SG  +T
Sbjct: 341 SLLIRLVNTVSGVMVT 356


>gi|443897407|dbj|GAC74748.1| CDK9 kinase-activating protein cyclin T [Pseudozyma antarctica
           T-34]
          Length = 414

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 81/349 (23%), Positives = 151/349 (43%), Gaps = 59/349 (16%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +++  DAF K    + +++  GG +TI+  L + +L+  ++  Y          VDS   
Sbjct: 13  KIRQFDAFPKTQSIYTQRSSKGGVLTIISALALVFLLWTELSTYLYGERGYSFAVDSQLQ 72

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
           S + I++D+ V  + C YL +D  D+ G++ LHV    +K+    DG        + ++A
Sbjct: 73  STMQINMDMTV-AMKCHYLTIDVRDAVGDR-LHVSDTEFKK----DGTTFDIGHADRLDA 126

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           + ++ +  + G T ++     +    Y  +   +K        K A+      +P+    
Sbjct: 127 LPQEAL--DVGKTISK----ARKKPLYRRKPRNKKFSRQVAFHKTAH-----LVPD---- 171

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
                              C+IYG +EV RV+G+ HI      ++ H ++  ++      
Sbjct: 172 ----------------GPACRIYGSMEVKRVTGNLHIT-----TLGHGYL-SMEHTDHKL 209

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N +H I   SFG       E  +PLD +V   ++  ++F Y++  IPT++    G +L 
Sbjct: 210 MNLSHVIHEFSFGPYFP---EISQPLDSSVETTDKHFTVFQYFVSAIPTLFIDARGRRLH 266

Query: 306 GGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
                            G+PGIF  Y++ PL + I E+S SL     ++
Sbjct: 267 THQYSVTDYARPIEHGKGVPGIFIKYDIEPLQMTIRERSVSLVQFLVRL 315


>gi|119191516|ref|XP_001246364.1| hypothetical protein CIMG_00135 [Coccidioides immitis RS]
 gi|392864406|gb|EAS34753.2| COPII-coated vesicle protein [Coccidioides immitis RS]
          Length = 399

 Score = 92.4 bits (228), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 88/390 (22%), Positives = 157/390 (40%), Gaps = 93/390 (23%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K    +   +  GG  T+  +LF   L+  ++  +   +      V+     
Sbjct: 24  LRTFDAFPKTKPTYTTASRRGGQWTVFIFLFCGMLVLSELISWHGGTENHHFSVEKGVSE 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE--------HNIYKRRLDLDGKPIQEP 118
           ++ ++LD+VV  + CD L ++  D++G+  L  E         + + R ++  GK     
Sbjct: 84  EIQLNLDLVV-RMPCDSLRVNMQDAAGDFILAAELLHKTPTSWDAWNREMNFAGKG---- 138

Query: 119 QKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
                 + + + ++ E+     E E+    G   G             EV+ +++ +   
Sbjct: 139 -----GSRQYQTLSAEDDVRLAEQEEDQHVGHVLG-------------EVRRSWKRQFPP 180

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSY--SINHVHV 235
            P+L               +    + C+IYG LE N+V G+FHI A GL Y      V+V
Sbjct: 181 GPKLK--------------RKDVVDSCRIYGSLEGNKVQGNFHITAKGLGYYDPTGMVNV 226

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
           +D+        N TH I  LSFG           PLD TVA  ++    + YY+ ++PTI
Sbjct: 227 NDM--------NFTHLITELSFGPHYP---TLLNPLDKTVAATKDKFYKYQYYLSVVPTI 275

Query: 296 YERL----------------------------------DGSKLGGGDGGMPGIFFSYELS 321
           Y R                                       +  G   +PGIFF +++ 
Sbjct: 276 YTRAGTVDPYSQRLPDPSTITVSQRKNTIFTNQYAVTSQSRTISQGPYSVPGIFFKFDIE 335

Query: 322 PLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
           P+++ ++E+  SL  L  +++  +SG  + 
Sbjct: 336 PILLVVSEERGSLLALLVRLVNVVSGVLVA 365


>gi|402885549|ref|XP_003906216.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 2 [Papio anubis]
          Length = 364

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/379 (24%), Positives = 150/379 (39%), Gaps = 78/379 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPT-ISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
           KL I++DI V     C Y                  N+       D  P Q+  + ++  
Sbjct: 73  KLRINIDITVAMKCQCKY----------------TFNLLNPHAVFDLSPQQKEWQRMLQL 116

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           ++ +            L++ +                      K A++    ALP     
Sbjct: 117 IQSR------------LQEEHSLQDVI---------------FKSAFKSASTALPP---- 145

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
              + + S++       + C+I+G+L VN+V+G+FHI  G +      H H        +
Sbjct: 146 ---REDDSSQS-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHES 197

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------- 294
           +N +H I HLSFG   +       PLDGT   A +   MF Y+I ++PT           
Sbjct: 198 YNFSHRIDHLSFG---ELVPAIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADT 254

Query: 295 ----IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
               + ER        G  G+ GIF  Y+LS LMV +TE+       + ++   + G + 
Sbjct: 255 HQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFS 314

Query: 351 TFMLVDALLHSCVKKISKV 369
           T      +LH   K I ++
Sbjct: 315 T----TGMLHGIGKFIVEI 329


>gi|425765498|gb|EKV04175.1| COPII-coated vesicle protein (Erv41), putative [Penicillium
           digitatum PHI26]
 gi|425783511|gb|EKV21358.1| COPII-coated vesicle protein (Erv41), putative [Penicillium
           digitatum Pd1]
          Length = 396

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 94/385 (24%), Positives = 159/385 (41%), Gaps = 85/385 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K    +   T  GG  T++  L  +     ++  +++ +      V+     
Sbjct: 23  LKTFDAFPKTKASYTTPTRSGGQWTVLILLICTVFSWSELKTWWRGTENYHFSVEKGVSH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           +L ++LD+VV  + CD L ++  D++G++              L G+ ++      +  +
Sbjct: 83  ELQLNLDMVV-HMPCDQLRVNIQDAAGDR-------------ILAGELLKRDDTNWLLWM 128

Query: 127 KKKKVTTENGT---TTTELEDPNKCGSCYGAETET-RKCCNTCNEVKEAYRYKKWALPEL 182
           +K+   T +G     T   E+ ++      AE E      +   EV+   R K    P +
Sbjct: 129 QKRNYETNDGAHEYQTLSHEESDRL-----AEQEADAHVGHVLGEVRHNPRRKFPKGPRM 183

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
                          +    + C+IYG LE N+V G FHI A G  Y  N  H+      
Sbjct: 184 R--------------RGVVPDACRIYGSLEGNKVQGDFHITARGHGYRENAPHL------ 223

Query: 242 TSAAFNTTHHIRHLSFGIK---LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
             +AFN +H I  LSFG     LQ+      PLD T+A+ EE    F Y++ I+PT+Y R
Sbjct: 224 DHSAFNFSHMITELSFGPHYPTLQN------PLDKTIAETEEHYYKFQYFLSIVPTLYSR 277

Query: 299 ----LD----------------------------GSKLGGGDGGMPGIFFSYELSPLMVK 326
               LD                             S +      +PGIFF Y++ P+++ 
Sbjct: 278 GKSALDLYTRSPETLAARHGRNTVFTNQYAATSQSSAIPESPMVVPGIFFKYDIEPILLL 337

Query: 327 ITEKSKSLGHLWTKIMCNISGTYIT 351
           ++E+      L  +++  +SG  +T
Sbjct: 338 VSEERAGFLSLLIRVINTVSGVLVT 362


>gi|123437985|ref|XP_001309782.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121891523|gb|EAX96852.1| hypothetical protein TVAG_470170 [Trichomonas vaginalis G3]
          Length = 344

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 83/310 (26%), Positives = 125/310 (40%), Gaps = 58/310 (18%)

Query: 79  ISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTT 138
           + C  +++D  D  G        +IYK RLD +  PI  P  +V                
Sbjct: 74  LPCILVSIDIYDVLGTLTDPNSKSIYKLRLDNNRNPI--PYSQV---------------- 115

Query: 139 TTELEDPNKCGSCYGAE-TETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
                    CGSCYG E  E  +CCNTC +V   +      L  + T  QC N    EK 
Sbjct: 116 ------SQNCGSCYGTEFAEGSRCCNTCEDVVSHHIKAGRPLTNVTTWQQCIN----EKY 165

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
             T  E CQI+G   V+ + G   I P  S         + +P+T    N TH+I H++F
Sbjct: 166 DFTGKEKCQIFGNHHVSAIDGGIRILPRFS--------SNEEPFTK-LLNLTHYIDHITF 216

Query: 258 GIKLQDDDERRKPL-DGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG-----GDGGM 311
           G          +PL D  + ++E G   + Y +K +PT+    DGS   G         +
Sbjct: 217 GTSFGP-----QPLDDALIVQSEPGQFHYRYDLKAVPTVMHNQDGSITHGFQYAVDSAKI 271

Query: 312 P---------GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
           P         GIFF+Y  + + V       ++  L +++ C   G +    L+D+  +  
Sbjct: 272 PITDRTRLGEGIFFNYYFATVAVVGKPDRFTIYILISRLFCIFGGGFFLARLIDSFGYRI 331

Query: 363 VKKISKVEIG 372
                K+ IG
Sbjct: 332 HTMEGKMRIG 341


>gi|326427137|gb|EGD72707.1| hypothetical protein PTSG_04435 [Salpingoeca sp. ATCC 50818]
          Length = 357

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 90/397 (22%), Positives = 168/397 (42%), Gaps = 82/397 (20%)

Query: 3   FSERLKGLDAFTK--PYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
             E++K LD F+K  P     + +  G  VT+V    +  L+  ++ +Y  +    + FV
Sbjct: 10  LQEQVKSLDVFSKVEPDTGITQSSTSGALVTLVTAAIVCVLVWSEISEYNTLKIKYDYFV 69

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGE-----QHLHVEHNIYKRRLDLDGKPI 115
           D+     + + +D+ V  + CD++  D ++ SGE     ++L +E   ++         +
Sbjct: 70  DTDLRRDMNMTVDMTV-AMQCDHIGADYINLSGESTDGSKYLKLEPAHFE---------L 119

Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
              Q E + A  K  V +E G+               G ++ +R    +  E        
Sbjct: 120 SPNQLEWLEAWAK--VKSEEGSR--------------GLDSLSRFLHGSMREPMPT---- 159

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS--YSINHV 233
             A PE+D+                  + C+++G L V +V+ +FHI  G S  +S  H 
Sbjct: 160 --AAPEIDS----------------EPDACRLHGVLPVAKVAANFHITAGKSVHHSRGHS 201

Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
           HV+ + P    A N +H I   SF     ++      LDG +   ++   +F Y+++++P
Sbjct: 202 HVNSMVP--PDAVNFSHRIDRFSF----SEEPRGAMALDGDLRTTDQPRQVFQYFLEVVP 255

Query: 294 TIYERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           +  +RL   +               L  G  G+PGI+F +++  + V ++E+   L  L 
Sbjct: 256 STTQRLGQRQPFRSNQYSVTEQHRVLKEGARGIPGIYFKFDIESIGVSVSEEHPPLSRLL 315

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKT 375
            + +C I G  +       +LHS +  I +   G KT
Sbjct: 316 IR-LCGIVGGIVA---ASGMLHSFIGWIIRTVSGNKT 348


>gi|58261152|ref|XP_567986.1| ER to Golgi transport-related protein [Cryptococcus neoformans var.
           neoformans JEC21]
 gi|134115843|ref|XP_773404.1| hypothetical protein CNBI2490 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|50256029|gb|EAL18757.1| hypothetical protein CNBI2490 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gi|57230068|gb|AAW46469.1| ER to Golgi transport-related protein, putative [Cryptococcus
           neoformans var. neoformans JEC21]
          Length = 431

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 93/383 (24%), Positives = 156/383 (40%), Gaps = 47/383 (12%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K    +  K+  GG +T +  L I  L+  D+ +Y   +      VDS    
Sbjct: 32  IKRFDAFPKVESTYTIKSRRGGVLTALVGLIIFLLVLNDLGEYLYGAPDYAFQVDSEVQK 91

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L +++D+ V  + C YL +D  D+ G++ LH+ ++  K     DG             V
Sbjct: 92  DLQLNVDLTV-AMPCRYLTIDLRDAVGDR-LHLSNSFAK-----DGTHFN---------V 135

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                   N ++TT    P+       +   T    ++ + +K  +     A     T  
Sbjct: 136 GTATFIKNNPSSTT----PSASEIISSSRRRTPNQQSSFSGIKRLFGLDSSASSNRRT-S 190

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSY-SINHVHVHDIQPYTSA 244
           Q    Y     K      C+IYG +EV +V+ + HI   G  Y S  H   H        
Sbjct: 191 QGHTAYRPTYDKVQDGPACRIYGSVEVKKVTANLHITTLGHGYMSFQHTDHH-------- 242

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-----ERL 299
             N +H +   SFG          +PLD +    E+  ++F Y+++++PT Y      +L
Sbjct: 243 LMNLSHVVHEFSFGPFFP---AIAQPLDQSYEITEQPFTIFQYFLRVVPTTYIDASRRKL 299

Query: 300 DGSKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
             S+    D         G+PG+FF Y+L P+ V I E++ SL     ++   + G +  
Sbjct: 300 ITSQYAVTDYSRSFEHGKGVPGLFFKYDLEPMSVIIRERTTSLYQFLIRLAGVVGGVWTV 359

Query: 352 FMLVDALLHSCVKKISKVEIGGK 374
                 + +   K +SK  +G K
Sbjct: 360 AAFALRVFNRAQKHVSKAVMGEK 382


>gi|340914937|gb|EGS18278.1| hypothetical protein CTHT_0063020 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 388

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 93/368 (25%), Positives = 149/368 (40%), Gaps = 66/368 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
            +  DAF K    +  +T  GG  T+   L    L   ++  +++ +      V+     
Sbjct: 29  FQAFDAFPKTKSQYTTRTSGGGKWTVAMSLIALILFWAELSRWWRGTEEHTFAVEKGVAR 88

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDL---DGKPIQEPQKEVV 123
            L I+LDIVV  + C  L ++  D++G++ L  E       + +   DGK +    ++V 
Sbjct: 89  TLDINLDIVV-RMRCADLHVNVQDAAGDRILAAERLTRDPTMWVQWVDGKGVHRLGRDV- 146

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
               + +V T  G    E          +G E       +  + V    +  KWA     
Sbjct: 147 ----QGRVVTGEGWVEDE---------GFGEE-------HVHDIVALGRKKAKWA----- 181

Query: 184 TIVQCKNEYSTEKL--KNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
                     T KL  +    + C+IYG LE+N+V G FHI A G  Y    +   + Q 
Sbjct: 182 ---------KTPKLPPRGGQADSCRIYGSLELNKVQGDFHITARGHGY----LEGGNAQH 228

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
              +AFN +H I  LSFG  L        PLD TV  A      F Y++ I+PT Y    
Sbjct: 229 LDHSAFNFSHIISELSFGPFLP---SLSNPLDRTVNLASHHFHRFQYFLSIVPTTYSVGR 285

Query: 301 GSKLGG-----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
             ++G                   +  +PGIFF Y++ P+++ I E   S+     K++ 
Sbjct: 286 PGEMGSQSIFTNQYAVTEQSHPVSERNIPGIFFKYDIEPILLNIVETRDSVFKFLVKVVN 345

Query: 344 NISGTYIT 351
            +SG  + 
Sbjct: 346 IVSGVLVA 353


>gi|190346055|gb|EDK38054.2| hypothetical protein PGUG_02151 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 407

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 99/398 (24%), Positives = 158/398 (39%), Gaps = 89/398 (22%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
            FS R+K  DAF K       ++  GG  TI+  +FI +++ V +  +       +  VD
Sbjct: 64  AFSTRVKTFDAFPKLNSQHAVRSQRGGLSTIMTVVFILFVMWVQIGGFLGGYVDHQFVVD 123

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
               S L I+LD+ V  + C++L  + +D + ++ L  E       L+  G     P   
Sbjct: 124 DQVRSDLRINLDMKV-AMPCEFLHTNVMDITDDRFLASE------VLNFQGSYFFVP--- 173

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
             + ++    TT+  T                                          PE
Sbjct: 174 --DLIRMNDATTDYET------------------------------------------PE 189

Query: 182 LDTIVQCKNEYSTEKLKNTFTE---GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHD 237
           L+ I+     Y  ++      E    C I+G + VN+VSG FHI A G+ Y  +  HV D
Sbjct: 190 LEEIMLEAGRYEFDREGYHEAESAPACHIFGSIPVNQVSGDFHITAKGMGYR-DRAHV-D 247

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
            Q     A N +H I   SFG   +     + PLD T    ++    + YY K++PT+YE
Sbjct: 248 PQ-----ALNFSHIIAEFSFG---EFYPLIKNPLDFTGKTTDDHFQAYKYYAKVVPTLYE 299

Query: 298 RL----DGSKL-------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           R+    D ++               G   G+PGIFF YE   + + +++K         +
Sbjct: 300 RMGLQVDTNQYSITESHRKYELNTNGRIQGVPGIFFKYEFEAIKLIVSDKRIPFTSFVAR 359

Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
           +   I G +I    V   L    +K+ K+  G K  TK
Sbjct: 360 LATIIGGVFI----VAGYLFRLYEKLLKILFGKKYATK 393


>gi|302882273|ref|XP_003040047.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256720914|gb|EEU34334.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 376

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 153/360 (42%), Gaps = 57/360 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    + ++T  GG  T+   +    LI  +   +++ + +    V++  G 
Sbjct: 23  VSAFDAFPKSKPQYIQRTSGGGKWTVAVSIISLILIWGEAARWWRGAESHNFEVEAGVGR 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD---LDGKPIQEPQKEVV 123
           +L I+LDIVV  + CD + ++  D+SG++ +  +   + + L    +D K + +  ++  
Sbjct: 83  ELQINLDIVV-RMQCDDIHVNVQDASGDRIMAAKRLRHDKTLWSQWVDSKGMHKLGRD-- 139

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
               + +V T++G                  E E     +  + V    +  KWA     
Sbjct: 140 ---SQGRVVTQSGWNDLG------------YEEEGFGEEHVHDIVALGRKKAKWA----- 179

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYT 242
                     T K+K    + C++YG L +N+V G FHI A G  Y  N  H+       
Sbjct: 180 ---------KTPKVKGR-ADSCRVYGSLHLNKVQGDFHITARGHGYMGNGEHL------D 223

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
              FN +H I  LS+G           PLDGTV  A +    F YY+ I+PT+Y     S
Sbjct: 224 HKNFNFSHIISELSYGPFYP---SLVNPLDGTVNAASDNFHKFQYYLSIVPTVYSVGSRS 280

Query: 303 KLGG-----------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
            L              +  +PGIFF Y++ P+++ + E    +     KI+  +SG  + 
Sbjct: 281 ILTNQYAVTEQSKSVNEHYIPGIFFKYDIEPILLTVHESRDGILTFLVKIINIVSGVLVA 340


>gi|85101064|ref|XP_961083.1| hypothetical protein NCU04293 [Neurospora crassa OR74A]
 gi|11611445|emb|CAC18610.1| conserved hypothetical protein [Neurospora crassa]
 gi|28922621|gb|EAA31847.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 379

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 149/361 (41%), Gaps = 58/361 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  +T  GG  T+   L    L   +   +++ S +    V+     
Sbjct: 22  VSAFDAFPKSKPQYVTRTTAGGKWTVFVGLISFILFWSEASRWWRGSESHTFAVEKGVSH 81

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
            L I+LDIVV  + C  + ++  D++G++      LH +  +++  +D   K I +  ++
Sbjct: 82  ALDINLDIVV-KMKCQDIHINVQDAAGDRILAASRLHRDPTVWQHWVD--NKGIHKLGRD 138

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                 + KV T  G    +  D       +G E       +  + V    R  KWA   
Sbjct: 139 A-----QGKVVTGEGYMQGQGHDEG-----FGEE-------HVHDIVSLGRRKAKWA--- 178

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
                       T +L     + C+++G LE+N+V G FHI      +  H ++   Q  
Sbjct: 179 -----------RTPRLWGATPDSCRVFGSLELNKVQGDFHIT-----AKGHGYMEFGQHL 222

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
             +AFN +H I  LSFG  L        PLD TV  A      F Y+I ++PT+Y     
Sbjct: 223 DHSAFNFSHIISELSFGPFLP---SLVNPLDQTVNIASANFHKFQYFISVVPTVYSSSGK 279

Query: 302 SKLGGG-----------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           S +              +  +PGIF  Y++ P+++ I E+  S      K++  ISG  +
Sbjct: 280 SIVTNQYAVTEQSQEVTERIIPGIFVKYDIEPILLHIDEERDSFLVFIIKVVNVISGALV 339

Query: 351 T 351
            
Sbjct: 340 A 340


>gi|348529156|ref|XP_003452080.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Oreochromis niloticus]
          Length = 379

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 96/384 (25%), Positives = 153/384 (39%), Gaps = 86/384 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y +     E  VD    S
Sbjct: 13  VKELDAFPKVSESYVETSASGGTVSLLAFSAMALLAVLEFFVYRETWMKYEYSVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVD-------SSGEQH------LHVEHNIYKRRLDLDGK 113
           KL I++DI V  + C ++  D +D       S+G Q+      L  +  +++R L L   
Sbjct: 73  KLRINIDITV-AMKCQHVGADILDLAETMITSNGLQYEPVIFELTPQQRLWQRTLLLIQN 131

Query: 114 PIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYR 173
            ++E  +  +  V  K +                     GA T                 
Sbjct: 132 RLRE--EHALQEVLYKTLLK-------------------GAPT----------------- 153

Query: 174 YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV 233
               ALP        + + S E L       C+I+G++ VN+V+G+ HI  G        
Sbjct: 154 ----ALPP-------REDASMEPLN-----ACRIHGHVYVNKVAGNLHITVGKPIHHPQG 197

Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
           H H     +   +N +H I HLSFG +L        PLDGT         MF Y+I ++P
Sbjct: 198 HAHIAAFVSHETYNFSHRIDHLSFGEELPGII---NPLDGTEKITYNNNQMFQYFITVVP 254

Query: 294 T---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           T               + ER        G  G+ GIF  Y+ S LMV ++E+   L    
Sbjct: 255 TKLNTYKISADTHQFSVTERERVINHAAGSHGVSGIFVKYDTSSLMVTVSEQHMPLWQFL 314

Query: 339 TKIMCNISGTYITFMLVDALLHSC 362
            ++   I G + T  ++  L+  C
Sbjct: 315 VRLCGIIGGIFSTTGMLHGLVGFC 338


>gi|146421059|ref|XP_001486481.1| hypothetical protein PGUG_02151 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 407

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/397 (24%), Positives = 158/397 (39%), Gaps = 89/397 (22%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           FS R+K  DAF K       ++  GG  TI+  +FI +++ V +  +       +  VD 
Sbjct: 65  FSTRVKTFDAFPKLNSQHAVRSQRGGLSTIMTVVFILFVMWVQIGGFLGGYVDHQFVVDD 124

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              S L I+LD+ V  + C++L  + +D + ++ L  E       L+  G     P    
Sbjct: 125 QVRSDLRINLDMKV-AMPCEFLHTNVMDITDDRFLASE------VLNFQGSYFFVP---- 173

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
            + ++    TT+  T                                          PEL
Sbjct: 174 -DLIRMNDATTDYET------------------------------------------PEL 190

Query: 183 DTIVQCKNEYSTEKLKNTFTE---GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
           + I+     Y  ++      E    C I+G + VN+VSG FHI A G+ Y  +  HV D 
Sbjct: 191 EEIMLEAGRYEFDREGYHEAESAPACHIFGSIPVNQVSGDFHITAKGMGYR-DRAHV-DP 248

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
           Q     A N +H I   SFG   +     + PLD T    ++    + YY K++PT+YER
Sbjct: 249 Q-----ALNFSHIIAEFSFG---EFYPLIKNPLDFTGKTTDDHFQAYKYYAKVVPTLYER 300

Query: 299 L----DGSKL-------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
           +    D ++               G   G+PGIFF YE   + + +++K         ++
Sbjct: 301 MGLQVDTNQYSITELHRKYELNTNGRIQGVPGIFFKYEFEAIKLIVSDKRIPFTLFVARL 360

Query: 342 MCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
              I G +I    V   L    +K+ K+  G K  TK
Sbjct: 361 ATIIGGVFI----VAGYLFRLYEKLLKILFGKKYATK 393


>gi|336472105|gb|EGO60265.1| hypothetical protein NEUTE1DRAFT_56465 [Neurospora tetrasperma FGSC
           2508]
 gi|350294686|gb|EGZ75771.1| DUF1692-domain-containing protein [Neurospora tetrasperma FGSC
           2509]
          Length = 379

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 149/361 (41%), Gaps = 58/361 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  +T  GG  T+   L    L   +   +++ S +    V+     
Sbjct: 22  VSAFDAFPKSKPQYVTRTTAGGKWTVFVALVSFILFWSEASRWWRGSESHTFAVEKGVSH 81

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
            L I+LDIVV  + C  + ++  D++G++      LH +  +++  +D   K I +  ++
Sbjct: 82  ALDINLDIVV-KMKCQDIHINVQDAAGDRILAASRLHRDPTVWQHWVD--NKGIHKLGRD 138

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                 + KV T  G    +  D       +G E       +  + V    R  KWA   
Sbjct: 139 A-----QGKVVTGEGYMQGQGHDEG-----FGEE-------HVHDIVSLGRRKAKWA--- 178

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY 241
                       T +L     + C+++G LE+N+V G FHI      +  H ++   Q  
Sbjct: 179 -----------RTPRLWGATPDSCRVFGSLELNKVQGDFHIT-----AKGHGYMEFGQHL 222

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
             +AFN +H I  LSFG  L        PLD TV  A      F Y+I ++PT+Y     
Sbjct: 223 DHSAFNFSHIISELSFGPFLP---SLVNPLDQTVNIASANFHKFQYFISVVPTVYSSSGK 279

Query: 302 SKLGGG-----------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           S +              +  +PGIF  Y++ P+++ I E+  S      K++  ISG  +
Sbjct: 280 SIVTNQYAVTEQSQEVTERIIPGIFVKYDIEPILLNIEEERDSFLVFIIKVVNVISGALV 339

Query: 351 T 351
            
Sbjct: 340 A 340


>gi|255944653|ref|XP_002563094.1| Pc20g05600 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211587829|emb|CAP85889.1| Pc20g05600 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 396

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 94/382 (24%), Positives = 155/382 (40%), Gaps = 79/382 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K    +   T  GG  T++  +  +     +   +++ +      V+     
Sbjct: 23  LKTFDAFPKTKAAYTTPTRSGGQWTVLILIICTIFSWSEFKTWWRGTENYHFSVEKGVSH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           +L ++LD+VV  + CD L ++  D++G++ L  E  + KR        +Q+   E  + V
Sbjct: 83  ELQLNLDMVV-HMPCDQLRVNIQDAAGDRILAGE--LLKRDDTNWLLWMQKRNHETSDGV 139

Query: 127 KK-KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            + + ++ E      E E     G   G             EV+   R K    P L   
Sbjct: 140 HEYQTLSHEEADRLAEQEADAHVGHVLG-------------EVRRNPRRKFEKGPRLR-- 184

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSA 244
                       +    + C+IYG LE N+V G FHI A G  Y  N  H+        +
Sbjct: 185 ------------RGVVADACRIYGSLEGNKVQGDFHITARGHGYRENAPHL------DHS 226

Query: 245 AFNTTHHIRHLSFGIK---LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
           +F+ +H I  LSFG     LQ+      PLD T+A+ EE    F Y++ ++PT+Y R  G
Sbjct: 227 SFDFSHMITELSFGPHYPTLQN------PLDKTIAETEEHYYKFQYFLSVVPTLYSRGKG 280

Query: 302 --------------------------------SKLGGGDGGMPGIFFSYELSPLMVKITE 329
                                           S +      +PGIFF Y + P+++ ++E
Sbjct: 281 ALDAYTRSPDAAASRYGRDTVFTNQYAATSQSSAIPESPMVVPGIFFKYNIEPILLLVSE 340

Query: 330 KSKSLGHLWTKIMCNISGTYIT 351
           +  S   L  +++  ISG  +T
Sbjct: 341 ERASFLSLLVRVINTISGVLVT 362


>gi|402085784|gb|EJT80682.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 379

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 89/365 (24%), Positives = 149/365 (40%), Gaps = 61/365 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  +T  GG  T+   +  + L   ++  +++   T    V+   G 
Sbjct: 21  VSAFDAFPKSKPQYVTRTAGGGKWTVAMLVISAVLTWSELARWWRGVETHTFAVEKGVGH 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I+LD+VV  + CD L ++  D++G++ L         RL +D  P    Q    N V
Sbjct: 81  SMQINLDVVV-HMKCDDLHVNVQDAAGDRILAAS------RLKMD--PTAWAQWVDGNGV 131

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            K      N   T E  + +     +G E       +  + V    +  +W         
Sbjct: 132 HKLGRDKHNRLITNEGFEHDGHDEGFGEE-------HVHDIVALGKKRARWG-------- 176

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HDIQPYTSA 244
                  T +L  +  + C+++G L++N+V G FHI A G  Y     H+ HD       
Sbjct: 177 ------KTPRLWGSTADSCRLFGSLDLNKVQGDFHITARGHGYMEFGEHLDHD------- 223

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
           AFN TH I   SFG   +       PLD T+  A      F Y++ ++PT+Y  +  S  
Sbjct: 224 AFNFTHIINEFSFG---EFYPSLVNPLDRTINGANTHFHKFQYFLSVVPTVYS-VKSSAG 279

Query: 305 GGG------------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
           G G                  +  +PGIFF Y++ P+++ I E   +      K++  +S
Sbjct: 280 GFGSTIFTNQYAVTEQNAEISERAIPGIFFKYDIEPVLLNIEESRDTFLLFLVKVVNILS 339

Query: 347 GTYIT 351
           G  + 
Sbjct: 340 GAMVA 344


>gi|255563175|ref|XP_002522591.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
 gi|223538182|gb|EEF39792.1| Endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Ricinus communis]
          Length = 191

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/182 (30%), Positives = 90/182 (49%), Gaps = 25/182 (13%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF--NTTHHIRHLSFGIK 260
           EGC++YG L+V RV+G+FHI      S++ +++   Q     A   N +H I  LSFG K
Sbjct: 13  EGCRVYGVLDVQRVAGNFHI------SVHGLNIFVAQMIFDGAIHVNVSHIIHDLSFGPK 66

Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG--------------SKLGG 306
                    PLDGT     + +  F YYIKI+PT Y  +                S +  
Sbjct: 67  FPG---LHNPLDGTARILHDASGTFKYYIKIVPTEYRYISKEVLPTNQFSVTEYFSPMSE 123

Query: 307 GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            D   P ++F Y+LSP+ V I E+ +S  H  T++   + GT+    ++D  ++  ++ +
Sbjct: 124 YDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFALTGMLDRWMYRLLEAV 183

Query: 367 SK 368
           +K
Sbjct: 184 TK 185


>gi|226479782|emb|CAX73187.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Schistosoma japonicum]
          Length = 410

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 85/380 (22%), Positives = 158/380 (41%), Gaps = 58/380 (15%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
           LD F K  ++  + T  GG +TI+ +  IS+L+  +  DY          +D     K+ 
Sbjct: 26  LDVFPKLPKECKKSTWGGGLLTILTFCCISWLLVNEFRDYLDPPVKYSYEIDKDISGKIK 85

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           +++DIVV +  C  +++D VD++G                    P+   +K         
Sbjct: 86  VNIDIVVAS-PCHAISMDVVDTTGS-------------------PLFGEEK--------- 116

Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
               E  +T  +L  P +      A  + +       E   A ++  W       +    
Sbjct: 117 ---IEYISTVFDLSPPARV-----AFKKRQYVAGALREKHHAIQHWLWKYASDTNVFTNF 168

Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSY-SINHVHVHDIQPYTSAA-FN 247
           NE  T+       + C+I G L V +V G+ HI  G     + ++H+H + P+ S    N
Sbjct: 169 NEPDTQVSGGRNPDACRIVGTLFVKKVEGNIHILLGKPLEGLGNLHLH-VAPFLSKTNLN 227

Query: 248 TTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDGSKL-- 304
            +H I H SFG  +   + +  PL+   +     ++ F Y++ ++PT +  +   ++   
Sbjct: 228 FSHRINHFSFGDLV---NGQIHPLEAIESITAVASTSFQYFVTMVPTKVVNQFHVTETYQ 284

Query: 305 ------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
                            G+PGIFF Y+  PL+VKIT   + LG  +T++     G + T 
Sbjct: 285 YAATVQNRTIDHASDSHGIPGIFFIYDTFPLVVKITYDRELLGTFFTRLAALAGGIFATI 344

Query: 353 MLVDALLHSCVKKISKVEIG 372
           + +  +L +  + + +  +G
Sbjct: 345 IYLREMLSNLPEILLRTRLG 364


>gi|448521200|ref|XP_003868450.1| Erv41 protein [Candida orthopsilosis Co 90-125]
 gi|380352790|emb|CCG25546.1| Erv41 protein [Candida orthopsilosis]
          Length = 352

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 90/371 (24%), Positives = 144/371 (38%), Gaps = 86/371 (23%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           FS+R+K  DAF K       ++  GG  T++ + F   ++ V++  Y       +  VD 
Sbjct: 4   FSKRVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFFGLLILWVEIGGYIGGYVDRQFIVDD 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              S L I+LD++V  + C++L  +AVD +G++ L  E       L+ +G     P    
Sbjct: 64  VLRSDLTINLDMIV-AMPCEFLHTNAVDIAGDRFLAGE------TLNFEGLKFFIPSGFS 116

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +N                   +PN                                 P+L
Sbjct: 117 IN-------------------NPNDFHET----------------------------PDL 129

Query: 183 DTIVQCKNEYSTEKLKNTFTEG---CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
           D ++Q        +L     EG   C I+G + VN+V G F I A GL Y        D 
Sbjct: 130 DEVMQESLRAEFSQLGRRVNEGAPACHIFGSIPVNQVKGEFRITAKGLGYK-------DR 182

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
                 A N +H I+  S+G           PLD T    EE   ++ Y+ K++PT+YE+
Sbjct: 183 SFVPVEALNFSHVIQEFSYGDFFP---FLNNPLDATGKVTEENLQIYLYHSKVVPTLYEK 239

Query: 299 L----DGSKLGGGDG--------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           L    D ++    +               G+PGI+F+YE  P+ + I EK         K
Sbjct: 240 LGLEVDTTQYSLTENHHIVKVNPHSKKPQGIPGIYFAYEFEPIKLIIREKRIPFLQFIAK 299

Query: 341 IMCNISGTYIT 351
           +   + G  + 
Sbjct: 300 LGTIVGGIIVA 310


>gi|449476586|ref|XP_004154778.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 3-like [Cucumis sativus]
          Length = 140

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 51/126 (40%), Positives = 75/126 (59%), Gaps = 1/126 (0%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
            +L+ LDA+ K  EDF+ +T  GG +T+    F+ +L   ++  Y    T  +L VD+SR
Sbjct: 6   NKLRNLDAYPKINEDFYRRTFSGGLITLASSFFMLFLFFSELRMYLHAKTETQLVVDTSR 65

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           G +L I+ D+  P I C  L+LDA+D SGEQHL + HNI K+R+D  G  I E + + + 
Sbjct: 66  GGELHINFDLSFPAIPCSILSLDAIDISGEQHLDIRHNIIKKRIDHLGTVI-EARPDGIG 124

Query: 125 AVKKKK 130
           A K  K
Sbjct: 125 APKVSK 130


>gi|145479237|ref|XP_001425641.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124392712|emb|CAK58243.1| unnamed protein product [Paramecium tetraurelia]
          Length = 326

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 85/329 (25%), Positives = 144/329 (43%), Gaps = 59/329 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQV---STTEELFVDSS 63
           ++ ++ FTK   +  +KT  GG + +V    + +LI  ++   FQ+   ST +   VD  
Sbjct: 1   MQYINLFTKSKVE-TKKTTCGGILALVTIFSVGFLIIGEIIRSFQLEVLSTIDTTNVDE- 58

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
              ++ ++L+I V  ++C  L+LD  D +G     +E+ I+K R+  DG+ I +   E V
Sbjct: 59  ---RIRVNLNITVHDMTCFALSLDQQDVTGTHLEDMEYTIHKLRIR-DGRFINKEYAENV 114

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
                     E         + N+   CYGA+    + C TC +V  AY  + W LP  +
Sbjct: 115 KLF-------EQSLYHWNWHNANEVNDCYGAQLFEGQKCITCQDVLLAYASRDWPLPRKE 167

Query: 184 TIVQCKNEYSTEK---------------------------LKNTFTEGCQIYGYLEVNRV 216
           +I QCK  Y  +                            +  T+ E CQI+G+  + R+
Sbjct: 168 SIQQCKYSYIQQNGRRVLFTEDFGEERRGQQYIDMNDLTAMAFTYGESCQIFGHFYIKRI 227

Query: 217 SGSFHIA-PGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR-----KP 270
            G+FHI+  G   +++ +   DIQ         +H I  L F  + Q     R       
Sbjct: 228 PGNFHISFHGKGQAVSLIS-QDIQ--------LSHTINWLEFTPQKQGPTFGRYFKTTNT 278

Query: 271 LDGTVAKAEEGASMFNYYIKIIPTIYERL 299
           LDGT  + ++      YY+K++ + YE L
Sbjct: 279 LDGTTHQLKQKEDT-QYYLKLVESHYETL 306


>gi|346970151|gb|EGY13603.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Verticillium dahliae VdLs.17]
          Length = 373

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 85/388 (21%), Positives = 151/388 (38%), Gaps = 70/388 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    + ++T  GG  T+   +    L   ++  +++ S +    V+   G 
Sbjct: 20  VSAFDAFPKSKPQYVQRTSGGGKWTVAMAVISVMLFWSELGRWWRGSESHTFAVEKGVGH 79

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++LDIVV  + C+ L ++  D+SG+                           ++ A 
Sbjct: 80  DLQVNLDIVV-KMRCEDLHVNVQDASGD---------------------------LILAA 111

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            K +    +     ++   +K G       ET    +      E + +        D + 
Sbjct: 112 TKLREEITSWHQWADMTGNHKLGRSPSGRIETNSGYHLDEGFGEEHVH--------DIVA 163

Query: 187 QCKNEYS---TEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYT 242
           Q K       T +L+    + C+I+G L++N+V G FHI A G  Y     H+       
Sbjct: 164 QSKKRQKWARTPRLRGP-PDSCRIFGSLDLNKVQGDFHITARGHGYQGAGQHL------D 216

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
             +FN +H +  LSFG    + +    PLD TV  A      F YY+ I+PT+Y     +
Sbjct: 217 HTSFNFSHIVNELSFGAFYPNLE---NPLDRTVNLAPANFHKFQYYLSIVPTVYTVGRSA 273

Query: 303 KLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
                               GD  +PG+F  Y++ P+++ + E        W K++  +S
Sbjct: 274 SKANTVYTNQFAVTEQSKEVGDHSVPGVFVKYDIEPILLLVEETRPGFVQFWLKVINVLS 333

Query: 347 GTYIT----FMLVDALLHSCVKKISKVE 370
           G  +     F L +    +  KK  + +
Sbjct: 334 GVLVAGHWGFTLSEWFKENWAKKKERTQ 361


>gi|145349688|ref|XP_001419260.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579491|gb|ABO97553.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 310

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 80/368 (21%), Positives = 144/368 (39%), Gaps = 82/368 (22%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
           +DAF +      ++T  G  V++V  +    L  V++ D+   +  +   VD +R + L 
Sbjct: 1   VDAFARAAPHLTKRTRAGACVSVVGVVLACALALVEITDFLTPTRAKTHGVDDARNATLR 60

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           I +D+  P + C  L +DA D SG+  +     + K RLD  G+ I              
Sbjct: 61  IEIDVTFPRMPCQLLYVDAYDESGKHEVDARGLLLKTRLDASGRAIG------------- 107

Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
                        E  +  G   G     ++     +EV+EA                  
Sbjct: 108 -------------EYESAGGVDLGGLVLFQRRPEHAHEVREA------------------ 136

Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTT 249
                        EGC+++G LE  RV+G+   + G         ++D +P+     +  
Sbjct: 137 ---------KADVEGCRLHGELEARRVAGTLRASTGPESYEFLKEIYD-EPW---EIDMR 183

Query: 250 HHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE------------ 297
           H ++  +FG +         P++G V + E  + ++ Y++K++PT Y             
Sbjct: 184 HAVKTFTFGAEFPGAV---NPMNG-VRRMETKSGIYKYFMKVVPTTYSSTRALFGFIPWT 239

Query: 298 -RLDGSKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
            R   ++    +        G +P +FF Y+LS + V IT  SKS+ +  TK +  + G 
Sbjct: 240 VRTRTNQYSVTEHFIETPHWGALPQLFFIYDLSAIAVNITVTSKSIVYFLTKTLATMGGI 299

Query: 349 YITFMLVD 356
           +     VD
Sbjct: 300 FALTRTVD 307


>gi|225717192|gb|ACO14442.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Esox lucius]
          Length = 379

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 144/375 (38%), Gaps = 68/375 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E T  GG V+++ +  ++ L   +   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETTASGGTVSLIAFTAMALLAFFEFFVYRDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C ++  D +D +                                  
Sbjct: 73  KLRINIDITV-AMKCQHVGADILDLA---------------------------------- 97

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK----KWALPEL 182
             + + T NG       +P   G     +   R      N ++E +  +    K  L   
Sbjct: 98  --ETMITSNGIQY----EPVVFGLTPEQKLWHRTLLLIQNRLREEHSLQEVLYKSVLKGA 151

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
            T +  +   ++E L       C+I+G++ VN+V+G+FHI  G        H H     +
Sbjct: 152 PTALPPREVATSEPLG-----ACRIHGHVYVNKVAGNFHITVGKPIHHPRGHAHIAAFVS 206

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-------- 294
              +N +H I H SFG   ++      PLDGT         MF Y+I ++PT        
Sbjct: 207 HDTYNFSHRIDHFSFG---EEIPGIINPLDGTEKVTTNNNHMFLYFITVVPTKLHTSKVS 263

Query: 295 -------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
                  + ER        G  G+ GIF  Y+ S LMV ++E+   L     ++   I G
Sbjct: 264 ADTHQFSVTERERVINHAAGSHGVSGIFMKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGG 323

Query: 348 TYITFMLVDALLHSC 362
            + T  ++   +  C
Sbjct: 324 IFSTTGMIHGFVGFC 338


>gi|380016475|ref|XP_003692209.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Apis florea]
          Length = 392

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 97/378 (25%), Positives = 149/378 (39%), Gaps = 74/378 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + +KT  GG  +I     I+YLI  +             ++DS    
Sbjct: 12  VKELDAFPKVPEPYVDKTAVGGTFSIFTICTIAYLIIAET----------SYYLDSRLQF 61

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLD---GKPIQEPQKEVV 123
           K     DI            DA                K ++++D     P      +V+
Sbjct: 62  KFETDTDI------------DA----------------KLKINIDITVAMPCGRIGADVL 93

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
           ++  +  V    G  + E ED     + +    E R             R +  A+ EL 
Sbjct: 94  DSTNQNMV----GHESLEQED-----TWWELTQEQRSHFEALKHTNSYLREEYHAIHELL 144

Query: 184 TIVQCKNEYSTEKLKNTFT-----EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI 238
                   YS E  K T         C+I+G L VN+V+G+FHI  G S SI   H+H  
Sbjct: 145 WKSNQVTLYS-EMPKRTHQPIYAPNACRIHGSLNVNKVAGNFHITAGKSLSIPKGHIHIS 203

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE 297
              T   +N TH I   SFG           PL+G    A+    ++ Y+++++PT I  
Sbjct: 204 AFMTEKDYNFTHRINKFSFG---GPSPGIVHPLEGDEKIADNNMLLYQYFVEVVPTDIQT 260

Query: 298 RLDGSKL--------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            L  SK                 G  G PGIFF Y++S L +K+T++  ++     K+  
Sbjct: 261 LLSTSKTYQYSVKDHQRPINHQKGSHGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCA 320

Query: 344 NISGTYITFMLVDALLHS 361
            + G ++T  LV  ++ S
Sbjct: 321 TVGGIFVTSGLVKNIVQS 338


>gi|326672443|ref|XP_003199668.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Danio rerio]
          Length = 365

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 56/178 (31%), Positives = 84/178 (47%), Gaps = 18/178 (10%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
            C+I+G + VN+V+G+FHI  G     +  H H         +N +H I HLSFG    D
Sbjct: 170 ACRIHGKIYVNKVAGNFHITLGKPIETHKGHAHYASFIKDEVYNFSHRIDHLSFG---ND 226

Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGGD 308
                 PLDG      E  ++F Y+I ++PT               + ER        G+
Sbjct: 227 VPGHINPLDGMEKTTLEQNTLFQYFITVVPTKLHTSNVSVDMHQFSVTERERVVSNEKGN 286

Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G+ GIFF Y+LSPLMV+++E+   L     ++   + G + T  L+  L+ S V  I
Sbjct: 287 QGVSGIFFKYKLSPLMVRVSEEHMPLAAFLVRLCGIVGGIFSTSDLLHRLIGSFVDII 344



 Score = 46.2 bits (108), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 43/87 (49%), Gaps = 1/87 (1%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E +   + +GG VT+  ++ ++ L   +   Y       E  VD    S
Sbjct: 15  IKNLDAFPKVPESYVATSAFGGTVTLTVFILMALLTISEFFVYQDTWMKYEYEVDRDFTS 74

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSG 93
           KL I +DI V  + C+ L  D +D +G
Sbjct: 75  KLKIKIDITV-AMKCERLGADVLDIAG 100


>gi|66500700|ref|XP_395190.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 1 [Apis mellifera]
          Length = 389

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 54/173 (31%), Positives = 84/173 (48%), Gaps = 18/173 (10%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
            C+I+G L VN+V+G+FHI  G S SI   H+H     T   +N TH I   SFG     
Sbjct: 169 ACRIHGSLNVNKVAGNFHITAGKSLSIPKGHIHISAFMTEKDYNFTHRINKFSFG---GP 225

Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERLDGSKL--------------GGGD 308
                 PL+G    A+    ++ Y+++++PT I   L  SK                 G 
Sbjct: 226 SPGIVHPLEGDEKIADNNMLLYQYFVEVVPTDIQTLLSTSKTYQYSVKDHQRPINHQKGS 285

Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
            G PGIFF Y++S L +K+T++  ++     K+   + G ++T  LV  ++ S
Sbjct: 286 HGSPGIFFKYDMSALKIKVTQQRDTVCQFLVKLCATVGGIFVTSGLVKNIVQS 338



 Score = 45.8 bits (107), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 44/88 (50%), Gaps = 1/88 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
          +K LDAF K  E + +KT  GG  +I     I+YLI  +   Y       +   D+   +
Sbjct: 12 VKELDAFPKVPEPYVDKTAVGGTFSIFTICTIAYLIIAETSYYLDSRLQFKFETDTDIDA 71

Query: 67 KLPIHLDIVVPTISCDYLALDAVDSSGE 94
          KL I++DI V  + C  +  D +DS+ +
Sbjct: 72 KLKINIDITV-AMPCGRIGADVLDSTNQ 98


>gi|281206876|gb|EFA81060.1| DUF1692 family protein [Polysphondylium pallidum PN500]
          Length = 344

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 62/231 (26%), Positives = 107/231 (46%), Gaps = 40/231 (17%)

Query: 22  EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISC 81
           +KTVYGG +T +C +F  +L+C ++  Y        L VD +RG++L I++DI  P++ C
Sbjct: 116 QKTVYGGVITAICMIFTMFLLCSELYYYTFPIRDHSLKVDVTRGNRLLINIDIHFPSLIC 175

Query: 82  DYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTE 141
             + ++++D                   +DG+PI++   ++V     ++    NG     
Sbjct: 176 SDINVESIDG------------------IDGRPIKDASYQIV-----RERLDRNGVVIDP 212

Query: 142 LEDPN---KCGSCYGAETE------TRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEY 192
              P    +C SC             ++CCN C++++E YR  K      D   QC    
Sbjct: 213 SNPPPGFFECVSCRLPANSKYAVLYPQRCCNKCDDLREFYRTNKIPQHYADQSPQCM--I 270

Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV---HDIQP 240
           S  + ++   EGC+IYG L V ++ G  HI  G+    N   +   +D+ P
Sbjct: 271 SDPEAED---EGCRIYGTLWVQKMKGDIHILAGIRPGYNAPGIYFKYDLSP 318



 Score = 38.5 bits (88), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 17/36 (47%), Positives = 24/36 (66%), Gaps = 1/36 (2%)

Query: 312 PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
           PGI+F Y+LSPLM+++ + SK    L T + C I G
Sbjct: 308 PGIYFKYDLSPLMIEVDQSSKPFVELVTSV-CAIGG 342


>gi|66360024|ref|XP_627190.1| ERV41 like membrane associated protein involved in vesicular
           transport with a transmembrane region near the
           C-terminus [Cryptosporidium parvum Iowa II]
 gi|46228832|gb|EAK89702.1| ERV41 like membrane associated protein involved in vesicular
           transport with a transmembrane region near the
           C-terminus [Cryptosporidium parvum Iowa II]
          Length = 403

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 86/405 (21%), Positives = 164/405 (40%), Gaps = 71/405 (17%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SR 64
           ++K  DAF+KP  +F  KT +GG +TI+  + +  L   ++  Y  ++  +E+ VD  S 
Sbjct: 15  KMKQFDAFSKPISEFRIKTAFGGYLTILSMIAMIILFYSELKYYLNITRKDEVTVDHLSS 74

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
              + + + +  P + CD L +  ++    + +++           DG            
Sbjct: 75  NRNINLRMQLEFPKLPCDILGVRIINLQENKEIYLP----------DG------------ 112

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKKWALP 180
            ++  K+    G+  +     + CG CY A    +     CCNTC ++   Y  K   LP
Sbjct: 113 GIEFVKI----GSNESNANSSSGCGPCYDASIINDLGAVNCCNTCKDIFNEYDKKGIKLP 168

Query: 181 ELDTIVQCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            + +  QC  + S +++ N       +EGC+I     + +V G   I+      + +  +
Sbjct: 169 HVISFKQCDYDKS-KRISNALSSNLNSEGCKIKVNGYIPKVKGKIEISH--KRWVKYKEM 225

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA----------SMF 285
            D++   S  FN ++ + +L FG +L     R K  +   +   E            +  
Sbjct: 226 TDLEIAESHLFNFSYKMNYLDFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFEDAYI 285

Query: 286 NYYIKIIPTIYERLDGS------------------KLGGG----DGGMPGIFFSYELSPL 323
           ++ +  IPT Y  ++                     L  G    D  +PGI  +Y+ +P 
Sbjct: 286 DFDMHCIPTQYNTINNKSINSHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTPF 345

Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           +VKITE  +S     T+    I G +    ++D      +  ++K
Sbjct: 346 LVKITESRRSFLSFITECCAIIGGIFAFSGMIDIFFFKFLSSVNK 390


>gi|307105802|gb|EFN54050.1| hypothetical protein CHLNCDRAFT_136126 [Chlorella variabilis]
          Length = 319

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 83/388 (21%), Positives = 147/388 (37%), Gaps = 103/388 (26%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTI--VCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           +L  L AF+   E    +T++G  VTI  VC   + ++  V  C    V   +++ VD+S
Sbjct: 7   KLSHLTAFSHAQEHLRVQTIHGAIVTIIGVCVALVLFISEVQQC--MVVKRVQDMRVDTS 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSG----EQHLHVEHN--IYKRRLDLDGKPIQE 117
           R  +L +  ++  P + C+ L +DA D SG    E  + V  N  ++K  +D+ G+ +  
Sbjct: 65  RREELHVSFNVTFPALPCEALLMDAGDVSGKWQTESRMKVAKNGEVHKHSVDISGRWL-- 122

Query: 118 PQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKW 177
                       ++      +  E ++P +     GA  +  + CN              
Sbjct: 123 ------------RLAEYTAPSEGEWDNPFEMNEI-GAALKRHEGCN-------------- 155

Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA---PGLSYSINHVH 234
                                        I+G+LEV RV+G+ H A     L  S+N   
Sbjct: 156 -----------------------------IHGWLEVQRVAGNVHFAVRPEALFLSMNAEA 186

Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           +  + P  ++  N +H                   PL+G          +  Y++K++PT
Sbjct: 187 IMQLHP-DASKLNISH-----------------ANPLEGVAQIDRTATGIDKYFVKVVPT 228

Query: 295 IYERLDGSK--------------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
            +  L G K                GG+   P ++  Y+ SP+MV I E    L  L  +
Sbjct: 229 DFYTLWGRKTHTYQYSVTEYYHQFRGGEEQPPAVYLLYDASPIMVDIREMRPGLLRLLVR 288

Query: 341 IMCNISGTYITFMLVDALLHSCVKKISK 368
           +   + G +    L D ++H  V  + +
Sbjct: 289 VCAVVGGAFALTGLFDKMVHRAVVAVKR 316


>gi|380492334|emb|CCF34678.1| hypothetical protein CH063_01185 [Colletotrichum higginsianum]
          Length = 377

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 88/361 (24%), Positives = 144/361 (39%), Gaps = 58/361 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  +T  GG  T+   +   +L   +V  +++ S T    V+   G 
Sbjct: 23  VSAFDAFPKAKPQYVTRTSGGGKWTVAMTVISVFLFWTEVGRWWRGSETHTFAVEKGIGH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           ++ I+LDIVV  + CD L ++  D++G++ L    ++ KR      + +       +   
Sbjct: 83  EMQINLDIVV-RMHCDDLHINVQDAAGDRIL--AGSMLKRDKTNWSQWVDSKGIHRLGRD 139

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            K K+ T  G    E          +G E       +  + V    +  KW         
Sbjct: 140 SKGKIVTGAGWQEEE---------GFGEE-------HVHDIVSLGKKKAKWG-------- 175

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                  T +L     + C++YG L+VNRV G FHI      +  H ++   +    AAF
Sbjct: 176 ------KTPRLWGD-GDSCRVYGNLDVNRVQGDFHIT-----ARGHGYMEFGEHLDHAAF 223

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
           N +H +  LSFG           PLD TV  A      F YY+ I+PT+Y     +    
Sbjct: 224 NFSHIVSELSFGPFYP---SLVNPLDRTVNLARINFHKFQYYLSIVPTVYTVGKSASSSN 280

Query: 307 ----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                            D  +PGIFF Y++ P+++ + E          KI+  +SG  +
Sbjct: 281 TIFTNQYAVTEQSKETDDHNIPGIFFKYDIEPILLSVEESRDGFLQFLMKIVNVVSGVLV 340

Query: 351 T 351
            
Sbjct: 341 A 341


>gi|256052432|ref|XP_002569774.1| ptx1 protein [Schistosoma mansoni]
 gi|353229921|emb|CCD76092.1| putative ptx1 protein [Schistosoma mansoni]
          Length = 460

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/387 (24%), Positives = 168/387 (43%), Gaps = 58/387 (14%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           S+ +  LD F K   +  + T  GG VTI+ +  IS+L+ ++   Y          +D S
Sbjct: 68  SQIVNELDVFPKLPRECKKSTWSGGLVTILTFGCISWLLIMEFRSYLDPPVNYSYELDKS 127

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
              K+ +++DIVV +  C  +++D VD+SG   L  E NI       +  P         
Sbjct: 128 TTGKVKVNIDIVVAS-PCHAVSMDVVDTSGSS-LSDEENIQYLPTSFELTPSARA----- 180

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
            A K ++                     Y AET   K     +   + + +K  +   + 
Sbjct: 181 -AFKYRQ---------------------YIAETLRAK-----HHTIQHWLWKYTSGTNVF 213

Query: 184 TIVQCKNEYSTEKLKNTF-TEGCQIYGYLEVNRVSGSFHIAPGLSYS-INHVHVHDIQPY 241
           TI +     + EK+ +   ++ C+I G L V +V G+ HI  G   +   ++H+H + P+
Sbjct: 214 TIFEVP--VADEKVSDDRNSDACRIVGTLFVKKVGGNIHILFGKPLNGFGNLHLH-VVPF 270

Query: 242 TSAAF-NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------ 294
           +  +  N +H I H SFG  +   + +  PL+   +  +   + F Y++ ++PT      
Sbjct: 271 SGQSLQNFSHRINHFSFGDLV---NGQIHPLEAVESVTDIAFTSFQYFVTMVPTKVVNHF 327

Query: 295 -IYERLDGSKL--------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
            I E    +            G  G+PGIFF Y++ PL+VKIT   + LG  +T++    
Sbjct: 328 HITETYQYAATLQNRTIDHDAGSHGIPGIFFVYDIFPLVVKITYDRELLGTFFTRLAALA 387

Query: 346 SGTYITFMLVDALLHSCVKKISKVEIG 372
            G + T   +  +L +    + +  +G
Sbjct: 388 GGIFATVAYLREILSNLPDILLRTRLG 414


>gi|41055383|ref|NP_956701.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Danio rerio]
 gi|82188148|sp|Q7T2D4.1|ERGI2_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 2
 gi|32451749|gb|AAH54593.1| ERGIC and golgi 2 [Danio rerio]
 gi|182890474|gb|AAI64472.1| Ergic2 protein [Danio rerio]
          Length = 376

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 56/180 (31%), Positives = 84/180 (46%), Gaps = 18/180 (10%)

Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
           N     C+I+G+L VN+V+G+FHI  G +      H H     +   +N +H I HLSFG
Sbjct: 163 NQPLNACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHETYNFSHRIDHLSFG 222

Query: 259 IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSK 303
              ++      PLDGT   + +   MF Y+I I+PT               + ER     
Sbjct: 223 ---EEIPGILNPLDGTEKVSADHNQMFQYFITIVPTKLQTYKVYADTHQYSVTERERVIN 279

Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
              G  G+ GIF  Y++S LMVK+TE+         ++   I G + T  ++  L+  CV
Sbjct: 280 HAAGSHGVSGIFMKYDISSLMVKVTEQHMPFWQFLVRLCGIIGGIFSTTGMLHNLVGFCV 339



 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 1/84 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
          ++ LDAF K  E + E T  GG V+++ +  ++ L   +   Y       E  VD    S
Sbjct: 13 VRELDAFPKVPESYVETTASGGTVSLLAFTAMALLAFFEFFVYRDTWMKYEYEVDKDFTS 72

Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
          KL I++DI V  + C ++  D +D
Sbjct: 73 KLRINIDITV-AMRCQFVGADVLD 95


>gi|452822342|gb|EME29362.1| hypothetical protein Gasu_31910 [Galdieria sulphuraria]
          Length = 170

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 54/178 (30%), Positives = 91/178 (51%), Gaps = 14/178 (7%)

Query: 41  LICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE 100
           LI  +V  Y++   T  L VD +R     I+LDI  P I C  L LD +D++G+  L V 
Sbjct: 4   LIISEVGRYWKPQVTTHLVVDYNREESFEIYLDITFPHIGCGALGLDTMDATGDSQLEVV 63

Query: 101 HN-IYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTT-TTELEDPNKCGSCYGAETET 158
           ++ + K R+  +G  +          +  + +  ++G   +  LE+   C SCYGA+  T
Sbjct: 64  NSKLSKFRVFQNGSQV----------LWNQSIVEKDGKVHSFVLEEATNCKSCYGAQIST 113

Query: 159 RKCCNTC-NEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNR 215
            +CCNTC  EV  AY +  W+  +++   QC  E   + +++  ++GC   G +EV +
Sbjct: 114 DQCCNTCEEEVLLAYEWIGWSY-QVEQFEQCHMEGVVQWVQSVLSQGCHFQGTIEVAK 170


>gi|323509323|dbj|BAJ77554.1| cgd8_2900 [Cryptosporidium parvum]
 gi|323510503|dbj|BAJ78145.1| cgd8_2900 [Cryptosporidium parvum]
          Length = 388

 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 89/406 (21%), Positives = 165/406 (40%), Gaps = 75/406 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD---SS 63
           +K  DAF+KP  +F  KT +GG +TI+  + +  L   ++  Y  ++  +E+ VD   S+
Sbjct: 1   MKQFDAFSKPISEFRIKTAFGGYLTILSMIAMIILFYSELKYYLNITRKDEVTVDHLSSN 60

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
           R   L + L+   P + CD L +  ++    + +++           DG           
Sbjct: 61  RNINLRMQLEF--PKLPCDILGVRIINLQENKEIYLP----------DG----------- 97

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKKWAL 179
             ++  K+    G+  +     + CG CY A    +     CCNTC ++   Y  K   L
Sbjct: 98  -GIEFVKI----GSNESNANSSSGCGPCYDASIINDLGAVNCCNTCKDIFNEYDKKGIKL 152

Query: 180 PELDTIVQCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
           P + +  QC  + S +++ N       +EGC+I     + +V G   I+      + +  
Sbjct: 153 PHVISFKQCDYDKS-KRISNALSSNLNSEGCKIKVNGYIPKVKGKIEISH--KRWVKYKE 209

Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA----------SM 284
           + D++   S  FN ++ + +L FG +L     R K  +   +   E            + 
Sbjct: 210 MTDLEIAESHLFNFSYKMNYLDFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFEDAY 269

Query: 285 FNYYIKIIPTIYERLDGS------------------KLGGG----DGGMPGIFFSYELSP 322
            ++ +  IPT Y  ++                     L  G    D  +PGI  +Y+ +P
Sbjct: 270 IDFDMHCIPTQYNTINNKSINSHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTP 329

Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
            +VKITE  +S     T+    I G +    ++D      +  ++K
Sbjct: 330 FLVKITESRRSFLSFITECCAIIGGIFAFSGMIDIFFFKFLSSVNK 375


>gi|393231429|gb|EJD39021.1| DUF1692-domain-containing protein [Auricularia delicata TFB-10046
           SS5]
          Length = 518

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 77/349 (22%), Positives = 137/349 (39%), Gaps = 61/349 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K    +  +   GG +T+   L    L+  D+ +Y       E  VD SR S
Sbjct: 20  LKQFDAFPKVPATYKSRRGEGGLLTLFACLLSVVLVLNDIAEYMWGWPDHEFSVDKSRQS 79

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            +PI++D++V  + C YL++D  D+ G++ LH+  N                       V
Sbjct: 80  YMPINVDLIV-NMPCHYLSVDIRDAVGDR-LHLSDN-----------------------V 114

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           K++    + G  T          S      ++RK     +  + + +             
Sbjct: 115 KREGTVWDVGQATRMANHSQTMMSATEVVRQSRKSRGLFSIFQRSSK------------P 162

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA-PGLSYSINHVHVHDIQPYTSAA 245
           Q K  Y+   +       C+++G + V +V+ + HI   G  YS N    H +       
Sbjct: 163 QFKPTYNHPNMGKAVGSACRVFGSMFVKKVTANLHITTAGHGYSSNAHTDHTM------- 215

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N +H I   SFG  + D  +   PLD     A+E  + + Y++ ++PT Y       + 
Sbjct: 216 MNLSHIISEFSFGPFMPDISQ---PLDNLFEVAKEPFTAYQYFLTVVPTTYVAPRSYPMR 272

Query: 306 GGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
                              PGIFF +++ P+ + + +++ +   L  +I
Sbjct: 273 TNQYSVTNYKRVFEHGRATPGIFFKFDIDPMQLTVIQRTTTFTQLIIRI 321


>gi|154418008|ref|XP_001582023.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121916255|gb|EAY21037.1| hypothetical protein TVAG_172950 [Trichomonas vaginalis G3]
          Length = 371

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 161/380 (42%), Gaps = 53/380 (13%)

Query: 23  KTVYGGAVTIVCWLFISYLICVDV--CDYFQVSTTEEL---FVDSSRGSKLPIHLDIVVP 77
           +T  GG ++ +  L++ +L+   +    Y ++ ++  L    VD  R  K  I+ DI + 
Sbjct: 19  QTFTGGLISFLTTLWVCFLLVGKIHGLIYPEIKSSVVLDKEHVDGQR--KTFINFDITIG 76

Query: 78  TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGT 137
           +  C  L +D  +  G Q  ++  NI   R    G+ I +  ++ V +  KK+       
Sbjct: 77  S-PCTMLHIDLFEHDGYQKTNIIENISLTRYAQSGEDINDLLEKRVPSKSKKQDFP---- 131

Query: 138 TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
                  P+ CG+CY   +  +KCCNTC EV + ++ K           QC  E     +
Sbjct: 132 -------PDYCGNCY--LSTDKKCCNTCREVMDVFKAKGLTYYASFRWEQCIRE----GV 178

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV-HVHDIQPYTSAAFNTTHHIRHLS 256
            +   E C+I G L+V + SG+FHIA G + + N+  H HD+     A+    H I  L+
Sbjct: 179 LDFGNETCRIKGKLKVKKQSGNFHIALGANTNDNYKGHSHDLSS-VDASHKLNHVIHSLT 237

Query: 257 FG-------IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP---TIYERLDGSKLGG 306
           FG        +L D + +   L+G+         M  YY+   P   +  +++D  +   
Sbjct: 238 FGEPVDYYKPQLTDVEMQLPELNGS------NYWMVTYYLHAAPERISTTDKIDSYRYSA 291

Query: 307 GDG----------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVD 356
                        G PGI F Y+ +P++V       S+  +   I   + G +    ++D
Sbjct: 292 FPSRRKVTNKTKKGFPGIVFYYDFAPMIVVYQPTHGSIRSIIVDICGIVGGAFSFAAIID 351

Query: 357 ALLHSCVKKISKVEIGGKTV 376
           AL    +  I    + GK  
Sbjct: 352 ALAFGALSGIRGKTMIGKAA 371


>gi|432862155|ref|XP_004069750.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Oryzias latipes]
          Length = 373

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 57/191 (29%), Positives = 89/191 (46%), Gaps = 18/191 (9%)

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
           T V+      T++  ++    C+I+G+L VN+V+G+FHI  G S      H H     + 
Sbjct: 146 TAVKGAQPAKTQRDSSSPPNACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVSH 205

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--------- 294
            ++N +H I HLSFG  +        PLDGT   A +   MF Y+I I+PT         
Sbjct: 206 DSYNFSHRIDHLSFGEAIPG---LISPLDGTEKIAADYNHMFQYFITIVPTKLNTYKVSA 262

Query: 295 ------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
                 + ER        G  G+ GIF  Y++S LMVK+TE+         ++   + G 
Sbjct: 263 ETHQYSVTERERVINHAAGSHGVSGIFMKYDISSLMVKVTEQHMPFWKFLVRLCGIVGGI 322

Query: 349 YITFMLVDALL 359
           + T  ++  L+
Sbjct: 323 FSTTGMIHGLV 333



 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 43/84 (51%), Gaps = 1/84 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
          +K LDAF K  E + E T  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13 VKELDAFPKVPESYVESTASGGTVSLIAFTLMAVLAFLEFFVYTNTWMKYEYEVDKDFSS 72

Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
          KL I++DI V  + C Y+  D +D
Sbjct: 73 KLRINVDITV-AMRCQYIGADVLD 95


>gi|225558748|gb|EEH07032.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 401

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/393 (23%), Positives = 156/393 (39%), Gaps = 83/393 (21%)

Query: 2   VFSER------LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTT 55
            F ER      L+  DAF K    +   T  GG  TI+ +   ++L   ++  +++    
Sbjct: 13  AFGERPGIGSGLRTFDAFPKTKPTYTTSTRRGGQWTIIVFALCAFLSLNELRTWYRGVEN 72

Query: 56  EELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
               V+     +L ++LDIVV  +SCD L ++  D++G++ L  +         LD +P 
Sbjct: 73  HHFSVEKGVSRELQMNLDIVV-AMSCDALRVNVQDAAGDRILASDL--------LDKQPT 123

Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK-CCNTCNEVKEAYRY 174
                    A   +++         E +  N+  S    E E      +   E K +Y+ 
Sbjct: 124 SW-------AAWNRELNGVTSGGGREYQTLNEEDSSRLMEQEADAHVGHALGEAKRSYKR 176

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHV 233
           K    P+L               +    + C+IYG LE N+V G FHI A G  Y     
Sbjct: 177 KFPKGPKLK--------------RGEKADSCRIYGSLEGNKVQGDFHITARGHGYPEFGE 222

Query: 234 HV-HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
           H+ HD       AFN +H +  LSFG           PLD T++        F YY+ ++
Sbjct: 223 HLSHD-------AFNFSHMVTELSFGPHYPS---LLNPLDKTISVTPARFFKFQYYLSVV 272

Query: 293 PTIYERL-----------DGSKLGGGDGG-----------------------MPGIFFSY 318
           PTIY R            D + +   + G                       +PGIFF Y
Sbjct: 273 PTIYTRAGIVDPYNHVLPDPTTIRPSERGSTIFTNQYAATSQSHEVPDPQYHIPGIFFKY 332

Query: 319 ELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
            + P+++ ++E+   L  L  +++  ++G  + 
Sbjct: 333 NIEPILLVVSEERGGLLALLVRLVNVLAGVVVA 365


>gi|410082748|ref|XP_003958952.1| hypothetical protein KAFR_0I00360 [Kazachstania africana CBS 2517]
 gi|372465542|emb|CCF59817.1| hypothetical protein KAFR_0I00360 [Kazachstania africana CBS 2517]
          Length = 354

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 96/385 (24%), Positives = 158/385 (41%), Gaps = 81/385 (21%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  E++ +K+  GG  +++ + F+ ++   +  +YF     E+  VD     
Sbjct: 4   LRTFDAFPKTEEEYQKKSSKGGLSSLLTYFFLIFIAWTEFGNYFGGYIDEQYTVDPEVKE 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++DI V  I C +L ++A D + ++ L  E       L L+  P   P    VN +
Sbjct: 64  DIQINMDIFV-NIPCKWLHINARDMTLDRKLAGEE------LKLEDMPFFIPFDTRVNDI 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAE----TETRKCCNTCNEVKEAYRYKKWALPEL 182
            +          T EL+     G    AE     + R+  +  N  +      K  +PE 
Sbjct: 117 TE--------IVTPELD--RILGEAIPAEFREKIDMRQFYDENNHDE-----TKHFVPEF 161

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
           +                    GC ++G + VNRV+G   I A G+ Y        D +  
Sbjct: 162 N--------------------GCHVFGSIPVNRVTGELQITAKGMGYP-------DREKA 194

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA-SMFNYYIKIIPTIYERLD 300
                N  H I  LSFG      D    PLD +    +E   S + Y++ +IPTIY++L 
Sbjct: 195 PIDEVNFAHVINELSFGDFYPYID---NPLDNSAKFDQENPISAYVYHMNVIPTIYQKL- 250

Query: 301 GSKLGGGD------------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
           G+++                      G +PGIF  Y   PL + +T+K  S      +++
Sbjct: 251 GAEVDTNQYSVSEYHYTEADNAIRKAGRVPGIFLKYNFEPLSIVVTDKRLSFIQFVIRLV 310

Query: 343 CNISG-TYIT---FMLVDALLHSCV 363
             +S   YI    F+LVD  L + +
Sbjct: 311 AILSFIVYIASWLFILVDTALVAAM 335


>gi|145511431|ref|XP_001441642.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408894|emb|CAK74245.1| unnamed protein product [Paramecium tetraurelia]
          Length = 329

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/191 (31%), Positives = 90/191 (47%), Gaps = 24/191 (12%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG---- 258
           EGCQI GY+ VN+V G+FH++      I H      Q       + +H I H+SFG    
Sbjct: 139 EGCQIAGYIIVNKVPGNFHVSAHAFGGILH---QVFQRSQIQTLDLSHTINHISFGEEDD 195

Query: 259 ---IKLQDDDERRKPLDGT--VAKAEEGASM-FNYYIKIIPTIYERLDGSKLGGGD---- 308
              IK Q       PLD T  VA+ + G  M F YYI ++PT Y  + G++         
Sbjct: 196 LMKIKKQFQKGVLNPLDNTKKVAQPQGGTGMMFQYYISVVPTTYVDVSGNEYYVHQFTAN 255

Query: 309 ------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH-S 361
                   +P  +F Y+LSP+ VK  +  +S  H   +I   + G +    +VD ++H S
Sbjct: 256 SNEVLTDHLPAAYFRYDLSPVTVKFLQYRESFLHFLVQICAILGGVFTIASIVDGMIHKS 315

Query: 362 CVKKISKVEIG 372
            V  + K E+G
Sbjct: 316 VVALLKKYEMG 326



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/89 (34%), Positives = 47/89 (52%), Gaps = 6/89 (6%)

Query: 6  RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
          RL+ LD + K   D  E T  G  ++++  LFI+ L       Y +V  + E+FVD +RG
Sbjct: 8  RLRKLDIYRKLPADLTEPTTAGALISVIIILFITELQA-----YIEVDNSSEMFVDINRG 62

Query: 66 S-KLPIHLDIVVPTISCDYLALDAVDSSG 93
            ++ ++LDI      CD L+LD  D  G
Sbjct: 63 GEQIRVNLDIEFHKFPCDILSLDVQDYYG 91


>gi|67901384|ref|XP_680948.1| hypothetical protein AN7679.2 [Aspergillus nidulans FGSC A4]
 gi|40742675|gb|EAA61865.1| hypothetical protein AN7679.2 [Aspergillus nidulans FGSC A4]
 gi|259484020|tpe|CBF79887.1| TPA: COPII-coated vesicle protein (Erv41), putative
           (AFU_orthologue; AFUA_2G01530) [Aspergillus nidulans
           FGSC A4]
          Length = 394

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 88/376 (23%), Positives = 146/376 (38%), Gaps = 71/376 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K    +   +  GG  T++  +  +     +   + +   T    V+     
Sbjct: 24  LRTFDAFPKTKPSYTTPSRRGGQWTVLILIICTIFSITEFRTWLKGHETHHFTVEKGVSH 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++ D V+  + CD L ++  D++G++ L  E  + K+          EP    +   
Sbjct: 84  DLQLNFDAVI-HMPCDALHINIQDAAGDRVLASE--MLKK----------EPTSWKLWMD 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           K+   ++E  T +    D  +      A  E     +  NE++   + K    P+L    
Sbjct: 131 KRNYHSSEYQTLSDSRGDEERVA----AMEEDVHAGHVLNELRRNGKRKFAKGPKLR--- 183

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                      +    + C+IYG LE N+V G FHI A G  Y     H+        +A
Sbjct: 184 -----------RGDVVDSCRIYGSLEGNKVQGDFHITARGHGYRDGREHL------DHSA 226

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN +H I  LSFG           PLD T+A  E     + Y++ I+PTIY R    +L 
Sbjct: 227 FNFSHIITELSFGPHYPS---LHNPLDKTIATTEFHYYKYQYFLSIVPTIYSRNQNLRLD 283

Query: 306 GGDGG------------------------------MPGIFFSYELSPLMVKITEKSKSLG 335
                                              +PGIFF Y + P+M+ I+E+     
Sbjct: 284 ALPSSSSARSNKNLIFTNQYAATSQSDAIPESPYVIPGIFFKYNIEPIMLLISEERTGFL 343

Query: 336 HLWTKIMCNISGTYIT 351
           +L  +I+  +SG  +T
Sbjct: 344 NLLIRIVNTVSGVLVT 359


>gi|358333955|dbj|GAA52416.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Clonorchis sinensis]
          Length = 306

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/165 (32%), Positives = 78/165 (47%), Gaps = 20/165 (12%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSY-SINHVHVHDIQPYTSAA-FNTTHHIRHLSFGIK 260
           + C I G   V +V+G+ H+ PG  +      HVH I P+   A FN +H I HLSFG +
Sbjct: 86  DACNIVGTFHVQKVAGNMHVLPGRPFDGPGGSHVH-IAPFVRLADFNFSHRINHLSFGAQ 144

Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----IYERLDGSKL----------GG 306
           + +   R  PLD     +      F YYI I+PT     +  LD  +           G 
Sbjct: 145 VAN---RVNPLDAVEEISYNPMETFRYYISIVPTRVVYAFSSLDTYQYAITVKNRTAEGN 201

Query: 307 GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
               +PGIFFSY+  PL+V++TE  +  G    ++   + G + T
Sbjct: 202 KSDSIPGIFFSYDTFPLLVQVTESRELFGTFLARLAALVGGLFAT 246


>gi|67623433|ref|XP_667999.1| serologically defined breast cancer antigen 84 like (42.9 kD)
           (XQ234) [Cryptosporidium hominis TU502]
 gi|54659178|gb|EAL37768.1| serologically defined breast cancer antigen 84 like (42.9 kD)
           (XQ234) [Cryptosporidium hominis]
          Length = 388

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 89/406 (21%), Positives = 164/406 (40%), Gaps = 75/406 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD---SS 63
           +K  DAF+KP  +F  KT +GG +TI+  + +  L   ++  Y  ++  +E+ VD   S+
Sbjct: 1   MKQFDAFSKPISEFRIKTAFGGYLTILSIIAMIILFYSELKYYLNITRKDEVTVDHLSSN 60

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
           R   L + L+   P + CD L +  ++    + +++           DG           
Sbjct: 61  RNINLRMQLEF--PKLPCDILGVRIINLQENKEIYLP----------DG----------- 97

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETR----KCCNTCNEVKEAYRYKKWAL 179
             ++  K+    G+  +     + CG CY A          CCNTC +V   Y  K   L
Sbjct: 98  -GIEFVKI----GSNESNANSSSGCGPCYDASINNDLGVVNCCNTCKDVFNEYDKKGIKL 152

Query: 180 PELDTIVQCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVH 234
           P + +  QC  + S +++ N       +EGC+I     + +V G   I+      + +  
Sbjct: 153 PHVISFKQCDYDKS-KRISNALSSNLNSEGCKIKVNGYIPKVKGKIEISH--KRWVKYKE 209

Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA----------SM 284
           + D++   S  FN ++ + +L FG +L     R K  +   +   E            + 
Sbjct: 210 MTDLEIAESHLFNFSYKMNYLDFGEELPGIPNRWKNQEYIQSSRFEKLGYSQDLVFDDAY 269

Query: 285 FNYYIKIIPTIYERLDGS------------------KLGGG----DGGMPGIFFSYELSP 322
            ++ +  IPT Y  ++                     L  G    D  +PGI  +Y+ +P
Sbjct: 270 IDFDMHCIPTQYNTINNKSINSHQFSVRSQYKKVLVSLANGKFIPDTSIPGIHINYDFTP 329

Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
            +VK+TE  +S     T+    I G +    ++D      +  ++K
Sbjct: 330 FLVKMTESRRSFLSFITECCAIIGGIFAFSGMIDIFFFKFLSSVNK 375


>gi|148678794|gb|EDL10741.1| ERGIC and golgi 2, isoform CRA_a [Mus musculus]
          Length = 375

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 149/368 (40%), Gaps = 53/368 (14%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 21  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +   Y+  L  D  P Q   + ++  +
Sbjct: 81  KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 138

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 139 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 165

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 166 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 219

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP-----TIYERLDG 301
           N +H I HLSFG  +        PLDGT   A +      +  KI       ++ ER   
Sbjct: 220 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDLVPTKLHTYKISADTHQFSVTERERI 276

Query: 302 SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
                G  G+ GIF  Y+LS LMV +TE+       + ++   I G + T      +LH 
Sbjct: 277 INHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST----TGMLHG 332

Query: 362 CVKKISKV 369
             K I ++
Sbjct: 333 IGKFIVEI 340


>gi|367025937|ref|XP_003662253.1| hypothetical protein MYCTH_2302675 [Myceliophthora thermophila ATCC
           42464]
 gi|347009521|gb|AEO57008.1| hypothetical protein MYCTH_2302675 [Myceliophthora thermophila ATCC
           42464]
          Length = 380

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 95/368 (25%), Positives = 152/368 (41%), Gaps = 67/368 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELF-VDSSRG 65
           +K  DAF K    + + T  GG  T+    FIS ++       +   T E  F V+    
Sbjct: 22  VKAFDAFPKAKPQYVQHTSAGGKWTVAM-AFISLILFWSELARWWRGTEEHTFAVEKGVS 80

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDL-----DGKPIQEPQK 120
             LPI+LD+VV  + C  L ++  D++G++ L    +  +R   L     DGK +    +
Sbjct: 81  HVLPINLDVVV-RMRCADLHVNVQDAAGDRILAA--SALRRDPTLWAHWVDGKGVHRLGR 137

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
           +      + +V T  G T  + ++       +G E       +  + V    +  KW+  
Sbjct: 138 DA-----QGRVITGEGYTGADHDE------GFGEE-------HVHDIVALGRKRAKWS-- 177

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
                        T +L     + C+IYG LE+N+V G FHI      +  H ++   + 
Sbjct: 178 ------------RTPRLWGAEADSCRIYGSLELNKVQGDFHIT-----ARGHGYMEFGEH 220

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY---- 296
               AFN +H I  LSFG  L        PLD TV  A      F Y++ ++PT Y    
Sbjct: 221 LDHNAFNFSHIISELSFGPFLP---SLVNPLDRTVNTAPAHFYKFQYFLSVVPTTYSVGH 277

Query: 297 --ERLDGSKLGGG-----------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
             ER   S L              +  +PGIF  Y++ P+++ I E   S      K++ 
Sbjct: 278 PEERGSRSVLTNQYAVTEQSKAVPENTVPGIFVKYDIEPILLNIVETRDSFFVFLIKVIN 337

Query: 344 NISGTYIT 351
            +SG  +T
Sbjct: 338 VVSGVLVT 345


>gi|440801547|gb|ELR22565.1| serologically defined breast cancer antigen 84 isoform 1, putative
           [Acanthamoeba castellanii str. Neff]
          Length = 355

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 56/187 (29%), Positives = 87/187 (46%), Gaps = 24/187 (12%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPG----LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGI 259
           GC+++G  EV +V G+ HIA G     S+  +  HVH I P   A+FN +H I HLSFG 
Sbjct: 151 GCRVFGKAEVQKVKGNLHIAAGSNAPQSHDGHQHHVHHITPEQVASFNVSHFIPHLSFGP 210

Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL--------------- 304
                  R  PL  T    E  A   N+ I+++PTIYE   G+ +               
Sbjct: 211 AF---PRRTDPLSWTRV-IEPNAMQVNHMIQLVPTIYEDWGGNVIEGYQYSAQTNYKHIV 266

Query: 305 -GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
            G     +PG+F  +++SP +++  E  +S  H  T++     GT++   L+ + L    
Sbjct: 267 PGASSFPLPGVFIKWDMSPFVIQYRETGRSFAHFLTRLCAITGGTFVVLGLIYSGLTKAF 326

Query: 364 KKISKVE 370
             +  V 
Sbjct: 327 PALRTVR 333



 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 12/117 (10%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVST--------- 54
           ++RL+  D F K  ED  E+   G AVTIV  L + +L   +   Y QV T         
Sbjct: 10  AKRLRSFDIFPKSVEDVREQASAGAAVTIVGVLVMLFLFVSEFSSYTQVVTEAWRGGAIW 69

Query: 55  --TEELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
              + +FVD++R   + I+ ++V   ++C  + +D VD+ G+       +I K+ +D
Sbjct: 70  AEADTIFVDTTREKTMWINFELVFLQLACKEVEVDIVDNFGDPQ-RGRRDIQKQAVD 125


>gi|195997845|ref|XP_002108791.1| hypothetical protein TRIADDRAFT_49706 [Trichoplax adhaerens]
 gi|190589567|gb|EDV29589.1| hypothetical protein TRIADDRAFT_49706 [Trichoplax adhaerens]
          Length = 324

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 92/187 (49%), Gaps = 25/187 (13%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLS--YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
           + C+I+G + +N+V+G+FH+  G+S  + + H HV D+ P  S  F  +H I  L+FG+ 
Sbjct: 137 DACRIHGNIPLNKVAGNFHVTAGMSINHPMGHAHVSDLVPRESVNF--SHRIDLLAFGVA 194

Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLG 305
             +      PLDG     +    M+ Y+IKI+PT               + E        
Sbjct: 195 APN---VINPLDGVEFITKITDKMYQYFIKIVPTKVKTFSVAIDTYQYSVTEHFSKVDHM 251

Query: 306 GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV---DALLHSC 362
            G  G+ G+FF Y+LSP+ V++TE     G L  ++   + G + T  ++    +L++  
Sbjct: 252 NGKHGVSGLFFKYDLSPISVQVTEARVPFGQLLIRLCGIVGGIFATSGMIHIFSSLIYEA 311

Query: 363 VKKISKV 369
           V +  K+
Sbjct: 312 VTRRKKL 318



 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 25/86 (29%), Positives = 43/86 (50%), Gaps = 1/86 (1%)

Query: 5  ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
          + +K LDAF K  ED  E +  GG  ++  +  I+ ++ +++ DY          VD   
Sbjct: 13 QEVKKLDAFPKIAEDCKESSTSGGTASVTAFFLITIMVIMELVDYSFSGVKYNYSVDKDI 72

Query: 65 GSKLPIHLDIVVPTISCDYLALDAVD 90
           SK+ +HLD+ +  + C  L  D +D
Sbjct: 73 QSKMMLHLDLTI-AMKCRDLGADVLD 97


>gi|240275142|gb|EER38657.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Ajellomyces capsulatus H143]
 gi|325094499|gb|EGC47809.1| COPII-coated vesicle protein [Ajellomyces capsulatus H88]
          Length = 401

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 95/393 (24%), Positives = 157/393 (39%), Gaps = 83/393 (21%)

Query: 2   VFSER------LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTT 55
            F ER      L+  DAF K    +   T  GG  TI+ +   ++L   ++  +++    
Sbjct: 13  AFGERPGIGSGLRTFDAFPKTKPTYTTSTRRGGQWTIIVFALCAFLSLNELRTWYRGVEN 72

Query: 56  EELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
               V+     +L ++LDIV   + CD L ++  D++G++ L  +       LD      
Sbjct: 73  HHFSVEKGVSRELQMNLDIVA-AMPCDALRVNVQDAAGDRILASD------LLD------ 119

Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRK-CCNTCNEVKEAYRY 174
           ++P        +   VT+  G    E +  N+  S    E E      +   E K +Y+ 
Sbjct: 120 KQPTSWAAWNRELNGVTSGGGR---EYQTLNEEDSSRLMEQEADAHVGHALGEAKRSYKR 176

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHV 233
           K    P+L               +    + C+IYG LE N+V G FHI A G  Y     
Sbjct: 177 KFPKGPKLK--------------RGEKADSCRIYGSLEGNKVQGDFHITARGHGYPEYGE 222

Query: 234 HV-HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
           H+ HD       AFN +H +  LSFG           PLD T++        F YY+ ++
Sbjct: 223 HLSHD-------AFNFSHMVTELSFGPHYPS---LLNPLDKTISVTPARFFKFQYYLSVV 272

Query: 293 PTIYERL-----------DGSKLGGGDGG-----------------------MPGIFFSY 318
           PTIY R            D + +   + G                       +PGIFF Y
Sbjct: 273 PTIYTRAGIVDPYNHVLPDPTTIRPSERGSTIFTNQYAATSQSHEVPDPQYHIPGIFFKY 332

Query: 319 ELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
            + P+++ ++E+  SL  L  +++  ++G  + 
Sbjct: 333 NIEPILLVVSEERGSLLALLVRLVNVLAGVVVA 365


>gi|212527292|ref|XP_002143803.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
           marneffei ATCC 18224]
 gi|210073201|gb|EEA27288.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
           marneffei ATCC 18224]
          Length = 402

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 93/381 (24%), Positives = 151/381 (39%), Gaps = 78/381 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K   ++   +  GG  T++ +   ++L   +  ++++ +  +   V+     
Sbjct: 24  LRTFDAFPKTKPNYTTASRRGGQWTVIIFAICTFLTFGEFVNWYRGTENQHFSVEKGVSR 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           +L +++D+VV  + C+ L ++  D+SG+      H +    L  DG    E   E +N  
Sbjct: 84  QLQMNIDMVV-KMHCNDLRVNVQDASGD------HIMAGMLLMKDGTNW-ELWNEKLNQQ 135

Query: 127 KKKKV---TTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
               V    T N      L D           + TR             R  K   P+  
Sbjct: 136 SSSGVPEYQTLNAEDVKRLMDQEDDAHARHVLSHTR-------------RNPKRKFPK-- 180

Query: 184 TIVQCKNEYSTEKLKNTF-TEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
                     T +L + + T+ C+IYG LE N+V G FHI A G  Y  N V  H     
Sbjct: 181 ----------TPRLSSKYPTDSCRIYGSLESNKVHGDFHITARGHGY--NEVGQH----L 224

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG 301
             + FN TH +  LSFG           PLD TVA  E     F Y+I ++PTIY + + 
Sbjct: 225 DHSNFNFTHMVTELSFGPHYPS---LLNPLDKTVASTETHYYKFQYFINVVPTIYAKGNN 281

Query: 302 S-------------------------------KLGGGDGGMPGIFFSYELSPLMVKITEK 330
           +                                L       PGIFF Y + P+++ ++E+
Sbjct: 282 AVEKYTANPAKAFEKSRNTIFTNQYSATSQSHPLPESPFNTPGIFFKYNIEPILLFVSEE 341

Query: 331 SKSLGHLWTKIMCNISGTYIT 351
             S   L  +++  +SG  +T
Sbjct: 342 RGSFLALLVRLVNVVSGVIVT 362


>gi|345325542|ref|XP_001508860.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Ornithorhynchus anatinus]
          Length = 372

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 72/288 (25%), Positives = 125/288 (43%), Gaps = 44/288 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  +++L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMAFLTVMEFLVYQDTWMKYEYEVDKDFAS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYIGADVLDLAETMVASADGLVYEPVI-FDLSPQQREWQRMLQMI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QNR------------LQEEHSLQDVI---------------FKSAFKSASTALPP----- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
             + + S +       + C+I+G+L VN+V+G+FHI  G +      H H     +  ++
Sbjct: 159 --RGDLSLQP-----PDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPGIIN---PLDGTEKIAVDHNQMFQYFITVVPT 256


>gi|340514865|gb|EGR45124.1| predicted protein [Trichoderma reesei QM6a]
          Length = 372

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 95/362 (26%), Positives = 144/362 (39%), Gaps = 63/362 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  KT  GG  T+   L  S  +  ++  +++ S      V+   G 
Sbjct: 21  VSAFDAFPKAKPQYVTKTAGGGKWTVAMLLVSSIFLWSEIGRWWRGSEHHTFAVEKGIGH 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I+LDIVV  +SC  L ++  D+SG++              L G  +          V
Sbjct: 81  DMQINLDIVVK-MSCGDLHVNVQDASGDR-------------ILAGDKLTRDATNWEQWV 126

Query: 127 KKKKV----TTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
             K V      ENG   T           +GA  E     +  + V  + +  KWA    
Sbjct: 127 DAKGVHRLGKNENGKLDT-------GAGWHGAHDEGFGEEHVHDIVSLSRKKAKWA---- 175

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HDIQP 240
                      T K +   T+ C++YG L++N+V G FHI A G  YS    H+ HD   
Sbjct: 176 ----------KTPKPRGR-TDSCRMYGSLDLNKVQGDFHITARGHGYSGIGGHLDHD--- 221

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY---E 297
                FN +H I  LS+G           PLD TV  A      F YY+ ++PT+Y    
Sbjct: 222 ----KFNFSHIISELSYGPFYP---SLINPLDRTVNTAIVHFHKFQYYLSVVPTVYIASH 274

Query: 298 RLDGSKLGG--------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
           R+  +             D  +PGIFF Y++ P+M+ + E          K++   SG  
Sbjct: 275 RIVNTNQYAVTEQSKTISDHQVPGIFFKYDIEPIMLSVEETRDGFFAFLLKLVNVFSGVM 334

Query: 350 IT 351
           + 
Sbjct: 335 VA 336


>gi|119497911|ref|XP_001265713.1| COPII-coated vesicle protein (Erv41), putative [Neosartorya
           fischeri NRRL 181]
 gi|119413877|gb|EAW23816.1| COPII-coated vesicle protein (Erv41), putative [Neosartorya
           fischeri NRRL 181]
          Length = 397

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 95/381 (24%), Positives = 156/381 (40%), Gaps = 79/381 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K    +   +  GG  T++  L  ++    +   + + +  +   V+     
Sbjct: 24  LKTFDAFPKTKPSYTAPSPRGGQWTVLILLVCTFFSISEFRTWLKGTEKQHFSVEKGISH 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++LDIVV  + CD L ++  D+SG++ L  E  + KR          EP        
Sbjct: 84  DLQLNLDIVV-HMPCDTLDVNIQDASGDRVLAGE--LLKR----------EP-------- 122

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGA----ETETRKCCNTCNEVKEAYRYKKWALPEL 182
                      T+ +L    +    YG     +T +++  +  +E +EA  +    L E+
Sbjct: 123 -----------TSWQLWMDKRNFEIYGGAHEYQTLSQEHADRLSE-QEADAHVHHVLGEV 170

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
               + K     +  +    + C+IYG LE N+V G FHI A G  Y       H+  P+
Sbjct: 171 RRNPRKKFAKGPKLRRGDAVDSCRIYGSLEGNKVQGDFHITARGHGY-------HNSAPH 223

Query: 242 TS-AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER-- 298
                FN +H I  LSFG           PLD T+A  E+    + Y++ I+PTIY +  
Sbjct: 224 LEHKTFNFSHMITELSFGPHYP---TLLNPLDKTIATTEDHYYKYQYFLSIVPTIYSKGN 280

Query: 299 --LD--------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEK 330
             LD                           S +      +PGIFF Y + P+++ I+E+
Sbjct: 281 LALDTYANAPPTSRYSKNLIFTNQYAATSQSSAIPENPYFIPGIFFKYNIEPILLMISEE 340

Query: 331 SKSLGHLWTKIMCNISGTYIT 351
             S   L  +++  ISG  +T
Sbjct: 341 RTSFLSLLVRLVNTISGVMVT 361


>gi|410907774|ref|XP_003967366.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Takifugu rubripes]
          Length = 388

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 85/176 (48%), Gaps = 18/176 (10%)

Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
           +T    C+I+G+L VN+V+G+FHI  G S      H H     +  ++N +H I HLSFG
Sbjct: 162 STSLHACRIHGHLYVNKVAGNFHITVGKSIPHPRGHAHLAALVSHDSYNFSHRIDHLSFG 221

Query: 259 IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSK 303
              +D      PLDGT   + +   +F Y+I I+PT               + E+     
Sbjct: 222 ---EDLPGIISPLDGTEKVSADSNHIFQYFITIVPTKLNTYRVSAETHQYSVTEQDRAIN 278

Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
              G  G+ GIF  Y+++ LMVK+TE+   L     ++   I G + T  ++  ++
Sbjct: 279 HAAGSHGVSGIFMKYDINSLMVKVTEQHMPLWQFLVRLCGIIGGIFSTTGMIHGIV 334



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 44/84 (52%), Gaps = 1/84 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
          +K LDAF K  E + E T  GG V+++ +  ++ L  ++   Y       E  VD   GS
Sbjct: 13 VKELDAFPKVPESYVESTASGGTVSLIAFSLMAILAFLEFFVYRDTWMKYEYEVDKDFGS 72

Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
          KL I++DI V  + C Y+  D +D
Sbjct: 73 KLRINVDITV-AMRCQYIGADVLD 95


>gi|261193579|ref|XP_002623195.1| COPII-coated vesicle protein [Ajellomyces dermatitidis SLH14081]
 gi|239588800|gb|EEQ71443.1| COPII-coated vesicle protein [Ajellomyces dermatitidis SLH14081]
 gi|239613876|gb|EEQ90863.1| COPII-coated vesicle protein [Ajellomyces dermatitidis ER-3]
 gi|327349942|gb|EGE78799.1| COPII-coated vesicle protein [Ajellomyces dermatitidis ATCC 18188]
          Length = 401

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 94/392 (23%), Positives = 159/392 (40%), Gaps = 81/392 (20%)

Query: 2   VFSER------LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTT 55
            F ER      L+  DAF K    +   TV GG  TI+ +   ++L   ++  +++    
Sbjct: 13  AFGERPGIGSGLRTFDAFPKTKPTYTSSTVRGGQWTIIVFALCAFLSINELRTWYRGVEN 72

Query: 56  EELFVDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
               V+     +L ++LDIVV  + CD L ++  D+ G++ L  +         LD +P 
Sbjct: 73  HHFSVEKGISRELQMNLDIVV-AMPCDALRVNVQDAVGDRILASDL--------LDKQPT 123

Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
                    A   +++   +   + E +  N+  +    E E  +  +  + + EA R  
Sbjct: 124 SW-------AAWNRELNVVSSGGSREYQTLNEEDAVRLMEQE--EDVHVGHALGEAQRSY 174

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVH 234
           K   P+   + + +N            + C+IYG L  N+V G FHI A G  Y     H
Sbjct: 175 KRKFPKGPKLKRGEN-----------ADSCRIYGSLVGNKVQGDFHITARGHGYFEFGEH 223

Query: 235 V-HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
           + HD       +FN +H I  LSFG           PLD T++        + YY+ I+P
Sbjct: 224 LSHD-------SFNFSHMITELSFGPHYS---TLLNPLDKTISTTPAHFHKYQYYMSIVP 273

Query: 294 TIYERL-----------DGSKLGGGDGG-----------------------MPGIFFSYE 319
           TIY R            D S +     G                       +PGIFF Y 
Sbjct: 274 TIYTRAGVVDPYSQALPDPSTITPSQRGNTIFTNQYAVTSRSHELPDAEYDVPGIFFKYT 333

Query: 320 LSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
           + P+++ ++E+  SL  L  +++  ++G  + 
Sbjct: 334 IEPILLVVSEERGSLLALLVRLVNVLAGVVVA 365


>gi|336269097|ref|XP_003349310.1| hypothetical protein SMAC_05593 [Sordaria macrospora k-hell]
 gi|380089883|emb|CCC12416.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 379

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 86/364 (23%), Positives = 147/364 (40%), Gaps = 65/364 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  +T  GG  T+   L    L   +   +++ + +    V+     
Sbjct: 23  VSAFDAFPKSKPQYVTRTTAGGKWTVFVTLISFILFWSEASRWWRGTESHTFAVEKGVSH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDG--KPIQEPQ 119
            L I+LDIVV  + C  + ++  D++G++      LH +  +++  +D  G  K  ++ Q
Sbjct: 83  SLDINLDIVV-KMKCQDIHINVQDAAGDRILAASKLHRDPTVWQHWVDNKGIHKLGRDAQ 141

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
            +VV            G    +  D       +G E       +  + V    +  KWA 
Sbjct: 142 GKVVT-----------GEDYLQGHDEG-----FGEE-------HVHDIVALGRKRAKWA- 177

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQ 239
                         T +L     + C+++G LE+N+V G FHI      +  H ++   Q
Sbjct: 178 -------------RTPRLWGATPDSCRVFGSLELNKVQGDFHIT-----AKGHGYMEFGQ 219

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
               +AFN +H I  LS+G  L        PLD TV  A      F Y+I ++PT+Y   
Sbjct: 220 HLDHSAFNFSHIISELSYGPFLP---SLVNPLDQTVNLATSNFHKFQYFISVVPTVYSVS 276

Query: 300 DGSKLGGGDGG------------MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
            G  +                  +PGIF  Y++ P+++ I E+  S      K++  ISG
Sbjct: 277 GGRSIVTNQYAVTEQSQEVTERIIPGIFVKYDIEPILLNIVEERDSFLLFLIKVVNVISG 336

Query: 348 TYIT 351
             + 
Sbjct: 337 ALVA 340


>gi|388583623|gb|EIM23924.1| DUF1692-domain-containing protein [Wallemia sebi CBS 633.66]
          Length = 396

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 81/358 (22%), Positives = 143/358 (39%), Gaps = 70/358 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K    +  ++  GG  T++    +  L+  ++ D+       +  VD++  +
Sbjct: 10  LREFDAFPKTQASYKIRSKQGGIATVIVIFALVLLVFHEIGDWLYGHNEYQFSVDTTTET 69

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           ++ +++D+ V  + C YL +D  D+ G++ L +  +I K     DG              
Sbjct: 70  EMQLNVDLTV-AMPCHYLNVDIRDAVGDR-LKLSDSIQK-----DG-------------- 108

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                      TT E E   + GS         K       VK++ + +KW  P      
Sbjct: 109 -----------TTFEPEKYRQIGSA--------KQSTLSRIVKDSKKGRKWFRP-----T 144

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
             +N +   K        C+IYG +E  +V+G+ HI   G  YS        ++      
Sbjct: 145 STRNRFPKTKKLIKDGPACRIYGSVETKKVNGNMHITTLGHGYS-------SLEHTDHKL 197

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N +H I   SFG   Q      +PLD +V   +    ++ Y++ ++PT Y    G  L 
Sbjct: 198 MNLSHTIDEFSFG---QHFPYISQPLDKSVEITDNHFPVYQYFMHVVPTTYVDASGHSLS 254

Query: 306 GGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                             G+PG+FF YEL P+ + ++  + S   L  ++   I G +
Sbjct: 255 TNQYSAREDIKFIHNHQRGIPGLFFRYELEPIHLSLSATTMSFTKLLIRLTALIGGVW 312


>gi|328862174|gb|EGG11276.1| hypothetical protein MELLADRAFT_33547 [Melampsora larici-populina
           98AG31]
          Length = 361

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 82/359 (22%), Positives = 144/359 (40%), Gaps = 76/359 (21%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           ++  DAF K    + E++  GG +TIV    I  LI  ++ +Y   + T    VD++ G 
Sbjct: 14  IREFDAFPKTIPTYKERSSRGGILTIVVGFLIMILIWHELREYLFGAATYSFSVDNTVGH 73

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++ D+ +  + C YL++D  D+ G++ +H+                           
Sbjct: 74  DLGLNFDVTI-NMPCHYLSIDVRDAVGDR-MHISDEF----------------------- 108

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            KK+ T  +      LE  N  G             +    V++A     W  P      
Sbjct: 109 -KKEGTEFSIGQAARLETNNDAG------------ISASKMVRDAQ--GGWTRPTF---- 149

Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
                   +K K    EG  C+I+G   V +V+G+ HI      ++ H ++   +     
Sbjct: 150 --------KKTKPLIPEGPACRIFGSTHVKKVTGNLHIT-----TLGHGYL-SWEHTDHQ 195

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-------- 296
             N TH I   SFG    +     +PLD +V   ++   +F Y+I ++PT Y        
Sbjct: 196 LMNLTHVISEFSFGEFFPN---MVQPLDNSVEITDKPFHIFQYFISVVPTTYINSGGRQV 252

Query: 297 -----ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                   D S+      G+PGIFF Y++ P+ + I E++ +L     ++   + G  +
Sbjct: 253 FTNQYSVTDMSRSTEHGRGVPGIFFKYDIEPMYLTIRERTTTLVQFLVRLAGIVGGIVV 311


>gi|440794754|gb|ELR15909.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
          Length = 306

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 86/190 (45%), Gaps = 35/190 (18%)

Query: 205 CQIYGYLEVNRVSGSFHIA-----PGLSY--SIN-------HVHVHDIQPYTSAAFNTTH 250
           C + G++ V ++ G F I+     P   Y  S+N       H H H   P  S  FN TH
Sbjct: 121 CLLTGHMAVRKIRGQFQISSRRFNPFSIYGSSLNKHTPTEDHPHPH---PEDSLPFNVTH 177

Query: 251 HIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA-SMFNYYIKIIPTIYERLDGSKLGGGDG 309
            IR LSFG K+  D     PLDG V    EG  S ++Y+++I+P  Y   DG  +     
Sbjct: 178 RIRELSFGPKVLPD---VGPLDGIVQTMREGERSQYSYFLQIVPASYHYADGRVVESYSF 234

Query: 310 GM-----------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDAL 358
                        PG+F+ Y+ SP    + E  KS  H  T+    I GT++ F L+ AL
Sbjct: 235 AFTMHTESRSELAPGVFWKYDFSPYATSLREVPKSFSHFITRCCAVIGGTFVVFGLLSAL 294

Query: 359 ---LHSCVKK 365
              L +  KK
Sbjct: 295 ASRLETAAKK 304



 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 40/74 (54%), Gaps = 3/74 (4%)

Query: 23 KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS---RGSKLPIHLDIVVPTI 79
          KTV  GAV+I+C+  + YL   +V +Y +   T ++ VD++       L + L +  P +
Sbjct: 25 KTVSSGAVSILCFFLLGYLFLQEVAEYQKAEVTSQVSVDTTIRNEFDSLLVSLTVEFPNL 84

Query: 80 SCDYLALDAVDSSG 93
           C+   +DA D +G
Sbjct: 85 GCEDFGVDAADYTG 98


>gi|123483410|ref|XP_001324018.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121906894|gb|EAY11795.1| conserved hypothetical protein [Trichomonas vaginalis G3]
          Length = 384

 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 135/320 (42%), Gaps = 60/320 (18%)

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           + LD+ V  + C +L LD +D+ G   L++       RL                + ++K
Sbjct: 72  VSLDVKV-NMPCYFLHLDVIDNLGFNQLNINTTAKFIRL----------------SAQEK 114

Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEV-------KEAYRYKKWALPEL 182
           ++   N T ++       C SCYG   E   CCN+C +         +A   K W     
Sbjct: 115 ELGYANETISS------ICHSCYGLLPEG-SCCNSCEQTLLLHIMNGKAANTKDWP---- 163

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYT 242
               QC+ +   +  +N   E C+I G + +N+  G+FHIAPG +    + HVHD+    
Sbjct: 164 ----QCQGKNPGKVYEN---EKCRIKGKVCLNKAQGNFHIAPGTNMKERYGHVHDLSGQL 216

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYE---R 298
              F+ +H I+ +  G K+        PL      +      ++ Y + + P +Y+   R
Sbjct: 217 -PNFDLSHVIQGMRVGPKI---PLTYNPLRYVQQIQNPNQPVVYRYDLVVTPAVYKSGNR 272

Query: 299 LDGSK----------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
           + G              G  GG PGI+F Y  +P  V +     ++  ++T I   +SG 
Sbjct: 273 ILGKGYDYTAMINRFFVGNSGGAPGIYFHYSFTPYGVTVNATYLTIAQIFTSIFGFMSGA 332

Query: 349 YITFMLVDALLHSCVKKISK 368
           Y  F ++D  +    K+++K
Sbjct: 333 YAIFSIIDESMFKDDKRMAK 352


>gi|452988546|gb|EME88301.1| hypothetical protein MYCFIDRAFT_25415 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 380

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 87/366 (23%), Positives = 146/366 (39%), Gaps = 68/366 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K    + E+T  GG  T+   L   +L   ++  +++ STT    V+   G 
Sbjct: 23  VKAFDAFPKTKPSYQERTSTGGIWTVTLILASLFLTWSELARWWKGSTTHTFSVEQGIGH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L I+LD+VV  ++C+ L ++  D++G++              L G   Q+         
Sbjct: 83  DLQINLDMVV-MMNCEDLHVNVQDAAGDR-------------ILAGSVFQKDPTIWTRWD 128

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           KK K    +     + E   + G  Y  E       N  +    + R+ K          
Sbjct: 129 KKLKA---HALGHDKQERLGEAGKDYKEE----DVHNYLSVAHHSKRFPK---------- 171

Query: 187 QCKNEYSTEKLKNTFT-EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
                  T K+   +T + C+IYG +  N+V G FHI      +  H ++   +    + 
Sbjct: 172 -------TPKIPRGWTADSCRIYGTMHGNKVQGDFHIT-----ARGHGYLEFAEHLDHSK 219

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN +H I  LSFG           PLD T A  +     F Y++ ++PT+Y   D   L 
Sbjct: 220 FNFSHRINELSFGPFYP---SLENPLDNTFATTDINYYKFQYFLSVVPTVYT-TDARALR 275

Query: 306 GGDGG--------------------MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
             D                      +PGIF  +++ P+ + I E+  S   L+ +I+  +
Sbjct: 276 LLDNNFVFTNQYAVTEQSRKVSENFVPGIFIKFDMEPIGLTIAEEWSSFPALFIRIVNVV 335

Query: 346 SGTYIT 351
           SG  + 
Sbjct: 336 SGLLVA 341


>gi|313220803|emb|CBY31643.1| unnamed protein product [Oikopleura dioica]
          Length = 289

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 32/189 (16%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           +GC   G   VN+V G+FH++          H   +QP      +  H I  LSFG  ++
Sbjct: 111 KGCIFGGTFHVNKVPGNFHVS---------THSSQVQPQNP---DMNHEIHELSFGESMK 158

Query: 263 DDDERRK----PLDGTVAKAEEGASMFNYYIKIIPTIYERL---------------DGSK 303
             +        PL+G    AE+ AS  +Y +K++PT+Y+ +               D   
Sbjct: 159 GINSNLPANFIPLNGKKTGAEKMASH-DYTLKVVPTVYQDIKKRTKFGYQFTAVYKDFVA 217

Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
            G G   MP I+F YE+SP+ VK TEKSK L H  T     I GT+    ++D+++ S  
Sbjct: 218 FGHGHRVMPAIWFRYEVSPITVKYTEKSKPLYHFLTTFCAIIGGTFTVAGMIDSMIFSAH 277

Query: 364 KKISKVEIG 372
           + + K   G
Sbjct: 278 QMVKKAGEG 286



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 1/104 (0%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
           ++  D + K  +D  + T  G  ++I   LFI +L+  +   + +     EL+VD  + G
Sbjct: 5   IRRFDIYRKVPKDLTQPTTAGAVISISSGLFILFLLVSEFLTFMRTDIVSELYVDDPTVG 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
            K+P+++ + +P I C +L +D  D  G   +    N  K  ++
Sbjct: 65  DKIPVNIRMSLPGIECKFLGIDIQDEHGRHEVGYLENTRKDPIN 108


>gi|313230728|emb|CBY08126.1| unnamed protein product [Oikopleura dioica]
          Length = 289

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 88/189 (46%), Gaps = 32/189 (16%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           +GC   G   VN+V G+FH++          H   +QP      +  H I  LSFG  ++
Sbjct: 111 KGCIFGGTFHVNKVPGNFHVS---------THSSQVQPQNP---DMNHEIHELSFGESMK 158

Query: 263 DDDERRK----PLDGTVAKAEEGASMFNYYIKIIPTIYERL---------------DGSK 303
             +        PL+G    AE+ AS  +Y +K++PT+Y+ +               D   
Sbjct: 159 GINSNLPANFIPLNGKKTGAEKMASH-DYTLKVVPTVYQDIKKRTKFGYQFTAVYKDFVA 217

Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
            G G   MP I+F YE+SP+ VK TEKSK L H  T     I GT+    ++D+++ S  
Sbjct: 218 FGHGHRVMPAIWFRYEVSPITVKYTEKSKPLYHFLTTFCAIIGGTFTVAGMIDSMIFSAH 277

Query: 364 KKISKVEIG 372
           + + K   G
Sbjct: 278 QMVKKAGEG 286



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 1/104 (0%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
           ++  D + K  +D  + T  G  ++I   LFI +L+  +   + +     EL+VD  + G
Sbjct: 5   IRRFDIYRKVPKDLTQPTTTGAVISISSGLFILFLLVSEFLTFMRTDIVSELYVDDPTVG 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
            K+P+++ + +P I C +L +D  D  G   +    N  K  ++
Sbjct: 65  DKIPVNIRMSLPGIECKFLGIDIQDEHGRHEVGYLENTRKDPIN 108


>gi|167523643|ref|XP_001746158.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775429|gb|EDQ89053.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1400

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 86/355 (24%), Positives = 149/355 (41%), Gaps = 82/355 (23%)

Query: 3   FSERLKGLDAFTK--PYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
             E++K LD F K  P  D    ++ G  VTI+  L I  LI  ++  Y  V    E  V
Sbjct: 10  LQEQVKQLDVFPKVEPDMDIQTTSISGAVVTIIVGLAIVGLIFTELMYYRTVDVVYEYAV 69

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGE-----QHLHVEHNIYKRRLDLDGKPI 115
           D+     + + +D+ +  + C+   +D +D SG      Q + VE   +K         +
Sbjct: 70  DTDLDPHMNLTVDMTI-AMPCENFGVDYIDVSGRSTDALQFMAVEPAHFK---------L 119

Query: 116 QEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK 175
              Q+E ++  +  +V  + G+    L+  ++    YG++ E                  
Sbjct: 120 SPNQQEWLD--QWAEVKAQEGSKG--LDSLHRF--LYGSKREPMPT-------------- 159

Query: 176 KWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLS--YSINHV 233
             A PE+D                   +GC+++G + V RVS +FH + G S  ++  H 
Sbjct: 160 --AAPEIDA----------------EPDGCRVHGTMPVARVSSNFHFSAGKSVHHASGHA 201

Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR--KPLDGTVAKAEEGASMFNYYIKI 291
           HV  I P      N +H I   SF        E+R    LDG +  ++    +F Y++K+
Sbjct: 202 HV-PIDP-NQKTINFSHRIDRFSF------SSEQRGAMALDGDMKVSDSNKQLFQYFLKV 253

Query: 292 IPTIYERLDGSK---------------LGGGDGGMPGIFFSYELSPLMVKITEKS 331
           +PT  +R+D ++               L   +  +PGI F YE+ P+ V + E++
Sbjct: 254 VPTTTKRMDEAEPFRSNQYSVTEQHHILAANERKLPGIHFKYEIEPIGVLVHEQA 308


>gi|358388143|gb|EHK25737.1| hypothetical protein TRIVIDRAFT_33251 [Trichoderma virens Gv29-8]
          Length = 370

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 92/361 (25%), Positives = 145/361 (40%), Gaps = 63/361 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  KT  GG  T+   L  S  +  ++  +++ +      V+   G 
Sbjct: 21  VSAFDAFPKSKPQYVTKTSGGGKWTVAMLLISSIFLWTEIGRWWRGAEHHTFAVEKGIGH 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL---HVEHNIYKRRLDLDGKPIQEPQKEVV 123
            + ++LDIVV  + CD L ++  D+SG++ L    +  +       +DGK +    K   
Sbjct: 81  DMQVNLDIVV-KMDCDDLHINVQDASGDRILAGDKLNRDATTWHQWVDGKGMHRLGK--- 136

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYK-KWALPEL 182
                    +ENG   T        G  + A  +        +++    R K KWA    
Sbjct: 137 ---------SENGKLDT--------GEGWLAAHDEGFGEEHVHDIVALSRKKAKWA---- 175

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
                      T   K    + C++YG L++NRV G FHI A G  Y   H+  HD    
Sbjct: 176 ----------KTPSPKGR-PDSCRMYGSLDLNRVQGDFHITARGHGYGGQHLD-HD---- 219

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY---ER 298
               FN +H I  +S+G           PLD TV  A      F YY+ ++PT+Y    R
Sbjct: 220 ---KFNFSHIISEMSYGPFYP---SLVNPLDRTVNSAIVHFHKFQYYLSVVPTVYLANNR 273

Query: 299 LDGSKLGG--------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           +  +             D  +PGIFF Y++ P+M+ + E          KI+   SG  +
Sbjct: 274 IVNTNQYAVTEQSKTISDHQVPGIFFKYDIEPIMLSVEESRDGFFTFLVKIVNIFSGVMV 333

Query: 351 T 351
            
Sbjct: 334 A 334


>gi|57208596|emb|CAI42845.1| ERGIC and golgi 3 [Homo sapiens]
          Length = 129

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 44/129 (34%), Positives = 69/129 (53%), Gaps = 23/129 (17%)

Query: 270 PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-------------------KLGGG--- 307
           PLD T   A + + MF Y++K++PT+Y ++DG                    K+  G   
Sbjct: 1   PLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEAPLPPQVLRTNQFSVTRHEKVANGLLG 60

Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
           D G+PG+F  YELSP+MVK+TEK +S  H  T +   I G +    L+D+L++   + I 
Sbjct: 61  DQGLPGVFVLYELSPMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIYHSARAIQ 120

Query: 368 -KVEIGGKT 375
            K+++G  T
Sbjct: 121 KKIDLGKTT 129


>gi|328700149|ref|XP_003241164.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 2 [Acyrthosiphon pisum]
 gi|328700151|ref|XP_001951220.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 1 [Acyrthosiphon pisum]
 gi|328700153|ref|XP_003241165.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 3 [Acyrthosiphon pisum]
          Length = 289

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 74/292 (25%), Positives = 123/292 (42%), Gaps = 48/292 (16%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAV-TIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +K LD+F K  E+ +E + Y   + T++  +F  +L+  ++  + Q         D+   
Sbjct: 14  VKELDSFPKVQEEIYEPSTYSNVILTVLISVFGLWLLISEIQYFLQEHYIYRFVPDTDYE 73

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
           SKLPI++DI V + +CD +  D VD++G+                               
Sbjct: 74  SKLPINIDITVAS-TCDSIGADIVDTTGQ------------------------------- 101

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNE-VKEAYRYKKWALPELDT 184
                    N     EL+  +        + +  +     N  ++E Y   K  L   D 
Sbjct: 102 ---------NMMLFGELKTDDTWWEMTKEQQQHFEKMRKFNAYLREEYHSMKDILWMFDD 152

Query: 185 IVQCKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP-YT 242
               KN+      K NT  + C+I+G L +N+V G+FHI PG S  +   HVH   P + 
Sbjct: 153 YNTLKNKIFVRTDKPNTLPDACRIHGSLILNKVIGNFHITPGKSLIVPGGHVHLTGPFFG 212

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           S A N +H I   SFG+  +       PL+G + +  E A  + Y+I ++ T
Sbjct: 213 SEATNFSHRINQFSFGVPTKG---IIYPLEGELYETNENAVSYKYFIDVVAT 261


>gi|452847826|gb|EME49758.1| hypothetical protein DOTSEDRAFT_58941 [Dothistroma septosporum
           NZE10]
          Length = 402

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 98/395 (24%), Positives = 150/395 (37%), Gaps = 97/395 (24%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           S  +K  DAF K    + ++T  GG  T+V  +    L   ++  ++   TT    V+  
Sbjct: 20  SSVVKSFDAFPKTKPSYTQRTESGGVWTVVLIVASLLLGWSEISGWWTGKTTHTFAVEQG 79

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
            G  L I+LD+VV  + C  L ++  DSSG++ L                          
Sbjct: 80  VGHDLQINLDVVV-AMQCGDLHVNVQDSSGDRIL------------------------AG 114

Query: 124 NAVKKKKVTTEN-GTTTTEL--EDPNKCGSCY---GAETETRKCCNTCNEVKEAYRYKKW 177
           +A+KK   T    G  +  L  E   +  S Y   GAE E     N     K   ++KK 
Sbjct: 115 SALKKDPTTWRQWGGRSHALASEKEERIRSGYDGKGAEYEEEDVHNYLGAAKRQKKFKK- 173

Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVH 236
             P L    Q               + C+IYG +  N+V G FHI A G  Y     H+ 
Sbjct: 174 -TPGLPWGAQA--------------DSCRIYGSMHGNKVQGDFHITARGHGYMEFGAHL- 217

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
                  + FN +H +  LSFG           PLD TVA   +    F YY+ ++PTIY
Sbjct: 218 -----DHSTFNFSHTVNELSFGPFYP---SLTNPLDNTVATTPDHFYKFQYYLSVVPTIY 269

Query: 297 -------ERLDG---SKLGGGDG------------------------------GMPGIFF 316
                   ++D    S   G DG                               +PG+F 
Sbjct: 270 TTDAKTLRKIDKHHESPSSGEDGLSQYPHRYSRNTVFTNQYAVTEQSHRVPENAVPGVFI 329

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
            +++ P+ + I E+  S+  L  +++  +SG  + 
Sbjct: 330 KFDIEPIGLTIAEEWSSIPALLIRLVNVVSGLLVA 364


>gi|331239265|ref|XP_003332286.1| hypothetical protein PGTG_14582 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|309311276|gb|EFP87867.1| hypothetical protein PGTG_14582 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 366

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 84/377 (22%), Positives = 156/377 (41%), Gaps = 78/377 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           ++  DAF K   ++ +++  GG +T+     I  LI  ++ +Y          VD S   
Sbjct: 15  IREFDAFPKTLPNYKQRSSRGGVLTVFVACLILVLIWHELKEYLFGEPKYSFLVDPSIAH 74

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L I++D+ V  + C YL++D  D+ G++ +++     K     D          + +A 
Sbjct: 75  SLGINIDLTV-AMPCHYLSVDIKDAVGDR-MYMNQEFKKEGTHFD----------IGDAK 122

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +     + +  + T++   +K G  +G   +TR                   +P+     
Sbjct: 123 RIDHNNSTSELSATQILHASKKGQTFG---KTRPL-----------------VPD----- 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                             C+IYG  +V +V+G+ HI      ++ H ++   +       
Sbjct: 158 ---------------GPACRIYGNTQVKKVTGNLHIT-----TLGHGYL-SWEHTDHKLM 196

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-ERL------ 299
           N +H I   SFG   Q   +  +PLD +V   ++   +F Y+I ++PT Y +RL      
Sbjct: 197 NLSHVITEFSFG---QFFPKIVQPLDNSVELTDKPFHIFQYFISVVPTTYIDRLGRQLHT 253

Query: 300 ------DGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI--- 350
                 D S+      G+PG+FF Y++ P+ + + E++ SL     ++   I G  +   
Sbjct: 254 NQYSVTDMSRPVEHGQGIPGLFFKYDMEPMSLILHERTTSLIQFLVRLAGMIGGIVVCTG 313

Query: 351 -TFMLVDALLHSCVKKI 366
            TF LVD  +   V  I
Sbjct: 314 WTFRLVDRFVQKIVPGI 330


>gi|254572003|ref|XP_002493111.1| Protein localized to COPII-coated vesicles, forms a complex with
           Erv46p [Komagataella pastoris GS115]
 gi|238032909|emb|CAY70932.1| Protein localized to COPII-coated vesicles, forms a complex with
           Erv46p [Komagataella pastoris GS115]
          Length = 333

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 94/393 (23%), Positives = 152/393 (38%), Gaps = 84/393 (21%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M +   ++  DAF K       ++  G   TI+   FI +LI V++  Y       +  +
Sbjct: 1   MDYHRTIRVFDAFPKTEPVNTVRSTKGSYSTILMGFFILFLIWVEIGGYVDGYIDRQFML 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D +    L I+LD+ V T  C+YL  +  D + ++ L  E      +L+ +G     P  
Sbjct: 61  DRNIQRVLNINLDMFVAT-PCNYLHTNVKDITQDRFLAQE------QLNFEGVNFFIPDS 113

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
             VN         E+  +T +L++                              ++ AL 
Sbjct: 114 FRVNG-------DESQGSTLDLDEV----------------------------MRESALA 138

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEG----CQIYGYLEVNRVSGSFHI-APGLSYSINHVHV 235
           E              + K +FT G    C I+G + VN+V G FHI   G  Y       
Sbjct: 139 EF-------------REKKSFTHGDAPACHIFGSIPVNKVHGFFHITGKGYGY------- 178

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
            D       A N TH I   SFG   +       PLD T     +    FNYY+ ++PT 
Sbjct: 179 RDRSIVPKEALNFTHVISEFSFG---EFYPYMNNPLDFTARTTNDHIHTFNYYLDVVPTE 235

Query: 296 YERL----DGSKLG------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
           Y++L    D ++         G    PG+FF+Y+  P+++ I EK  S      +++   
Sbjct: 236 YKKLGIVIDTTQYSMTVTELPGLSRPPGLFFNYQFEPIILSIEEKRISFVRFLVRLVTIC 295

Query: 346 SGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
            G     M+V   +   V K+ +V  G +   +
Sbjct: 296 GGI----MVVAKWIFRTVDKLIRVVFGNQVANR 324


>gi|213512030|ref|NP_001133523.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Salmo salar]
 gi|209154344|gb|ACI33404.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Salmo salar]
          Length = 381

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 53/171 (30%), Positives = 81/171 (47%), Gaps = 18/171 (10%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
            C+I+G+L VN+V+G+FHI  G +      H H     +   +N +H I HLSFG   ++
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDTYNFSHRIDHLSFG---EE 225

Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGGD 308
                 PLDGT     +   MF Y+I I+PT               + ER        G 
Sbjct: 226 IPGIINPLDGTEKVCTDHNQMFQYFITIVPTKLNTYQISADTNQYSVTERERVINHAVGS 285

Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
            G+ GIF  Y++S LMVK+TE+   L     ++   I G + T  ++  ++
Sbjct: 286 HGVSGIFMKYDISSLMVKVTEQHMPLWRFLVRLCGIIGGIFSTTGMIHGMV 336



 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 1/84 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
          +K LDAF K  E + E T  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 14 VKELDAFPKVPESYVETTATGGTVSLIAFTAMALLAFLEFFVYRDTWMQYEYEVDKDFSS 73

Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
          KL I++DI V  + C ++  D +D
Sbjct: 74 KLRINIDITV-AMRCQFVGADVLD 96


>gi|405968654|gb|EKC33703.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Crassostrea gigas]
          Length = 345

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 53/176 (30%), Positives = 82/176 (46%), Gaps = 24/176 (13%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSI-NHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
           + C++YG LEVN+V+G+FHI  G S  +    H H         +N +H I H SFG  +
Sbjct: 122 DACRVYGSLEVNKVAGNFHITAGKSVPVFPRGHAHISMMVHEKEYNFSHRIDHFSFGESV 181

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------------IYERLDGSKLG 305
           +       PLDG    + +   +FNY+IKI+PT                + +R       
Sbjct: 182 KG---IINPLDGEEQVSSDNFHVFNYFIKIVPTEVRTYAAGNIDTYQFSVTQRNRTINHS 238

Query: 306 GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
            G  G+PGIF  Y+L+ L +++ EK +       + +C I G       V  +LH+
Sbjct: 239 KGSHGVPGIFVKYDLNALKIRVVEKHRPFSQFLIR-LCGIVG---GIFAVSGMLHN 290


>gi|145235453|ref|XP_001390375.1| COPII-coated vesicle protein (Erv41) [Aspergillus niger CBS 513.88]
 gi|134058058|emb|CAK38286.1| unnamed protein product [Aspergillus niger]
 gi|350632895|gb|EHA21262.1| hypothetical protein ASPNIDRAFT_191708 [Aspergillus niger ATCC
           1015]
          Length = 399

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 89/381 (23%), Positives = 145/381 (38%), Gaps = 77/381 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K    +   +  GG  T++  +  +     +   +   S      V+   G 
Sbjct: 24  LKTFDAFPKTKPSYTAPSRRGGQWTVLILVICTVFTFSEFRTWLHGSENHHFSVEKGVGH 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++LD+VV  + CD L ++  D+SG++              L G  +Q  +      +
Sbjct: 84  DLQLNLDLVV-RMPCDTLDVNIQDASGDR-------------ILAGDLLQRERTSWKLWM 129

Query: 127 KKKKVTTENGT---TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
            K+   T  G     T   ED ++      A        +   EV++  R K    P L 
Sbjct: 130 DKRNRETSGGVHEYQTLSQEDTDRIS----AREADAHVHHVLGEVRKNPRRKFAKGPRLR 185

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYT 242
                         +    + C+IYG LE N+V G FHI A G  Y     H+       
Sbjct: 186 --------------RGDTVDSCRIYGSLEGNKVQGDFHITARGHGYRNFGEHL------D 225

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER---- 298
              FN +H +  LSFG           PLD T+A  E     + Y++ ++PT+Y +    
Sbjct: 226 HGVFNFSHMVTELSFGPHYP---TLLNPLDKTIATTETHYYKYQYFLSVVPTLYSKGASA 282

Query: 299 LD----------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEK 330
           LD                             ++L      +PGIFF Y + P+++ I+E+
Sbjct: 283 LDTYTNHPDLIATNRNRNLVFTNQYAATTQATELPENPYFIPGIFFKYNIEPILLMISEE 342

Query: 331 SKSLGHLWTKIMCNISGTYIT 351
             S   L  +++  +SG  +T
Sbjct: 343 RTSFLSLLIRLVNTVSGVMVT 363


>gi|320591987|gb|EFX04426.1| copii-coated vesicle protein [Grosmannia clavigera kw1407]
          Length = 385

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 88/368 (23%), Positives = 151/368 (41%), Gaps = 67/368 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           ++  DAF K    + ++T  GG  T+   +    L   ++  ++  S      V    G 
Sbjct: 26  VQAFDAFPKAKPQYVQRTAGGGKWTVAMIVVSLLLFWTELRRWWAGSQEHTFAVAKGVGH 85

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
            + I++DIVV  + CD L ++  D++G++      L  +   + + +D  G         
Sbjct: 86  SMQINMDIVV-KMRCDDLHINVQDAAGDRIMAAAKLQRDATTWAQWVDHGGN------HR 138

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
           +    + + +T E  TT      P++ G  +G E       +  + V    R  +W    
Sbjct: 139 LGRDTQGRMITGEGWTTL-----PHEEG--FGEE-------HVHDIVALGRRKARWG--- 181

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
                       T +L+    + C+I+G L++NRV G +HI A G  Y    + + D   
Sbjct: 182 -----------KTPRLRGAAPDSCRIFGSLDLNRVQGDYHITARGHGY----MEMGDHLD 226

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE-RL 299
           +TS  FN +H +  LSFG           PLD TV +A      F Y++ I+PT+Y    
Sbjct: 227 HTS--FNFSHVVNELSFGPFYP---SLVNPLDQTVNEATANFYRFQYFMSIVPTVYSVGH 281

Query: 300 DGSKLGGG----------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            GS+                       +PGIFF Y++ P+++ I E          KI+ 
Sbjct: 282 AGSRSARSIVTNQYAVTEQSAEIDQRAIPGIFFKYDIEPILLYIEESRDGFLVFVLKIVN 341

Query: 344 NISGTYIT 351
            +SG  + 
Sbjct: 342 VLSGALVA 349


>gi|392577310|gb|EIW70439.1| hypothetical protein TREMEDRAFT_43159 [Tremella mesenterica DSM
           1558]
          Length = 435

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 84/379 (22%), Positives = 152/379 (40%), Gaps = 55/379 (14%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K    +  ++  G  +T +    I  L+  D+ +Y   +      VD     
Sbjct: 33  IKSFDAFPKVQSTYTSQSRRGAVLTALVGFIIFLLVLNDLGEYLYGAPDYTFDVDQQLQK 92

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L +++D+ V  + C +L++D  D+ G++ LH+                        +  
Sbjct: 93  DLQLNVDLTV-AMPCHFLSIDLRDAVGDR-LHLS-----------------------DGF 127

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            K+  T   G   T    P    S     + +R+   T        R    + P+  T  
Sbjct: 128 TKEGTTFAVGKAVTSKTHPTPI-SASQVISSSRRRTPTQQRSFSGIRRLLSSRPKRRTRK 186

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                 +  K  N     C+IYG +EV +V+ + HI      ++ H ++   +    A  
Sbjct: 187 HAMFRPTPNKADNG--PACRIYGSVEVKKVTANLHIT-----TLGHGYM-SFEHTDHALM 238

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
           N +H +   SFG       +   PLD T+  ++   +   Y+++++PT Y   +G KL  
Sbjct: 239 NLSHVVHEFSFGPFFPAIAQ---PLDMTMQVSDNPFTAIQYFLRVVPTTYIDANGRKLVT 295

Query: 307 GD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG---TYI 350
                           G+PGIFF Y+L  + V + E++ SL H   +++  I G   T  
Sbjct: 296 SQYAVTDYLRSFQHGQGVPGIFFKYDLEAMAVTVRERTTSLYHFVIRLIGVIVGGVWTVA 355

Query: 351 TFMLVDALLHSCVKKISKV 369
           ++ L   +L+   K+ +KV
Sbjct: 356 SYAL--RVLNRAEKQFTKV 372


>gi|328352874|emb|CCA39272.1| Peroxisomal membrane protein PEX28 [Komagataella pastoris CBS 7435]
          Length = 849

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 95/393 (24%), Positives = 154/393 (39%), Gaps = 84/393 (21%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M +   ++  DAF K       ++  G   TI+   FI +LI V++  Y       +  +
Sbjct: 517 MDYHRTIRVFDAFPKTEPVNTVRSTKGSYSTILMGFFILFLIWVEIGGYVDGYIDRQFML 576

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           D +    L I+LD+ V T  C+YL  +  D + ++ L  E      +L+ +G     P  
Sbjct: 577 DRNIQRVLNINLDMFVAT-PCNYLHTNVKDITQDRFLAQE------QLNFEGVNFFIPDS 629

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
             VN         E+  +T +L++                       ++E+      AL 
Sbjct: 630 FRVNG-------DESQGSTLDLDE----------------------VMRES------ALA 654

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEG----CQIYGYLEVNRVSGSFHI-APGLSYSINHVHV 235
           E              + K +FT G    C I+G + VN+V G FHI   G  Y       
Sbjct: 655 EF-------------REKKSFTHGDAPACHIFGSIPVNKVHGFFHITGKGYGY------- 694

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
            D       A N TH I   SFG   +       PLD T     +    FNYY+ ++PT 
Sbjct: 695 RDRSIVPKEALNFTHVISEFSFG---EFYPYMNNPLDFTARTTNDHIHTFNYYLDVVPTE 751

Query: 296 YERL----DGSKLG------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
           Y++L    D ++         G    PG+FF+Y+  P+++ I EK  S      +++   
Sbjct: 752 YKKLGIVIDTTQYSMTVTELPGLSRPPGLFFNYQFEPIILSIEEKRISFVRFLVRLVTIC 811

Query: 346 SGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
            G     M+V   +   V K+ +V  G +   +
Sbjct: 812 GG----IMVVAKWIFRTVDKLIRVVFGNQVANR 840


>gi|146163751|ref|XP_001012240.2| hypothetical protein TTHERM_00103890 [Tetrahymena thermophila]
 gi|146145943|gb|EAR91995.2| hypothetical protein TTHERM_00103890 [Tetrahymena thermophila
           SB210]
          Length = 331

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 95/399 (23%), Positives = 145/399 (36%), Gaps = 107/399 (26%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           ++GLD F K  +D    T  GG  +I+ ++    L   ++ DY       ++ V      
Sbjct: 1   MRGLDFFQKVNQDIDTSTATGGVYSIIAFVVGFILFWNELKDYRTDQMIYKMRVQQLEVE 60

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            +  ++D+ +    C  LALD  D  G   L     I K R+  DG              
Sbjct: 61  SVKANIDLHIYGSPCTLLALDLQDEVGNHTLDYTDTIKKIRVLKDG-------------- 106

Query: 127 KKKKVTTENGTTTTELE------DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALP 180
                        TELE      +PN  GS                E+ EA         
Sbjct: 107 -------------TELESGFGDGNPNYRGSS--------------QEIDEA--------- 130

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
               I    NE           EGC+I GY+ + +V G+FHI+      + +  +   +P
Sbjct: 131 ----IDAVNNE-----------EGCRINGYINLKKVPGNFHISYHAKMDVMN-RIASTKP 174

Query: 241 YTSAAFNTTHHIRHLSFG---------------IKLQDDDERRKPLDGTVAKAEEGASMF 285
            T +  N  + I HL FG                  Q+ +    P D T      G + +
Sbjct: 175 DTYSKINLNYKINHLGFGENTNHMATIFKIMGRTLFQETNTNDYPHDDT-KYINPGKNDY 233

Query: 286 NYYIKIIPTIYERLDGSKL----------------GGGDGGMPGIFFSYELSPLMVKITE 329
           + Y+KI+P    R D +KL                      +P IFF YE+SP+ V  + 
Sbjct: 234 DNYLKILPC---RYDSNKLHMSVSRYKYAMYSTHTPKSSTEIPTIFFRYEISPINVYYST 290

Query: 330 KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           KSKS  H   +I   + G +    + ++L    + KISK
Sbjct: 291 KSKSFYHFLVQIFAIVGGIFAVMGIFNSLTTGVISKISK 329


>gi|392297516|gb|EIW08616.1| Erv41p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 352

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 90/354 (25%), Positives = 147/354 (41%), Gaps = 66/354 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K  E + +K+  GG  +++ +LF+ ++   +  +YF     ++  VDS    
Sbjct: 4   LKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQVRD 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++DI V T  CD+L ++  D + ++ L +E    +        P   P    VN +
Sbjct: 64  TVQINMDIYVNT-KCDWLQINVRDQTMDRKLVLEELQLEEM------PFFIPYDTKVNDI 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                   N   T EL++    G    A  E R+  +T +   E+    K  LPE +   
Sbjct: 117 --------NEIITPELDE--ILGEAIPA--EFREKLDTRSFFDES-DPNKAHLPEFN--- 160

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                            GC I+G + VNRVSG   I   ++ S+ +V      P     F
Sbjct: 161 -----------------GCHIFGSIPVNRVSGELQI---IAKSLGYVASRK-APLEELKF 199

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DG 301
           N  H I   SFG      D    PLD T     +E  + + YY  ++PT++++L    D 
Sbjct: 200 N--HVINEFSFGDFYPYID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDT 254

Query: 302 SKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           ++    D              MPGIFF Y   PL + +++   S      +++ 
Sbjct: 255 NQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 308


>gi|358374656|dbj|GAA91246.1| COPII-coated vesicle protein [Aspergillus kawachii IFO 4308]
          Length = 399

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 89/381 (23%), Positives = 144/381 (37%), Gaps = 77/381 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K    +   +  GG  T++  +  +     +   +   S      V+   G 
Sbjct: 24  LKTFDAFPKTKPSYTAPSRRGGQWTVLILVICTVFTFSEFRTWLNGSENHHFSVEKGVGH 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++LD+VV  + CD L ++  D+SG++              L G  +Q  +      +
Sbjct: 84  DLQLNLDLVV-RMPCDTLDVNIQDASGDR-------------ILAGDLLQRERTSWKLWM 129

Query: 127 KKKKVTTENGT---TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
            K+   T  G     T   ED ++      A        +   EV++  R K    P L 
Sbjct: 130 DKRNRETSGGVHEYQTLSQEDSDRIS----AREADAHVHHVLGEVRKNPRRKFAKGPRLR 185

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYT 242
                         +    + C+IYG LE N+V G FHI A G  Y     H+       
Sbjct: 186 --------------RGDTVDSCRIYGSLEGNKVQGDFHITARGHGYRNFGEHL------D 225

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER---- 298
              FN +H +  LSFG           PLD T+A  E     + Y++ ++PT+Y +    
Sbjct: 226 HGVFNFSHMVTELSFGPHYP---TLLNPLDKTIATTETHYYKYQYFLSVVPTLYSKGASA 282

Query: 299 LD----------------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEK 330
           LD                              +L      +PGIFF Y + P+++ I+E+
Sbjct: 283 LDTYTNHPDLIATNRNRNLVFTNQYAATTQAQELPENPYFIPGIFFKYNIEPILLMISEE 342

Query: 331 SKSLGHLWTKIMCNISGTYIT 351
             S   L  +++  +SG  +T
Sbjct: 343 RTSFLSLLIRLVNTVSGVMVT 363


>gi|297262047|ref|XP_001105686.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like isoform 2 [Macaca mulatta]
          Length = 374

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 55/182 (30%), Positives = 84/182 (46%), Gaps = 22/182 (12%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+I+G+L VN+V+G+FHI  G +      H H        ++N +H I HLSFG  + 
Sbjct: 165 DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHESYNFSHRIDHLSFGELVP 224

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGG 307
                  PLDGT   A +   MF Y+I ++PT               + ER        G
Sbjct: 225 ---AIINPLDGTEKIAIDHNQMFQYFITVVPTKLHTYKISADTHQFSVTERERIINHAAG 281

Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
             G+ GIF  Y+LS LMV +TE+       + ++   + G + T      +LH   K I 
Sbjct: 282 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST----TGMLHGIGKFIV 337

Query: 368 KV 369
           ++
Sbjct: 338 EI 339


>gi|322697212|gb|EFY88994.1| COPII-coated vesicle protein (Erv41), putative [Metarhizium acridum
           CQMa 102]
          Length = 372

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 85/359 (23%), Positives = 146/359 (40%), Gaps = 57/359 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K   ++  +T  GG  T+   +   +L+  ++  +++ S +    V+     
Sbjct: 21  VSAFDAFPKSKPEYVTRTEGGGKWTVAMAVVSIFLLWAEIARWWRGSESHTFAVEKGISH 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE--PQKEVVN 124
            + I+LD V+  + C  L ++  D++G++ L         +L++D     +   QK V  
Sbjct: 81  SMQINLDTVI-LMKCGDLHINVQDAAGDRIL------AGAKLNMDETSWSQWVNQKGVHK 133

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             +  +     G     L+D       +G E       +  + V    R  KWA      
Sbjct: 134 LGRDSEGRVVTGAGWQNLDDEG-----FGEE-------HVHDIVALGQRRAKWA------ 175

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS 243
                    T ++K    + C+IYG L++N+V G FHI A G  Y     H+   Q    
Sbjct: 176 --------KTPRVKGP-PDSCRIYGSLDLNKVQGDFHITARGHGYRGQGSHLDHSQ---- 222

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
             FN +H I  LSFG           PLD T+  AE     F YY+ ++PT Y     S 
Sbjct: 223 --FNFSHIISELSFGSYYP---SLVNPLDRTINIAENHFHKFQYYVSVVPTRYSVGSSSI 277

Query: 304 L-----------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                       G  +  +PGIF  Y++ P+++ + E    +     K++  +SG  + 
Sbjct: 278 FTNQYAVTEQSKGVSEYNVPGIFVKYDIEPILLSVNEDRDGILMFVVKLINVLSGVLVA 336


>gi|154343635|ref|XP_001567763.1| hypothetical protein, unknown function [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134065095|emb|CAM43209.1| hypothetical protein, unknown function [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 309

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 53/184 (28%), Positives = 86/184 (46%), Gaps = 22/184 (11%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG---I 259
           EGC++ GY++V +V G+FHI+       +H   H +  +     N  H I HLSFG   +
Sbjct: 131 EGCRLEGYIKVGKVPGNFHIS-------SHGRQHLLMTHFPNGTNAEHSIHHLSFGTLDV 183

Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE---------RLDGSKLGGGDGG 310
           K  D   +  PLDG   ++E    ++ Y++ I+PTIYE         +  G+        
Sbjct: 184 KKLDKKAQLHPLDGKEHRSEV-PKIYQYFLDIVPTIYESSFSTAHTYQFTGTSSSSPVPS 242

Query: 311 --MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
             M  + F Y++SP+ V+ +    SL H  T +   I G Y    L+   +HS   +  +
Sbjct: 243 SQMAAVVFQYQMSPITVRYSSARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSAAQFQR 302

Query: 369 VEIG 372
             +G
Sbjct: 303 RILG 306



 Score = 47.0 bits (110), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 46/109 (42%), Gaps = 2/109 (1%)

Query: 11  DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV--DSSRGSKL 68
           D F    +D  E T  G  ++I C   +  L   +V  Y       ++ +  D    S +
Sbjct: 11  DFFRHIPKDLTESTTSGAIISIACVTVMVLLFVGEVISYVSPRIQSDMIILPDLDETSTI 70

Query: 69  PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
            + +DI  P + C  L LD +D       +   +I + RLD  GKPI +
Sbjct: 71  KVSMDITFPKMPCAILTLDILDVLHNHMFNSMDHITRTRLDPAGKPISD 119


>gi|70988875|ref|XP_749289.1| COPII-coated vesicle protein (Erv41) [Aspergillus fumigatus Af293]
 gi|66846920|gb|EAL87251.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
           fumigatus Af293]
 gi|159128703|gb|EDP53817.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
           fumigatus A1163]
          Length = 379

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 91/371 (24%), Positives = 154/371 (41%), Gaps = 71/371 (19%)

Query: 12  AFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIH 71
           A TKP   +   +  GG  T++  L  ++L   +   + + +  +   V+      L ++
Sbjct: 13  AKTKP--SYTAPSPRGGQWTVLVLLVCTFLSISEFRTWLKGTEKQHFSVEKGISHDLQLN 70

Query: 72  LDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKV 131
           LDIVV  +SCD L ++  D+SG++ L  +  + KR          EP    +   K+   
Sbjct: 71  LDIVV-HMSCDMLDVNIQDASGDRILAGQ--LLKR----------EPTSWQLWMDKRNYE 117

Query: 132 TTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNE 191
           T                G  +  +T +++  +  +E +EA  +    L E+    + K  
Sbjct: 118 T---------------YGGAHEYQTLSQEHADRLSE-QEADAHVHHVLGEVRRNPRKKFA 161

Query: 192 YSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTH 250
              +  +    + C+IYG LE N+V G FHI A G  Y  N  H+          FN +H
Sbjct: 162 KGPKLRRGDAVDSCRIYGSLEGNKVQGDFHITARGHGYHNNAPHLEH------KTFNFSH 215

Query: 251 HIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LD------ 300
            I  LSFG           PLD T+A  E+    + Y++ I+PTIY +    LD      
Sbjct: 216 MITELSFGPHY---PTLLNPLDKTIATTEDHYYKYQYFLSIVPTIYSKGNLALDTYANAP 272

Query: 301 --------------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
                                S +      +PG+FF Y + P+++ I+E+  S   L  +
Sbjct: 273 PSNRRGKNLVFTNQYAVTSQSSVIPESPYFIPGLFFKYNIEPILLLISEERTSFLSLLVR 332

Query: 341 IMCNISGTYIT 351
           ++  +SG  +T
Sbjct: 333 LVNTVSGVMVT 343


>gi|74189495|dbj|BAE22750.1| unnamed protein product [Mus musculus]
          Length = 303

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 56/182 (30%), Positives = 84/182 (46%), Gaps = 22/182 (12%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+I+G+L VN+V+G+FHI  G +      H H        ++N +H I HLSFG  + 
Sbjct: 94  DACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSYNFSHRIDHLSFGELVP 153

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGG 307
                  PLDGT   A +   MF Y+I ++PT               + ER        G
Sbjct: 154 G---IINPLDGTEKIAVDHNQMFQYFITVVPTKLHTYKISADTHQFSVTERERIINHAAG 210

Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
             G+ GIF  Y+LS LMV +TE+       + ++   I G + T      +LH   K I 
Sbjct: 211 SHGVSGIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIIGGIFST----TGMLHGIGKFIV 266

Query: 368 KV 369
           ++
Sbjct: 267 EI 268


>gi|325184531|emb|CCA19024.1| endoplasmic reticulumGolgi intermediate compartment protein
           putative [Albugo laibachii Nc14]
          Length = 466

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 58/190 (30%), Positives = 86/190 (45%), Gaps = 30/190 (15%)

Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
           N   EGCQ+YG+L V RV G+FHI         H+  H      S+  N +H +  L FG
Sbjct: 284 NAGPEGCQLYGHLIVKRVPGNFHI---------HLS-HPFYSMNSSLVNASHTVNELWFG 333

Query: 259 IKLQDDDERRKP----LDG-TVAKAEEGASMFNY----YIKIIPTIYERLDGSKLGG--- 306
             L      + P    LD   +A+ E  A M NY    YIK++   Y + +G  +     
Sbjct: 334 EVLSASALAKLPPNTRLDSHRLARQEFTAYMQNYTYVHYIKVVTNTYVQRNGEVISAYRY 393

Query: 307 --------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDAL 358
                       +P + F Y+LSP+ V+ITE+S    H  T     I G +    ++D L
Sbjct: 394 TAHSNEYLETEDLPSVMFRYDLSPMSVRITERSMPFYHFVTSACAIIGGVFTVIGIIDQL 453

Query: 359 LHSCVKKISK 368
           +H  V+ ++K
Sbjct: 454 VHQTVRAMNK 463



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/118 (24%), Positives = 58/118 (49%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  D + K  ED    T+ G +++IV    +  L  ++   Y  V+   ++ +D     
Sbjct: 7   LKKWDFYKKIPEDLTVSTLPGVSLSIVGCFIMLILFILEFNAYLSVNHAYDIVIDEGLDE 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           K  I+ +I +P + C++ ++D  D +G +  ++  N+ K R+D  G+ +     EV +
Sbjct: 67  KFEINFNITIPDLPCEFASIDVSDMTGTRKHNMTKNVSKFRIDTKGRLVGFASDEVTH 124


>gi|346322712|gb|EGX92310.1| COPII-coated vesicle protein (Erv41), putative [Cordyceps militaris
           CM01]
          Length = 376

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 90/359 (25%), Positives = 149/359 (41%), Gaps = 54/359 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISY-LICVDVCDYFQVSTTEELFVDSSRG 65
           +   DAF K   ++  +T  GG  T+V  +FIS  L+  +V  +++ S T    V+    
Sbjct: 21  VSAFDAFPKSKPEYVTRTAGGGKWTVVI-VFISLVLMGSEVGRWWRGSETHNFAVEKGIS 79

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             + I+LDIVV  + C+ L ++  D+SG++ L    ++  R   +    + +     +  
Sbjct: 80  HDMQINLDIVVHML-CNDLHINVQDASGDRILAA--SMLHRDPTMWSHWVDQAGVHKLGH 136

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
               +V T  G T+    D       +G E       +  + V    +  KW+       
Sbjct: 137 DANGRVNTGEGWTSLAHNDEG-----FGEE-------HVHDIVALGKKRAKWS------- 177

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSA 244
                   T +   T  + C++YG L++N+V G FHI A G  Y     H+   Q     
Sbjct: 178 -------KTPRFWGT-ADSCRVYGSLDLNKVQGDFHITARGHGYMEFGQHLDHNQ----- 224

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE------- 297
            FN +H I  LS+G           PLD TV  A      F YY+ ++PTIY        
Sbjct: 225 -FNFSHVISELSYGAFYP---SLVNPLDRTVNLAAAHFHKFQYYLSVVPTIYSVGSSTIQ 280

Query: 298 -----RLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                  + SK       +PGIF  Y++ P+++ + E   S      K++  +SG  + 
Sbjct: 281 TNQYAVTEQSKEIDEHSAVPGIFVKYDIEPILLAVHESRDSFPVFLLKLINIVSGVLVA 339


>gi|296821254|ref|XP_002850059.1| ER-derived vesicles protein ERV41 [Arthroderma otae CBS 113480]
 gi|238837613|gb|EEQ27275.1| ER-derived vesicles protein ERV41 [Arthroderma otae CBS 113480]
          Length = 399

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 85/388 (21%), Positives = 154/388 (39%), Gaps = 83/388 (21%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           + +LK  DAF K    +   +  GG  T+   +  + L C ++  +++        V+  
Sbjct: 21  AAKLKTFDAFPKTKPSYTSTSRSGGLWTVFIAILCAILSCSELVTWYRGHENHHFSVERG 80

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
              ++ ++LD+VV  + CD + ++  D+ G+   H+          L G+ +   Q+   
Sbjct: 81  VSQEMQLNLDVVV-AMPCDDVRINVQDAVGD---HI----------LAGELLT--QQPTS 124

Query: 124 NAVKKKKVTTENGTTTTEL-----EDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWA 178
            A   ++   + G  + E      EDP +       + E     +   EV+   + K   
Sbjct: 125 WAAWNREFNRQRGGGSPEYQTLSKEDPFRLEE----QEEDLHVEHVLGEVRRGRKKKFPK 180

Query: 179 LPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHD 237
            P+L               K+   + C+++G LE N+V G+ HI A G  Y      +  
Sbjct: 181 APKLK--------------KSDAVDSCRVFGSLEGNKVQGNLHITARGFGY------LEW 220

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
            QP    + N TH I  LSFG           PLD TV+        + Y++ ++PTIY 
Sbjct: 221 GQPTNPHSLNFTHLITELSFGPHYA---RLLNPLDKTVSTTSVNFYKYQYHLSVVPTIYT 277

Query: 298 R-----------LDGSKLGGGDG-----------------------GMPGIFFSYELSPL 323
           +            D S +   D                         +PGIFF Y + P+
Sbjct: 278 KSGHIDPNHRSLPDPSSITAKDSKTTVSTNQYAVTSYSQPVQPRIESIPGIFFKYNIEPI 337

Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYIT 351
           ++ ++++  SL  L  +++  +SG  +T
Sbjct: 338 LLIVSQERDSLLALLVRLVNVVSGVLVT 365


>gi|332373256|gb|AEE61769.1| unknown [Dendroctonus ponderosae]
          Length = 382

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 84/381 (22%), Positives = 152/381 (39%), Gaps = 64/381 (16%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
            R+K +D F K  + +   +  GG  +I+ +L I +L+  ++  Y       +   D   
Sbjct: 17  NRVKKMDIFPKVEDPYKMTSSVGGTFSIISFLIIGWLVYSEISYYLNSKFVFKFSPDVQL 76

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
             KL +++DI V  + C  L  D +DS+ +       N YK                   
Sbjct: 77  EDKLDMNIDITV-AMPCSKLGTDVLDSTNQ-------NTYKFG----------------- 111

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
                  T +   T  EL D  K         E +K  N+   ++E Y   K  L +   
Sbjct: 112 -------TLKQDDTWFELSDNQK------VHFEHKKHFNSY--LREEYHAIKDLLWKNSF 156

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
             Q  +    +   +   + C+IYG L +N+V+G+F I+ G  Y     +       +  
Sbjct: 157 STQFGDLPPRDHTPSRPHDACRIYGTLGLNKVAGNFLISGGKRYMFGLGYQQFRTLISEG 216

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------- 294
            +N TH I   SFG           PL+G      +  ++ NY+I+I+PT          
Sbjct: 217 EYNFTHRINRFSFG---HSSPGIVHPLEGDELILPDPMTVVNYFIEIVPTTVNTFMYTIS 273

Query: 295 --------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
                   +   +D +K   G  G P I+F Y++S L V ++++   LG    ++   + 
Sbjct: 274 TYQYSVKELTRPIDHNK---GSHGTPAIYFKYDMSALRVTVSQERDHLGMFLARLCSIVG 330

Query: 347 GTYITFMLVDALLHSCVKKIS 367
           G Y+   ++++++   +  I+
Sbjct: 331 GVYVCSGILNSIVQLLLNFIT 351


>gi|242783317|ref|XP_002480163.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218720310|gb|EED19729.1| COPII-coated vesicle protein (Erv41), putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 400

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 88/400 (22%), Positives = 162/400 (40%), Gaps = 73/400 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K   ++   +  GG  T++     ++L   ++  +++ +  +   V+     
Sbjct: 24  LKTFDAFPKTKPNYTTPSRRGGQWTVIIIAICTFLSIGELITWYRGTENQHFSVEKGVSR 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           +L +++D+VV  + C+ + ++  D+SG+   H+   +   +   + +   E   +  + V
Sbjct: 84  QLQMNIDMVV-KMPCNDIRVNVQDASGD---HIMAGMLLMKDSTNWEMWNEKLNQQSSGV 139

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            + +  T N   T  L +  +    +   + TR             R  +   P+     
Sbjct: 140 TEYQ--TLNAEDTKRLLEQEEDMHAHHVLSHTR-------------RNPRRKFPK----- 179

Query: 187 QCKNEYSTEKLKNTF-TEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSA 244
                  T +L   + T+ C+IYG LE N+V G FHI A G  Y+    H+         
Sbjct: 180 -------TPRLSAKYPTDSCRIYGSLESNKVHGDFHITARGHGYNELGEHL------DHK 226

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS-- 302
            FN TH I  LSFG           PLD TVA  E+    F Y++ ++PTIY + + +  
Sbjct: 227 TFNFTHMITELSFGPHYPS---LLNPLDKTVAYTEDHYYKFQYFLNVVPTIYAKGNNAVE 283

Query: 303 -----------------------------KLGGGDGGMPGIFFSYELSPLMVKITEKSKS 333
                                         L       PGIFF Y + P+++ ++E+  S
Sbjct: 284 KYTANPALAFKKSRNTIFTNQYSATSQSHALPENPYNTPGIFFKYNIEPILLFVSEERGS 343

Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGG 373
              L  +++  +SG  +T   +  L    ++ + +   GG
Sbjct: 344 FLALLVRLVNVVSGVIVTGGWLYQLSGWAMEVLRRRRRGG 383


>gi|260800124|ref|XP_002594986.1| hypothetical protein BRAFLDRAFT_128968 [Branchiostoma floridae]
 gi|229280225|gb|EEN50997.1| hypothetical protein BRAFLDRAFT_128968 [Branchiostoma floridae]
          Length = 292

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 58/199 (29%), Positives = 90/199 (45%), Gaps = 32/199 (16%)

Query: 194 TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIR 253
           TEK+      GC+  G   +N+V G+FH++          H   +QP   A+ + TH + 
Sbjct: 103 TEKVPVNNGLGCRFEGRFWINKVPGNFHMS---------THSAHVQP---ASPDMTHVVH 150

Query: 254 HLSFGIKLQD--DDERR---KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK----- 303
            L FG  L     D  +    PLD          S  +Y++KI+PTI+E     K     
Sbjct: 151 DLRFGEDLAAFLPDHIKGSFNPLDEVERLHANALSSHDYFLKIVPTIFENRSDKKSFAFQ 210

Query: 304 ----------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
                      G G+  MP I+F Y+LSP+ VK T+K K   H  T I   + GT+    
Sbjct: 211 YTYAYKDYISFGHGNRVMPAIWFRYDLSPITVKYTDKRKPFYHFITTICAVVGGTFTVAG 270

Query: 354 LVDALLHSCVKKISKVEIG 372
           ++D+++ +  +   K E+G
Sbjct: 271 IIDSVIFTAAEVFKKAELG 289



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 48/93 (51%), Gaps = 2/93 (2%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--R 64
          +K  D + K  +D  + T+ G  V+I+  +FI +L+  +   +       ELFVD+S   
Sbjct: 5  VKRFDIYRKIPKDLTQPTLTGALVSILSGMFIVFLLLSEFHAFIMSDIMSELFVDNSGGG 64

Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
          G ++ + L+I +P + C+ + LD  D  G   +
Sbjct: 65 GGQISVFLNISLPRLKCEVVGLDIQDEMGRHEV 97


>gi|254579156|ref|XP_002495564.1| ZYRO0B14344p [Zygosaccharomyces rouxii]
 gi|238938454|emb|CAR26631.1| ZYRO0B14344p [Zygosaccharomyces rouxii]
          Length = 353

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 94/381 (24%), Positives = 150/381 (39%), Gaps = 80/381 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  E   +K+  GG  +I  +LF+ ++   +   YF     E   VD     
Sbjct: 4   LRSFDAFPKTDETHVKKSSNGGLSSIFTYLFLLFIAWTEFGSYFGGYVDEHYEVDDQLRE 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
              I++D+ V T  C YL ++  D++      ++     + L+L+  P   P    VN +
Sbjct: 64  TFQINMDLYVKT-PCQYLDINVRDTT------MDRKFVSKELNLEDMPFFIPYGSRVNDM 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                   N   T +L++                     N +   +R K      +DT  
Sbjct: 117 --------NEIVTPDLDN------------------VLSNAIPAQFREK------IDT-- 142

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
              N    E+ ++ F   C I+G ++VNRV+G   I A G  YS                
Sbjct: 143 ---NNMFDEEERDAFN-SCHIFGSVQVNRVAGELQITAKGHGYS-------SFMRAPPEE 191

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA-SMFNYYIKIIPTIYERLDGSKL 304
            + +H I  LS+G      D    PLD T     +   + F Y   I+PTIYE+L G+K+
Sbjct: 192 IDFSHVINELSYGEFYPYID---NPLDSTAKFVPDAPRTTFVYDTAIVPTIYEKL-GAKI 247

Query: 305 ------------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
                             G G    PGIF  Y+  PL + I++   S      +++  +S
Sbjct: 248 DTNQYAVSEYHINPEAQQGKGPIRFPGIFLRYDFEPLSIHISDVRLSFIQFVVRLVAILS 307

Query: 347 GTYIT----FMLVDALLHSCV 363
               T    F L+D +L +C+
Sbjct: 308 FVIYTASWAFRLIDLVLLTCL 328


>gi|340507573|gb|EGR33515.1| hypothetical protein IMG5_050820 [Ichthyophthirius multifiliis]
          Length = 290

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/354 (23%), Positives = 140/354 (39%), Gaps = 102/354 (28%)

Query: 55  TEELFVDSSRG-SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
           T E+FVDS RG  K+ ++LDI  P   CD L+LD  D  G   ++VE +++K R+   G+
Sbjct: 3   TSEMFVDSLRGGQKIRVNLDIDFPKFPCDILSLDFQDIMGSHSVNVEGDLHKTRITKTGE 62

Query: 114 PIQEPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYR 173
                +++                        NK  S +  +          N+V     
Sbjct: 63  YFDRHEQQ-----------------------QNKQHSGHAHDQ--------SNQVD---- 87

Query: 174 YKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA-------PGL 226
                L  +   +Q K             EGC++ G++ VNRV G+FHI+        G 
Sbjct: 88  -----LQRIQQAIQNK-------------EGCKLSGFMYVNRVPGNFHISCHAFGQILGY 129

Query: 227 SYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR-----------KPLDGTV 275
            + I  ++  D+          +H I HLSFG    D+DE +            P+D  V
Sbjct: 130 VFRITGINTIDL----------SHKINHLSFG----DEDEIKIVKKQFTLGVLNPMDKLV 175

Query: 276 AKAEE-----GASMFNYYIKIIPTIY----------ERLDGSKLGGGDGGMPGIFFSYEL 320
              ++     G S +NYY+ ++PT Y           +   ++       +P I+F Y+L
Sbjct: 176 KTKQKHFENYGIS-YNYYLNVVPTTYIDEWGYTYYVNQFVFTENQIQTDYIPAIYFRYDL 234

Query: 321 SPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGK 374
           SP+ V   +      H   ++   + G +     +D +    V ++ K   G K
Sbjct: 235 SPVTVMFKKDRMPFLHFLVQVSAIVGGIFTIAAFMDEIAFKIVIQLFKNSEGEK 288


>gi|57208595|emb|CAI42844.1| ERGIC and golgi 3 [Homo sapiens]
          Length = 156

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 72/159 (45%), Gaps = 51/159 (32%)

Query: 250 HHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS------- 302
           H+I+HLSFG   +D      PLD T   A + + MF Y++K++PT+Y ++DG        
Sbjct: 1   HYIQHLSFG---EDYPGIVNPLDHTNVTAPQASMMFQYFVKVVPTVYMKVDGEAQQERGR 57

Query: 303 KLGGGDGG-----------------------------------------MPGIFFSYELS 321
             GG DGG                                         +PG+F  YELS
Sbjct: 58  SRGGADGGWSQVLALALAQAPLPPQVLRTNQFSVTRHEKVANGLLGDQGLPGVFVLYELS 117

Query: 322 PLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH 360
           P+MVK+TEK +S  H  T +   I G +    L+D+L++
Sbjct: 118 PMMVKLTEKHRSFTHFLTGVCAIIGGMFTVAGLIDSLIY 156


>gi|403216157|emb|CCK70655.1| hypothetical protein KNAG_0E04020 [Kazachstania naganishii CBS
           8797]
          Length = 351

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 86/365 (23%), Positives = 143/365 (39%), Gaps = 72/365 (19%)

Query: 18  EDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVP 77
           E + +K+  GG  +I+ +LF+ ++   +   YF     ++  VDS     + ++LD+ V 
Sbjct: 10  EQYKQKSSKGGLTSILTYLFLIFIAYSEFGSYFGGYLDQQYIVDSELREDVELNLDVFV- 68

Query: 78  TISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGT 137
            + CD++ ++  DS+       +  I    L  +  P   P    VN + +  +T E   
Sbjct: 69  HMPCDFIHVNVRDST------FDRKIVSEELKFEDMPFFIPYDTKVNDIPEI-ITPEMDE 121

Query: 138 TTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKL 197
              E        + +  + + R   +  +     +      LPE +              
Sbjct: 122 ILGE-----AIPASFREKVDMRLYYDENDPDTHHH------LPEFN-------------- 156

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLS 256
                 GC I+G + VNRV G F I A GL Y        D+        N  H I   S
Sbjct: 157 ------GCHIFGSIPVNRVRGEFQITAKGLGY-------RDMNAAPKEKINFAHVINEWS 203

Query: 257 FGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERLDGS-----------KL 304
           FG      D    PLD T     ++  + F YY+ ++PTIY++L              + 
Sbjct: 204 FGDFYPYID---NPLDATAKFDKDDPLTAFVYYLSVVPTIYQKLGAEVDTNQYSVSEYRF 260

Query: 305 GGGD------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS-GTYIT---FML 354
              D      G +PGIFF Y    L + +T++  S      +++  +S   YI    F+L
Sbjct: 261 NSTDKTFRDTGYVPGIFFRYNFESLSIVMTDRRLSFLQFIVRLVAIMSFAVYIASWIFIL 320

Query: 355 VDALL 359
            D LL
Sbjct: 321 TDTLL 325


>gi|151946097|gb|EDN64328.1| ER vesicle protein [Saccharomyces cerevisiae YJM789]
 gi|190408176|gb|EDV11441.1| hypothetical protein SCRG_01831 [Saccharomyces cerevisiae RM11-1a]
 gi|259148509|emb|CAY81754.1| Erv41p [Saccharomyces cerevisiae EC1118]
          Length = 352

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 88/355 (24%), Positives = 142/355 (40%), Gaps = 68/355 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K  E + +K+  GG  +++ +LF+ ++   +  +YF     ++  VDS    
Sbjct: 4   LKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQVRD 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++DI V T  CD+L ++  D + ++ L +E    +        P   P    VN +
Sbjct: 64  TVQINMDIYVNT-KCDWLQINVRDQTMDRKLVLEELQLEEM------PFFIPYDTKVNDI 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                   N   T EL++    G    AE   +    +  +  +  R     LPE +   
Sbjct: 117 --------NEIITPELDE--ILGEAIPAEFREKLDTRSFFDESDPNRAH---LPEFN--- 160

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                            GC I+G + VNRVSG   I A  L Y  +        P     
Sbjct: 161 -----------------GCHIFGSIPVNRVSGELQITAKSLGYVASRK-----APLEELK 198

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----D 300
           FN  H I   SFG      D    PLD T     +E  + + YY  ++PT++++L    D
Sbjct: 199 FN--HVINEFSFGDFYPYID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVD 253

Query: 301 GSKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            ++    D              MPGIFF Y   PL + +++   S      +++ 
Sbjct: 254 TNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 308


>gi|6323573|ref|NP_013644.1| Erv41p [Saccharomyces cerevisiae S288c]
 gi|2497084|sp|Q04651.1|ERV41_YEAST RecName: Full=ER-derived vesicles protein ERV41
 gi|558408|emb|CAA86254.1| unnamed protein product [Saccharomyces cerevisiae]
 gi|285813935|tpg|DAA09830.1| TPA: Erv41p [Saccharomyces cerevisiae S288c]
          Length = 352

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 90/355 (25%), Positives = 145/355 (40%), Gaps = 68/355 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K  E + +K+  GG  +++ +LF+ ++   +  +YF     ++  VDS    
Sbjct: 4   LKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQVRD 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++DI V T  CD+L ++  D + ++ L +E    +        P   P    VN +
Sbjct: 64  TVQINMDIYVNT-KCDWLQINVRDQTMDRKLVLEELQLEEM------PFFIPYDTKVNDI 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                   N   T EL++    G    A  E R+  +T +   E+    K  LPE +   
Sbjct: 117 --------NEIITPELDE--ILGEAIPA--EFREKLDTRSFFDES-DPNKAHLPEFN--- 160

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                            GC ++G + VNRVSG   I A  L Y  +        P     
Sbjct: 161 -----------------GCHVFGSIPVNRVSGELQITAKSLGYVASRK-----APLEELK 198

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----D 300
           FN  H I   SFG      D    PLD T     +E  + + YY  ++PT++++L    D
Sbjct: 199 FN--HVINEFSFGDFYPYID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVD 253

Query: 301 GSKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            ++    D              MPGIFF Y   PL + +++   S      +++ 
Sbjct: 254 TNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 308


>gi|349580221|dbj|GAA25381.1| K7_Erv41p [Saccharomyces cerevisiae Kyokai no. 7]
          Length = 352

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/355 (25%), Positives = 145/355 (40%), Gaps = 68/355 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K  E + +K+  GG  +++ +LF+ ++   +  +YF     ++  VDS    
Sbjct: 4   LKTFDAFPKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQVRD 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++DI V T  CD+L ++  D + ++ L +E    +        P   P    VN +
Sbjct: 64  TVQINMDIYVNT-KCDWLQINVRDQTMDRKLVLEELQLEEM------PFFIPYDTKVNDI 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                   N   T EL++    G    A  E R+  +T +   E+    K  LPE +   
Sbjct: 117 --------NEIITPELDE--ILGEAIPA--EFREKLDTRSFFDES-DPNKAHLPEFN--- 160

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                            GC ++G + VNRVSG   I A  L Y  +        P     
Sbjct: 161 -----------------GCHVFGSIPVNRVSGELQITAKSLGYVASRK-----APLEELK 198

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----D 300
           FN  H I   SFG      D    PLD T     +E  + + YY  ++PT++++L    D
Sbjct: 199 FN--HVINEFSFGDFYPYID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVD 253

Query: 301 GSKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            ++    D              MPGIFF Y   PL + +++   S      +++ 
Sbjct: 254 TNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 308


>gi|121710902|ref|XP_001273067.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
           clavatus NRRL 1]
 gi|119401217|gb|EAW11641.1| COPII-coated vesicle protein (Erv41), putative [Aspergillus
           clavatus NRRL 1]
          Length = 401

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 87/382 (22%), Positives = 148/382 (38%), Gaps = 76/382 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K    +   +  GG  T++  L  ++    +   + + +      V+     
Sbjct: 24  LKIFDAFPKTKPSYTAPSHRGGQWTVLILLICTFFSLSEFRAWLRGTEKHHFSVEKGISH 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++LDIVV  + C+ L ++  D+SG++              L G+ +Q  +      +
Sbjct: 84  DLQLNLDIVV-DMPCESLDVNIQDASGDR-------------ILAGELLQRERTSWNLWM 129

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +K+      G    +  +  + G     + +     +   EV+   R K    P L    
Sbjct: 130 EKRNYEIHGGAHEYQTLN-QEHGDRLAEQEQDAHVHHVLGEVRRNPRKKFPRGPRLR--- 185

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS-A 244
                      +    + C+IYG LE N+V G FHI A G  Y       H   P+   +
Sbjct: 186 -----------RGDVVDSCRIYGSLEGNKVQGDFHITARGHGY-------HAAAPHLEHS 227

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LD 300
            FN +H +  LSFG           PLD T+A  EE    + Y++ ++PTIY +    LD
Sbjct: 228 TFNFSHMVTELSFGPHYPTI---LNPLDKTIATTEEHYYKYQYFLSVVPTIYSKGNLALD 284

Query: 301 G-------------------------------SKLGGGDGGMPGIFFSYELSPLMVKITE 329
                                           + L      +PGIFF Y + P+++ I+E
Sbjct: 285 AYSGSAPTLHDPNRNRNRNLIFTNQYAATSQSTALPESPYFVPGIFFKYSIEPILLIISE 344

Query: 330 KSKSLGHLWTKIMCNISGTYIT 351
           +  S   L  +++  +SG  +T
Sbjct: 345 ERGSFLTLLVRLVNTVSGVIVT 366


>gi|66773206|ref|NP_080631.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           isoform 2 [Mus musculus]
 gi|12854944|dbj|BAB30175.1| unnamed protein product [Mus musculus]
          Length = 302

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 72/288 (25%), Positives = 120/288 (41%), Gaps = 44/288 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +   Y+  L  D  P Q   + ++  +
Sbjct: 73  KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 158 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 211

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT
Sbjct: 212 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPT 256


>gi|302422316|ref|XP_003008988.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Verticillium albo-atrum VaMs.102]
 gi|261352134|gb|EEY14562.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Verticillium albo-atrum VaMs.102]
          Length = 374

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/392 (22%), Positives = 155/392 (39%), Gaps = 77/392 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    + ++T  GG  T+     IS ++           + E   + S R S
Sbjct: 20  VSAFDAFPKSKPQYVQRTSGGGKWTVAM-AVISVMLFWPELGRGGRGSREPTRLRSRRAS 78

Query: 67  K--LPIHLDIVVPTISCDYLALDAVDSSGE-----QHLHVEHNIYKRRLDLDGKPIQEPQ 119
              L ++LDIVV  + C+ L ++  D+SG+       L  E   + +  D+ G       
Sbjct: 79  ATTLQVNLDIVV-KMRCEDLHINVQDASGDLILAATKLREEITSWHQWADITGN------ 131

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
              +      ++ T +G    E          +G E       +  + V ++ + +KWA 
Sbjct: 132 -HKLGRSPSGRIETNSGYHLDE---------GFGEE-------HVHDIVAQSKKRQKWA- 173

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
                         T +L+    + C+I+G L++N+V G FHI A G  Y     H+   
Sbjct: 174 -------------RTPRLRGP-PDSCRIFGSLDLNKVQGDFHITARGHGYQGAGQHL--- 216

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
                 +FN +H +  LSFG    + +    PLD TV  A      F YY+ I+PT+Y  
Sbjct: 217 ---DHTSFNFSHIVNELSFGAFYPNLE---NPLDRTVNLASANFHKFQYYLSIVPTVYTV 270

Query: 299 LDGSKLGG----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
              +                    GD  +PG+F  Y++ P+++ + E        W K++
Sbjct: 271 GRSASKANTVYTNQFAVTEQSKEVGDHSVPGVFVKYDIEPILLLVEETRPGFVQFWLKVI 330

Query: 343 CNISGTYIT----FMLVDALLHSCVKKISKVE 370
             +SG  +     F L +    +  KK  + +
Sbjct: 331 NVLSGVLVAGHWGFTLSEWFKENWAKKKERTQ 362


>gi|148678795|gb|EDL10742.1| ERGIC and golgi 2, isoform CRA_b [Mus musculus]
          Length = 310

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 72/288 (25%), Positives = 120/288 (41%), Gaps = 44/288 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 21  VKELDAFPKVPDSYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +   Y+  L  D  P Q   + ++  +
Sbjct: 81  KLRINIDITV-AMKCHYVGADVLDLAETMVASADGLAYEPAL-FDLSPQQREWQRMLQLI 138

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 139 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 165

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI  G +      H H        ++
Sbjct: 166 ------PREDDSSLTPDACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVNHDSY 219

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT 294
           N +H I HLSFG  +        PLDGT   A +   MF Y+I ++PT
Sbjct: 220 NFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVPT 264


>gi|390594538|gb|EIN03948.1| DUF1692-domain-containing protein [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 551

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/366 (22%), Positives = 146/366 (39%), Gaps = 78/366 (21%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           E LK  DAF K    +  ++   G  T +      +L+  D+ ++       E  VD+  
Sbjct: 21  ESLKHFDAFPKLPASYKARSESRGLFTALVAFIAFFLVLNDLGEFIWGWPDYEFSVDNEA 80

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
            S + I++D+VV  + C YL++D  D+ G+            RL L             +
Sbjct: 81  RSHMNINVDMVV-KMPCQYLSVDLRDAVGD------------RLYLS------------S 115

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
           A ++     + G  T   E        + A+   RK      + +  +          D 
Sbjct: 116 AFRRDGTLFDIGQATALKE--------HAAQLSARKAVAQSRQSRGLF----------DV 157

Query: 185 IVQCKNEYSTEKLKNTFTE-----GCQIYGYLEVNRVSGSFHIA-PGLSY-SINHVHVHD 237
           +++     S +  K T+        C+IYG L+V +V+ + HI   G  Y S+ HV  HD
Sbjct: 158 LLR----RSGQGYKPTYNHQPDGGACRIYGTLQVKKVTANLHITTAGHGYASVQHV-PHD 212

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
                    N +H I   SFG    D  +   PLD +     +    + Y++ ++PT Y 
Sbjct: 213 -------QMNLSHVITEFSFGPYFPDITQ---PLDDSFEITTDPFIAYQYFLHVVPTTYV 262

Query: 298 RLDGSKLGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
               S L                  G PGIFF +EL PL + + +++ +L  L+ +++  
Sbjct: 263 APRSSPLKTAQYSVTHYTRVLEHGRGTPGIFFKFELDPLSITVNQRTTTLAQLFIRVIGV 322

Query: 345 ISGTYI 350
           + G ++
Sbjct: 323 VGGIFV 328


>gi|255726548|ref|XP_002548200.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240134124|gb|EER33679.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 355

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 93/399 (23%), Positives = 158/399 (39%), Gaps = 92/399 (23%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F+ +++  DAF K   +   ++  GG  T+V ++F   ++ +++  Y       +  VD+
Sbjct: 4   FTNKVRTFDAFPKVDPNQQVRSQRGGFSTLVTYMFGLLILWIEIGGYIGGYVDRQFTVDN 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              S L I+LD++V  + C++L  +  D + +++L  E       L+ +G     P    
Sbjct: 64  QIRSDLTINLDMIV-GMPCEFLHTNVEDITRDRYLAGE------TLNFEGIHFIVPPSFR 116

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +N                   +PN                                 P+L
Sbjct: 117 IN-------------------NPNDFHET----------------------------PDL 129

Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSY-SINHVHVHD 237
           D I+Q   + E+ ++  + N     C I+G + V +V G F I A G  Y   +HV +  
Sbjct: 130 DEIMQESLRAEFRSQGARVNEGAPACHIFGSIPVTQVRGDFRITAKGFGYRDRSHVPIE- 188

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
                  AFN +H I+  SFG   +       PLD T    EE    + YY K++PT+YE
Sbjct: 189 -------AFNFSHVIQEFSFG---EFYPFINNPLDATGKITEEKLQTYLYYAKVVPTMYE 238

Query: 298 RL----DGSKLGGGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
           +L    D ++    +               G+PGI+F Y+  P+ + I EK         
Sbjct: 239 QLGLEIDTNQYSLTESQHVIQVDEQTKRPNGIPGIYFRYDFEPIKLVIREKRIPFFQFIA 298

Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
           K +  I G     M+    L    +K+  +  G K V K
Sbjct: 299 K-LGTIGG---GIMIAAGYLFKLYEKLLLILYGKKYVDK 333


>gi|401427507|ref|XP_003878237.1| hypothetical protein, unknown function [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322494484|emb|CBZ29786.1| hypothetical protein, unknown function [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 309

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 85/187 (45%), Gaps = 22/187 (11%)

Query: 200 TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG- 258
           +  EGC++ GY++V +V G+FHI+       +H   H +  +     N  H I HLSFG 
Sbjct: 128 SVAEGCRLEGYIKVGKVPGNFHIS-------SHGRQHLLAQHFPNGINVEHSIHHLSFGT 180

Query: 259 --IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LDGSKLGGGDGGMP 312
             +K         PLDG   ++E    ++ Y++ I+PTIYE     +   +  G     P
Sbjct: 181 TDVKKLAKKAALHPLDGKEHRSEV-PMVYQYFLDIVPTIYESSFSTVHTYQFTGTSSSTP 239

Query: 313 -------GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
                   + F Y+LSP+ V+ +    SL H  T +   I G Y    L+   +HS   +
Sbjct: 240 VPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSAAQ 299

Query: 366 ISKVEIG 372
             +  +G
Sbjct: 300 FQRRVLG 306



 Score = 45.1 bits (105), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 26/109 (23%), Positives = 47/109 (43%), Gaps = 2/109 (1%)

Query: 11  DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV--DSSRGSKL 68
           D F     D  E T  G  ++I C + ++ L   +V  Y       ++ +  D    + +
Sbjct: 11  DFFRHIPRDLTESTTAGSIISIACVVLMALLFAGEVISYVFPRIQSDMIIMPDLDDQNTI 70

Query: 69  PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
            + +D+  P + C  L LD +D       +   +I + RLD  G+PI +
Sbjct: 71  KVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPISD 119


>gi|398021306|ref|XP_003863816.1| hypothetical protein, unknown function [Leishmania donovani]
 gi|322502049|emb|CBZ37133.1| hypothetical protein, unknown function [Leishmania donovani]
          Length = 309

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 85/187 (45%), Gaps = 22/187 (11%)

Query: 200 TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG- 258
           +  EGC++ GY++V +V G+FHI+       +H   H +  +     N  H I HLSFG 
Sbjct: 128 SVAEGCRLEGYIKVAKVPGNFHIS-------SHGRQHLLAQHFPNGINVEHSIHHLSFGT 180

Query: 259 --IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LDGSKLGGGDGGMP 312
             +K         PLDG   ++E    ++ Y++ I+PTIYE     +   +  G     P
Sbjct: 181 IDVKKLAKKAALHPLDGKEHRSEV-PMVYQYFLDIVPTIYESSFSTVHTYQFTGTSSSTP 239

Query: 313 -------GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
                   + F Y+LSP+ V+ +    SL H  T +   I G Y    L+   +HS   +
Sbjct: 240 VPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSAAQ 299

Query: 366 ISKVEIG 372
             +  +G
Sbjct: 300 FQRRVLG 306



 Score = 42.4 bits (98), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 25/109 (22%), Positives = 46/109 (42%), Gaps = 2/109 (1%)

Query: 11  DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV--DSSRGSKL 68
           D F     D  E T  G  +++ C + +  L   +V  Y       ++ +  D    + +
Sbjct: 11  DFFRHIPRDLTEPTTAGSIISVACVVVMVLLFAGEVISYVFPRIQSDMIIMPDLDDRNTI 70

Query: 69  PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
            + +D+  P + C  L LD +D       +   +I + RLD  G+PI +
Sbjct: 71  KVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPISD 119


>gi|146097219|ref|XP_001468078.1| hypothetical protein, unknown function [Leishmania infantum JPCM5]
 gi|134072444|emb|CAM71154.1| hypothetical protein, unknown function [Leishmania infantum JPCM5]
          Length = 309

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 85/187 (45%), Gaps = 22/187 (11%)

Query: 200 TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG- 258
           +  EGC++ GY++V +V G+FHI+       +H   H +  +     N  H I HLSFG 
Sbjct: 128 SVAEGCRLEGYIKVAKVPGNFHIS-------SHGRQHLLAQHFPNGINVEHSIHHLSFGT 180

Query: 259 --IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LDGSKLGGGDGGMP 312
             +K         PLDG   ++E    ++ Y++ I+PTIYE     +   +  G     P
Sbjct: 181 IDVKKLAKKAALHPLDGKEHRSEV-PMVYQYFLDIVPTIYESSFSTVHTYQFTGTSSSTP 239

Query: 313 -------GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
                   + F Y+LSP+ V+ +    SL H  T +   I G Y    L+   +HS   +
Sbjct: 240 VPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSAAQ 299

Query: 366 ISKVEIG 372
             +  +G
Sbjct: 300 FQRRVLG 306



 Score = 42.4 bits (98), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 25/109 (22%), Positives = 46/109 (42%), Gaps = 2/109 (1%)

Query: 11  DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV--DSSRGSKL 68
           D F     D  E T  G  +++ C + +  L   +V  Y       ++ +  D    + +
Sbjct: 11  DFFRHIPRDLTEPTTAGSIISVACVVVMVLLFAGEVISYVFPRIQSDMIIMPDLDDRNTI 70

Query: 69  PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
            + +D+  P + C  L LD +D       +   +I + RLD  G+PI +
Sbjct: 71  KVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPISD 119


>gi|294655234|ref|XP_457337.2| DEHA2B08778p [Debaryomyces hansenii CBS767]
 gi|199429792|emb|CAG85341.2| DEHA2B08778p [Debaryomyces hansenii CBS767]
          Length = 354

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 95/401 (23%), Positives = 162/401 (40%), Gaps = 95/401 (23%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGG---AVTIVCWLFISYLICVDVCDYFQVSTTEELF 59
           F+ +++  DAF K   +   ++  GG    VTIVC L I   + V++  +       +  
Sbjct: 4   FTTKVRTFDAFPKVDAEHTVRSSRGGFSTLVTIVCGLLI---LWVEIGGFLGGYVDHQFT 60

Query: 60  VDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
           +D    S L +++D++V  + C++L  + +D + ++ L  E       L+ +G     PQ
Sbjct: 61  IDDKVKSDLSLNIDMLV-AMPCEFLHTNVMDITDDRFLAGE------LLNFEGTNFFLPQ 113

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
              +N+                                     NT ++            
Sbjct: 114 HFEINS------------------------------------KNTDHDT----------- 126

Query: 180 PELDTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV 235
           P+LD ++Q   + E+     + N     C I+G + VN+V G FHI   G  Y+      
Sbjct: 127 PDLDHVMQETLRAEFRVAGARVNEGAPACHIFGSIPVNQVKGDFHITGKGFGYNDGR--- 183

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
             + P+   A N TH I   S+G      +    PLD T    E+    + YY K++PTI
Sbjct: 184 -SVVPF--EALNFTHVISEFSYGDFYPFIN---NPLDFTGKVTEQKLQAYKYYSKVVPTI 237

Query: 296 YERL----DGSKLGGGDG-------------GMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           YE+L    D ++    +              G+PGIFF YE  P+ + I+EK        
Sbjct: 238 YEKLGMIIDTNQYSLTEQHNVYKVNRFNNVEGIPGIFFKYEFEPIKLIISEKRIPFIQFV 297

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTKR 379
           +++   I G     ++V   L+   +K   V + GK  T+R
Sbjct: 298 SRLATIIGG----LLIVAGYLYRLYEKFLTV-LFGKRYTER 333


>gi|313247758|emb|CBY15879.1| unnamed protein product [Oikopleura dioica]
          Length = 285

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 93/201 (46%), Gaps = 30/201 (14%)

Query: 189 KNEYSTEKL-KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFN 247
           KN   T+ L KN    GC+ +G   VN+V G+FH++          H    QP+    FN
Sbjct: 98  KNTRKTDMLNKNQQKSGCRFHGEFYVNKVPGNFHVS---------THASKKQPH-KHDFN 147

Query: 248 TTHHIRHLSFGIKLQ--DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI---------- 295
             H I  L FG  L   +    +  L G     E   S ++Y +KI+PT+          
Sbjct: 148 --HKINKLFFGEDLSALELPGNQTSLAGQATTNEPSLS-YDYTLKIVPTVHNDNKRRTTF 204

Query: 296 -YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFML 354
            Y+    SK      G P I+F YE++P+ VK T K K   HL T I   + GT+    +
Sbjct: 205 GYQYTVTSKTFKNTRGTPAIWFRYEIAPITVKYTHKKKPFYHLLTTICAIVGGTFTVAGM 264

Query: 355 VDALL---HSCVKKISKVEIG 372
           +D+++   H  VKK S+ ++G
Sbjct: 265 IDSMIFSAHQAVKKASEGKLG 285



 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 58/112 (51%), Gaps = 3/112 (2%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
           +K  D + K  +D  + T  G  ++I    FI +L+  +V  + Q     EL+VD  + G
Sbjct: 3   IKRFDIYRKLPKDLTQPTTTGALISICSTFFIIFLLVSEVLSFLQEEVVSELYVDDPTTG 62

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
           + +P+ +D+ +P ++C+Y+A+   D+ G   +    N   R+ D+  K  Q+
Sbjct: 63  ATIPVIVDLEIPNMACEYVAIPKKDNQGRHEVGYLKNT--RKTDMLNKNQQK 112


>gi|367038975|ref|XP_003649868.1| hypothetical protein THITE_2126029 [Thielavia terrestris NRRL 8126]
 gi|346997129|gb|AEO63532.1| hypothetical protein THITE_2126029 [Thielavia terrestris NRRL 8126]
          Length = 380

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 144/372 (38%), Gaps = 76/372 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  +T  GG  T+   L    L   ++  +++ +      V+     
Sbjct: 23  VSAFDAFPKSKPQYVTRTSGGGKWTVAMGLVSLVLFWSELGRWWRGTEEHTFAVEKGVSH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD---------LDGKPIQE 117
            L I+LD+VV  + C  L ++  D++G++ L  +      RL          +DGK + +
Sbjct: 83  VLNINLDVVV-RMRCADLHVNVQDAAGDRILAAD------RLSRDPTAWAHWVDGKGMHK 135

Query: 118 PQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKW 177
             ++      + +V T  G T    E        +G E       +  + V    R  KW
Sbjct: 136 LGRDA-----QGRVITGEGYTAEHDE-------GFGEE-------HVHDIVALGRRRAKW 176

Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVH 236
           +               T +L     + C+IYG LE+N+V G FHI A G  Y     H+ 
Sbjct: 177 S--------------RTPRLWGAEPDSCRIYGSLELNKVQGDFHITARGHGYMAFGDHL- 221

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
                   AFN +H I  LSFG  L        PLD TV  A      F Y++ ++PT Y
Sbjct: 222 -----DHNAFNFSHIISELSFGPFLP---SLANPLDRTVNIATAHFHKFQYFLSVVPTTY 273

Query: 297 ERLDGSKLGG-----------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
                  LG                   D  +PGIF  Y++ P+++ I E          
Sbjct: 274 SVGRPGALGARSIFTNQYAVTEQSQEVPDTTIPGIFVKYDIEPILLNIVETRDGFFVFLL 333

Query: 340 KIMCNISGTYIT 351
           +++  +SG  + 
Sbjct: 334 RVINVVSGVLVA 345


>gi|50293697|ref|XP_449260.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528573|emb|CAG62234.1| unnamed protein product [Candida glabrata]
          Length = 352

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 90/380 (23%), Positives = 150/380 (39%), Gaps = 81/380 (21%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  E + +K+  GG  +I+ ++F+ ++   +   +F     ++  VD     
Sbjct: 4   LRTFDAFPKTDETYKKKSTKGGVTSILTYIFLLFIAWTEFGKFFGGYIDQQYTVDKVVRE 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
              I++D+ V  I C+ + ++  D + ++ L ++       L L+  P   P    VN V
Sbjct: 64  TAQINMDLYV-NIKCENIHINVRDQTQDRKLVIQD------LKLEDMPFFIPYDSKVNGV 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAE----TETRKCCNTCNEVKEAYRYKKWALPEL 182
                   N   T ++++    G    AE     +TR+  +  +   E Y      LP+ 
Sbjct: 117 --------NSIVTPDIDE--ILGEAIPAEFREKLDTRQFYDENDPESEKY------LPKF 160

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
           +                    GC I+G + VNRV G   I A G  Y        +I   
Sbjct: 161 N--------------------GCHIFGSVPVNRVKGELQITASGYGYPGKRAPKEEI--- 197

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL- 299
                +  H I  LSFG      D    PLD T     E   S + YYI  +PT+Y++L 
Sbjct: 198 -----DFAHAINELSFGDFYPYID---NPLDKTARFDKEHPLSAYMYYISAVPTMYKKLG 249

Query: 300 ----------DGSKLGGGDGG------MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                     +  K    D        +PGIFF Y   PL ++IT+   S      +++ 
Sbjct: 250 VEIETFQYSVNDYKYSMTDADPATVRKIPGIFFRYGFEPLSIEITDVRISFLQFIVRLVA 309

Query: 344 NISGTYITFMLVDALLHSCV 363
            +S     FM V + + + +
Sbjct: 310 ILS----FFMFVVSWIFTII 325


>gi|256269733|gb|EEU05000.1| Erv41p [Saccharomyces cerevisiae JAY291]
          Length = 353

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 91/355 (25%), Positives = 146/355 (41%), Gaps = 67/355 (18%)

Query: 7   LKGLDAF-TKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           LK  DAF TK  E + +K+  GG  +++ +LF+ ++   +  +YF     ++  VDS   
Sbjct: 4   LKTFDAFRTKTEEQYKKKSTKGGLTSLLTYLFLLFIAWTEFGEYFGGYIDQQYVVDSQVR 63

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             + I++DI V T  CD+L ++  D + ++ L +E    +        P   P    VN 
Sbjct: 64  DTVQINMDIYVNT-KCDWLQINVRDQTMDRKLVLEELQLEEM------PFFIPYDTKVND 116

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +        N   T EL++    G    AE   R+  +T +   E+    K  LPE +  
Sbjct: 117 I--------NEIITPELDE--ILGEAIPAEF--REKLDTRSFFDES-DPNKAHLPEFN-- 161

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
                             GC I+G + VNRVSG   I      S+ +V      P     
Sbjct: 162 ------------------GCHIFGSIPVNRVSGELQITAN---SLGYVASRK-APLEELK 199

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----D 300
           FN  H I   SFG      D    PLD T     +E  + + YY  ++PT++++L    D
Sbjct: 200 FN--HVINEFSFGDFYPYID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVD 254

Query: 301 GSKLGGGD------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            ++    D              MPGIFF Y   PL + +++   S      +++ 
Sbjct: 255 TNQYSVNDYRYLYKDVAAKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 309


>gi|322710423|gb|EFZ01998.1| COPII-coated vesicle protein (Erv41), putative [Metarhizium
           anisopliae ARSEF 23]
          Length = 372

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 83/359 (23%), Positives = 146/359 (40%), Gaps = 57/359 (15%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K   ++  +T  GG  T+   +   +L+  ++  +++ + +    V+     
Sbjct: 21  VSAFDAFPKSKPEYVTRTEGGGKWTVAMAVVSIFLLWAEIARWWRGAESHTFAVEKGVSH 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE--PQKEVVN 124
            + I+LD V+  + C  L ++  D++G++ L         +L++D     +   QK V  
Sbjct: 81  SMQINLDTVI-LMKCGDLHINVQDAAGDRIL------AGSKLNMDETSWSQWVNQKGVHK 133

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             +  +     G     L+D       +G E       +  + V    R  KWA      
Sbjct: 134 LGRDSEGRVITGAGWQNLDDEG-----FGEE-------HVHDIVALGQRRAKWA------ 175

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS 243
                    T ++K    + C+IYG L++N+V G FHI A G  Y     H+   Q    
Sbjct: 176 --------KTPRVKGP-PDSCRIYGSLDLNKVQGDFHITARGHGYRGQGSHLDHEQ---- 222

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
             FN +H I  LSFG           PLD T+  AE     F YY+ ++PT Y     S 
Sbjct: 223 --FNFSHIISELSFGSYYP---SLVNPLDRTLNIAENHFHKFQYYVSVVPTRYSVGSSSI 277

Query: 304 L-----------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                       G  +  +PG+F  Y++ P+++ + E    +     K++  +SG  + 
Sbjct: 278 FTNQYAVTEQSKGVSEYNVPGVFVKYDIEPILLSVNEDRDGILMFVVKLINVLSGVLVA 336


>gi|213408569|ref|XP_002175055.1| COPII-coated vesicle component Erv41 [Schizosaccharomyces japonicus
           yFS275]
 gi|212003102|gb|EEB08762.1| COPII-coated vesicle component Erv41 [Schizosaccharomyces japonicus
           yFS275]
          Length = 331

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 85/364 (23%), Positives = 146/364 (40%), Gaps = 84/364 (23%)

Query: 5   ERLKGLDAFTKPYEDFH-EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           E ++  DAF K  + +  +++  GG ++I+  + I+ +  ++   YFQ +  ++ FV  +
Sbjct: 8   EGIRVFDAFPKVAKTYRKQRSSQGGLLSIILAICITCISIMEFFFYFQGTREQQFFVYET 67

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
               + I+LD+ +  + C +L +D +D                              + +
Sbjct: 68  ISEHMNINLDMTI-AMPCKFLQVDVLD------------------------------QTM 96

Query: 124 NAVKKKKVTTENGTTTTELE-DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           + V   +V T+  TT  ++  +P    S     T +    +     ++ +  K   LP+ 
Sbjct: 97  DHVFATEVFTKQETTVEDMRHEPLPVTS-----TGSFDAADLRRTRRKKFNKKSKTLPDG 151

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
            +                    C+ YG + V+R  G  HI APG  Y ++++ ++     
Sbjct: 152 GS-------------------ACRFYGAVTVHRTQGLLHITAPGWGYGMSNIPLN----- 187

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL-- 299
              A N TH I  LSFG            LDG+    +E A  F YY  IIPT Y     
Sbjct: 188 ---ALNFTHAIDELSFGDYYP---SLVNALDGSYGFTDEHAFAFQYYTSIIPTTYTSTFR 241

Query: 300 ------------DGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
                          +  G     PGIF SY++ PL + I E   SLG+   +I+  ISG
Sbjct: 242 NVQTNQYAVTENSVRRQTGFRSDPPGIFISYDIEPLGIHIRETYPSLGNTILRILA-ISG 300

Query: 348 TYIT 351
             +T
Sbjct: 301 GLVT 304


>gi|358390077|gb|EHK39483.1| hypothetical protein TRIATDRAFT_302881 [Trichoderma atroviride IMI
           206040]
          Length = 372

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 88/363 (24%), Positives = 144/363 (39%), Gaps = 65/363 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  +T  GG  T+   L  S  +  ++  +++        V+   G 
Sbjct: 21  VSAFDAFPKSKPQYVTQTSGGGKWTVAMLLISSIFMWTELGRWWRGIEAHTFAVERGVGH 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQ-----HLHVEHNIYKRRLDLDGKPIQEPQKE 121
            + I+LDIVV  + CD L ++  D+SG++      L  E   + + +D  G         
Sbjct: 81  DMQINLDIVV-KMHCDDLHVNVQDASGDRILAADKLAREATTWSQWVDEKGM-------- 131

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                  K    ENG   T L   +K    +G E       +  + +    R  KWA   
Sbjct: 132 ------HKLGKNENGQLDTGLGWHSKHDEGFGEE-------HVHDIIALTQRRAKWA--- 175

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HDIQ 239
                       T + +    + C+++G +++N+V G FHI A G  Y     H+ HD  
Sbjct: 176 -----------RTPRPRGK-PDSCRMFGSMDLNKVQGDFHITARGHGYMGMGQHLDHD-- 221

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY--- 296
                 FN +H I  +S+G           PLD TV  A      F YY+ ++PT+Y   
Sbjct: 222 -----KFNFSHIISEMSYGPYYP---SLVNPLDRTVNSAIVHFHKFQYYLSVVPTVYLAN 273

Query: 297 ERLDGSKLGG--------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
            R+  +             D  +PGIFF Y++ P+++ + E          KI+   SG 
Sbjct: 274 RRIVNTNQYAVTEHSKTISDHQIPGIFFKYDIEPILLSVEESRDGFLSFVIKIVNIFSGV 333

Query: 349 YIT 351
            + 
Sbjct: 334 MVA 336


>gi|157874469|ref|XP_001685717.1| hypothetical protein LMJF_32_3910 [Leishmania major strain
           Friedlin]
 gi|68128789|emb|CAJ08922.1| hypothetical protein LMJF_32_3910 [Leishmania major strain
           Friedlin]
          Length = 309

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 85/187 (45%), Gaps = 22/187 (11%)

Query: 200 TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG- 258
           +  EGC++ GY++V +V G+FHI+       +H   H +  +     N  H I HLSFG 
Sbjct: 128 SVAEGCRLEGYIKVAKVPGNFHIS-------SHGRQHLLAQHFPNGINVEHSIHHLSFGT 180

Query: 259 --IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS----KLGGGDGGMP 312
             +K         PLDG   ++E    ++ Y++ I+PTIYE    +    +  G     P
Sbjct: 181 IDVKKLAKKAALHPLDGKEHRSEM-PMVYQYFLDIVPTIYESSFSTVYTYQFTGTSSSTP 239

Query: 313 -------GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
                   + F Y+LSP+ V+ +    SL H  T +   I G Y    L+   +HS   +
Sbjct: 240 VPARQMAAVVFQYQLSPITVRYSLARVSLTHFLTYVCAIIGGVYTVAGLLSRFVHSSAAQ 299

Query: 366 ISKVEIG 372
             +  +G
Sbjct: 300 FQRHVLG 306



 Score = 42.0 bits (97), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 25/109 (22%), Positives = 46/109 (42%), Gaps = 2/109 (1%)

Query: 11  DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV--DSSRGSKL 68
           D F     D  E T  G  +++ C + +  L   +V  Y       ++ +  D    + +
Sbjct: 11  DFFRHIPRDLTESTTAGSIISVACVVVMVLLFAGEVIAYVFPRIQSDMIIMPDLDDRNTI 70

Query: 69  PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQE 117
            + +D+  P + C  L LD +D       +   +I + RLD  G+PI +
Sbjct: 71  KVSMDMTFPKMPCAVLTLDILDVLHNHMFNSMEHITRTRLDAAGQPISD 119


>gi|453088947|gb|EMF16987.1| DUF1692-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 404

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 84/385 (21%), Positives = 151/385 (39%), Gaps = 81/385 (21%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K    + ++T  GG  T++  +    L   ++  +++  TT    V+   G 
Sbjct: 23  VKAFDAFPKTKPSYQQRTSTGGVWTVILIVASVALTWSELARWWKGETTHTFAVEQGVGH 82

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L ++LD VV  + C  L ++  D++G++ L    +++ +    DG    +       A 
Sbjct: 83  DLQMNLDTVV-RMKCADLHVNVQDAAGDRIL--AGSVFHK----DGTTWDQW------AG 129

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            +K     +   +T+ E  ++ GS   AE       +  +  +  +++ +   P +    
Sbjct: 130 NRKA----HALGSTKEERLSQKGSAASAEYREEDVHHYLSSARMKHKFGR--TPHIP--- 180

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                      +    + C+IYG +  N+V G FHI      +  H ++   Q    + F
Sbjct: 181 -----------RGREADSCRIYGSMHGNKVKGDFHIT-----ARGHGYMEFGQHLDHSTF 224

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY---------- 296
           N +H I  LSFG           PLD T A  E     F YY+ ++PTIY          
Sbjct: 225 NFSHRITELSFGPYYP---SLTNPLDNTFATTESNFYKFQYYLSVVPTIYTADAKALRKI 281

Query: 297 ERLDGSKLGGGDG------------------------------GMPGIFFSYELSPLMVK 326
           ++   S   G DG                               +PGIF  +++ P+ + 
Sbjct: 282 DKYHESPTSGDDGLSQQPKRYSKNTVFTNQYAVTEQSHPVSESSVPGIFVKFDIEPIQLT 341

Query: 327 ITEKSKSLGHLWTKIMCNISGTYIT 351
           I E   S+  L  +I+  +SG  + 
Sbjct: 342 IAENWSSVPALLIRIVNVVSGLLVA 366


>gi|198421328|ref|XP_002120997.1| PREDICTED: similar to Endoplasmic reticulum-Golgi intermediate
           compartment protein 1 (ER-Golgi intermediate compartment
           32 kDa protein) (ERGIC-32) [Ciona intestinalis]
          Length = 289

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 53/198 (26%), Positives = 89/198 (44%), Gaps = 30/198 (15%)

Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
           ++EK+      GC      ++N+V G+FH++          H    QP      + TH I
Sbjct: 101 NSEKVPTHDGNGCLFTSRFQINKVPGNFHVS---------THSARSQPDNP---DMTHEI 148

Query: 253 RHLSFGIKLQDDDERRK---PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS------- 302
           + L  G  +     + +    L+G     +   S  +Y +KI+PT+YE +DG+       
Sbjct: 149 KELRIGDNMVIPGVKSQSFNALEGKTTFDKHPLSSHDYIMKIVPTVYESIDGNLRYLYQY 208

Query: 303 --------KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFML 354
                     G G   MP I+F YE++P+ VK TE+ K   H  T +   I GT+    +
Sbjct: 209 TNAYKDYIAYGHGQRVMPAIWFRYEMTPITVKYTERRKPFYHFITMVCAIIGGTFTVAGI 268

Query: 355 VDALLHSCVKKISKVEIG 372
           +D+++ S  +   K+ IG
Sbjct: 269 IDSMIFSATEMYKKLTIG 286



 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 32/103 (31%), Positives = 56/103 (54%), Gaps = 3/103 (2%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           MVF   ++  D + K  +D  + T  G A+++ C  FISYL+  ++  +  +    EL+V
Sbjct: 1   MVFD--IRRFDIYRKVPKDLTQPTTTGAAISVGCCFFISYLLISELLGFLTIDVASELYV 58

Query: 61  DSSR-GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHN 102
           D  + G K+P+ + I +P + C+YL +D  DS G   + +  N
Sbjct: 59  DDPQSGDKIPVQIIISLPKMKCEYLGMDIQDSMGRHEVGMVDN 101


>gi|443700340|gb|ELT99344.1| hypothetical protein CAPTEDRAFT_162161 [Capitella teleta]
          Length = 110

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 42/110 (38%), Positives = 63/110 (57%), Gaps = 18/110 (16%)

Query: 284 MFNYYIKIIPTIYERLDG--------------SKLGGG---DGGMPGIFFSYELSPLMVK 326
           MF+YY+K++PT Y R +G               K+GGG   + G+PG+F +YELSP+MVK
Sbjct: 1   MFSYYVKVVPTSYLRANGEFVSSNQYSVTKHHKKVGGGILGEQGLPGVFVTYELSPMMVK 60

Query: 327 ITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS-KVEIGGKT 375
            TEK++S  H  T +   I G +    LVDA ++   + I  K+++G  T
Sbjct: 61  YTEKNRSFMHFLTGVCAIIGGVFTVAGLVDAFIYHSARAIQKKIDLGKAT 110


>gi|320580226|gb|EFW94449.1| COPii-coated vesicle-associated protein, putative [Ogataea
           parapolymorpha DL-1]
          Length = 901

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 90/367 (24%), Positives = 142/367 (38%), Gaps = 78/367 (21%)

Query: 23  KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPTISCD 82
           ++  G   TI+ +LF+ +LI V+V  Y   +   +  VD      L I+LD+VV  + C+
Sbjct: 584 RSTRGSYSTIITYLFLLFLIWVEVGGYIDGAIDHQFTVDELVRKDLVINLDLVV-AMPCN 642

Query: 83  YLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTEL 142
           Y+  +  D + ++ L  E       L+  G     P+    +A K           T EL
Sbjct: 643 YIHTNVRDLTDDRFLAAE------LLNYQGTTFNIPRWYEQSAKK---------IVTPEL 687

Query: 143 EDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFT 202
           E                                      L+  +Q + +Y  E   +   
Sbjct: 688 E------------------------------------AVLERSLQARFQYQGEH-HDEGA 710

Query: 203 EGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
             C+I+G + VNRV G  HI A G  Y        D     +   N TH I   SFG   
Sbjct: 711 PACRIFGAIPVNRVKGELHITAKGYGY-------RDRTRIPAEGLNFTHAISEFSFGEFF 763

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL----DGSK----LGGGDGG-MP 312
              D    PLD T+   +     F Y+I ++PT+Y +L    D ++    L    G  +P
Sbjct: 764 PYLD---NPLDMTLKTTDAHLHTFKYHINVVPTLYRKLGVEIDTNQYSLSLTESSGKYVP 820

Query: 313 GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           GIFF YE  P+ + + E   S      ++   + G     ++V   L+    K+  + + 
Sbjct: 821 GIFFQYEFEPIKLVVEETRLSFWQFVVRLATIMGG----ILVVAGWLYKLFDKLILLTL- 875

Query: 373 GKTVTKR 379
           GK   KR
Sbjct: 876 GKEFAKR 882


>gi|170108190|ref|XP_001885304.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
           bicolor S238N-H82]
 gi|164639780|gb|EDR04049.1| endoplasmic reticulum-derived transport vesicle ERV46 [Laccaria
           bicolor S238N-H82]
          Length = 398

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 83/358 (23%), Positives = 146/358 (40%), Gaps = 66/358 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K    +  +T   G +TI   L    L+  D+ +Y       E  VD ++ S
Sbjct: 19  LAKFDAFPKLPSTYKTRTESRGFMTIFVILLAFLLMLNDIGEYIWGWPDFEFSVDDNKSS 78

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L +++D+VV  + C ++++D  D+ G+            RL L G              
Sbjct: 79  FLDVNVDLVV-NMPCKFISVDLRDAMGD------------RLYLSGG------------- 112

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            ++  T  N    T L++ ++  S   A +++RK       +   +R  K          
Sbjct: 113 LRRDGTEFNVGQATALKEHSEALSARQAVSQSRKSRGLFANL---FRRNK---------S 160

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
             K  Y+ +   N     C+++G L+V RV+ + HI   G  Y+ ++ HV   Q      
Sbjct: 161 NFKPTYNYQPHGN----ACRVWGSLQVKRVTANLHITTLGHGYA-SYEHVDHNQ------ 209

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N +H I   SFG    D  +   PLD +    +E    + Y++ ++PT Y     + L 
Sbjct: 210 MNLSHVITEFSFGPHFPDITQ---PLDNSFESTDERFVAYQYFLHVVPTTYIAPRSAPLQ 266

Query: 306 -------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                          + G PGIFF ++L PL +   +++ +   L  + +  I G ++
Sbjct: 267 THQYSVTHYTRVMQHNQGTPGIFFKFDLDPLAITQHQRTTTFLQLLIRCVGVIGGVFV 324


>gi|367012766|ref|XP_003680883.1| hypothetical protein TDEL_0D00880 [Torulaspora delbrueckii]
 gi|359748543|emb|CCE91672.1| hypothetical protein TDEL_0D00880 [Torulaspora delbrueckii]
          Length = 348

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 92/375 (24%), Positives = 149/375 (39%), Gaps = 76/375 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  E   +++  GG  +++ +LF+ ++   +   YF     ++  VD     
Sbjct: 4   LRSFDAFPKTDETHQQRSFKGGLSSVMTYLFLLFMCWTEFGSYFGGYVDQQYKVDGEVRE 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
              I++D+ V  + C+ L ++  D +      ++  +  + L +   P   P   +VN +
Sbjct: 64  TFQINMDMYV-NMPCNLLHINVRDKT------MDRKVVSKELSMQNMPFFVPYGTMVNDM 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           KK          T +L++    G    A+   R   +    V EA          L + V
Sbjct: 117 KK--------IATPDLDE--ILGEAIPAQFRERMDPS----VLEA---------SLGSDV 153

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                        TF +GC IYG + VNRV+G   I A G  Y        D +    + 
Sbjct: 154 -------------TF-DGCHIYGSVPVNRVAGELQITAKGWGY-------QDFEKAPVSE 192

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASM-FNYYIKIIPTIYERL----- 299
            N +H I   S+G      D    PLD T   +     M + Y   I+PT+YE+L     
Sbjct: 193 INFSHVINEFSYGDFFPYID---NPLDNTAKISIVDRLMGYLYDTSIVPTVYEKLGAYVD 249

Query: 300 -----------DGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG- 347
                      D      G   +PGIFF Y+  PL + I ++  S      +++  +S  
Sbjct: 250 TNQYAVSERQFDQKSTKRGSTTVPGIFFRYDFEPLSISIKDRRLSFIQFIIRLVALLSFV 309

Query: 348 TYI---TFMLVDALL 359
            YI   TF +VD  L
Sbjct: 310 VYIASWTFRMVDLTL 324


>gi|225712562|gb|ACO12127.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Lepeophtheirus salmonis]
          Length = 290

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 84/188 (44%), Gaps = 31/188 (16%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC    +  +N+V G+FH++          H  D+QP     +N +H I  +SFG K++ 
Sbjct: 112 GCLFEAHFHINKVPGNFHVS---------THSVDVQP---DEYNFSHEIHEVSFGSKIKK 159

Query: 264 DDERR----KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG------------- 306
              +       L G  +          Y +KI+PT YE L G+KL               
Sbjct: 160 ISSKNIGTFNSLSGRDSSESGALDSHEYVMKIVPTTYESLGGAKLFAYQYTYAYRSYVSF 219

Query: 307 GDGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
           G GG  +P ++F Y+L+P+ VK  E    + H  T +   + GT+    ++D+ L +  +
Sbjct: 220 GHGGRVVPALWFRYDLNPITVKYHETRPPIYHFLTTVCAIVGGTFTVAGIIDSTLFTATQ 279

Query: 365 KISKVEIG 372
              K E+G
Sbjct: 280 LFKKFELG 287



 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 53/106 (50%), Gaps = 3/106 (2%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           MVF   +K  D + K  +D  + TV G  ++I C +FI  ++  +   +       EL V
Sbjct: 1   MVFD--VKRFDVYRKIPKDLTQPTVAGAIISICCTIFIFLMLVTEFWFFITPDVQSELIV 58

Query: 61  DSSRGS-KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
           +++  + ++P+ ++I +P + C+YL +D  D  G   +    N  K
Sbjct: 59  ENANPTDRIPVRINISLPKMKCEYLGIDIQDDMGRHEVGFVENTAK 104


>gi|145546125|ref|XP_001458746.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124426567|emb|CAK91349.1| unnamed protein product [Paramecium tetraurelia]
          Length = 325

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 55/220 (25%), Positives = 106/220 (48%), Gaps = 34/220 (15%)

Query: 170 EAYRYKKWALPELDTIVQCKNEYSTEKLKNTFT--EGCQIYGYLEVNRVSGSFHIAPGLS 227
           E Y+ +      +D  +   +  + E+ +  +   EGC + GY+ ++RV G+FHI+    
Sbjct: 96  ELYKSRTLNGKVIDKYLSTNDSLNLERAQQAYQQKEGCDLAGYIIISRVPGNFHISA--- 152

Query: 228 YSINHVH---VHDIQPYTS-AAFNTTHHIRHLSFGIK--LQDDDERRK-----PLDGTV- 275
               H +   V+ + P+   +  + +H I+HLSFG +  +Q   E+ K     PLDG   
Sbjct: 153 ----HPYGGQVNMVLPFVGLSVIDLSHSIKHLSFGKQNDIQKIREKFKQGLLNPLDGIRR 208

Query: 276 AKAEEGASM---FNYYIKIIPTIYERLDGSKLGGGDGG----------MPGIFFSYELSP 322
            K +E  ++     YYI I+PT+Y  +D  +                 MP ++F Y++SP
Sbjct: 209 IKTQELTNVGVTHQYYISIVPTLYVDIDNKEYFVNQFAANTNEAQTTQMPAVYFRYDISP 268

Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
           + V+ T+  +S  H   ++   + G +    ++D++ ++C
Sbjct: 269 VTVQFTKYYESFNHFIVQLCAILGGVFTIAGIIDSIFYAC 308



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 53/106 (50%), Gaps = 1/106 (0%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
            D + K  +D  E +  G  ++    + +  L   +  +Y       E+++D ++  KL 
Sbjct: 4   FDLYRKLPQDLIEPSKSGALISFTSLILMFILFITEFQEYLTQQVQTEMYIDQNKDDKLL 63

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
           +++DI  P + CD++++D  D  G    +VE  +YK R  L+GK I
Sbjct: 64  VNMDISFPNMPCDFISIDQQDVIGTHQQNVEGELYKSR-TLNGKVI 108


>gi|358058634|dbj|GAA95597.1| hypothetical protein E5Q_02253 [Mixia osmundae IAM 14324]
          Length = 682

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 80/359 (22%), Positives = 138/359 (38%), Gaps = 78/359 (21%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K    +   +  GG  T++  + I  L+  +  +Y       E  VD   G 
Sbjct: 29  LRTFDAFPKTLPTYRSTSSRGGVYTVLLAVAILVLVWYEATEYLFGEPLYEFSVDKGIGK 88

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L I++D+ V  + C YL +D  D+ G++ LHV     K     +   I + Q+ V  A 
Sbjct: 89  MLQINVDMTV-AMPCHYLTVDIRDAVGDR-LHVSDEFVKDGTTFE---IGQAQRLVTMAF 143

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +                DP                        EAY+           + 
Sbjct: 144 ES---------------DP------------------------EAYK----------VVQ 154

Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           + +   + E+  +    G  C+IYG + V +V+G+ HI      ++ H ++   +     
Sbjct: 155 EARRPRAFEQTYHIVENGPACRIYGTMAVKKVTGNLHIT-----TLGHGYL-SWEHTDHK 208

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY-------- 296
             N +H I   SFG       +   PLD T+   E    +F Y++ I+ T Y        
Sbjct: 209 LMNLSHVIHEFSFGPLFPGISQ---PLDNTLEVTESSFHIFQYFMSIVSTTYVDHHRNVL 265

Query: 297 -----ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                   D S+      G+PGIF  Y+  P+M+ + E++ +LG    ++   + G  +
Sbjct: 266 ETAQYSVTDMSRATVHGRGVPGIFLKYDPEPMMLTLRERTTTLGQFLIRLAGIVGGVIV 324


>gi|353236810|emb|CCA68797.1| related to ERV41-component of copii vesicles involved in transport
           between the ER and golgi complex [Piriformospora indica
           DSM 11827]
          Length = 559

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 82/363 (22%), Positives = 143/363 (39%), Gaps = 72/363 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K    +  +T +GG +T+        L+  D+ ++    +  E  +D+ +  
Sbjct: 47  IKQFDAFPKLPASYKSRTKFGGFMTLFVVTLSFLLVLNDIGEFIWGWSDYEFAIDTDQHR 106

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L I++D+VV T  C  L++D  D+ G++ LH+   I +     D     E  KE    +
Sbjct: 107 LLEINVDLVVNT-PCSILSVDLRDAVGDR-LHLSDTIVRDGTLFDISQAHE-FKEHQRVL 163

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
             +++                      A   +R   +     +  +R   W         
Sbjct: 164 STREIV--------------------AASRRSRGFFSMFKASRPQFR-PTW--------- 193

Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAP-GLSYSINHVHV-HDIQPYT 242
                       N   +G  C++YG   V +++G+FHI   G  Y  ++ H  HD     
Sbjct: 194 ------------NHTPDGGACRVYGSFAVRKLTGNFHITTLGHGYGGHNAHASHD----- 236

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
               N +H I   SFG    D     +PLD +    +E    F Y+I ++PT Y      
Sbjct: 237 --NINMSHVITEFSFGPYYPD---IVQPLDYSFETTQEHFVAFQYFITVVPTTYVAPRSK 291

Query: 303 KLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
            L                  G PGIFF Y++ P+ ++I +++ +L     +I+  I G +
Sbjct: 292 PLHTHQYSVTHYVKELPHSQGTPGIFFKYDIDPVALEIHQRTTTLTQFLVRIVGVIGGVW 351

Query: 350 ITF 352
           + F
Sbjct: 352 VCF 354


>gi|400594740|gb|EJP62573.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Beauveria bassiana ARSEF 2860]
          Length = 374

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 95/366 (25%), Positives = 149/366 (40%), Gaps = 69/366 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISY-LICVDVCDYFQVSTTEELFVDSSRG 65
           +   DAF K   ++  +T  GG  T+   +FIS  L+  +V  +++   T    V+    
Sbjct: 21  VSAFDAFPKSKPEYVTRTAGGGKWTVAM-IFISLVLMGSEVARWWRGEQTHNFAVEKGIS 79

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            ++ I+LDIVV  +  D L ++  D+SG++ L            L   P +  Q  V N 
Sbjct: 80  HEMQINLDIVVNMLCAD-LHINVQDASGDRIL--------ASAMLHRDPTKWSQ-WVDNG 129

Query: 126 VKK------KKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           V K       +V T  G T+    D       +G E       +  + V    +  KW+ 
Sbjct: 130 VHKLGHDANGRVNTGEGWTSLANNDEG-----FGEE-------HVHDIVALGKKRAKWS- 176

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HD 237
                         T +   T  + C+IYG L++N+V G FHI A G  Y     H+ HD
Sbjct: 177 -------------KTPRFWGT-ADSCRIYGSLDLNKVQGDFHITARGHGYMEFGQHLDHD 222

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
                   FN +H I  LS+G           PLD TV  A      F YY+ ++PT+Y 
Sbjct: 223 -------KFNFSHVISELSYGAFYP---SLVNPLDRTVNVAAAHFHKFQYYLSVVPTVYS 272

Query: 298 ------------RLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
                         + SK       +PGIF  Y++ P+++ + E   S      K++  +
Sbjct: 273 VGRSTIQTNQYAVTEQSKEIDEHSAVPGIFVKYDIEPILLAVHESRDSFIVFLLKLINVV 332

Query: 346 SGTYIT 351
           SG  + 
Sbjct: 333 SGVLVA 338


>gi|225712696|gb|ACO12194.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Lepeophtheirus salmonis]
          Length = 372

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 48/180 (26%), Positives = 84/180 (46%), Gaps = 20/180 (11%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+I+G L +N+V+G+FHI+PG +  +   HVH         +N TH I   SFG    
Sbjct: 172 DACRIHGSLTLNKVAGNFHISPGKTLPLFRAHVHFATFGGDEVYNFTHRIDRFSFGTP-- 229

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----------------IYERLDGSKLGG 306
                 +PL+G    A + +  + Y I+++PT                + E    +K   
Sbjct: 230 -HGGIVQPLEGEEKIAMQDSMHYQYLIQVVPTDIQGYTDLIWSTYQYSVKEHKRATK-ER 287

Query: 307 GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
           G G  PGI+F Y++S L V  ++  + +     +++  + G   T  +V   + S ++KI
Sbjct: 288 GSGDTPGIYFKYDMSALKVLASQDREPIFKFLVRLLAAVGGRIATSQIVCVFIKSMIEKI 347



 Score = 46.2 bits (108), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 42/84 (50%), Gaps = 1/84 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
          +K LDAF K  E + EKT  G A++I+  + +  L+C +   +           D+   S
Sbjct: 16 VKELDAFPKVPETYVEKTASGAAISIITTILVIVLLCSETSYFMDPGINFRFIPDTDFKS 75

Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
          KL I++DI + T  C  +  D +D
Sbjct: 76 KLEINVDITIAT-PCKAIGADVLD 98


>gi|164661257|ref|XP_001731751.1| hypothetical protein MGL_1019 [Malassezia globosa CBS 7966]
 gi|159105652|gb|EDP44537.1| hypothetical protein MGL_1019 [Malassezia globosa CBS 7966]
          Length = 454

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 77/358 (21%), Positives = 146/358 (40%), Gaps = 66/358 (18%)

Query: 20  FHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIHLDIVVPT- 78
           + ++T YGG VT+  ++    +I  ++  Y  +  T    +DS  G  + I+LD+VV T 
Sbjct: 53  YQKRTSYGGFVTLAVFIATMVVIWYEIQHYLMLKPTYSFDIDSHVGGFMQINLDVVVATP 112

Query: 79  --------------ISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
                         ++   +++D  D+SG+     E +I K  +D + K  Q  QK  + 
Sbjct: 113 CGRTYPYDVRFPCILTLSGVSIDLRDASGDTLHFSEDDIVKDPVDFN-KERQRAQKRSLT 171

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
               K + ++      ++E  +K                    V    R++       D 
Sbjct: 172 QYFLKMLHSQY-RNMKKIERKDK------------------KIVAGGPRHRDSGFDFSDP 212

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           +   +               C++YG + V +V+G+ HI+  +  +   V+ H+       
Sbjct: 213 MENAEE-----------ARACRVYGSILVKKVTGNLHISTFVP-TFMAVNAHE----NGM 256

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------- 294
             + +H I   SFG    +  E   PLD ++   ++ A+ F Y++ ++PT          
Sbjct: 257 GIDMSHIIHEFSFGDYFPNIAE---PLDASLELTDDPAAAFQYFLSVVPTHFIHGRRVIK 313

Query: 295 --IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
              Y   D  +   G    PG++F Y++ PL +K+T KS SL     ++   + G +I
Sbjct: 314 TNQYSVHDYKRNPQGSLTFPGLYFKYDIEPLTMKVTHKSVSLVAFIVRVCSVLGGLWI 371


>gi|308806572|ref|XP_003080597.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
 gi|116059058|emb|CAL54765.1| COPII vesicle protein (ISS) [Ostreococcus tauri]
          Length = 327

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 82/384 (21%), Positives = 152/384 (39%), Gaps = 86/384 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICV-DVCDYFQVSTTEELFVDSSRG 65
            + LDA T        KT  G  V++ C  F++ ++ +    D+F    T+   VD  R 
Sbjct: 5   FRSLDALTSAPAHLRRKTSTGAVVSL-CGTFVAVILTLSQTIDFFTPLRTKTTRVDEQRA 63

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            ++ + +D+    + C  L +DA D+SG           K  +D+ G+ +    K  ++A
Sbjct: 64  GEMTMDIDVTFTRMPCQILYVDAYDASG-----------KHEVDVRGRLM----KTRLDA 108

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSC-YGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             +            EL +    G    G     R+     +EV++A             
Sbjct: 109 AGR------------ELGEYESAGGVDLGGLVLFRRRPEHGSEVRKA------------- 143

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
                             EGC+++G +E  RV+GS  I+ G   S   +     +P+   
Sbjct: 144 --------------KADMEGCRLHGRVEARRVAGSLRISTG-PESFEFLREMFNEPW--- 185

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE------- 297
             +  H I+  +FG +         PL+G V + E+ + ++ Y++K++PT Y        
Sbjct: 186 EIDARHAIKTFAFGPEFPGSV---NPLNG-VKRKEKKSGIYKYFMKVVPTTYANSRNLFG 241

Query: 298 ------RLDGSKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                 R+  ++    +        G +P I FSY++S + V +  +SKS  +  TK + 
Sbjct: 242 MIPWTMRVRTNQYSVTEHFTESAHWGMLPQILFSYDISAISVNVESQSKSGVYFLTKTIA 301

Query: 344 NISGTYITFMLVDALLHSCVKKIS 367
            + G +     +D  +   V+  S
Sbjct: 302 TVGGVFALTRTIDRYVDLAVRVTS 325


>gi|145551751|ref|XP_001461552.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124429387|emb|CAK94179.1| unnamed protein product [Paramecium tetraurelia]
          Length = 317

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 52/184 (28%), Positives = 92/184 (50%), Gaps = 26/184 (14%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS-AAFNTTHHIRHLSFGIKL 261
           EGC++ GY+ ++RV G+FHI+   SY      V+ + P+   +  + +H I+HLSFG + 
Sbjct: 131 EGCEMTGYIIISRVPGNFHISAH-SYG---GQVNIVLPFVEMSTIDLSHTIKHLSFGNQN 186

Query: 262 QDDDERRK-------PLDG-TVAKAEEGASM---FNYYIKIIPTIYERLDGSKL------ 304
                R K       PLDG +  K +E  ++     YYI I+PTIY  +D  +       
Sbjct: 187 DIQKIREKFQQGLLNPLDGISRIKTQELKNVGVTHQYYISIVPTIYVDIDNREYFVNQFT 246

Query: 305 ----GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH 360
                     MP I+F Y++SP+ V+ T+  ++  H   ++   + G +    ++D++ +
Sbjct: 247 ANTNEAQTNSMPAIYFRYDISPVTVQFTKYYETFNHFIVQLCAILGGVFTIAGIIDSVFY 306

Query: 361 SCVK 364
           +  K
Sbjct: 307 ALQK 310



 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 24/106 (22%), Positives = 53/106 (50%), Gaps = 1/106 (0%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
            D + K  +D  E +  G  ++    + +  L   +  +Y       E+++D ++   L 
Sbjct: 4   FDLYRKLPQDLIEPSKSGALISFTSLILMFILFITEFQEYLTQQVQTEMYIDQNKDDTLL 63

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
           +++DI  P + CD++++D  D  G    +V+  + K+R+ L+G+ I
Sbjct: 64  VNMDISFPNMPCDFISIDQQDVIGTHQQNVKGELLKKRI-LNGRVI 108


>gi|219125194|ref|XP_002182871.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405665|gb|EEC45607.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 467

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/190 (28%), Positives = 91/190 (47%), Gaps = 40/190 (21%)

Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           GCQ+ G+L VNRV G+FH+ A   S+++N           +A  N +H + HLSFG  + 
Sbjct: 285 GCQVSGHLMVNRVPGNFHLEAKSKSHNLN-----------AAMTNLSHVVNHLSFGEPID 333

Query: 263 DDDERRK--------------PLDGTVAKAEEGASMFNYYIKIIPT-------------I 295
           +++ + K              P+DG     +     F++YIK++ T              
Sbjct: 334 ENNRKSKRILKQVPEEHRQFAPMDGQAFLTKAFHQAFHHYIKVVSTHLNMGSSDANSMLT 393

Query: 296 YERLDGSKLGG-GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFML 354
           Y+ L+ S++    D  +P   FSY+LSP+ V + ++ +      T +   I GT+ T  L
Sbjct: 394 YQFLEQSQIVFYDDVNVPEARFSYDLSPMSVVVEKEGRKWYDYLTSLCAIIGGTFTTLGL 453

Query: 355 VDALLHSCVK 364
           +DA L+  +K
Sbjct: 454 IDATLYKVLK 463



 Score = 44.3 bits (103), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 21/106 (19%), Positives = 50/106 (47%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +  +D + +  +D  E T  G  +++   + +  L   +   + +      + +D +   
Sbjct: 1   MSSVDFYRRVPKDLTEATSLGAIMSVCALVVMGVLFLSETAAFARTGIATSITLDENTSP 60

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
           ++ ++ +I +  + CDY+++D  D+ G    +V  NI K +LD  G
Sbjct: 61  QIRLNFNITLTDLQCDYVSIDVWDALGTNKQNVTKNIDKWQLDAQG 106


>gi|392594239|gb|EIW83563.1| DUF1692-domain-containing protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 506

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 86/358 (24%), Positives = 144/358 (40%), Gaps = 68/358 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K    +  ++   G +TI        LI  D+ +Y       E  VD    S
Sbjct: 20  LAKFDAFPKLPSSYKSRSESRGFLTIFVGFLCFLLILNDLSEYIWGWPDYEFGVDKQSKS 79

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + +++D+VV  + C +L++D  D SG++ L++      RR   DG              
Sbjct: 80  FMDVNVDMVV-NMPCQFLSVDLRDVSGDR-LYLSKGF--RR---DG-------------- 118

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                T  +    T L++  K  S   A +++RK     +  K +               
Sbjct: 119 -----TLFDIGQATSLKEHAKMLSAQQAVSQSRKSRGFFSWFKRS--------------- 158

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
             K E+            C+IYG L V +V+ + H+   G  Y+ +H+HV   +      
Sbjct: 159 --KAEFRPTYNHQPDGSACRIYGTLAVKKVTANLHVTTLGHGYT-SHMHVDHTK------ 209

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
            N +H I   SFG    D  +   PLD +   A++  + F YY+ ++PT Y       L 
Sbjct: 210 MNLSHVITEFSFGPYFPDISQ---PLDYSFEVAKDPYTAFQYYMHVVPTNYIAPRSKPLE 266

Query: 306 GGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                             G+PGIFF ++L P+++ I +++ SL  L  + +  I G +
Sbjct: 267 TNQYSVTHYTHIYKTPHEGIPGIFFKFDLDPMVLSIHQRTTSLTALIIRCVGVIGGVF 324


>gi|260950511|ref|XP_002619552.1| hypothetical protein CLUG_00711 [Clavispora lusitaniae ATCC 42720]
 gi|238847124|gb|EEQ36588.1| hypothetical protein CLUG_00711 [Clavispora lusitaniae ATCC 42720]
          Length = 347

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 93/380 (24%), Positives = 141/380 (37%), Gaps = 89/380 (23%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGG---AVTIVCWLFISYLICVDVCDYFQVSTTEELF 59
           FS +++  DAF K   +   ++  GG    +T+ C L I   I + +  Y       +  
Sbjct: 4   FSSKVRVFDAFPKVAPEASVRSQRGGFSTILTVFCGLLI---IWIQIGGYLGGYIDRQFS 60

Query: 60  VDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
           VD+     L I+LD+VV  + C +++ + +D + +++L  E       L+  G     P+
Sbjct: 61  VDNETRKDLNINLDMVV-AMPCQFISTNVMDITSDRYLAGE------VLNFQGTGFYVPE 113

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
              +N         EN    T                                       
Sbjct: 114 FFALN--------RENNDYDT--------------------------------------- 126

Query: 180 PELDTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
           PELD I+Q   + EY     + N     C I+G + VN V G F I P  S         
Sbjct: 127 PELDEIMQETLRAEYGIAGARVNEDAPACHIFGTIPVNHVRGEFFIVPKGS------MYR 180

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
           D       A+N +H I   SFG           PLD T    EE    + Y+ K++PT Y
Sbjct: 181 DRSSIDPKAYNFSHVISEFSFGDFYP---FITNPLDFTAKVTEENRQAYRYFAKLVPTHY 237

Query: 297 ERL---------DGSKLGGGDGGM----PGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
           E+L           +++   D       PGIFF Y   P+ + I EK         ++M 
Sbjct: 238 EKLGLVVDTYQYSLTEIHNVDHNRGIPPPGIFFDYSFEPIKLTIREKRIGFFAFVARLMT 297

Query: 344 NISGTYIT----FMLVDALL 359
            +SG  I     F L + LL
Sbjct: 298 VLSGLLIAAGYLFRLYEKLL 317


>gi|45190741|ref|NP_984995.1| AER136Wp [Ashbya gossypii ATCC 10895]
 gi|44983720|gb|AAS52819.1| AER136Wp [Ashbya gossypii ATCC 10895]
 gi|374108218|gb|AEY97125.1| FAER136Wp [Ashbya gossypii FDAG1]
          Length = 340

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 143/375 (38%), Gaps = 76/375 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  +    ++  GG ++I+ +LF+ ++   +   YF     E+  +D     
Sbjct: 4   LRTFDAFPKTDQQHVRRSSRGGIMSIMMYLFLLFIAWGEFGSYFGGYLDEQYIIDPELRQ 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
              I++D++V  + C YL + A D + +       N   +RL     P   P     ++V
Sbjct: 64  TTQINMDVMV-QMPCKYLDVKATDITRDI------NDVSKRLVFKNIPFFVPYGTTFDSV 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                         E+  P+  G                + +   +R     +P+ D   
Sbjct: 117 N-------------EVRTPDIDGML-------------ADAIPLKFREN---IPDAD--- 144

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
               E+           GC IYG + VNRV G  HI P G  YS      HD        
Sbjct: 145 -LPEEFEFN--------GCHIYGSIPVNRVKGELHITPKGWRYSSRQRVPHD-------E 188

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL----DG 301
            N TH     SFG      D     LD     A++  + F+Y++ ++PTIY ++    D 
Sbjct: 189 INLTHIFNEFSFGEFFPYIDNT---LDQVGRYAQQRLTRFHYFVSVLPTIYRKMGAVVDT 245

Query: 302 SKLGGGDGGM---------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG-TYI- 350
           ++       +         PGIF  Y    L V + +K  S      +++  +S   YI 
Sbjct: 246 NQYSVSHNDITYTSSRLYTPGIFILYNFEALTVVVQDKRISFWAFLIRLVTMLSFIVYIA 305

Query: 351 --TFMLVDALLHSCV 363
              F LVD LL S +
Sbjct: 306 AWAFRLVDWLLISTL 320


>gi|410914052|ref|XP_003970502.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Takifugu rubripes]
          Length = 290

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/186 (29%), Positives = 85/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G   +N+V G+FHI+          H    QP      + TH I  L+FG KLQ 
Sbjct: 114 GCRFEGEFIINKVPGNFHIS---------THSASAQPQNP---DMTHFIHKLAFGDKLQM 161

Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
             E+     L G    A    +  +Y +KI+PT+YE L G +             +    
Sbjct: 162 HQEKGAFNALGGADRLASNPLASHDYILKIVPTVYEDLSGKQKFSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ +      T I   + GT+    ++D+ + +  +  
Sbjct: 222 TGRIVPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGIIDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K++IG
Sbjct: 282 KKIQIG 287



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 52/101 (51%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
           ++  D + K  +D  + T  G  ++I+C +FI +L   ++  +       EL+VD     
Sbjct: 5   VRRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + CD + LD  D  G   + H+E+++
Sbjct: 65  SGGKIEVSLNITLPNLHCDLVGLDIQDEMGRHEVGHIENSM 105


>gi|326470603|gb|EGD94612.1| COPII-coated vesicle protein [Trichophyton tonsurans CBS 112818]
          Length = 399

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 87/385 (22%), Positives = 152/385 (39%), Gaps = 77/385 (20%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           + +LK  DAF K    +   +  GG  TI   +  + L C ++  +++        V+  
Sbjct: 21  ATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSVERG 80

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEV 122
              ++ +++D VV  + CD + ++  D++G+   H+          L G  + QEP    
Sbjct: 81  VSQEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSW- 125

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE-TETRKCCNTCNEVKEAYRYKKWALPE 181
             A   +++       + E +  NK  S    E  E     +   EV+ + + K    P+
Sbjct: 126 --AAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQAEDLHVEHVLGEVRRSRKKKFPKAPK 183

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
           L               K+   + C+++G LE N+V G+ HI A G  Y           P
Sbjct: 184 LK--------------KSDAVDSCRVFGSLEGNKVQGNLHITARGFGY---FEWGRATNP 226

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER-- 298
           ++    N TH I  LSFG           PLD TV+        + YY+ ++PTIY +  
Sbjct: 227 HS---LNFTHLITELSFGPHY---GRLLNPLDKTVSSTSINFYKYQYYLSVVPTIYTKSG 280

Query: 299 ---------LDGSKLGGGDG-----------------------GMPGIFFSYELSPLMVK 326
                     D S +   D                          PGIFF Y + P+++ 
Sbjct: 281 HIDPNRRSLPDASTITAKDSKTTVSTNQYAVTSYSQPIQPRIDSTPGIFFKYNIEPILLI 340

Query: 327 ITEKSKSLGHLWTKIMCNISGTYIT 351
           ++++  SL  L  +++  +SG  +T
Sbjct: 341 VSQERDSLLALMVRLVNVVSGVLVT 365


>gi|387015778|gb|AFJ50008.1| ER Golgi intermediate [Crotalus adamanteus]
          Length = 290

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 54/187 (28%), Positives = 88/187 (47%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           +GC+  G+  +N+V G+FHI+          H    QP      + TH I  LSFG KLQ
Sbjct: 113 DGCRFEGHFSINKVPGNFHIS---------THSATAQPQNP---DMTHVIHKLSFGDKLQ 160

Query: 263 DDD--ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
             +       L GT   +    +  +Y +KI+PT+YE + G +             +   
Sbjct: 161 VPNIHGAFNALGGTDRLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVANKEYVAYS 220

Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  + 
Sbjct: 221 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEA 280

Query: 366 ISKVEIG 372
             K+++G
Sbjct: 281 WKKIQLG 287



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C  FI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTFTGAIISICCCFFILFLFLSELTGFIATEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ ++L+I +P++ C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIEVNLNISLPSLHCELIGLDIQDEMGRHEVGHIDNSM 105


>gi|195439332|ref|XP_002067585.1| GK16119 [Drosophila willistoni]
 gi|194163670|gb|EDW78571.1| GK16119 [Drosophila willistoni]
          Length = 443

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 82/389 (21%), Positives = 154/389 (39%), Gaps = 52/389 (13%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSS 63
           E  K LDAF K  E + E T  GG ++++  L I YL+  ++  Y+ +     +   D +
Sbjct: 16  EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELQYYWHETQIIYQFEPDIA 75

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV----EHNIYKRRLDLDGKPIQEPQ 119
              ++P+H+DI V         +D +D + +            ++    D D    Q  Q
Sbjct: 76  LEEQVPMHVDITVAMPCASLSGVDLMDETQQDVFAYGTLQREGVWWEMSDADRMQFQSAQ 135

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
             + N   +++    +       +D  + G   G    + K                 A 
Sbjct: 136 --LTNHYLREQY---HSVADILFKDIMRDGILKGRSDSSAKPA---------------AP 175

Query: 180 P--ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
           P   L  ++    +   ++ +  F + C+++G L +N+V+G  H+  G    +     H 
Sbjct: 176 PPGSLPAVLDLHQDTHLQQPEAKF-DACRLHGTLGINKVAGVLHLVGGAQPVVGLFQDHW 234

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
           +  +     N TH I  LSFG   Q      +PL+G     +E A+   Y++KI+PT  E
Sbjct: 235 MIEFRRMPANFTHRINRLSFG---QYSRRIVQPLEGDETIIQEEATTVQYFLKIVPTEIE 291

Query: 298 ------------------RLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
                             +LD  +      G PGI+F Y+ S L + ++     +     
Sbjct: 292 QTFSTINTFQYSVTENVRKLDSER---NSYGSPGIYFKYDWSALKIVVSNDRDHILTFVI 348

Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISK 368
           ++   ISG  +    +++LL    +++ +
Sbjct: 349 RLCSIISGIIVLSGAINSLLLGMQRRLLR 377


>gi|402591333|gb|EJW85263.1| hypothetical protein WUBG_03826, partial [Wuchereria bancrofti]
          Length = 244

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/188 (27%), Positives = 85/188 (45%), Gaps = 29/188 (15%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--I 259
           T GC++ G  E+++V G+FHI+          H  D QP T   ++  H I  + FG  I
Sbjct: 66  TSGCRLEGKFEISKVPGNFHIS---------THAADTQPET---YDMRHTIHSVVFGDDI 113

Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG------------- 306
               +     PL    A   +G+   +Y +KI+P++YE + G+K                
Sbjct: 114 STSQNLGSFNPLKNREALESDGSFTHDYVLKIVPSVYEDITGNKKYSYQYTYAHKEYVTY 173

Query: 307 --GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
                 MP ++F YEL P+ +K TE+ +      T I   + GT+    ++DA L S  +
Sbjct: 174 HYSGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLFSLTE 233

Query: 365 KISKVEIG 372
              K ++G
Sbjct: 234 LYRKHQMG 241


>gi|326928384|ref|XP_003210360.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Meleagris gallopavo]
          Length = 321

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 88/187 (47%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           +GC+  G+  +N+V G+FH++          H    QP      + TH I  LSFG KLQ
Sbjct: 144 DGCRFEGHFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDKLQ 191

Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
             +       L+G    +    +  +Y +KI+PT+YE + G +             +   
Sbjct: 192 VQNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVANKEYVAYS 251

Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  + 
Sbjct: 252 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILDSCIFTASEA 311

Query: 366 ISKVEIG 372
             K+++G
Sbjct: 312 WKKIQLG 318



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 27/105 (25%), Positives = 53/105 (50%), Gaps = 4/105 (3%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
            S  + G D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD 
Sbjct: 32  LSHCVVGFDIYRKVPKDLTQPTYTGALISVCCCLFILFLFLSELTGFIATEIVNELYVDD 91

Query: 63  ---SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
                G K+ ++L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 92  PDKDSGGKIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 136


>gi|345320110|ref|XP_001521132.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like, partial [Ornithorhynchus anatinus]
          Length = 283

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 88/187 (47%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           +GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG KLQ
Sbjct: 106 DGCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDKLQ 153

Query: 263 DDD--ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
             +       L G   ++    + ++Y +KI+PT+YE  +G +             +   
Sbjct: 154 VQNIHGAFNALGGADKRSSNPLASYDYILKIVPTVYEDKNGKQRYSYQYTVANKEYVAYS 213

Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  + 
Sbjct: 214 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEA 273

Query: 366 ISKVEIG 372
             K+++G
Sbjct: 274 WKKIQLG 280



 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGS 66
            D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD      G 
Sbjct: 1   FDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKDSGG 60

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
           K+ + L+I +P + CD + LD  D  G   + H+++++
Sbjct: 61  KIDVSLNISLPNLHCDLVGLDIQDEMGRHEVGHIDNSM 98


>gi|158292439|ref|XP_313915.3| AGAP005044-PA [Anopheles gambiae str. PEST]
 gi|157016993|gb|EAA09437.3| AGAP005044-PA [Anopheles gambiae str. PEST]
          Length = 371

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 48/181 (26%), Positives = 86/181 (47%), Gaps = 26/181 (14%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
           + C+I+G L +N+V+G+FHI  G  + +S  H+H++ I  + +   N +H I   SFG  
Sbjct: 169 DACRIHGVLTLNKVAGNFHITVGKTIHFSRGHIHLNSI--FANTQTNFSHRINRFSFG-- 224

Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP---------------TIYERLDGSKLG 305
                    PL+G     + G  M  Y+I+++P               T+ E L    + 
Sbjct: 225 -DHTAGIIHPLEGDEKLFDNGQVMMQYFIEVVPTDVQKFYSHSKTYQYTVRENLQLIDID 283

Query: 306 GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
            G  G+ GI+F Y++S L V + +   S+ H   ++   I+G     +++  +L  C+  
Sbjct: 284 KGMQGVAGIYFKYDMSALRVLVRQDRDSIAHFIVRLSSIIAG----IVVISGMLSKCMHL 339

Query: 366 I 366
           I
Sbjct: 340 I 340



 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 44/85 (51%), Gaps = 1/85 (1%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
           LDAF K  E+F + T  GG ++++  L I +LI  +V  Y           D+   SKL 
Sbjct: 17  LDAFPKVKEEFVQPTRVGGTLSLISRLVIVFLIYHEVTYYLDSRLVFTFVPDTDLQSKLK 76

Query: 70  IHLDIVVPTISCDYLALDAVDSSGE 94
           +H+D+ V  + C  +  D +DS+ +
Sbjct: 77  VHIDLTV-AMPCKSIGADILDSTNQ 100


>gi|344229081|gb|EGV60967.1| DUF1692-domain-containing protein [Candida tenuis ATCC 10573]
 gi|344229082|gb|EGV60968.1| hypothetical protein CANTEDRAFT_115996 [Candida tenuis ATCC 10573]
          Length = 352

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 83/371 (22%), Positives = 146/371 (39%), Gaps = 86/371 (23%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F+ R++  DAF K   +   +++ G   TI  + F   ++ V+V  +       +  VD 
Sbjct: 4   FATRVRTFDAFPKVDSEHTVRSLRGALSTIATYFFALVILWVEVGGFLGGYVDHQFVVDD 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              + L I++D+ V T+ C+ +  + VD + ++ L  E       L+ +G     P +  
Sbjct: 64  QIRTNLSINIDMTV-TMPCELIHTNVVDITDDRFLAAE------LLNFEGVHFFAPPQFF 116

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
                  ++ ++N                                       K++  P+L
Sbjct: 117 -------RINSQN---------------------------------------KEYETPDL 130

Query: 183 DTIVQ--CKNEY--STEKLKNTF-TEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVH 236
           D +++   + E+  S +K+        C I+G + VN V G FHI A G+ Y  + +H  
Sbjct: 131 DHVMRENIRAEFYISGQKINQVAGAPACHIFGTIPVNHVQGEFHITAKGVGYQ-DSLHT- 188

Query: 237 DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIY 296
              P+    F  +H I+  SFG      D    PLD +     E    + YY  ++PT+Y
Sbjct: 189 ---PWERMNF--SHVIQEFSFGTFYPMID---NPLDMSGKITHESLQSYKYYSNVVPTLY 240

Query: 297 ERL----DGSKLGGGDGGM-------------PGIFFSYELSPLMVKITEKSKSLGHLWT 339
           ERL    D ++    +  +             PGIFF YE  P+ + I EK         
Sbjct: 241 ERLGIVVDTNQYSISEQHLVIRKDSNGRIYSPPGIFFKYEFEPIKLTIVEKRLPFIQFVA 300

Query: 340 KIMCNISGTYI 350
           ++   + G  I
Sbjct: 301 RLGTILGGLLI 311


>gi|115623567|ref|XP_794044.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Strongylocentrotus purpuratus]
          Length = 289

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 50/197 (25%), Positives = 88/197 (44%), Gaps = 28/197 (14%)

Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
           +T+K+     +GC  Y    +N+V G+FH++            H +      + +  H I
Sbjct: 101 NTKKIPLNNGQGCLFYSAFTINKVPGNFHVS-----------THAVGMNQPQSTDFAHII 149

Query: 253 RHLSFGIKLQDD--DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK------- 303
             +SFG  +Q+        PL+G   +  +     +YY+KI+PT+YE L G+K       
Sbjct: 150 HEVSFGDDIQNKTLGASFNPLEGRDKRDSKSDLSHDYYMKIVPTVYEDLWGTKNVSYQYT 209

Query: 304 --------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
                    G G   +P I+F Y++SP+ VK  EK        T +   + GT+    + 
Sbjct: 210 YAYKDYGSQGHGRRVLPAIWFRYDISPITVKYHEKRAPFYTFITTVCAIVGGTFTVAGIF 269

Query: 356 DALLHSCVKKISKVEIG 372
           D+++ +  +   K E+G
Sbjct: 270 DSIIFTAAEVFKKAELG 286



 Score = 46.6 bits (109), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 53/110 (48%), Gaps = 3/110 (2%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           MVF  R   LD + K  +D  + T  G  V+++  LFI++L+  +   + +     EL+V
Sbjct: 1   MVFDFRR--LDVYRKIPKDLTQPTYAGACVSLLSMLFITFLLLSEFMSFIRPEVVSELYV 58

Query: 61  DS-SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
           D+     +L + +++ +P + C  + LD  D  G   +    N  K  L+
Sbjct: 59  DNPGEIERLTVRVNLSLPKLHCGVVGLDIQDDMGRHEVGYVDNTKKIPLN 108


>gi|224067439|ref|XP_002195791.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Taeniopygia guttata]
          Length = 290

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 88/187 (47%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           +GC+  G+  +N+V G+FH++          H    QP      + TH I  LSFG KLQ
Sbjct: 113 DGCRFEGHFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDKLQ 160

Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
             +       L+G    +    +  +Y +KI+PT+YE + G +             +   
Sbjct: 161 VHNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVANKEYVAYS 220

Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  + 
Sbjct: 221 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILDSCIFTASEA 280

Query: 366 ISKVEIG 372
             K+++G
Sbjct: 281 WKKIQLG 287



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ ++L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|154415829|ref|XP_001580938.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121915161|gb|EAY19952.1| hypothetical protein TVAG_402060 [Trichomonas vaginalis G3]
          Length = 359

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 88/350 (25%), Positives = 143/350 (40%), Gaps = 62/350 (17%)

Query: 7   LKGLDAFTKPYE-DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELF----VD 61
           LK LD F K  + +F   T+ G  ++ +  +    LI  ++ +Y +     +L     +D
Sbjct: 5   LKELDIFDKFADAEFALHTIGGKFMSAIFSIIAVILIFAELFNYTKPIVYRDLLNIPQLD 64

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
                     + + +P   C +L  DA+DS G + L V ++I  +R+ +D + I      
Sbjct: 65  KDNTVNFTFSIQVALP---CFFLHFDALDSIGVEMLDVSNDIKFKRMSVDNRFID----- 116

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                           +   L+D   C  C+G + E  +CCNTC+EVK  +  +      
Sbjct: 117 ---------------YSNESLKD--ICLPCHGLKPEG-ECCNTCDEVKAIFEARGEDFNP 158

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHV-HVHD--I 238
           L    QC    +    K   +E C I G +   +  G FHIAPG +       H HD  +
Sbjct: 159 L-PFDQCMGNVN---FKKDMSESCLIEGTIHTFKSPGQFHIAPGRNTKFRRTGHQHDTGL 214

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS---MFNYYI-KIIPT 294
            P  S      H I     G   Q  D  R P+ G + +  +      +++ +I K++ T
Sbjct: 215 SPEASCP----HTIHEFYVG---QKYDNVRSPIRGKIFRDRDSLPRIYLYDLFITKVLHT 267

Query: 295 IYERLD----------GSKL-GGGDGGMPGIFFSYELSPLMVKITEKSKS 333
             + L           G+K+   G    PGI+F Y  SP+   I E+S S
Sbjct: 268 FNDALQYTSYEYSYNLGAKIFNPGSFYQPGIYFKYMFSPM--TIVERSIS 315


>gi|291244956|ref|XP_002742359.1| PREDICTED: endoplasmic reticulum-golgi intermediate compartment
           (ERGIC) 1-like [Saccoglossus kowalevskii]
          Length = 318

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 52/195 (26%), Positives = 87/195 (44%), Gaps = 24/195 (12%)

Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
           +T K+      GC+   Y ++N+V G+FH++   + S         QP  +   +T H I
Sbjct: 130 NTNKIPLNNNAGCRFEAYFKINKVPGNFHVSTHAAGSR--------QPQKADFVHTIHEI 181

Query: 253 RHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS---------- 302
             +   I+ +  +    PL G         S  +YY+K++PT+YE + G           
Sbjct: 182 I-IGDDIQNKSINAAFNPLAGYDRSDAAAESSHDYYMKVVPTVYEDVWGRVNLSYQYTYA 240

Query: 303 -----KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
                  G G   MP I+F Y++SP+ VK  EK        T I   + GT+    ++D+
Sbjct: 241 YKDYVSYGHGHRVMPAIWFRYDISPITVKYHEKRAPFYTFITTICAIVGGTFTVAGIIDS 300

Query: 358 LLHSCVKKISKVEIG 372
           +++S  +   K EIG
Sbjct: 301 MIYSASEVFKKAEIG 315



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/108 (28%), Positives = 52/108 (48%), Gaps = 4/108 (3%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           FS R    D + K  +D  + T+ G  V+I   LFI +L+  +   +       EL+VD+
Sbjct: 33  FSNRF---DVYRKIPKDLTQPTLAGAMVSICSALFIVFLLLSEFTSFIAPDVRSELYVDN 89

Query: 63  -SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
                KL + L+I +P + C+++ LD  D  G   + +  N  K  L+
Sbjct: 90  PGHIEKLNVKLNISLPRLKCEFIGLDIQDDMGRHEVGLVDNTNKIPLN 137


>gi|154335780|ref|XP_001564126.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134061160|emb|CAM38182.1| conserved hypothetical protein [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 309

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 69/326 (21%), Positives = 132/326 (40%), Gaps = 66/326 (20%)

Query: 68  LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVK 127
           +P+H D++ P +SC+ L++D VD++G    +    I+K  +  DG+              
Sbjct: 1   MPVHFDVLFPYMSCNRLSIDVVDATGTAKFNCTGTIHKLPISGDGE-------------V 47

Query: 128 KKKVTTENGTTTTELEDPN---KCGSC-----YGAETETR-----KCCNTCNEVKEAYRY 174
           + K T ++     E++D     KC  C      G   + R     KCC++C+ V E Y+ 
Sbjct: 48  QYKGTMKDLGNDIEMDDTGGDKKCRRCPSFAFEGVAADVRNAAASKCCDSCDSVFELYKD 107

Query: 175 KKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP---GLSYSIN 231
            +   P ++   QC  +            GC + G L++ +V  +    P   G  YS+ 
Sbjct: 108 LEKEFPGIEYFPQCLEQLYER------ARGCNVIGSLDLKKVPVTVIFGPRRTGRRYSLK 161

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERR---KPLDGTVAKAEEGASMFNYY 288
            V             +T+H I+ L  G +  +   +    +PL G   +  +  S   Y 
Sbjct: 162 DV----------IRLDTSHVIKKLRIGDEAVERFSKHGVAEPLCGH-ERFSKTYSETRYL 210

Query: 289 IKIIPTIYE--RLDGSKLG---------------GGDGGMPGIFFSYELSPLMVKITEKS 331
           +K++PT Y   R   +K                 G  G +P + F++E + + V    + 
Sbjct: 211 VKVVPTTYRKTRTRDAKASTYEYSAQCSSQAIVVGFSGVVPAVLFAFEPAAIQVNNVFER 270

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDA 357
           + + H   ++   + G ++    +D+
Sbjct: 271 QPVSHFLVQLCGIVGGLFVVLGFIDS 296


>gi|443921357|gb|ELU41041.1| endoplasmic reticulum-derived transport vesicle ERV46 [Rhizoctonia
           solani AG-1 IA]
          Length = 579

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 73/312 (23%), Positives = 123/312 (39%), Gaps = 69/312 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGG--AVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           L+ +D+  KP     E T+Y    +VT++    I     +++ DY ++    ++ VD SR
Sbjct: 164 LEQVDSVGKP---LRENTLYANRFSVTLISMGIILIFTIIEIIDYRRIGMASDIIVDVSR 220

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           G ++ ++++I  P + C  L+LD  D SG+    V H+I K RL+  G  I E       
Sbjct: 221 GEQISVNMNITFPRVPCYLLSLDITDVSGDIQQDVSHHILKTRLEPSGAMIHE------- 273

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
                ++ +E G +   +E                                    PE D 
Sbjct: 274 NTLNYRIKSETGISHQGME---------------------------------LRRPEHDR 300

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
                 E    K  + F         L +N+V+G+FH +PG S+     H +D+ PY   
Sbjct: 301 AGMLLLELIPFKEPHPF---------LRINKVTGNFHFSPGRSFLSQRGHAYDLVPYLKD 351

Query: 245 A--FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS-------------MFNYYI 289
               +  H+I    F    + +D  R+   GT  +A  G+              M  Y++
Sbjct: 352 GNHHDFGHYIHEFHFEGDREIEDRWREGNRGTEWRARVGSDKQPLDGLEQPSNWMIQYFL 411

Query: 290 KIIPTIYERLDG 301
           K++ T    LDG
Sbjct: 412 KVVSTEVRHLDG 423


>gi|158292441|ref|XP_001688474.1| AGAP005044-PB [Anopheles gambiae str. PEST]
 gi|157016994|gb|EDO64057.1| AGAP005044-PB [Anopheles gambiae str. PEST]
          Length = 287

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 48/184 (26%), Positives = 87/184 (47%), Gaps = 26/184 (14%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
           + C+I+G L +N+V+G+FHI  G  + +S  H+H++ I  + +   N +H I   SFG  
Sbjct: 85  DACRIHGVLTLNKVAGNFHITVGKTIHFSRGHIHLNSI--FANTQTNFSHRINRFSFG-- 140

Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP---------------TIYERLDGSKLG 305
                    PL+G     + G  M  Y+I+++P               T+ E L    + 
Sbjct: 141 -DHTAGIIHPLEGDEKLFDNGQVMMQYFIEVVPTDVQKFYSHSKTYQYTVRENLQLIDID 199

Query: 306 GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH----S 361
            G  G+ GI+F Y++S L V + +   S+ H   ++   I+G  +   ++   +H    +
Sbjct: 200 KGMQGVAGIYFKYDMSALRVLVRQDRDSIAHFIVRLSSIIAGIVVISGMLSKCMHLIGDA 259

Query: 362 CVKK 365
           C K+
Sbjct: 260 CCKR 263


>gi|300121843|emb|CBK22417.2| unnamed protein product [Blastocystis hominis]
          Length = 251

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 65/218 (29%), Positives = 97/218 (44%), Gaps = 34/218 (15%)

Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPE--LDTIVQCKNEYSTEKLKNTFTEGCQIY 208
           CYGA  E  +CCNTC+ + EAY  + W+ P   L     C+N   +     +F  GC I+
Sbjct: 35  CYGAGAEG-QCCNTCSAIVEAYNSRGWS-PHFVLQFSPLCRNSRPSVL---SFKSGCMIW 89

Query: 209 GYLEVNRVSGSFHIAPGLSY-SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDER 267
           G ++V++V+G  HI        I    V+D +    +   ++H I H SFG  +   +  
Sbjct: 90  GAIDVHQVAGDIHIQTTTGMIDILGAPVYDAE--IISKLKSSHFIEHFSFGKHIPGVE-- 145

Query: 268 RKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL------------------GGGDG 309
             PL+G    A +  S   Y I+I+P IYER  G ++                   G   
Sbjct: 146 -NPLNGRRFLANQLTS-HAYQIEILPAIYER-GGVEIRSNEISVYETDKVVTVEPSGTAD 202

Query: 310 GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
             PG+FF Y +SP    I E  K    L  + +C + G
Sbjct: 203 VEPGLFFKYRISPFEHVIREDRKEFWSLVVR-LCGVMG 239


>gi|449272958|gb|EMC82607.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Columba livia]
          Length = 297

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 88/187 (47%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           +GC+  G+  +N+V G+FH++          H    QP      + TH I  LSFG KLQ
Sbjct: 120 DGCRFEGHFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDKLQ 167

Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
             +       L+G    +    +  +Y +KI+PT+YE + G +             +   
Sbjct: 168 VHNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMGGKQRYSYQYTVANKEYVAYS 227

Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  + 
Sbjct: 228 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILDSCIFTASEA 287

Query: 366 ISKVEIG 372
             K+++G
Sbjct: 288 WKKIQLG 294



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/98 (25%), Positives = 50/98 (51%), Gaps = 4/98 (4%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGS 66
            D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD      G 
Sbjct: 15  FDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKDSGG 74

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
           K+ ++L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 75  KIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 112


>gi|149241719|ref|XP_001526345.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146450468|gb|EDK44724.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 353

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 84/356 (23%), Positives = 137/356 (38%), Gaps = 100/356 (28%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           S+R+K  DAF K       ++  GG  T++ + F   ++ V+V  +       +  VD  
Sbjct: 5   SKRVKTFDAFPKVDPQHQVRSERGGLSTLLTYFFGLLILWVEVGGFIGGYVDRQFEVDRV 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
             S L I++D++V  + C+++  +  D + ++ L  E       L+ +G     PQ   +
Sbjct: 65  VRSDLSINVDMIV-AMPCEFIHTNVEDITRDRFLAGE------TLNFEGIHFFIPQNFKI 117

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
           N                   +PN                                 P+LD
Sbjct: 118 N-------------------NPNDFHET----------------------------PDLD 130

Query: 184 TIVQCKNEYSTEKLKNTFTEG----------CQIYGYLEVNRVSGSFHI-APGLSYSINH 232
            ++Q       E L+  F +G          C I+G + VN+V G F I   G  YS + 
Sbjct: 131 EVMQ-------ESLRAEFRQGGQRINEGAPACHIFGSIPVNQVKGDFRITGKGFGYS-DR 182

Query: 233 VHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKII 292
           +HV        AA N TH I+  S+G   +       PLD T    EE    + Y  +++
Sbjct: 183 LHV------PLAALNFTHVIQEFSYG---EFFPFLNNPLDATGKVTEEKLQAYIYNAQVV 233

Query: 293 PTIYERL------------------DGSKLGGGDGGMPGIFFSYELSPLMVKITEK 330
           PT+YE+L                     ++     G+PGI+F YE  P+ + I EK
Sbjct: 234 PTLYEKLGLEVDTNQYSLTENHHVIKLDEISNRPQGVPGIYFRYEFEPIKLTIREK 289


>gi|229366152|gb|ACQ58056.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Anoplopoma fimbria]
          Length = 290

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 50/187 (26%), Positives = 88/187 (47%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IK 260
           +GC+  G   +N+V G+FH++          H    QP +    + TH+I  L+FG  I+
Sbjct: 113 DGCRFEGEFTINKVPGNFHVS---------THSATAQPQSP---DMTHNIHKLAFGEKIQ 160

Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
           +Q        L G    +    +  +Y +KI+PT+YE L G +             +   
Sbjct: 161 VQRVQGAFNALGGADRLSSNPLASHDYILKIVPTVYEDLSGKQRFSYQYTVANKEYVAYS 220

Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G  +P I+F Y+LSP+ VK TE+ + +    T I   + GT+    ++D+ + +  + 
Sbjct: 221 HAGRIIPAIWFRYDLSPITVKYTERRQPVYRFITTICAIVGGTFTVAGIIDSCIFTASEA 280

Query: 366 ISKVEIG 372
             K++IG
Sbjct: 281 WKKIQIG 287



 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 52/101 (51%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
           ++  D + K  +D  + T  G  ++I+C +FI +L   ++  +       EL+VD     
Sbjct: 5   VRRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATELVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + CD + LD  D  G   + H+++++
Sbjct: 65  SGGKIEVSLNISLPNLHCDLVGLDIQDEMGRHEVGHIDNSM 105


>gi|357627966|gb|EHJ77470.1| putative PTX1 protein isoform 1 [Danaus plexippus]
          Length = 353

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 41/176 (23%), Positives = 83/176 (47%), Gaps = 18/176 (10%)

Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
           N   + C+++G L +N+V+G+FHI  G S  +   H+H    +     N +H I  LSFG
Sbjct: 138 NRRPDACRLHGVLTLNKVAGNFHITAGKSLHLPRGHIHLNMLFDDTPQNFSHRINRLSFG 197

Query: 259 IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSK 303
                 +    PL+G      + + ++ Y+++++PT               + E      
Sbjct: 198 ---SPANGIIYPLEGDEKITSDESMLYQYFLEVVPTDVDTTFESIKTFQYSVKELARPIS 254

Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
              G  G+PG+FF Y+++ L V++ ++ ++L     ++   I G Y+    ++ ++
Sbjct: 255 HSKGSHGVPGVFFKYDMAALKVQVYQERENLLQFMLRLFSIIGGIYVIISFINTIV 310


>gi|47222972|emb|CAF99128.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 288

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 53/186 (28%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G   +N+V G+FHI+          H    QP      + TH I  L+FG KLQ 
Sbjct: 112 GCRFEGEFNINKVPGNFHIS---------THSASAQPQNP---DMTHFIHKLAFGDKLQM 159

Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
              +     L G    A    +  +Y +KI+PT+YE L G +             +    
Sbjct: 160 HQVKGAFNALGGADRLASNPLASHDYILKIVPTVYEDLSGKQKFSYQYTVANKEYVAYSH 219

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ +      T I   + GT+    ++D+ + +  +  
Sbjct: 220 TGRIVPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGIIDSCIFTASEAW 279

Query: 367 SKVEIG 372
            K++IG
Sbjct: 280 KKIQIG 285



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 51/101 (50%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
           L   D + K  +D  + T  G  ++I+C +FI +L   ++  +       EL+VD     
Sbjct: 3   LHRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATEIVNELYVDDPDKD 62

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + CD + LD  D  G   + H+E+++
Sbjct: 63  SGGKIEVSLNITLPNLHCDLVGLDIQDEMGRHEVGHIENSM 103


>gi|296415728|ref|XP_002837538.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295633410|emb|CAZ81729.1| unnamed protein product [Tuber melanosporum]
          Length = 341

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/163 (31%), Positives = 81/163 (49%), Gaps = 29/163 (17%)

Query: 205 CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG---IK 260
           C+IYG + VNR+ G FHI A G  Y  +  H+         +FN +H I  LSFG    K
Sbjct: 155 CRIYGSMGVNRILGDFHITAKGHGYWEDGAHI------DHRSFNFSHVITELSFGDYYPK 208

Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE-RLDGSKLGGGD----------- 308
           L +      PLDG V+K +E    F Y++ I+PT YE +  G  L               
Sbjct: 209 LVN------PLDGVVSKTDENFHKFQYFLSIVPTTYESQTSGKSLLTNQYAVTEQSRKIS 262

Query: 309 -GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
              +PGI+F Y++ P+ +KI+++  +L     +++  +SG  +
Sbjct: 263 SHSVPGIYFKYDIEPISLKISDRRTALLAFVVRLVNIVSGILV 305


>gi|19112857|ref|NP_596065.1| COPII-coated vesicle component Erv41 (predicted)
           [Schizosaccharomyces pombe 972h-]
 gi|74582843|sp|O94283.1|ERV41_SCHPO RecName: Full=ER-derived vesicles protein 41
 gi|3850069|emb|CAA21880.1| COPII-coated vesicle component Erv41 (predicted)
           [Schizosaccharomyces pombe]
          Length = 333

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/164 (33%), Positives = 77/164 (46%), Gaps = 29/164 (17%)

Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
            C+IYG L VNRV+G  HI APG  Y  +++  H +        N TH+I  LSFG   +
Sbjct: 151 ACRIYGQLVVNRVNGQLHITAPGWGYGRSNIPFHSL--------NFTHYIEELSFG---E 199

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS---------------KLGGG 307
                   LDG    A +    F YY+ ++PT Y+    S               +LG G
Sbjct: 200 YYPALVNALDGHYGHANDHPFAFQYYLSVLPTSYKSSFRSFETNQYSLTENSVVRQLGFG 259

Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
               PGIF  Y+L PL V++ +K  ++     +I+  ISG  IT
Sbjct: 260 SLP-PGIFIDYDLEPLAVRVVDKHPNVASTLLRILA-ISGGLIT 301


>gi|289741661|gb|ADD19578.1| cOPII vesicle protein [Glossina morsitans morsitans]
          Length = 418

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 88/387 (22%), Positives = 159/387 (41%), Gaps = 68/387 (17%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           E  K LDAF K  E + E T  GG ++++  L I YLI  +V  Y+Q +     F     
Sbjct: 16  ELAKNLDAFKKVPEKYTEATEIGGTLSLISRLLIIYLIYREV-KYYQDAGLVYQFEPDID 74

Query: 65  GSKLPIHLDIVVPTISCDYLA-LDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
             K+ +H+DI V  + C+ L+ +D +D                          E Q++V 
Sbjct: 75  KEKVQMHVDITV-AMPCNSLSGVDLMD--------------------------ETQQDVF 107

Query: 124 --NAVKKKKV----TTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKW 177
              A++++ V    T    T    ++  N            R+  ++  ++   Y  +  
Sbjct: 108 AYGALRRQGVWWHLTPHERTEFERVQHENHF---------LREEYHSVADLLFKYIIQS- 157

Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHD 237
             PE+D   +   E   + L     + C+++G L +N+V+G  H+  G    ++ +  H 
Sbjct: 158 --PEVD---ETATEEDEKPLSEEQYDACRLHGTLGINKVAGVLHLVGGTQPVVDLLGEHL 212

Query: 238 IQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--- 294
           +  +   A N TH I  LSFG   Q      +PL+G      E  ++  Y++ I+PT   
Sbjct: 213 MIGFRHIAANFTHRINRLSFG---QYARRIVQPLEGDETFVSEEGTIVQYFLNIVPTEIH 269

Query: 295 ---------IYERLDGSKLGGGDG---GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
                     Y   +  ++   D    G PGI+F Y+ S L + +     ++     ++ 
Sbjct: 270 KTFTTISTYQYSVTENVRVLDSDRNSYGSPGIYFKYDWSALKIIVRTDRDNMLQFIIRLC 329

Query: 343 CNISGTYITFMLVDALLHSCVKKISKV 369
             ISG  +   +++  L +  + I K+
Sbjct: 330 SIISGIVVLSGILNVFLLTLRRNIIKI 356


>gi|326479518|gb|EGE03528.1| COPII-coated vesicle protein [Trichophyton equinum CBS 127.97]
          Length = 399

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/386 (22%), Positives = 152/386 (39%), Gaps = 77/386 (19%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
            + +LK  DAF K    +   +  GG  TI   +  + L C ++  +++        V+ 
Sbjct: 20  IATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSVER 79

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKE 121
               ++ +++D VV  + CD + ++  D++G+   H+          L G  + QEP   
Sbjct: 80  GVSQEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSW 125

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAE-TETRKCCNTCNEVKEAYRYKKWALP 180
              A   +++       + E +  NK  S    E  E     +   EV+ + + K    P
Sbjct: 126 ---AAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQAEDLHVEHVLGEVRRSRKKKFPKAP 182

Query: 181 ELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQ 239
           +L               K+   + C+++G LE N+V G+ HI A G  Y           
Sbjct: 183 KLK--------------KSDAVDSCRVFGSLEGNKVQGNLHITARGFGY---FEWGRATN 225

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
           P+   + N TH I  LSFG           PLD TV+        + Y++ ++PTIY + 
Sbjct: 226 PH---SLNFTHLITELSFGPHY---GRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYTKS 279

Query: 300 -----------DGSKLGGGDG-----------------------GMPGIFFSYELSPLMV 325
                      D S +   D                          PGIFF Y + P+++
Sbjct: 280 GHIDPNRRSLPDASTITAKDSKTTVSTNQYAVTSYSQPIQPRIDSTPGIFFKYNIEPILL 339

Query: 326 KITEKSKSLGHLWTKIMCNISGTYIT 351
            ++++  SL  L  +++  +SG  +T
Sbjct: 340 IVSQERDSLLALMVRLVNVVSGVLVT 365


>gi|308198100|ref|XP_001386838.2| predicted protein [Scheffersomyces stipitis CBS 6054]
 gi|149388859|gb|EAZ62815.2| putative ER to golgi transport [Scheffersomyces stipitis CBS 6054]
          Length = 352

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 81/348 (23%), Positives = 140/348 (40%), Gaps = 84/348 (24%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F+++++  DAF K       ++  GG  T++       ++ V++  +       +  VD+
Sbjct: 4   FAKKVRTFDAFPKVDSQHTVRSQRGGFSTLMTAFCGLLIVWVEIGGFLGGYVDHQFIVDN 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              S L I++D++V  + C++L  +  D + +++L  E       L+  G     P    
Sbjct: 64  EIKSSLVINVDMLV-AMPCEFLHTNVEDITKDRYLAGE------TLNFQGTNFITPPTFN 116

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +N +  K  T                                               P+L
Sbjct: 117 INNINDKHDT-----------------------------------------------PDL 129

Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
           D I+Q   + E+S    + N     C I+G + V+ V G FHI A GL YS +  HV   
Sbjct: 130 DEIMQDSLRAEFSVSGARINEGAPACHIFGSIPVSHVKGDFHITAKGLGYS-DRSHV--- 185

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
                 A N +H I+  SFG      +    PLD +    EE    ++Y+ K++PT+Y+R
Sbjct: 186 ---PLEALNFSHVIQEFSFGDFYPFIN---NPLDASGKLTEEPLISYSYFAKVVPTLYQR 239

Query: 299 L----DGSKLGGGDG------------GMPGIFFSYELSPLMVKITEK 330
           L    D ++    +             G+PGIFF Y+  P+ + I E+
Sbjct: 240 LGLVVDTNQYSLTENNHVFKLEHKRPTGIPGIFFKYDFEPIKLIIIER 287


>gi|71480113|ref|NP_001025133.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Danio rerio]
 gi|78099248|sp|Q4V8Y6.1|ERGI1_DANRE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 1; AltName: Full=ER-Golgi intermediate
           compartment 32 kDa protein; Short=ERGIC-32
 gi|66911928|gb|AAH97146.1| Zgc:114085 [Danio rerio]
          Length = 290

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/186 (27%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G   +N+V G+FH++          H    QP +    + TH I  L+FG KLQ 
Sbjct: 114 GCRFEGEFSINKVPGNFHVS---------THSATAQPQSP---DMTHIIHKLAFGAKLQV 161

Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
              +     L G         +  +Y +KI+PT+YE L G +             +    
Sbjct: 162 QHVQGAFNALGGADRLQSNALASHDYILKIVPTVYEELGGKQRFSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ +      T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRRPFYRFITTICAIIGGTFTVAGIIDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K++IG
Sbjct: 282 KKIQIG 287



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 44/188 (23%), Positives = 85/188 (45%), Gaps = 9/188 (4%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
           ++  D + K  +D  + T  G  ++I C +F+ +L   ++  +       EL+VD     
Sbjct: 5   VRRFDIYRKVPKDLTQPTYTGAFISICCCVFMLFLFLSELTGFIATEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNIYKRRLDLDGKPIQEPQKEV 122
            G K+ + L+I +P + CD + LD  D  G   + H+E+++ K  L+ +G   +   +  
Sbjct: 65  SGGKIDVSLNISLPNLHCDLVGLDIQDEMGRHEVGHIENSM-KVPLN-NGHGCRFEGEFS 122

Query: 123 VNAVKKK-KVTTENGTTTTELEDPNKC--GSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           +N V     V+T + T   +  D         +GA+ + +      N +  A R +  AL
Sbjct: 123 INKVPGNFHVSTHSATAQPQSPDMTHIIHKLAFGAKLQVQHVQGAFNALGGADRLQSNAL 182

Query: 180 PELDTIVQ 187
              D I++
Sbjct: 183 ASHDYILK 190


>gi|148223633|ref|NP_001084786.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Xenopus laevis]
 gi|78099249|sp|Q6NS19.1|ERGI1_XENLA RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 1; AltName: Full=ER-Golgi intermediate
           compartment 32 kDa protein; Short=ERGIC-32
 gi|47125098|gb|AAH70532.1| MGC78834 protein [Xenopus laevis]
          Length = 290

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/186 (27%), Positives = 86/186 (46%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G   +N+V G+FH++          H    QP   A  +  H I  LSFG  LQ 
Sbjct: 114 GCRFEGLFSINKVPGNFHVS---------THSAIAQP---ANPDMRHIIHKLSFGNTLQV 161

Query: 264 DD--ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           D+       L G    A +     +Y +KI+PT+YE L+G +             +    
Sbjct: 162 DNIHGAFNALGGADKLASKALESHDYVLKIVPTVYEDLNGKQQFSYQYTVANKAYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + +    T +   I GT+    ++D+ + +  +  
Sbjct: 222 TGRVVPAIWFRYDLSPITVKYTERRQPMYRFITTVCAIIGGTFTVAGILDSFIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI++L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISICCCLFITFLFLSELTGFIANEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L++ +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVTLNVTLPNLPCEVVGLDIQDEMGRHEVGHIDNSM 105


>gi|410918691|ref|XP_003972818.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Takifugu rubripes]
          Length = 378

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/171 (28%), Positives = 76/171 (44%), Gaps = 18/171 (10%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
            C+IYG++ VN+V+G+ HI  G        H H     +   +N +H I HLSFG ++  
Sbjct: 168 ACRIYGHIYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHETYNFSHRIDHLSFGEEITG 227

Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGGD 308
                 PLDGT     +   M+ Y+I ++PT               + ER        G 
Sbjct: 228 II---NPLDGTEKITSKHTQMYQYFITVVPTRLVTHKVSADTHQFSVTERERVINHAAGS 284

Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
            G+ GIF  Y+ S L V +TE+   L     ++   + G + T  ++  L+
Sbjct: 285 HGVSGIFVKYDTSSLTVTVTEQHMPLWQFLVRLCGIVGGIFSTTGMLHGLV 335


>gi|225685292|gb|EEH23576.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 386

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 146/375 (38%), Gaps = 75/375 (20%)

Query: 12  AFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLPIH 71
           A TKP   +   T  GG  T+V ++  + L   ++  +++        V+     +L ++
Sbjct: 16  AKTKP--TYTSSTRRGGQWTVVVFVLCALLSISELRTWYKGVENHHFSVEKGISRELQLN 73

Query: 72  LDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKKKV 131
           LDIVV  ++CD L ++  D++G++ L           D+  K   EP        +    
Sbjct: 74  LDIVV-AMTCDALRINVQDAAGDRILAS---------DMLNK---EPTSWAAWNRELNVA 120

Query: 132 TTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNE 191
            +  G     L + +      G   E  +  +  + + EA R  K   P+          
Sbjct: 121 LSGGGREYQTLAEEDA-----GRLMEQEEDMHVGHALGEARRSHKRKFPK---------- 165

Query: 192 YSTEKLKN-TFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTH 250
               KLK     + C+IYG LE N+V G FHI      +  H +    +     AFN +H
Sbjct: 166 --GPKLKRGEMPDSCRIYGSLEGNKVQGDFHIT-----ARGHGYFEFGEHLDHHAFNFSH 218

Query: 251 HIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL----------- 299
            I  LSFG           PLD T++        + YY+ I+PTIY R            
Sbjct: 219 MITELSFGPHYST---LLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRAGTIDPYSQVLP 275

Query: 300 DGSKLGGGDGG-----------------------MPGIFFSYELSPLMVKITEKSKSLGH 336
           D S +                             +PGIFF Y + P+++ I+E+  SL  
Sbjct: 276 DPSTISPSQRKNTIFTNQYAVTSRSHELPDVQFHVPGIFFKYNIEPILLIISEERGSLLA 335

Query: 337 LWTKIMCNISGTYIT 351
           L  +++  +SG  + 
Sbjct: 336 LLVRLVNVMSGVVVA 350


>gi|315054535|ref|XP_003176642.1| hypothetical protein MGYG_00729 [Arthroderma gypseum CBS 118893]
 gi|311338488|gb|EFQ97690.1| hypothetical protein MGYG_00729 [Arthroderma gypseum CBS 118893]
          Length = 399

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 84/384 (21%), Positives = 155/384 (40%), Gaps = 75/384 (19%)

Query: 4   SERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           + +LK  DAF K    +   +  GG  TI   +  + L C ++  +++        V+  
Sbjct: 21  ATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAILCTLLTCSELITWYRGHENHHFSVERG 80

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEV 122
              ++ +++D VV  + CD + ++  D++G+   H+          L G  + QEP    
Sbjct: 81  VSQEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSW- 125

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
             A   +++       + E +  NK  +    E E  +  +  + + E  R +K   P+ 
Sbjct: 126 --AAWNREMNKRRSGGSPEYQTLNKEDTLRLEEQE--EDLHVEHVLGEVRRSRKKKFPK- 180

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
                     + +  K+   + C+++G LE N+V G+ HI A G  Y           P+
Sbjct: 181 ----------APKMKKSDVVDSCRVFGSLEGNKVQGNLHITARGFGY---FEWGRATNPH 227

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER--- 298
              + N TH I  LSFG           PLD TV+        + Y++ ++PTIY +   
Sbjct: 228 ---SLNFTHLITELSFGPHY---GRLLNPLDKTVSTTSVNFYKYQYHLSVVPTIYTKSGH 281

Query: 299 LDGSKLGGGDG-------------------------------GMPGIFFSYELSPLMVKI 327
           +D S+    D                                  PGIFF Y + P+++ +
Sbjct: 282 MDPSRRSLPDSSTITAKDSKTTVSTNQYAVTSYSQPIQPRIDSTPGIFFKYNIEPILLIV 341

Query: 328 TEKSKSLGHLWTKIMCNISGTYIT 351
           +++  SL  L  +++  +SG  +T
Sbjct: 342 SQERDSLLGLMIRLVNVVSGVLVT 365


>gi|301626814|ref|XP_002942582.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like, partial [Xenopus (Silurana) tropicalis]
          Length = 298

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 87/186 (46%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G+  +N+V G+FH++          H    QP   A  +  H I  LSFG  LQ 
Sbjct: 122 GCRFEGFFSINKVPGNFHVS---------THSAMAQP---ANPDMRHIIHKLSFGNTLQV 169

Query: 264 DD--ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           ++       L G    A +     +Y +KI+PT+YE ++G +             +    
Sbjct: 170 ENIHGAFNALGGADKLASQALESHDYVLKIVPTVYEDMNGEQQFSYQYTVANKAYVAYSH 229

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + +    T +   I GT+    ++D+ + +  +  
Sbjct: 230 TGRVVPAIWFRYDLSPITVKYTERRQPIYRFITTVCAIIGGTFTVAGILDSFIFTASEAW 289

Query: 367 SKVEIG 372
            K+++G
Sbjct: 290 KKIQLG 295



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/98 (25%), Positives = 52/98 (53%), Gaps = 4/98 (4%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGS 66
            D + K  +D  + T  G  ++I C LFI++L   ++  +       EL+VD    + G 
Sbjct: 16  FDIYRKVPKDLTQPTYTGAIISICCCLFITFLFLSELTGFIANEIVNELYVDDPDKNSGG 75

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
           K+ + L++ +P ++C+ + LD  D  G   + H+++++
Sbjct: 76  KIEVTLNVSLPNLACEVVGLDIQDEMGRHEVGHIDNSM 113


>gi|432943284|ref|XP_004083140.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 2-like [Oryzias latipes]
          Length = 372

 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/175 (28%), Positives = 79/175 (45%), Gaps = 18/175 (10%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+I+G + VN+V+G+ HI  G        H H     +  ++N +H I  L FG   +
Sbjct: 159 DACRIHGDIYVNKVAGNLHITVGKPIHHPQGHAHIAAFVSHESYNFSHRIDRLCFG---E 215

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGG 307
           +      PLDGT     +   M+ Y+I ++PT               + ER        G
Sbjct: 216 EIPGIINPLDGTEKITYDNNQMYQYFITVVPTKLKTYKITADTHQFSVTERERVINHTAG 275

Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
             G+ GIFF Y+ S LMV ++E+   L     ++   I G Y T  ++ +L+  C
Sbjct: 276 SHGVSGIFFKYDTSSLMVTVSEQHMPLWQFLVRLCGIIGGIYSTTGMLHSLIGFC 330



 Score = 38.5 bits (88), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 43/84 (51%), Gaps = 1/84 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
          +K LDAF K  + + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13 VKELDAFPKVSDSYVETSTSGGTVSLIAFSTMALLSVLEFFVYQDTWMKYEYEVDKDFSS 72

Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
          KL I++D+ V  + C ++  D +D
Sbjct: 73 KLRINVDVTV-AMRCQHVGADILD 95


>gi|195130281|ref|XP_002009580.1| GI15435 [Drosophila mojavensis]
 gi|193908030|gb|EDW06897.1| GI15435 [Drosophila mojavensis]
          Length = 433

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 91/393 (23%), Positives = 163/393 (41%), Gaps = 63/393 (16%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV---D 61
           E  K LDAF K  E + E T  GG ++++  L I YL+  ++  Y+  + TE ++    D
Sbjct: 16  EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELRYYW--NETEIIYQFEPD 73

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHV-------EHNIYKRRLDLDGKP 114
            S   ++ +H+DI   T++    +L  VD   E  L V          ++ +  D D + 
Sbjct: 74  ISLDEQVQMHVDI---TVAMPCASLSGVDLMDETQLDVFAYGTLQREGVWWQMSDADRRH 130

Query: 115 IQEPQKEVVNAVKKKKVTTENGTTTTEL---EDPNKCGSCYGAETETRKCCNTCNEVKEA 171
            Q  Q  + N   +++  +       ++     P K       E++T+            
Sbjct: 131 FQSMQ--MTNHYLREEYHSVADILFKDILRERSPPK-------ESDTQSDAAAPPPPG-- 179

Query: 172 YRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
                 AL +L  I Q +++Y          + C+++G L +N+V+G  H+  G    + 
Sbjct: 180 ------ALQQLQQISQMESKY----------DACRLHGTLGINKVAGVLHLVGGAQPVVG 223

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKI 291
               H +  +     N TH I  LSFG   Q      +PL+G      E A+   Y+IK+
Sbjct: 224 MFEDHWMIEFRRMPANFTHRINRLSFG---QYSRRIVQPLEGDETIIREEATTVQYFIKV 280

Query: 292 IPT---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGH 336
           +PT               + E +          G PGI+F Y+ S L + ++    +L  
Sbjct: 281 VPTEIRHTFSTISTFQYAVTENVRKLDAERNSYGSPGIYFKYDWSALKIVVSHDRDNLVT 340

Query: 337 LWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
              ++   ISG  +    V+ALL +  +++ ++
Sbjct: 341 FVIRLCSIISGIIVISGAVNALLVAIQRRLLRM 373


>gi|354545468|emb|CCE42196.1| hypothetical protein CPAR2_807450 [Candida parapsilosis]
          Length = 351

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 91/383 (23%), Positives = 141/383 (36%), Gaps = 90/383 (23%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           FS+R+K  DAF K       ++  GG  T++ +     ++ V+V  Y       +  VD 
Sbjct: 4   FSKRVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFLGLLILWVEVGGYIGGYVDRQFLVDD 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              S L I+LD++V  + C+YL  +  D + ++ L  E       L+ +G     P    
Sbjct: 64  VLRSDLTINLDMIV-AMPCEYLHTNVEDITRDRFLAGE------TLNFEGVKFFIP---- 112

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
                                 PN                N  N+  E         P+L
Sbjct: 113 ----------------------PNFS-------------INNPNDFHET--------PDL 129

Query: 183 DTIVQCKNEYSTEKLKNTFTEG---CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
           D ++Q        +L     EG   C I+G + VN+V G F I A G  Y        D 
Sbjct: 130 DEVMQESLRAEFSQLGRRVNEGAPACHIFGSIPVNQVKGDFRITAKGFGY-------RDR 182

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
                 A N +H I+  S+G           PLD T    EE    + Y+ K++PT+YE+
Sbjct: 183 SFVPLEALNFSHVIQEFSYG---DFYPFLNNPLDATGKVTEENLQTYLYHAKVVPTLYEK 239

Query: 299 L----DGSKLGGGDG--------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           L    D ++    +                + GI+F+YE  P+ + I EK         K
Sbjct: 240 LGLEVDTTQYSLTENHHVVKVDPHSKRPQEISGIYFAYEFEPIKLIIREKRIPFLQFIAK 299

Query: 341 IMCNISGTYIT----FMLVDALL 359
           +     G  +     F L + LL
Sbjct: 300 LGTIAGGVVVAAGYLFKLYEKLL 322


>gi|312081872|ref|XP_003143209.1| HT034 [Loa loa]
 gi|307761627|gb|EFO20861.1| hypothetical protein LOAG_07628 [Loa loa]
          Length = 292

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 84/188 (44%), Gaps = 29/188 (15%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--I 259
           T GC+  G  E+++V G+FH++          H  D QP T   ++  H I  + FG  I
Sbjct: 114 TSGCRFEGKFEISKVPGNFHLS---------THAADTQPET---YDMRHTIHSVVFGDNI 161

Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG------------- 306
               +     PL    A   +G+   +Y +KI+P++YE ++G+                 
Sbjct: 162 ITSQNLGSFNPLKNREALQTDGSFTHDYVLKIVPSVYEDINGNTKYSYQYTYAHKEYVTY 221

Query: 307 --GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
                 MP ++F YEL P+ +K TE+ +      T I   + GT+    ++DA L S  +
Sbjct: 222 HYSGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLFSLTE 281

Query: 365 KISKVEIG 372
              K +IG
Sbjct: 282 LYRKHQIG 289



 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/104 (27%), Positives = 51/104 (49%), Gaps = 1/104 (0%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRGSKL 68
           LD + K   D  + T  G  +++VC  FI +++  D+  +  +    ELFVD   R  ++
Sbjct: 13  LDIYRKVPRDLTQPTTTGAVISVVCISFILFMVINDLLSFLTLEIRSELFVDDPGREGRI 72

Query: 69  PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
            + L+I +P +SC Y+ +D  D +G   +    N  K  +   G
Sbjct: 73  EVQLNISLPYLSCYYIGIDIQDDNGRHEVGFVQNTEKIPIGTSG 116


>gi|170587366|ref|XP_001898447.1| HT034 [Brugia malayi]
 gi|158594071|gb|EDP32661.1| HT034, putative [Brugia malayi]
          Length = 286

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/188 (26%), Positives = 84/188 (44%), Gaps = 29/188 (15%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--I 259
           T GC+  G  ++++V G+FHI+          H  D QP T   ++  H I  + FG  +
Sbjct: 108 TSGCRFEGKFDISKVPGNFHIS---------THAADTQPET---YDMRHTIHSVVFGDDV 155

Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG------------- 306
               +     PL    A   +G+   +Y +KI+P++YE + G+K                
Sbjct: 156 STSQNLGSFNPLKNREALESDGSFTHDYVLKIVPSVYEDITGNKKYSYQYTYAHKEYVTY 215

Query: 307 --GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
                 MP ++F YEL P+ +K TE+ +      T I   + GT+    ++DA L S  +
Sbjct: 216 HYSGKVMPALWFRYELQPITIKYTERRQPFYTFITSICAVVGGTFTVAGIIDASLFSLTE 275

Query: 365 KISKVEIG 372
              K ++G
Sbjct: 276 LYRKHQMG 283



 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 51/100 (51%), Gaps = 1/100 (1%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
           +K  D + K   D  + T  G  +++VC  FI +++  D+ ++  +    ELFVD   R 
Sbjct: 4   IKRFDIYRKVPRDLTQPTTTGAIISVVCISFILFMVINDLLNFLTLEVRSELFVDDPGRE 63

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
            ++ + L+I +P +SC Y+ +D  D +G   +    N  K
Sbjct: 64  GRIEVQLNISLPYLSCYYIGIDIQDDNGRHEVGFVRNTEK 103


>gi|449489976|ref|XP_004158474.1| PREDICTED: protein disulfide-isomerase 5-3-like [Cucumis sativus]
          Length = 224

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 92/212 (43%), Gaps = 36/212 (16%)

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           +  NE    K       GC+I GY+ V +V GS  IA   + S +H        + ++  
Sbjct: 20  KSNNETGNVKRPAPSAGGCRIEGYVRVKKVPGSLVIA---ARSESH-------SFDASQM 69

Query: 247 NTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFN-----------YYIKI 291
           N +H I HLSFG K+      D ++  P  G       G S  N           +Y++I
Sbjct: 70  NMSHIISHLSFGRKISPKAFSDAKQLIPYIGISHDRLNGRSFINQRDLGANVTIEHYLQI 129

Query: 292 IPT-IYERLDGSKLG----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           + T +  R  G  L                +P + F + LSP+ V ITE  KS  H  T 
Sbjct: 130 VKTEVLTRRSGKLLEEYEYTAHSSVSQSLYIPVVKFHFVLSPMQVVITENQKSFSHFITN 189

Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           +   I G +    ++DALLH+ ++ + KVE+G
Sbjct: 190 VCAIIGGVFTVAGILDALLHNTIRLMKKVELG 221


>gi|348516790|ref|XP_003445920.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Oreochromis niloticus]
          Length = 290

 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 85/187 (45%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           +GC+  G   +N+V G+FH++          H    QP      + TH I  L+FG KLQ
Sbjct: 113 DGCRFEGEFTINKVPGNFHVS---------THSATAQPQNP---DMTHTIHKLAFGEKLQ 160

Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
               +     L G    +    +  +Y +KI+PT+YE L G +             +   
Sbjct: 161 VQKVQGAFNALGGADKMSSNPLASHDYILKIVPTVYEDLSGRQRFSYQYTVANKEYVAYS 220

Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G  +P I+F Y+LSP+ VK TE+ + L    T I   I G +    ++D+ + +  + 
Sbjct: 221 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGAFTVAGIIDSCIFTASEA 280

Query: 366 ISKVEIG 372
             K++IG
Sbjct: 281 WKKIQIG 287



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 52/101 (51%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
           ++  D + K  +D  + T  G  ++I+C +FI +L   ++  +       EL+VD     
Sbjct: 5   VRRFDIYRKVPKDLTQPTYTGAFISILCCVFILFLFLSELTGFIATEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + CD + LD  D  G   + H+E+++
Sbjct: 65  SGGKIEVSLNISLPNLHCDLVGLDIQDEMGRHEVGHIENSM 105


>gi|224117462|ref|XP_002317580.1| predicted protein [Populus trichocarpa]
 gi|222860645|gb|EEE98192.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 64/227 (28%), Positives = 98/227 (43%), Gaps = 43/227 (18%)

Query: 178 ALPELDTIVQCKNEYSTEKLKNTFTE--GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
           A+      ++ K E +T+ +K       GC+I GY+ V +V G+  I+     +++  H 
Sbjct: 266 AMESQRQALEHKPENATQHVKRPAPSAGGCRIEGYVRVKKVPGNLMIS-----ALSGAHS 320

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKL----QDDDERRKPLDGTVAKAEEGASMFNY---- 287
            D     S   N +H I H SFG+K+      D +R  P  G       G S  N+    
Sbjct: 321 FD-----SKQMNLSHVISHFSFGMKVLPRVMSDVKRLLPYIGRSHDKLNGRSFINHRDVG 375

Query: 288 -------YIKIIPTI---------------YERLDGSKLGGGDGGMPGIFFSYELSPLMV 325
                  Y++++ T                YE    S L      MP   F +ELSP+ V
Sbjct: 376 ANVTIEHYLQVVKTEVVTRRSSSERKLIEEYEYTAHSSLSQ-TVYMPTAKFHFELSPMQV 434

Query: 326 KITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
            ITE SKS  H  T +   I G +    ++D++LH  V+ + KVE+G
Sbjct: 435 LITENSKSFSHFITNVCAIIGGVFTVAGILDSILHHTVRMMKKVELG 481



 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 37/115 (32%), Positives = 64/115 (55%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           MV + +LK +D + K   D  E ++ G  ++IV  L + +L  +++ +Y  V+T+  + V
Sbjct: 1   MVSTNKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMMFLFGMELNNYLTVNTSTTVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  +I  P++SC++ ++D  D  G   L++   I K  +D D KP
Sbjct: 61  DNSSDGEFLRIDFNISFPSLSCEFASVDVSDVLGTNRLNITKTIRKFSIDHDLKP 115


>gi|449468488|ref|XP_004151953.1| PREDICTED: protein disulfide-isomerase 5-4-like [Cucumis sativus]
          Length = 481

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 92/212 (43%), Gaps = 36/212 (16%)

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           +  NE    K       GC+I GY+ V +V GS  IA   + S +H        + ++  
Sbjct: 277 KSNNETGNVKRPAPSAGGCRIEGYVRVKKVPGSLVIA---ARSESH-------SFDASQM 326

Query: 247 NTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFN-----------YYIKI 291
           N +H I HLSFG K+      D ++  P  G       G S  N           +Y++I
Sbjct: 327 NMSHIISHLSFGRKISPKAFSDAKQLIPYIGISHDRLNGRSFINQRDLGANVTIEHYLQI 386

Query: 292 IPT-IYERLDGSKLG----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           + T +  R  G  L                +P + F + LSP+ V ITE  KS  H  T 
Sbjct: 387 VKTEVLTRRSGKLLEEYEYTAHSSVSQSLYIPVVKFHFVLSPMQVVITENQKSFSHFITN 446

Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           +   I G +    ++DALLH+ ++ + KVE+G
Sbjct: 447 VCAIIGGVFTVAGILDALLHNTIRLMKKVELG 478



 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 60/110 (54%), Gaps = 1/110 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S +LK +D + K   D  E T+ G  ++IV  L + +L  +++ +Y  VST+  + V
Sbjct: 1   MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
           D S+ G  L +  +I  P +SC++ A+D  D  G   L++   I K  +D
Sbjct: 61  DNSTDGDFLRMDFNISFPALSCEFAAVDVNDVLGTNRLNITKTIRKFSID 110


>gi|440632946|gb|ELR02865.1| hypothetical protein GMDG_05797 [Geomyces destructans 20631-21]
          Length = 384

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 86/376 (22%), Positives = 151/376 (40%), Gaps = 67/376 (17%)

Query: 3   FSER---LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELF 59
           F+E+   +   DAF K   ++  KT  GG  T++  +  + L   ++  +++ +      
Sbjct: 14  FAEKGSIVSAFDAFPKSKPEYVTKTSGGGKWTVLMLIISALLTMSELGRWWRGNEDHTFE 73

Query: 60  VDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
           V+      L ++LD+VV  + C  + ++  D+SG++ L     + K  L    + +    
Sbjct: 74  VEKFVSRDLQVNLDMVV-AMRCPDIHINVQDASGDRIL--ASKVLKTELTNWLQWVNMKG 130

Query: 120 KEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL 179
           +  +       V T+ G  +         G   G E E     +  + +  A R  KWA 
Sbjct: 131 QHQLGHNADGSVITDEGWESD--------GHDEGFEEE-----HVHDIIYTAMRSNKWA- 176

Query: 180 PELDTIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVH 236
                         T K+K    +G  C+I+G + +N+V G FHI A G  Y        
Sbjct: 177 -------------KTPKIKGHPRDGDSCRIFGSMMLNKVQGDFHITARGHGYQ----EAF 219

Query: 237 DIQPYTSAAFNTTHHIRHLSFGI---KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
             +    ++FN +H +   SFG    KL +      PLD T+           Y++ ++P
Sbjct: 220 GTKHLDHSSFNFSHIVSEFSFGAFYPKLIN------PLDQTITTTANQFYKSQYFMSVVP 273

Query: 294 TIYERLDGSKLGG------------------GDGGMPGIFFSYELSPLMVKITEKSKSLG 335
           TIY     + L                     +  +PGIFF Y++ PLM+ I E+  S  
Sbjct: 274 TIYTVSSPNPLSSKSTIFTNQYAVTHEDRKINERTVPGIFFKYDIEPLMLTIEERRDSFL 333

Query: 336 HLWTKIMCNISGTYIT 351
               K++  +SG  + 
Sbjct: 334 RFAIKVVNILSGVLVA 349


>gi|302659461|ref|XP_003021421.1| hypothetical protein TRV_04495 [Trichophyton verrucosum HKI 0517]
 gi|291185318|gb|EFE40803.1| hypothetical protein TRV_04495 [Trichophyton verrucosum HKI 0517]
          Length = 427

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 88/407 (21%), Positives = 158/407 (38%), Gaps = 91/407 (22%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
            + +LK  DAF K    +   +  GG  TI   +  + L C ++  +++        V+ 
Sbjct: 20  IATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSVER 79

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKE 121
               ++ +++D VV  + CD + ++  D++G+   H+          L G  + QEP   
Sbjct: 80  GVSQEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSW 125

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
              A   +++       + E +  NK  S    E E  +  +  + + E  R +K   P+
Sbjct: 126 ---AAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQE--EDLHVEHVLGEVRRSRKKKFPK 180

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSY-----SINHVHV 235
                      S +  K+   + C+++G LE N+V G+ HI A G  Y     + N   +
Sbjct: 181 -----------SPKLKKSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWGRATNPHSM 229

Query: 236 HDIQPYTS-----------------AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA 278
             +QP  +                    N TH I  LSFG           PLD TV+  
Sbjct: 230 SLLQPIITCIHGDAKNLTDQLTKLFPGLNFTHLITELSFGPHY---GRLLNPLDKTVSST 286

Query: 279 EEGASMFNYYIKIIPTIYER-----------LDGSKLGGGDG------------------ 309
                 + Y++ ++PTIY +            D S +   D                   
Sbjct: 287 SINFYKYQYHLSVVPTIYTKSGHIDPNRRSLPDASTITAKDSKTTVSTNQYAVTSYSQPI 346

Query: 310 -----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                  PGIFF Y + P+++ ++++  SL  L  +++  +SG  +T
Sbjct: 347 QPRIDATPGIFFKYNIEPILLIVSQERDSLLALMVRLVNVVSGVLVT 393


>gi|67482091|ref|XP_656395.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS]
 gi|56473591|gb|EAL51010.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS]
 gi|449705171|gb|EMD45274.1| Hypothetical protein EHI5A_018710 [Entamoeba histolytica KU27]
          Length = 315

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 60/203 (29%), Positives = 86/203 (42%), Gaps = 39/203 (19%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPG-LSYSINHV-------------HVHDIQPYTSAAFNTT 249
           GC++YG ++V+RVSG FH+A G +S+    +             H+H        +FN T
Sbjct: 116 GCRMYGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQEMKSFNPT 175

Query: 250 HHIRHLSFGIKLQDD-DERRKPLDG---TVAKAEEGASMFNYYIKIIPTIY--------- 296
           H+I HLSF   L         PL+G   T++  +       YYI +IPT++         
Sbjct: 176 HYINHLSFSNTLGSTVHSGETPLNGKKFTLSGFDNARK--TYYINVIPTLFKYPSYTLRT 233

Query: 297 ------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                 ER D     G     PG+FF YELSP +V       S  H    +   I G  I
Sbjct: 234 YQLSVNER-DVPVTYGASFTQPGVFFKYELSPYIVINEMNDHSFAHSLASVGAIIGGVLI 292

Query: 351 TFMLVDALL---HSCVKKISKVE 370
              L+  L    H  V  + ++E
Sbjct: 293 IMGLLSRLFDSKHELVTSVVEME 315


>gi|301093181|ref|XP_002997439.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
 gi|262110695|gb|EEY68747.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
          Length = 278

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 44/179 (24%), Positives = 79/179 (44%), Gaps = 26/179 (14%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC++YG ++V +V+G    A   S ++          +    FN++H + HL FG ++ D
Sbjct: 109 GCRLYGTVQVQKVAGDLSFAHEGSLTV-------FSFFDFLNFNSSHVVNHLRFGPQIPD 161

Query: 264 DDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG----------------SKLGGG 307
                 PL        +  + + Y++ ++P+ Y  L+G                S+   G
Sbjct: 162 ---METPLIDVSKILTKNLATYKYFVSVVPSRYVYLNGRSVTTFQYSVTEHETSSRGPNG 218

Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
               PG+ FSYE SP+ V+  E   S+ H  T     + G +    ++D  ++S  KK+
Sbjct: 219 QVSFPGVIFSYEFSPIAVEYIESKLSVLHFLTSTSAIVGGVFAVARMIDGAIYSVSKKV 277



 Score = 45.1 bits (105), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 25/98 (25%), Positives = 54/98 (55%), Gaps = 3/98 (3%)

Query: 8   KGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSK 67
           +  D   K  E   E+T+ GG VT++  + +++L+  ++  ++ VS T  + VD+     
Sbjct: 4   RRFDLNAKGVEGIQERTIGGGVVTLMSCVAVAFLLLSELSVWWTVSVTHRMHVDTDP-QD 62

Query: 68  LPIHLDIVVPTI--SCDYLALDAVDSSGEQHLHVEHNI 103
            PI++++ V  +  +C  +A+D  DS G + + ++ +I
Sbjct: 63  FPINIEVDVSFLHEACKEVAMDVSDSKGHKEIMLQKDI 100


>gi|238880883|gb|EEQ44521.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 345

 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 153/398 (38%), Gaps = 90/398 (22%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           FS+++K  DAF K       ++  GG  T++ +     ++ +++  Y       +  VD 
Sbjct: 4   FSQKVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFCGLLILWIEIGGYIGGYVDRQFTVDD 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              S L I++D++V  + C ++  +  D + + +L  E       L+ +G     P    
Sbjct: 64  QIRSALTINVDMIV-AMPCQFIHTNVEDITHDTYLAGE------TLNFEGIHFFVPDSFK 116

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +N                   +PN                                 P+L
Sbjct: 117 IN-------------------NPNDFHET----------------------------PDL 129

Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
           D ++Q   + E+ +E  + N     C I+G + VN+V G F I   G  Y  +  HV   
Sbjct: 130 DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRITGKGFGYR-DRSHV--- 185

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
            P+ S   N +H I+  SFG   +       PLD T    EE    + YY K++PT+YE+
Sbjct: 186 -PFES--LNFSHVIQEFSFG---EFYPYLNNPLDATGKVTEERLQTYMYYAKVVPTLYEQ 239

Query: 299 L----DGSKLGGGDG--------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           L    D ++    +               G+PGI+F Y+  P+ + I EK         K
Sbjct: 240 LGLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQFIAK 299

Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
            +  I G     ++    L    +K+  +  G K V +
Sbjct: 300 -LATIGG---GLLIAAGYLFRLYEKLLFIFYGQKAVQQ 333


>gi|448105220|ref|XP_004200441.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
 gi|448108351|ref|XP_004201072.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
 gi|359381863|emb|CCE80700.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
 gi|359382628|emb|CCE79935.1| Piso0_003028 [Millerozyma farinosa CBS 7064]
          Length = 344

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 83/369 (22%), Positives = 145/369 (39%), Gaps = 85/369 (23%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           FS +++  DAF K      +++  GG  T+V  LFI  +  V++  +       +  VD 
Sbjct: 4   FSTKVRTFDAFPKIDPHKTQRSSSGGFSTLVTALFILLVTWVEIGGFLGGYVDHQFIVDD 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              S L I+LD++V  + C+YL  + +D + ++ L  E       L+  G     P  ++
Sbjct: 64  KLTSDLFINLDMLV-GMPCEYLHTNVMDVTHDRLLAGE------LLNFQGMNFFVP--DI 114

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           V      ++ +EN    T                                       P+L
Sbjct: 115 V------QMNSENNDHNT---------------------------------------PDL 129

Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
           D +++   + E++    + N     C IYG + VN+V+G FHI   G  Y+  H      
Sbjct: 130 DEVMRETVRAEFNVAGTRMNEDASACHIYGSIPVNKVAGDFHITGKGFGYADRHR----- 184

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
            P+     N +H I   SFG   +     + PLD T   A +    + Y++  +PT+YE+
Sbjct: 185 VPF--EKLNFSHVIMEFSFG---EFYPMIKNPLDFTGKIASQKLQSYKYFMTAVPTLYEK 239

Query: 299 LD-----------------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
           L                   +   G    +PG++F Y+   + + I EK         ++
Sbjct: 240 LGIEVDTYQYSLTEQHRAITTDETGLPSDIPGLYFKYDFDTIKLLIAEKRIPFLQFVARL 299

Query: 342 MCNISGTYI 350
              +SG +I
Sbjct: 300 ATIVSGLFI 308


>gi|427788003|gb|JAA59453.1| Putative copii vesicle protein [Rhipicephalus pulchellus]
          Length = 285

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 51/191 (26%), Positives = 84/191 (43%), Gaps = 28/191 (14%)

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
           K     GC+  G   +++V G+FH++          H    QP      + TH I  L+F
Sbjct: 104 KTPVGSGCRFEGKFFIHKVPGNFHVS---------THAAAKQP---DKIDMTHIIHDLTF 151

Query: 258 GIKLQDDDERR-KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG---------- 306
           G+K+ D+       LD        G    +Y +KI+PT+YE+  G ++            
Sbjct: 152 GVKMTDEVRGSFNSLDEMDKSGANGIESHDYVMKIVPTVYEKSKGERIESYQYTYAYKSY 211

Query: 307 ---GDGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
                 G  MP I+F Y+L+P+ VK T +   L    T +   + GT+    +VD+L+ +
Sbjct: 212 VSISHSGRIMPAIWFRYDLTPITVKYTRRGIPLYSFLTSVCAIVGGTFTVAGIVDSLVFT 271

Query: 362 CVKKISKVEIG 372
             +   K E+G
Sbjct: 272 ASEVFRKFEMG 282



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 31/106 (29%), Positives = 48/106 (45%), Gaps = 3/106 (2%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           MVF  R    D + K  +D  + TV G  ++I+   FIS L   +   Y       EL+V
Sbjct: 1   MVFDVR--RFDIYRKIPKDLTQPTVTGAVISILSCFFISILFLSEFISYMSPELVSELYV 58

Query: 61  DS-SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
           D+ S   K+P+ ++I +  + C  + LD  D  G   +    N  K
Sbjct: 59  DNPSSADKIPVSINITLLKLDCSVVGLDIQDDMGRHEVGFVENTEK 104


>gi|347828541|emb|CCD44238.1| similar to endoplasmic reticulum-Golgi intermediate compartment
           protein 2 [Botryotinia fuckeliana]
          Length = 381

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 78/354 (22%), Positives = 136/354 (38%), Gaps = 81/354 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K    +  +T  GG  T+   L    L+  +   ++    T    V+   G 
Sbjct: 21  VKAFDAFPKAKPQYITQTSGGGKWTVAMMLVSFALLVSEFMRWWTGHETHTFVVEKGVGH 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLH---VEHNIYKRRLDLDGKPIQEPQKEVV 123
            L +++D+VV  + C  L ++  D++G++ L    ++ +       +D K + +  K+  
Sbjct: 81  SLQVNMDMVV-KMKCSELHINVQDAAGDRILAGIMLKEDATNWNQWVDAKGMHQLGKDAH 139

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
             V   +   E G     + D    G                   K+  ++ K       
Sbjct: 140 GRVITGEEYHEEGFGEEHVHDIVTLGG------------------KKRAKFAK------- 174

Query: 184 TIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIA------PGLSYSINHVHV 235
                     T ++K     G  C++YG LEVN+V G FH+       P + + ++H   
Sbjct: 175 ----------TPRVKGGPKGGDSCRVYGSLEVNKVQGDFHLTARGHGYPEMGHHLDH--- 221

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
                   +AFN +H I  LSFG           PLD T+A        + Y++ I+PT+
Sbjct: 222 --------SAFNFSHIINELSFGPFYP---SLLNPLDRTIAGTPNHFHKYQYFLSIVPTL 270

Query: 296 YERLDGSKLGG--------------------GDGGMPGIFFSYELSPLMVKITE 329
           Y     +                        G+  +PGIFF Y++ PL++ + E
Sbjct: 271 YSLSPSTFSPSSSPTLLRTNQYAVTSQEHIVGERSVPGIFFKYDIEPLLLTVEE 324


>gi|432879813|ref|XP_004073560.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Oryzias latipes]
          Length = 271

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 85/187 (45%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           EGC+  G   +N+V G+FH++          H    QP      + TH I  L+FG  LQ
Sbjct: 94  EGCRFEGKFTINKVPGNFHVS---------THSATAQPQNP---DMTHSIHKLAFGDTLQ 141

Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
             + +     L G    +    +  +Y +KI+PT+YE L G +             +   
Sbjct: 142 VHNVKGAFNALGGADKLSSNPLASHDYILKIVPTVYEDLSGRQRFSYQYTVANKEYVAYS 201

Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G  +P I+F Y+LSP+ VK TE+ +      T I   + GT+    ++D+ + +  + 
Sbjct: 202 HTGRIIPAIWFRYDLSPITVKYTERRQPFYRFITTICAIVGGTFTVAGIIDSCIFTASEA 261

Query: 366 ISKVEIG 372
             K++IG
Sbjct: 262 WKKIQIG 268



 Score = 43.1 bits (100), Expect = 0.24,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 40/77 (51%), Gaps = 4/77 (5%)

Query: 31  TIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGSKLPIHLDIVVPTISCDYLALD 87
           +I+C  FI +L   ++  +       EL+VD      G K+ + L+I +P + CD + LD
Sbjct: 10  SILCCFFILFLFLSELTGFIATEIVNELYVDDPDKDSGGKIDVSLNISLPNLHCDLVGLD 69

Query: 88  AVDSSGEQHL-HVEHNI 103
             D  G   + H+++++
Sbjct: 70  IQDEMGRHEVGHIDNSM 86


>gi|403337257|gb|EJY67839.1| hypothetical protein OXYTRI_11647 [Oxytricha trifallax]
          Length = 279

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 81/345 (23%), Positives = 146/345 (42%), Gaps = 99/345 (28%)

Query: 58  LFVDSSR-GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQ 116
           +FVD+S    +L I++DIV P + C+ L LD +D  G   + +  ++YK+ L        
Sbjct: 1   MFVDASHHDDRLNINIDIVFPKMPCEVLTLDIMDIMGTHIVDIGGSLYKKGL-------- 52

Query: 117 EPQKEVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKK 176
                           ++NG   +E        S  G   +TR+                
Sbjct: 53  ----------------SQNGEFVSET-------SMLGG-IQTRQ---------------- 72

Query: 177 WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVH 236
                 D + + K+E   +       +GCQ+ G+  +NRV G+FHI+   S+S   + V+
Sbjct: 73  ------DLLKRIKDEMDQK-------QGCQLKGFFNINRVPGNFHIS---SHSQKDLIVN 116

Query: 237 -DIQPYTSAAFNTTHHIRHLSFGIK-----LQDDDERR---KPLDGTVAKAEEG------ 281
            ++Q YT   F+ TH I H+SFG +     +Q + +++    PLDG    A +       
Sbjct: 117 LEMQGYT---FDFTHKINHVSFGRQEDFKVIQKNFKQQGVLNPLDGLEFSANQDNKGKPQ 173

Query: 282 ASMFNYYIKIIPTIYERLDGSK--------------LGGGDGGMPGIFFSYELSPLMVKI 327
           A   N+++  + + Y  +D ++                  +     + FSYELSP+ V  
Sbjct: 174 ALATNFFMVAVSSYY--MDTNRNTYNMYQLTSTHKSQSNANVNENMLVFSYELSPIKVLF 231

Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
            ++ +++     ++   I G +    +VD ++H  V  + K  IG
Sbjct: 232 NQEKENIVDFMIQLCAIIGGVFTISSVVDTIIHRSVSLLFKQRIG 276


>gi|50303625|ref|XP_451754.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49640886|emb|CAH02147.1| KLLA0B04950p [Kluyveromyces lactis]
          Length = 341

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 84/371 (22%), Positives = 150/371 (40%), Gaps = 76/371 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  E   + +  GG  +I+ +LF+ ++   +   +F     ++  VD     
Sbjct: 4   LRTFDAFPKTDEQHVKTSSKGGLSSILTYLFLLFIAWSEFGSFFGGYIDQQYVVDDQIKE 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I+LD+ V  ++C  + ++A D +G++ L  E+      + ++G P   P    VN +
Sbjct: 64  TVTINLDLYV-NMACKNIRINARDITGDRGLISEN------IQMEGMPFYIPVGTRVNEM 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                   N   + +L++    G    A+   R+  +T                     +
Sbjct: 117 --------NNIVSPDLDE--ILGEAIPAQF--REAIDTSE-------------------L 145

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
             +++++          GC I+G + VN+V G  HI A G  Y        D        
Sbjct: 146 TGRDDFN----------GCHIFGSVPVNKVKGELHITAHGWGYRSASAIPKD-------Q 188

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG---- 301
            N  H I  LSFG      D    PLD T   ++E    + Y+  I+PT+Y+++      
Sbjct: 189 INFNHVINELSFGDFYPYID---NPLDNTAKFSDEKIKAYYYFTSIVPTLYKKMGAEVDT 245

Query: 302 -----SKLGGGDG----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT- 351
                S+   G+     G+PGIF  Y+  P+ + I++          +++  +S    T 
Sbjct: 246 NQYALSETEYGESSKATGVPGIFIRYQFEPMKIIISDMRIGFFQFIIRLVAILSFIVYTA 305

Query: 352 ---FMLVDALL 359
              F LVD  L
Sbjct: 306 SWIFRLVDKSL 316


>gi|323448816|gb|EGB04710.1| hypothetical protein AURANDRAFT_55105 [Aureococcus anophagefferens]
          Length = 324

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 87/187 (46%), Gaps = 36/187 (19%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC + G++ VNRV G+FHI    + SI+H          +A  N +H + HLSFG  L  
Sbjct: 143 GCMVSGHVLVNRVPGNFHIE---ARSIHH-------NLNAAMTNLSHVVNHLSFGTPLAK 192

Query: 264 DDERR----------KPLDGTVAKAEEGASMFNYYIKIIPTIYE---------RLDGSKL 304
           D +R+           PLDG +  + +   + ++Y K++ T +E          + G ++
Sbjct: 193 DMQRKVSKYPQFQSVHPLDGGIFVSRDYHQVHHHYSKVVSTHFEVGGMMTKSREIVGYQM 252

Query: 305 GGGDGGM-------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
                 M       P   FSY+LSP+ V ++ K +      T +   I GT+    +VDA
Sbjct: 253 LAQSQIMHYNEMDVPEAKFSYDLSPMAVLVSSKGRRWYDFVTSVCAIIGGTFTVVGIVDA 312

Query: 358 LLHSCVK 364
           +L+  +K
Sbjct: 313 VLYKIIK 319


>gi|241953329|ref|XP_002419386.1| COPii-coated vesicle-associated protein, putative [Candida
           dubliniensis CD36]
 gi|223642726|emb|CAX42980.1| COPii-coated vesicle-associated protein, putative [Candida
           dubliniensis CD36]
          Length = 345

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 89/398 (22%), Positives = 153/398 (38%), Gaps = 90/398 (22%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F++++K  DAF K       ++  GG  T+V +     ++ +++  Y       +  VD 
Sbjct: 4   FAQKVKTFDAFPKVDPHHQVRSQRGGLSTLVTYFCGLLILWIEIGGYIGGYVDRQFTVDD 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              S L I++D++V  + C ++  +  D + + +L  E       L+ +G     P    
Sbjct: 64  QIRSDLTINIDMIV-AMPCQFIHTNVEDITHDTYLAGE------TLNFEGIHFFVPDSFK 116

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +N                   +PN                                 P+L
Sbjct: 117 IN-------------------NPNDFHET----------------------------PDL 129

Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
           D ++Q   + E+ +E  + N     C I+G + VN+V G F I   G  Y  +  HV   
Sbjct: 130 DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRITGKGFGYR-DRSHV--- 185

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
            P+ S   N +H I+  SFG   +       PLD T    EE    + YY K++PT+YE+
Sbjct: 186 -PFES--LNFSHVIQEFSFG---EFYPYLNNPLDATGKITEERLQTYMYYAKVVPTLYEQ 239

Query: 299 L----DGSKLGGGDG--------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           L    D ++    +               G+PGI+F Y+  P+ + I EK         K
Sbjct: 240 LGLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQFIAK 299

Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
            +  I G     ++    L    +K+  +  G K V +
Sbjct: 300 -LATIGG---GLLIAAGYLFRLYEKLLFIFYGQKAVQQ 333


>gi|301100294|ref|XP_002899237.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
 gi|262104154|gb|EEY62206.1| endoplasmic reticulum-Golgi intermediate compartment protein,
           putative [Phytophthora infestans T30-4]
          Length = 469

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 86/189 (45%), Gaps = 34/189 (17%)

Query: 203 EGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
           EGC+++G+L V RV G+FH+     +YS++           S+  N +H +  L FG  L
Sbjct: 289 EGCRLFGHLYVKRVPGNFHVHLANPAYSMD-----------SSLVNASHTVNELWFGEHL 337

Query: 262 QDDDERRKPLDGTVA------KAEEGASMFN-----YYIKIIPTIYERLDGSKLGG---- 306
              D  R P +          + ++  S++      +YIK++   Y + DGS++      
Sbjct: 338 APGDMSRLPREAQTQLYTHRLENQDFTSLYKNHTYVHYIKVVTNSYVQGDGSEINVYKYT 397

Query: 307 -------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
                      +P + F Y+LSP+ V+I+E +    H  T     I G +    +VD ++
Sbjct: 398 AHSNEYLETDDLPSVMFRYDLSPMSVRISEDTVPFYHFVTSACAIIGGVFTVIGIVDQII 457

Query: 360 HSCVKKISK 368
           H   + ++K
Sbjct: 458 HQTARALNK 466



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/109 (24%), Positives = 54/109 (49%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  D + K  ED    T+ G +++I     +  L  ++   Y  V    ++ +D     
Sbjct: 7   LKKWDFYKKIPEDLTVSTLPGVSLSIAGCFIMFLLFILEFNSYLTVDYKYDIVMDEGLDQ 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
            + I+ +I VP + C++ ++D  D +G +  ++  +I+K RLD  G+ +
Sbjct: 67  TMRINFNITVPDLPCEFASVDVSDMTGTRKHNMTSDIFKIRLDQKGRMV 115


>gi|334311203|ref|XP_001380577.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Monodelphis domestica]
          Length = 321

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 85/187 (45%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IK 260
           EGC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  ++
Sbjct: 144 EGCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQ 191

Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
           +Q+       L G         +  +Y +KI+PT+YE   G +             +   
Sbjct: 192 VQNIHGAFNALGGADKLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYS 251

Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  + 
Sbjct: 252 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEA 311

Query: 366 ISKVEIG 372
             K+++G
Sbjct: 312 WKKIQLG 318



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 51/102 (50%), Gaps = 4/102 (3%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS--- 62
           RL   D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD    
Sbjct: 35  RLTRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEVVNELYVDDPDK 94

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
             G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 95  DSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 136


>gi|169860063|ref|XP_001836668.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Coprinopsis cinerea okayama7#130]
 gi|116502344|gb|EAU85239.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Coprinopsis cinerea okayama7#130]
          Length = 516

 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 84/364 (23%), Positives = 141/364 (38%), Gaps = 78/364 (21%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K    +  ++   G + +   +    L+  D+ ++       E  VD+ +GS
Sbjct: 19  LTKFDAFPKLPSTYKARSESRGFLMVFVIILAFLLMLNDIGEFIWGWPDFEFGVDNDKGS 78

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            LPI+LD+ V  + C YL +D  D+ G+            RL L             N  
Sbjct: 79  TLPINLDMTV-NMPCKYLTVDLRDAMGD------------RLFLS------------NGF 113

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           ++     + G  T  L++     S   A  ++RK       +   +R KK          
Sbjct: 114 RRDGTIFDVGQATA-LKEHAAALSAQEAVAQSRKSRGFFATL---FRSKK---------- 159

Query: 187 QCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAP-GLSY-SINHVHVHDIQ 239
                    K K T+        C+I+G + V +V+ + H+   G  Y S  HV  H + 
Sbjct: 160 --------SKFKPTYNHQADASACRIWGTMYVKKVTANLHVTTLGHGYASYEHVDHHLM- 210

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
                  N +H I+  SFG       E  +PLD +     E    + Y++ ++PT Y   
Sbjct: 211 -------NLSHVIQEFSFGPHFP---EIVQPLDNSFEATHEHFIAYQYFLHVVPTTYVAP 260

Query: 300 DGSKLGGG-------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
             + L                + G PGIFF +EL PL +   +++ +L  L  + +  I 
Sbjct: 261 RTAPLETNQYSVTHYTRVLEHNRGTPGIFFKFELDPLKITQYQRTTTLLQLMIRCVGVIG 320

Query: 347 GTYI 350
           G ++
Sbjct: 321 GVFV 324


>gi|407037175|gb|EKE38536.1| hypothetical protein ENU1_163530 [Entamoeba nuttalli P19]
          Length = 315

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 87/211 (41%), Gaps = 39/211 (18%)

Query: 196 KLKNTFTEGCQIYGYLEVNRVSGSFHIAPG-LSYSINHV-------------HVHDIQPY 241
           K  N    GC+++G ++V+RVSG FH+A G +S+    +             H+H     
Sbjct: 108 KFDNRLLGGCRMHGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQ 167

Query: 242 TSAAFNTTHHIRHLSFGIKLQDD-DERRKPLDG---TVAKAEEGASMFNYYIKIIPTIY- 296
              +FN TH+I HLSF   L         PL+G   T+   +       YYI +IPT++ 
Sbjct: 168 EMKSFNPTHYINHLSFSNILGSTVHSGETPLNGKEFTLNGFDNARK--TYYINVIPTLFK 225

Query: 297 --------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
                         ER D     G     PG+FF YELSP +V       S  H    + 
Sbjct: 226 YPSYTLRTYQLSVNER-DVPVTYGASFAQPGVFFKYELSPYIVINEMNDHSFAHSLASVG 284

Query: 343 CNISGTYITFMLVDALL---HSCVKKISKVE 370
             I G  I   L+  L    H  V  + ++E
Sbjct: 285 AIIGGVLIIMGLLSRLFDSKHELVTSVVEME 315


>gi|340505495|gb|EGR31815.1| hypothetical protein IMG5_101180 [Ichthyophthirius multifiliis]
          Length = 327

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 51/191 (26%), Positives = 86/191 (45%), Gaps = 23/191 (12%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG---- 258
           EGC I G + VN+V G+FHI+   S++  HV    +        + +H ++HLSFG    
Sbjct: 137 EGCNISGTMLVNKVPGNFHIS---SHAYGHVLGQVLSNAGKNTIDLSHKVKHLSFGDEFD 193

Query: 259 ---IKLQDDDERRKPLDGTVAKAEEG---ASMFNYYIKIIPTIY----------ERLDGS 302
              IK Q       P+D       +       + YYI I+PT Y           +   +
Sbjct: 194 LKNIKRQFSQGLLHPMDNKQKDKPQNILNGITYQYYINIVPTTYVDTGNKNYHVYQFTYN 253

Query: 303 KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
                +  +P +++ Y+LSP+ VK + + +S  H   +I   I G +    +VD++++  
Sbjct: 254 SNEQINNHLPTVYYRYDLSPVTVKFSMQKESFLHFLVQICAIIGGIFTVASIVDSIVYRA 313

Query: 363 VKKISKVEIGG 373
           V  I K +  G
Sbjct: 314 VLNILKRDASG 324



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/109 (32%), Positives = 56/109 (51%), Gaps = 1/109 (0%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
            +K  D + K   D  + T  G  V+I+C + +  L   ++  +  +  T E+F+D  RG
Sbjct: 2   NIKSFDMYRKLPSDLTQSTTSGAVVSIICGIIVLILFISELRSFLAIEETSEMFIDIVRG 61

Query: 66  -SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
             K+ ++LDI  P   CD L+LD  D  G   +++E  I KRR+  DG 
Sbjct: 62  GQKIKVNLDIDFPKFPCDILSLDMQDIMGSHTVNIEGTINKRRISSDGN 110


>gi|154305556|ref|XP_001553180.1| hypothetical protein BC1G_08547 [Botryotinia fuckeliana B05.10]
          Length = 381

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 77/354 (21%), Positives = 136/354 (38%), Gaps = 81/354 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K  DAF K    +  +T  GG  T+   L    L+  +   ++    T    V+   G 
Sbjct: 21  VKAFDAFPKAKPQYITQTSGGGKWTVAMMLVSFALLVSEFMRWWTGHETHTFVVEKGVGH 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLH---VEHNIYKRRLDLDGKPIQEPQKEVV 123
            L +++D+VV  + C  L ++  D++G++ L    ++ +       +D K + +  K+  
Sbjct: 81  SLQVNMDMVV-KMKCSELHINVQDAAGDRILAGIMLKEDATNWNQWVDAKGMHQLGKDAH 139

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
             V   +   E G     + D    G                   K+  ++ K       
Sbjct: 140 GRVITGEEYHEEGFGEEHVHDIVTLGG------------------KKRAKFAK------- 174

Query: 184 TIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIA------PGLSYSINHVHV 235
                     T ++K     G  C++YG LEVN+V G FH+       P + + ++H   
Sbjct: 175 ----------TPRVKGGPKGGDSCRVYGSLEVNKVQGDFHLTARGHGYPEMGHHLDH--- 221

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
                   +AFN +H I  LSFG           PLD T+A        + Y++ ++PT+
Sbjct: 222 --------SAFNFSHIINELSFGPFYPS---LLNPLDRTIAGTPNHFHKYQYFLSVVPTL 270

Query: 296 YERLDGSKLGG--------------------GDGGMPGIFFSYELSPLMVKITE 329
           Y     +                        G+  +PGIFF Y++ PL++ + E
Sbjct: 271 YSLSPSTFSPSSSPTLLRTNQYAVTSQEHIVGERSVPGIFFKYDIEPLLLTVEE 324


>gi|426200953|gb|EKV50876.1| hypothetical protein AGABI2DRAFT_113626 [Agaricus bisporus var.
           bisporus H97]
          Length = 542

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 88/360 (24%), Positives = 147/360 (40%), Gaps = 69/360 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K    F  ++   G +TI   L    L+  D+ +Y       +  VD     
Sbjct: 20  LAKFDAFPKVPSAFKARSESRGFMTIFVMLVALLLMLNDIGEYIWGWPEFKFAVDQDNAP 79

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + ++LD+VV  + C YL++D  D  G+            RL L G             +
Sbjct: 80  YMFVNLDMVV-NMQCRYLSVDLRDVVGD------------RLLLSG------------GL 114

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           ++  V    G  T  L++ +K  S   A +++RK                      D+++
Sbjct: 115 QRDGVKFNIGEATA-LKEHSKGLSARQALSQSRKSRGF-----------------FDSLL 156

Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTS 243
           +  +E   +   N   +G  C+IYG + V RV+ + HI   G  YS ++ HV   Q    
Sbjct: 157 RRNSEPKFKPTYNHVPDGGACRIYGTMPVKRVTANLHITTVGHGYS-SYQHVDHNQ---- 211

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
              N +H I   SFG       E  +PLD +    ++  + + Y++ ++PT Y     S 
Sbjct: 212 --MNLSHVITEFSFGPYF---PEIVQPLDESFEVTQDHFTAYQYFLHVVPTTYIAPRTSP 266

Query: 304 LGGG-------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           L                + G PGIFF ++L PL + I +K+ +L  L  + +  I G ++
Sbjct: 267 LRTNQYSVTHYTRQVEHNKGTPGIFFKFDLDPLNITIHQKTTTLIQLLIRCVGVIGGVFV 326


>gi|346469653|gb|AEO34671.1| hypothetical protein [Amblyomma maculatum]
          Length = 285

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 51/191 (26%), Positives = 85/191 (44%), Gaps = 28/191 (14%)

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
           K     GC+  G   +++V G+FH++          H    QP      + TH I  L+F
Sbjct: 104 KTPVGSGCRFEGKFFIHKVPGNFHVS---------THAAAKQP---EKIDMTHIIHDLTF 151

Query: 258 GIKLQDDDERR-KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG---------- 306
           G+K+ D+ +     LD        G    +Y +KI+PT+YE+  G ++            
Sbjct: 152 GVKMTDEVKGSFNSLDEMDKSGGNGIESHDYVMKIVPTVYEKSRGERIESYQYTYAYKSY 211

Query: 307 ---GDGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHS 361
                 G  MP I+F Y+L+P+ VK T +   L    T +   + GT+    +VD+L+ +
Sbjct: 212 VSISHTGRIMPAIWFRYDLTPITVKYTRRGVPLYSFLTSVCAIVGGTFTVAGIVDSLIFT 271

Query: 362 CVKKISKVEIG 372
             +   K E+G
Sbjct: 272 ASEVFRKFEMG 282



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 46/100 (46%), Gaps = 1/100 (1%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
           ++  D + K  +D  + TV G  ++I+   FIS L   +   Y       EL+VD+ S  
Sbjct: 5   VRRFDIYRKIPKDLTQPTVTGAVISILSCFFISILFLSEFISYMSPELVSELYVDNPSSA 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
            K+P+ ++I +  + C  + LD  D  G   +    N  K
Sbjct: 65  EKIPVSINITLLKLDCSVVGLDIQDDMGRHEVGFVENTEK 104


>gi|115433364|ref|XP_001216819.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114189671|gb|EAU31371.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 449

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 82/320 (25%), Positives = 128/320 (40%), Gaps = 75/320 (23%)

Query: 68  LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVK 127
           L ++LDIVV  + CD L ++  D++G++ L  E  + KR          EP    +  + 
Sbjct: 133 LQLNLDIVV-EMPCDTLDVNIQDAAGDRVLAGE--LLKR----------EPTSWQL-WMD 178

Query: 128 KKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQ 187
           K+   +  G+   +       G    A+ E     +   EV+   R K    P+L     
Sbjct: 179 KRNYESYGGSHEYQTLSQEDAGRLE-AQDEDAHVHHVLGEVRRNPRKKFPKSPKLR---- 233

Query: 188 CKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTS-AA 245
                     +    + C+IYG LE N+V G FHI A G  Y        D  P+     
Sbjct: 234 ----------RGDAVDSCRIYGSLEGNKVQGDFHITARGHGYR-------DFAPHLDHQT 276

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER------- 298
           FN +H I  LSFG           PLD T+A+ E     F Y++ ++PTIY +       
Sbjct: 277 FNFSHMITELSFGPHYP---TLLNPLDKTIAETETHYYKFQYFLSVVPTIYSKGNRVLDT 333

Query: 299 --------LDGSK-------------------LGGGDGGMPGIFFSYELSPLMVKITEKS 331
                    D S+                   L      +PGIFF Y + P+++ I+E+ 
Sbjct: 334 YSIAPPTLHDNSRHNKNLVFTNQYAATSQSDALPESPFFVPGIFFKYNIEPILLLISEER 393

Query: 332 KSLGHLWTKIMCNISGTYIT 351
            S   L  +++  +SG  +T
Sbjct: 394 GSFLSLLIRLVNTVSGVMVT 413


>gi|409083992|gb|EKM84349.1| hypothetical protein AGABI1DRAFT_32491 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 542

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 88/360 (24%), Positives = 147/360 (40%), Gaps = 69/360 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K    F  ++   G +TI   L    L+  D+ +Y       +  VD     
Sbjct: 20  LAKFDAFPKVPSAFKARSESRGFMTIFVMLVALLLMLNDIGEYIWGWPEFKFAVDQDNAP 79

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + ++LD+VV  + C YL++D  D  G+            RL L G             +
Sbjct: 80  YMFVNLDMVV-NMQCRYLSVDLRDVVGD------------RLLLSG------------GL 114

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           ++  V    G  T  L++ +K  S   A +++RK                      D+++
Sbjct: 115 QRDGVKFNIGEATA-LKEHSKGLSARQALSQSRKSRGF-----------------FDSLL 156

Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTS 243
           +  +E   +   N   +G  C+IYG + V RV+ + HI   G  YS ++ HV   Q    
Sbjct: 157 RRNSEPKFKPTYNHVPDGGACRIYGTMPVKRVTANLHITTVGHGYS-SYQHVDHNQ---- 211

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
              N +H I   SFG       E  +PLD +    ++  + + Y++ ++PT Y     S 
Sbjct: 212 --MNLSHVITEFSFGPYF---PEIVQPLDESFEVTQDHFTAYQYFLHVVPTTYIAPRTSP 266

Query: 304 LGGG-------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           L                + G PGIFF ++L PL + I +K+ +L  L  + +  I G ++
Sbjct: 267 LRTNQYSVTHYTRQVEHNKGTPGIFFKFDLDPLNITIHQKTTTLIQLLIRCVGVIGGVFV 326


>gi|68465583|ref|XP_723153.1| likely COPII secretory vesicle component [Candida albicans SC5314]
 gi|68465876|ref|XP_723006.1| likely COPII secretory vesicle component [Candida albicans SC5314]
 gi|46445018|gb|EAL04289.1| likely COPII secretory vesicle component [Candida albicans SC5314]
 gi|46445174|gb|EAL04444.1| likely COPII secretory vesicle component [Candida albicans SC5314]
          Length = 345

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 88/398 (22%), Positives = 153/398 (38%), Gaps = 90/398 (22%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           F++++K  DAF K       ++  GG  T++ +     ++ +++  Y       +  VD 
Sbjct: 4   FAQKVKTFDAFPKVDPQHQVRSQRGGLSTLLTYFCGLLILWIEIGGYIGGYVDRQFTVDD 63

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
              S L I++D++V  + C ++  +  D + + +L  E       L+ +G     P    
Sbjct: 64  QIRSALTINVDMIV-AMPCQFIHTNVEDITHDTYLAGE------TLNFEGIHFFVPDSFK 116

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
           +N                   +PN                                 P+L
Sbjct: 117 IN-------------------NPNDFHET----------------------------PDL 129

Query: 183 DTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDI 238
           D ++Q   + E+ +E  + N     C I+G + VN+V G F I   G  Y  +  HV   
Sbjct: 130 DEVMQESLRAEFRSEGARVNEGAPACHIFGSIPVNQVRGDFRITGKGFGYR-DRSHV--- 185

Query: 239 QPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER 298
            P+ S   N +H I+  SFG   +       PLD T    EE    + YY K++PT+YE+
Sbjct: 186 -PFES--LNFSHVIQEFSFG---EFYPYLNNPLDATGKVTEERLQTYMYYAKVVPTLYEQ 239

Query: 299 L----DGSKLGGGDG--------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
           L    D ++    +               G+PGI+F Y+  P+ + I EK         K
Sbjct: 240 LGLEIDTNQYSLTENQHVIKVDQSTHRPDGIPGIYFLYDFEPIKLVIREKRIPFFQFIAK 299

Query: 341 IMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
            +  I G     ++    L    +K+  +  G K V +
Sbjct: 300 -LATIGG---GLLIAAGYLFRLYEKLLFIFYGQKAVQQ 333


>gi|395505103|ref|XP_003756885.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Sarcophilus harrisii]
          Length = 290

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 85/187 (45%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IK 260
           EGC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  ++
Sbjct: 113 EGCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQ 160

Query: 261 LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGG 307
           +Q+       L G         +  +Y +KI+PT+YE   G +             +   
Sbjct: 161 VQNIHGAFNALGGADKLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYS 220

Query: 308 DGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  + 
Sbjct: 221 HTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEA 280

Query: 366 ISKVEIG 372
             K+++G
Sbjct: 281 WKKIQLG 287



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|156065931|ref|XP_001598887.1| hypothetical protein SS1G_00976 [Sclerotinia sclerotiorum 1980]
 gi|154691835|gb|EDN91573.1| hypothetical protein SS1G_00976 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 421

 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 81/349 (23%), Positives = 134/349 (38%), Gaps = 71/349 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           ++  DAF K    +  +T  GG  T+   +    L+  +   ++    T    V+   G 
Sbjct: 21  VQAFDAFPKAKPQYITQTSGGGKWTVAMLIISFALLLSEFSRWWTGYETHTFVVEKGIGH 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL---HVEHNIYKRRLDLDGKPIQEPQKEVV 123
            L I++D+VV  + C  L ++  D++G++ L    ++ +       +D K + +  K+  
Sbjct: 81  SLQINMDMVV-KMKCSGLHINVQDAAGDRILAGIMLKEDPTNWSQWVDAKGVHQLGKDAH 139

Query: 124 NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELD 183
             V   +   E G     + D    G                   K+  ++ K       
Sbjct: 140 GRVVTGEEYHEEGFGEEHVHDIVALGG------------------KKRAKFAK------- 174

Query: 184 TIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQP 240
                     T +LK     G  C++YG LEVN+V G FHI A G  Y     H+     
Sbjct: 175 ----------TPRLKGGPRGGDSCRVYGSLEVNKVQGDFHITAKGHGYPELGQHL----- 219

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
               AFN +H I  LSFG           PLD T+A        + Y++ I+PT+Y    
Sbjct: 220 -DHNAFNFSHIINELSFGPFYPS---LLNPLDRTIAGTPNHFHKYQYFLSIVPTLYSLSP 275

Query: 301 GSKLGG--------------------GDGGMPGIFFSYELSPLMVKITE 329
            +                        G+  +PGIFF Y++ PL++ + E
Sbjct: 276 STFSPSSSPSLLRTNQYAVTSQEHIVGERNVPGIFFKYDIEPLLLTVEE 324


>gi|255563725|ref|XP_002522864.1| thioredoxin domain-containing protein, putative [Ricinus communis]
 gi|223537948|gb|EEF39562.1| thioredoxin domain-containing protein, putative [Ricinus communis]
          Length = 478

 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 60/217 (27%), Positives = 92/217 (42%), Gaps = 41/217 (18%)

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
           ++ +N   + K     T GC+I GY+ V +V G+  I+            H   P   + 
Sbjct: 270 LKSENATQSTKRPAPLTGGCRIEGYVRVKKVPGNLIISA-------RSGAHSFDP---SQ 319

Query: 246 FNTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFNY-----------YIK 290
            N +H I HLSFG+K+     ++ +R  P  G       G S  N+           Y++
Sbjct: 320 MNMSHVISHLSFGLKVSPKVMNEAKRLVPYIGGSHDKLNGRSFVNHRDVDANVTIEHYLQ 379

Query: 291 IIPTI---------------YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLG 335
           I+ T                YE    S L      +P   F +ELSP+ V ITE  KS  
Sbjct: 380 IVKTEVVTRRSSREHKLLEEYEYTAHSSLVQS-VYIPAAKFHFELSPMQVLITENPKSFS 438

Query: 336 HLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           H  T +   I G +    ++D++LH  V+ + KVE+G
Sbjct: 439 HFITNVCAIIGGVFTVAGILDSILHHTVRLMKKVELG 475


>gi|361126303|gb|EHK98312.1| putative ER-derived vesicles protein 41 [Glarea lozoyensis 74030]
          Length = 343

 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 53/206 (25%), Positives = 85/206 (41%), Gaps = 40/206 (19%)

Query: 194 TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA-----AFNT 248
           T +L+    + C+IYG LEVN+V G FH+             H  Q + +      AFN 
Sbjct: 140 TPRLRGNVGDSCRIYGNLEVNKVQGDFHLT---------ARGHGYQEWGAGHLDHTAFNF 190

Query: 249 THHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGD 308
           +H +  LSFG           PLD TV+        F Y++ ++PT Y  +D S     D
Sbjct: 191 SHIVNELSFGAFYP---SLLNPLDRTVSTTPNHFHKFQYFLSVVPTAYT-VDSSSRSARD 246

Query: 309 G------------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                               +PGIFF Y++ P+++ + E   S      K++   SG  +
Sbjct: 247 TIFTNQYAVTEQSHEVNERSVPGIFFKYDIEPMLLTVEESRDSFLRFVVKVVNVFSGVLV 306

Query: 351 T----FMLVDALLHSCVKKISKVEIG 372
                F L +  + +  K+   + +G
Sbjct: 307 AGHWGFTLTEWAVSAFGKRKRSMSVG 332


>gi|406868300|gb|EKD21337.1| copii-coated vesicle protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 382

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 83/352 (23%), Positives = 137/352 (38%), Gaps = 67/352 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           ++  DAF K    +  +T  GG  T+   +    LI  +   +++   T    V+ +   
Sbjct: 22  VQAFDAFPKAKPQYVTRTSGGGKWTVAMLIVSFMLIYSEFSRWWRGHETHTFTVEKAVER 81

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L I+LDIVVP + C+ + ++  D++G++ L     ++ R       P Q  Q      V
Sbjct: 82  GLQINLDIVVP-MKCEDIHINVQDAAGDRIL--AGVMFTR------NPTQWAQWVHERGV 132

Query: 127 KKKKVTTENGTTTTE--LEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
            +          T E  L+     G  +  +           +  +  R +K A  E+D+
Sbjct: 133 HRLGTDANGKIITGEEYLDHDEGFGEEHVHDIVAAAGKLKKAKFAKTPRSRKSA--EMDS 190

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSY---SINHVHVHDIQP 240
                               C+I+G LEVN+V G  HI A G  Y   +  H+  H    
Sbjct: 191 --------------------CRIFGNLEVNKVQGELHITARGHGYQELAAGHLDHH---- 226

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
               AFN +H +  LSFG           PLD TV+        F Y++ ++PT+Y  +D
Sbjct: 227 ----AFNFSHVVSELSFGPFYP---SLHNPLDRTVSTTPNNFHKFQYFLSVVPTVYS-VD 278

Query: 301 GSKLGGG------------------DGGMPGIFFSYELSPLMVKITEKSKSL 334
            S                       +  +PGIFF Y+  P+++ + E   S 
Sbjct: 279 SSTTYSSQTLFTNQYAVTEQSHVVSEFSVPGIFFKYDFEPMLLTVQESRDSF 330


>gi|357474735|ref|XP_003607653.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
 gi|355508708|gb|AES89850.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
          Length = 477

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/197 (27%), Positives = 83/197 (42%), Gaps = 36/197 (18%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
           T GC++ GY+ V +V GS  ++             D   + ++  N +H I HLSFG K+
Sbjct: 288 TGGCRVEGYVRVKKVPGSLVVSAR----------SDAHSFDASQMNMSHVINHLSFGKKV 337

Query: 262 QD----DDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTIYERLDGSKL-- 304
                 D +   P  G       G S  N           +YI+++ T      G KL  
Sbjct: 338 TPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIEHYIQVVKTEVITRKGYKLIE 397

Query: 305 ---------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
                          +P   F  ELSP+ V ITE  KS  H  T +   I G +    ++
Sbjct: 398 EYEYTAHSSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGVFTVAGIL 457

Query: 356 DALLHSCVKKISKVEIG 372
           D++LH+ +K + K+EIG
Sbjct: 458 DSILHNTIKAMKKIEIG 474


>gi|21618302|gb|AAM67352.1| unknown [Arabidopsis thaliana]
          Length = 317

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 95/211 (45%), Gaps = 37/211 (17%)

Query: 189 KNEYSTEKLKNT-FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFN 247
           K++ S+  LK    T GC++ GY+ V +V G+  ++   + S +H        + S+  N
Sbjct: 114 KSDNSSRTLKKAPSTGGCRVEGYMRVKKVPGNLMVS---ARSGSH-------SFDSSQMN 163

Query: 248 TTHHIRHLSFGIKLQDDD----ERRKPLDGTVAKAEEGASMFN-----------YYIKII 292
            +H + HLSFG ++        +R  P  G      +G S  N           +Y++I+
Sbjct: 164 MSHVVNHLSFGRRIMPQKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYLQIV 223

Query: 293 PTIYERLDGSKLG-----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
            T   + +G  L                 +P   F +ELSP+ V ITE SKS  H  T +
Sbjct: 224 KTEVVKSNGQALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFITNV 283

Query: 342 MCNISGTYITFMLVDALLHSCVKKISKVEIG 372
              I G +    ++D++LH  +  + K+E+G
Sbjct: 284 CAIIGGAFTVAGILDSILHHSMTLMKKIELG 314


>gi|238480964|ref|NP_680742.2| protein PDI-like 5-4 [Arabidopsis thaliana]
 gi|332659898|gb|AEE85298.1| protein PDI-like 5-4 [Arabidopsis thaliana]
          Length = 532

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 98/214 (45%), Gaps = 37/214 (17%)

Query: 186 VQCKNEYSTEKLKNT-FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           ++ K++ S+  LK    T GC++ GY+ V +V G+  ++   + S +H        + S+
Sbjct: 326 LEDKSDNSSRTLKKAPSTGGCRVEGYMRVKKVPGNLMVS---ARSGSH-------SFDSS 375

Query: 245 AFNTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFN-----------YYI 289
             N +H + HLSFG ++      + +R  P  G      +G S  N           +Y+
Sbjct: 376 QMNMSHVVNHLSFGRRIMPQKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYL 435

Query: 290 KIIPTIYERLDGSKLG-----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           +I+ T   + +G  L                 +P   F +ELSP+ V ITE SKS  H  
Sbjct: 436 QIVKTEVVKSNGQALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFI 495

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           T +   I G +    ++D++LH  +  + K+E+G
Sbjct: 496 TNVCAIIGGVFTVAGILDSILHHSMTLMKKIELG 529


>gi|22328963|ref|NP_567765.2| protein PDI-like 5-4 [Arabidopsis thaliana]
 gi|75213708|sp|Q9T042.1|PDI54_ARATH RecName: Full=Protein disulfide-isomerase 5-4; Short=AtPDIL5-4;
           AltName: Full=Protein disulfide-isomerase 7; Short=PDI7;
           AltName: Full=Protein disulfide-isomerase 8-2;
           Short=AtPDIL8-2; Flags: Precursor
 gi|4490704|emb|CAB38838.1| putative protein [Arabidopsis thaliana]
 gi|7269561|emb|CAB79563.1| putative protein [Arabidopsis thaliana]
 gi|15450832|gb|AAK96687.1| putative protein [Arabidopsis thaliana]
 gi|20259836|gb|AAM13265.1| putative protein [Arabidopsis thaliana]
 gi|332659897|gb|AEE85297.1| protein PDI-like 5-4 [Arabidopsis thaliana]
          Length = 480

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 98/214 (45%), Gaps = 37/214 (17%)

Query: 186 VQCKNEYSTEKLKNT-FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           ++ K++ S+  LK    T GC++ GY+ V +V G+  ++   + S +H        + S+
Sbjct: 274 LEDKSDNSSRTLKKAPSTGGCRVEGYMRVKKVPGNLMVS---ARSGSH-------SFDSS 323

Query: 245 AFNTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFN-----------YYI 289
             N +H + HLSFG ++      + +R  P  G      +G S  N           +Y+
Sbjct: 324 QMNMSHVVNHLSFGRRIMPQKFSEFKRLSPYLGLSHDRLDGRSFINQRDLGPNVTIEHYL 383

Query: 290 KIIPTIYERLDGSKLG-----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           +I+ T   + +G  L                 +P   F +ELSP+ V ITE SKS  H  
Sbjct: 384 QIVKTEVVKSNGQALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFI 443

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           T +   I G +    ++D++LH  +  + K+E+G
Sbjct: 444 TNVCAIIGGVFTVAGILDSILHHSMTLMKKIELG 477


>gi|73953406|ref|XP_852891.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 isoform 1 [Canis lupus familiaris]
          Length = 290

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 85/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G+  +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 114 GCRFEGHFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|327307836|ref|XP_003238609.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
 gi|326458865|gb|EGD84318.1| COPII-coated vesicle protein [Trichophyton rubrum CBS 118892]
          Length = 399

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 85/383 (22%), Positives = 152/383 (39%), Gaps = 77/383 (20%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +LK  DAF K    +   +  GG  TI   +  + L C ++  +++        V+    
Sbjct: 23  KLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSVERGVS 82

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKEVVN 124
            ++ +++D VV  + CD + ++  D++G+   H+          L G  + QEP      
Sbjct: 83  QEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSWTA- 127

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
               +++       + E +  NK  +    E E  +  +  + + E  R +K   P+   
Sbjct: 128 --WNREMNQRRSGGSPEYQTLNKEDTFRLEEQE--EDLHVEHVLGEVRRSRKKKFPK--- 180

Query: 185 IVQCKNEYSTEKLKNT-FTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYT 242
                      KLK +   + C+++G LE N+V G+ HI A G  Y           P+ 
Sbjct: 181 ---------APKLKRSDAVDSCRVFGSLEGNKVQGNLHITARGFGY---FEWGRTTNPH- 227

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL--- 299
             + N TH I  LSFG           PLD TV+        + Y++ ++PTIY +    
Sbjct: 228 --SLNFTHLITELSFGPHY---GRLLNPLDKTVSSTSINFYKYQYHLSVVPTIYTKSGHI 282

Query: 300 --------DGSKLGGGDG-----------------------GMPGIFFSYELSPLMVKIT 328
                   D S +   D                          PGIFF Y + P+++ ++
Sbjct: 283 DPNRRSLPDASTITAKDSKTTVSTNQYAVTSYSQPIQPRIDATPGIFFKYNIEPILLIVS 342

Query: 329 EKSKSLGHLWTKIMCNISGTYIT 351
           ++  SL  L  +++  +SG  +T
Sbjct: 343 QEWDSLLALMVRLVNVVSGVLVT 365


>gi|302508773|ref|XP_003016347.1| hypothetical protein ARB_05746 [Arthroderma benhamiae CBS 112371]
 gi|291179916|gb|EFE35702.1| hypothetical protein ARB_05746 [Arthroderma benhamiae CBS 112371]
          Length = 427

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 87/407 (21%), Positives = 157/407 (38%), Gaps = 91/407 (22%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
            + +LK  DAF K    +   +  GG  TI   +  + L C ++  +++        V+ 
Sbjct: 20  IATKLKTFDAFPKTKPSYTSTSRGGGLWTIFVAIICTILSCSELITWYRGHENHHFSVER 79

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI-QEPQKE 121
               ++ +++D VV  + CD + ++  D++G+   H+          L G  + QEP   
Sbjct: 80  GVSQEMQLNIDTVV-AMPCDDVRINIQDAAGD---HI----------LAGDLLTQEPTSW 125

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                  +++       + E +  NK  S    E E  +  +  + + E  R +K   P+
Sbjct: 126 ---GAWNREMNQRRSGGSPEYQTLNKEDSLRLEEQE--EDLHVEHVLGEVRRSRKKKFPK 180

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSY-----SINHVHV 235
                      S +  K+   + C+++G LE N+V G+ HI A G  Y     + N   +
Sbjct: 181 -----------SPKLKKSDAVDSCRVFGSLEGNKVQGNLHITARGFGYFEWGRATNPHSM 229

Query: 236 HDIQPYTS-----------------AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA 278
             +QP  +                    N TH I  LSFG           PLD TV+  
Sbjct: 230 SLLQPIITCIHGDAKNLTDQLTKLFPGLNFTHLITELSFGPHY---GRLLNPLDKTVSST 286

Query: 279 EEGASMFNYYIKIIPTIYERL-----------DGSKLGGGDG------------------ 309
                 + Y++ ++PTIY +            D S +   D                   
Sbjct: 287 SINFYKYQYHLSVVPTIYTKSGHIDPNRRSLPDTSTITAKDSKTTVSTNQYAVTSYSQPI 346

Query: 310 -----GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                  PGIFF Y + P+++ ++++  SL  L  +++  +SG  +T
Sbjct: 347 QPRIDATPGIFFKYNIEPILLIVSQERDSLLALMVRLVNVVSGVLVT 393


>gi|226294628|gb|EEH50048.1| endoplasmic reticulum-Golgi intermediate compartment protein
           [Paracoccidioides brasiliensis Pb18]
          Length = 392

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 89/383 (23%), Positives = 145/383 (37%), Gaps = 88/383 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K    +   T  GG  T+V ++  + L   ++  +++        V+     
Sbjct: 24  LRTFDAFPKTKPTYTSSTRRGGQWTVVVFVLCALLSISELRTWYKGVENHHFSVEKGISR 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           +L ++LDIVV  ++CD L ++  D++G++ L           D+  K   EP        
Sbjct: 84  ELQLNLDIVV-AMTCDALRINVQDAAGDRILAS---------DMLNK---EPTSWAAWNR 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +     +  G     L + +      G   E  +  +  + + EA R  K   P+     
Sbjct: 131 ELNVALSGGGREYQTLAEEDA-----GRLMEQEEDMHVGHALGEARRSHKRKFPK----- 180

Query: 187 QCKNEYSTEKLKN-TFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSA 244
                    KLK     + C+IYG LE N+V G FHI A G  Y                
Sbjct: 181 -------GPKLKRGEMPDSCRIYGSLEGNKVQGDFHITARGHGY---------------- 217

Query: 245 AFNTTHHIRH--LSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL--- 299
            F    H+ H  LSFG           PLD T++        + YY+ I+PTIY R    
Sbjct: 218 -FEFGEHLDHHELSFGPHYST---LLNPLDKTMSTTPFNFYKYQYYMSIVPTIYTRAGTV 273

Query: 300 --------DGSKLGGGDGG-----------------------MPGIFFSYELSPLMVKIT 328
                   D S +                             +PGIFF Y + P+++ I+
Sbjct: 274 DPYSQVLPDPSTISPSQRKNTIFTNQYAVTSRSHELPDVQFHVPGIFFKYNIEPILLIIS 333

Query: 329 EKSKSLGHLWTKIMCNISGTYIT 351
           E+  SL  L  +++  ++G  + 
Sbjct: 334 EERGSLLALLVRLVNVMAGVVVA 356


>gi|224126339|ref|XP_002319814.1| predicted protein [Populus trichocarpa]
 gi|222858190|gb|EEE95737.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 63/219 (28%), Positives = 95/219 (43%), Gaps = 43/219 (19%)

Query: 186 VQCKNEYSTEKLKNTFTE--GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
           ++ K E +TE +K       GC+I GY+ V +V G+  I+   + S  H        + S
Sbjct: 274 LEHKPENATEHVKRPAPSAGGCRIEGYVRVKKVPGNLVIS---ARSGAH-------SFDS 323

Query: 244 AAFNTTHHIRHLSFGIKL----QDDDERRKPLDGTVAKAEEGASMFNY-----------Y 288
           A  N +H I H SFG+K+      D +R  P  G       G S  N+           Y
Sbjct: 324 AQMNLSHVISHFSFGMKVLPRVMSDVKRLIPHIGRSHDKLNGRSFINHRDVGANVTIEHY 383

Query: 289 IKIIPTI---------------YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKS 333
           ++++ T                YE    S L      MP   F +ELSP+ V ITE  KS
Sbjct: 384 LQVVKTEVVTRRSSAEHKLIEEYEYTAHSSLAQ-TVYMPTAKFHFELSPMQVLITENPKS 442

Query: 334 LGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
             H  T +   I G +    ++D++LH+  + + KVE+G
Sbjct: 443 FSHFITNVCAIIGGVFTVAGILDSILHNTFRMMKKVELG 481



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 36/115 (31%), Positives = 64/115 (55%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           MV + +LK +D + K   D  E ++ G  ++IV  L + +L  +++ +Y  V+T+  + V
Sbjct: 1   MVSTNKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELNNYLTVNTSTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  ++  P++SC++ ++D  D  G   L++   I K  +D D KP
Sbjct: 61  DNSSDGEFLRIDFNLSFPSLSCEFASVDVSDVLGTNRLNITKTIRKFSIDHDLKP 115


>gi|295663046|ref|XP_002792076.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226279251|gb|EEH34817.1| conserved hypothetical protein [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 392

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 90/388 (23%), Positives = 148/388 (38%), Gaps = 98/388 (25%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K    +   T  GG  T+V ++  + L   ++  +++        V+     
Sbjct: 24  LRTFDAFPKTKPTYTSSTRRGGQWTVVVFVLCALLSISELRTWYKGVENHHFSVEKGISR 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           +L ++LDIVV  ++CD L ++  D++G++ L  +       L+ +        +E+  A+
Sbjct: 84  ELQLNLDIVV-AMTCDALRINVQDAAGDRILASD------MLNKEPTSWAAWNRELNVAL 136

Query: 127 -----KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                + + +T E+     E E+    G   G                EA R  K   P+
Sbjct: 137 SGGGREYQTLTEEHAGRLMEQEEDMHVGHALG----------------EARRSHKRKFPK 180

Query: 182 LDTIVQCKNEYSTEKLKN-TFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQ 239
                         KLK     + C+IYG LE N+V G FHI A G  Y           
Sbjct: 181 ------------GPKLKRGEMPDSCRIYGSLEGNKVQGDFHITARGHGY----------- 217

Query: 240 PYTSAAFNTTHHIRH--LSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
                 F    H+ H  LSFG           PLD T++        + YY+ I+PTIY 
Sbjct: 218 ------FEYGEHLDHHELSFGPHYST---LLNPLDKTMSTTPFNFYKYQYYMSIVPTIYT 268

Query: 298 RL-----------DGSKLGGGDGG-----------------------MPGIFFSYELSPL 323
           R            D S +                             +PGIFF Y + P+
Sbjct: 269 RTGTIDPYSQVLPDPSTISPSQRKNTIFTNQYAVTSRSHELPDVQFYVPGIFFKYSIEPI 328

Query: 324 MVKITEKSKSLGHLWTKIMCNISGTYIT 351
           ++ I+E+  SL  L  +++  ++G  + 
Sbjct: 329 LLIISEERGSLLALLVRLVNVMAGVVVA 356


>gi|432100023|gb|ELK28916.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Myotis davidii]
          Length = 298

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 122 GCRFEGQFSINKVPGNFHVS---------THSASAQPQNP---DMTHVIHKLSFGDTLQV 169

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 170 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 229

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 230 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 289

Query: 367 SKVEIG 372
            K+++G
Sbjct: 290 KKIQLG 295



 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGS 66
            D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD      G 
Sbjct: 16  FDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKDSGG 75

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
           K+ + L+I +P + CD + LD  D  G   + H+++++
Sbjct: 76  KIDVSLNISLPNLHCDLVGLDIQDEMGRHEVGHIDNSM 113


>gi|217072996|gb|ACJ84858.1| unknown [Medicago truncatula]
 gi|388501234|gb|AFK38683.1| unknown [Medicago truncatula]
          Length = 243

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 82/197 (41%), Gaps = 36/197 (18%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK- 260
           T GC++ GY+ V +V GS  ++             D   + ++  N +H I HLSFG K 
Sbjct: 54  TGGCRVEGYVRVKKVPGSLVVSAR----------SDAHSFDASQMNMSHVINHLSFGKKV 103

Query: 261 --------------LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL-- 304
                         L  + +R         +  EG     +YI+++ T      G KL  
Sbjct: 104 TPRAMIDVKHWIPYLGINHDRLNGRSFVNTRDLEGNVTIEHYIQVVKTEVITRKGYKLIE 163

Query: 305 ---------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLV 355
                          +P   F  ELSP+ V ITE  KS  H  T +   I G +    ++
Sbjct: 164 EYEYTAHSSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGVFTVAGIL 223

Query: 356 DALLHSCVKKISKVEIG 372
           D++LH+ +K + K+EIG
Sbjct: 224 DSILHNTIKAMKKIEIG 240


>gi|167382848|ref|XP_001736294.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165901464|gb|EDR27547.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 315

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 57/206 (27%), Positives = 86/206 (41%), Gaps = 34/206 (16%)

Query: 196 KLKNTFTEGCQIYGYLEVNRVSGSFHIAPG------------LSYSINHV--HVHDIQPY 241
           K  N    GC+++G ++V+RVSG FH+A G            ++ +  H   H+H     
Sbjct: 108 KFDNRLLGGCRMHGTMKVSRVSGEFHVAFGKISFRQGRLNQFITATQKHTQGHIHQFTIQ 167

Query: 242 TSAAFNTTHHIRHLSFGIKLQDD-DERRKPLDG---TVAKAEEGASMFNYYIKIIPTIYE 297
              +FN TH+I HLSF   L         PL+G   T+   +       YYI +IPT+++
Sbjct: 168 EMKSFNPTHYINHLSFSNTLGSTVHSGETPLNGKEFTLNGFDNARK--TYYINVIPTLFK 225

Query: 298 ----RLDGSKLG----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                L   +L           G     PG+FF YELSP +V       S  H    +  
Sbjct: 226 YPSYTLRTYQLSVSERDIPVTYGASFAQPGVFFKYELSPYIVINEMNDHSFAHSLASVGA 285

Query: 344 NISGTYITFMLVDALLHSCVKKISKV 369
            + G  I    +  L  S  + ++ V
Sbjct: 286 IVGGVLIIIGWLSKLFDSNRELVTSV 311



 Score = 38.1 bits (87), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 27/108 (25%), Positives = 46/108 (42%), Gaps = 4/108 (3%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           + LK  D F K  E     T      +++ ++ I  L+  +  ++        + VD+ +
Sbjct: 6   QVLKECDIFLKVPEKLKITTNTTKLFSVISYIIIGLLVFSETYNFLNPQWVSHVDVDTVK 65

Query: 65  GSKLP---IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNI-YKRRL 108
              LP   I++DI  P + CD   LD  + +G   L V   I +  RL
Sbjct: 66  AGVLPNMYINIDITFPKMKCDDFGLDVTEITGSLQLGVTDGIKFDNRL 113


>gi|224013158|ref|XP_002295231.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969193|gb|EED87535.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 492

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 90/206 (43%), Gaps = 55/206 (26%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG----- 258
           GCQ+ G+L VNRV G+FHI    + S+NH          +A  N TH + HLSFG     
Sbjct: 293 GCQVSGHLMVNRVPGNFHIE---AKSVNH-------NLNAAMTNLTHRVNHLSFGEPITK 342

Query: 259 ------------------IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------ 294
                              ++ ++ ++  P+D T     +    F++YIK++ T      
Sbjct: 343 LPPHMENTPFMRKVKRVLKQVPEEHKQFNPMDDTEYVTAQFHQAFHHYIKVVSTHLNMGS 402

Query: 295 ---------------IYERLDGSKLGGGDG-GMPGIFFSYELSPLMVKITEKSKSLGHLW 338
                          +Y+ L+ S++   D   +P   FSY++SP+ V + ++ +      
Sbjct: 403 SSKSEYSVNDVNAVTVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVVVQKEGRKWYDYL 462

Query: 339 TKIMCNISGTYITFMLVDALLHSCVK 364
           T +   I GT+ T  L+DA L+   K
Sbjct: 463 TSLCAIIGGTFTTLGLIDATLYKVFK 488



 Score = 44.7 bits (104), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 22/107 (20%), Positives = 53/107 (49%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +  +D + +  +D  E T  G  ++I     ++ L   +   + + +    + +D +   
Sbjct: 13  MSSVDFYRRVPKDLTEATSLGAIMSICAITVMAILFFSETLAFARTAMVTSIALDENDQP 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
           ++ ++ +I +  + CD++++D  D+ G    +V  NI K +LD DG+
Sbjct: 73  QIRLNFNITLMDLHCDFVSVDVWDTLGTNRQNVTKNIEKWQLDEDGQ 119


>gi|13385678|ref|NP_080446.1| endoplasmic reticulum-Golgi intermediate compartment protein 1 [Mus
           musculus]
 gi|52000733|sp|Q9DC16.1|ERGI1_MOUSE RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 1; AltName: Full=ER-Golgi intermediate
           compartment 32 kDa protein; Short=ERGIC-32
 gi|12835932|dbj|BAB23423.1| unnamed protein product [Mus musculus]
 gi|13529617|gb|AAH05516.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Mus
           musculus]
 gi|26351067|dbj|BAC39170.1| unnamed protein product [Mus musculus]
 gi|26353098|dbj|BAC40179.1| unnamed protein product [Mus musculus]
 gi|53236959|gb|AAH83144.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Mus
           musculus]
 gi|71059789|emb|CAJ18438.1| 1200007D18Rik [Mus musculus]
 gi|74185526|dbj|BAE30231.1| unnamed protein product [Mus musculus]
 gi|148690563|gb|EDL22510.1| RIKEN cDNA 1200007D18 [Mus musculus]
 gi|158148953|dbj|BAF82010.1| MAA-136 protein [Mus musculus]
          Length = 290

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHTIHKLSFGDTLQV 161

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|50510831|dbj|BAD32401.1| mKIAA1181 protein [Mus musculus]
          Length = 320

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 144 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHTIHKLSFGDTLQV 191

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 192 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 251

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 252 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 311

Query: 367 SKVEIG 372
            K+++G
Sbjct: 312 KKIQLG 317



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 35  FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 94

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 95  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 135


>gi|301763094|ref|XP_002916978.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Ailuropoda melanoleuca]
          Length = 306

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 130 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 177

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 178 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 237

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 238 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 297

Query: 367 SKVEIG 372
            K+++G
Sbjct: 298 KKIQLG 303



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS---RGS 66
            D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD      G 
Sbjct: 24  FDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDSGG 83

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
           K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 84  KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 121


>gi|344265732|ref|XP_003404936.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Loxodonta africana]
          Length = 338

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 85/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 162 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 209

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE  +G +             +    
Sbjct: 210 QNVQGAFNALGGADRLHSNPLASHDYILKIVPTVYEDKNGKQRYSYQYTVANKEYVAYSH 269

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 270 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 329

Query: 367 SKVEIG 372
            K+++G
Sbjct: 330 KKIQLG 335



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD     
Sbjct: 53  FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 112

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 113 SGGKIDVTLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 153


>gi|114603487|ref|XP_001145588.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Pan troglodytes]
          Length = 424

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 248 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 295

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 296 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 355

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 356 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 415

Query: 367 SKVEIG 372
            K+++G
Sbjct: 416 KKIQLG 421



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS---RGS 66
            D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD      G 
Sbjct: 142 FDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDSGG 201

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
           K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 202 KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 239


>gi|109079798|ref|XP_001099287.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Macaca mulatta]
          Length = 379

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 203 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 250

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 251 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 310

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 311 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 370

Query: 367 SKVEIG 372
            K+++G
Sbjct: 371 KKIQLG 376



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 94  FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 153

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 154 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 194


>gi|194382656|dbj|BAG64498.1| unnamed protein product [Homo sapiens]
          Length = 235

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 59  GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 106

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 107 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 166

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 167 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 226

Query: 367 SKVEIG 372
            K+++G
Sbjct: 227 KKIQLG 232


>gi|156406959|ref|XP_001641312.1| predicted protein [Nematostella vectensis]
 gi|156228450|gb|EDO49249.1| predicted protein [Nematostella vectensis]
          Length = 287

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 87/190 (45%), Gaps = 36/190 (18%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           EGC I     +N+V G+FH++          H    QP +    +  H I  ++FG ++ 
Sbjct: 111 EGCFISTRFTINKVPGNFHVS---------THGAGKQPDSP---DMNHIINAVNFGSRIM 158

Query: 263 DDDERRKPLDGTVAKAEE-----GASMFNYYIKIIPTIYERLDGS--------------- 302
           D    + P   T  K  +     G +  +Y +KI+PTIY++LDG+               
Sbjct: 159 D----KLPGAFTALKDRKRHDTNGLASHDYILKIVPTIYQKLDGTTTFSYQYTWAYKEYV 214

Query: 303 KLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
               G   +P I+F Y+LSP+ VK  E+ + L H  T +   + GT+    ++D+ + + 
Sbjct: 215 SYSHGGQMLPAIWFRYDLSPITVKYIERRQPLYHFITTVCAIVGGTFTVAGIIDSAVFTA 274

Query: 363 VKKISKVEIG 372
            +   K ++G
Sbjct: 275 SEMWRKHQLG 284



 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 53/104 (50%), Gaps = 1/104 (0%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
           ++  D + K  +D  E T  G  ++I   LFI++L   +   +       ELFVD+ +  
Sbjct: 5   VRRFDIYRKVPKDLTEPTFAGAVISICSCLFITFLFLSEFYGFIGTEIASELFVDNPTED 64

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
            K+P+ L+I +P + C++  LD  D  G   +  + N+ +R ++
Sbjct: 65  DKIPVILNITLPRMKCEFPGLDIQDEMGRHEVGFKENVERREIN 108


>gi|403290258|ref|XP_003936243.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Saimiri boliviensis boliviensis]
          Length = 415

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 239 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 286

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 287 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGRQQYSYQYTVANKEYVAYSH 346

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 347 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 406

Query: 367 SKVEIG 372
            K+++G
Sbjct: 407 KKIQLG 412



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
           L   D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 130 LHRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 189

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 190 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 230


>gi|410349413|gb|JAA41310.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
           troglodytes]
 gi|410349417|gb|JAA41312.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
           troglodytes]
          Length = 290

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 48/101 (47%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P   C  + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNSQCRLVGLDIQDEMGRHEVGHIDNSM 105


>gi|417409674|gb|JAA51332.1| Putative endoplasmic reticulum-golgi intermediate compartment
           protein, partial [Desmodus rotundus]
          Length = 318

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 142 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 189

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 190 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 249

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 250 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 309

Query: 367 SKVEIG 372
            K+++G
Sbjct: 310 KKIQLG 315



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 33  FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 92

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 93  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 133


>gi|390459630|ref|XP_002744599.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Callithrix jacchus]
          Length = 342

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 166 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 213

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 214 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVANKEYVAYSH 273

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 274 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 333

Query: 367 SKVEIG 372
            K+++G
Sbjct: 334 KKIQLG 339



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
           L   D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 57  LHRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 116

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 117 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 157


>gi|348575225|ref|XP_003473390.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Cavia porcellus]
          Length = 345

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 169 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 216

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 217 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 276

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 277 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 336

Query: 367 SKVEIG 372
            K+++G
Sbjct: 337 KKIQLG 342



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 60  FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 119

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 120 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 160


>gi|395817675|ref|XP_003782285.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Otolemur garnettii]
          Length = 356

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 180 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 227

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 228 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVANKEYVAYSH 287

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 288 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 347

Query: 367 SKVEIG 372
            K+++G
Sbjct: 348 KKIQLG 353



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
            +  D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD     
Sbjct: 71  FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 130

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 131 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 171


>gi|355691849|gb|EHH27034.1| hypothetical protein EGK_17136, partial [Macaca mulatta]
 gi|355750428|gb|EHH54766.1| hypothetical protein EGM_15664, partial [Macaca fascicularis]
          Length = 290

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 51/101 (50%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
           L+  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 5   LRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|338713524|ref|XP_001499596.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Equus caballus]
          Length = 356

 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 180 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 227

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 228 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 287

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 288 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 347

Query: 367 SKVEIG 372
            K+++G
Sbjct: 348 KKIQLG 353



 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 51/101 (50%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
           L+  D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD     
Sbjct: 71  LRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 130

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 131 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 171


>gi|6330243|dbj|BAA86495.1| KIAA1181 protein [Homo sapiens]
          Length = 336

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 160 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 207

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 208 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 267

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 268 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 327

Query: 367 SKVEIG 372
            K+++G
Sbjct: 328 KKIQLG 333



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 51  FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 110

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 111 SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 151


>gi|354477345|ref|XP_003500881.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Cricetulus griseus]
          Length = 333

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 157 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 204

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 205 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 264

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 265 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 324

Query: 367 SKVEIG 372
            K+++G
Sbjct: 325 KKIQLG 330



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRGS 66
            D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD      G 
Sbjct: 51  FDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKDSGG 110

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
           K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 111 KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 148


>gi|72534712|ref|NP_001026881.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Homo sapiens]
 gi|332248275|ref|XP_003273290.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Nomascus leucogenys]
 gi|426351000|ref|XP_004043047.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Gorilla gorilla gorilla]
 gi|51701446|sp|Q969X5.1|ERGI1_HUMAN RecName: Full=Endoplasmic reticulum-Golgi intermediate compartment
           protein 1; AltName: Full=ER-Golgi intermediate
           compartment 32 kDa protein; Short=ERGIC-32
 gi|15215343|gb|AAH12766.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
           [Homo sapiens]
 gi|15680269|gb|AAH14490.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
           [Homo sapiens]
 gi|119581826|gb|EAW61422.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1,
           isoform CRA_a [Homo sapiens]
 gi|208966210|dbj|BAG73119.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1
           [synthetic construct]
 gi|410301142|gb|JAA29171.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
           troglodytes]
 gi|410349415|gb|JAA41311.1| endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Pan
           troglodytes]
          Length = 290

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|402873423|ref|XP_003900575.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Papio anubis]
 gi|380784387|gb|AFE64069.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Macaca mulatta]
 gi|383408185|gb|AFH27306.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Macaca mulatta]
 gi|384941372|gb|AFI34291.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Macaca mulatta]
          Length = 290

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|350594414|ref|XP_003134100.3| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Sus scrofa]
          Length = 313

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 137 GCRFEGQFSINKVPGNFHVS---------THSATAQPPNP---DMTHVIHKLSFGDTLQV 184

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 185 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 244

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 245 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 304

Query: 367 SKVEIG 372
            K+++G
Sbjct: 305 KKIQLG 310



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 4/98 (4%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS---RGS 66
            D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD      G 
Sbjct: 31  FDIYRKVPKDLTQPTYTGAIISICCCLFIFFLFLSELTGFITTEIVNELYVDDPDKDSGG 90

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
           K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 91  KIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 128


>gi|149052230|gb|EDM04047.1| rCG34297 [Rattus norvegicus]
          Length = 283

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 107 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 154

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 155 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 214

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 215 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 274

Query: 367 SKVEIG 372
            K+++G
Sbjct: 275 KKIQLG 280



 Score = 43.1 bits (100), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 21/80 (26%), Positives = 39/80 (48%), Gaps = 3/80 (3%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
           +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 5  FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64

Query: 64 RGSKLPIHLDIVVPTISCDY 83
           G K+ + L+I +P + C++
Sbjct: 65 SGGKIDVSLNISLPNLHCEH 84


>gi|297803392|ref|XP_002869580.1| hypothetical protein ARALYDRAFT_492089 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297315416|gb|EFH45839.1| hypothetical protein ARALYDRAFT_492089 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 480

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 96/214 (44%), Gaps = 37/214 (17%)

Query: 186 VQCKNEYSTEKLKNT-FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
           ++ K++ S+  LK    T GC+I GY+ V +V G+  ++   + S +H        + S+
Sbjct: 274 LEDKSDNSSRTLKKAPSTGGCRIEGYIRVKKVPGNLMVS---ARSGSH-------SFDSS 323

Query: 245 AFNTTHHIRHLSFGIKLQDDD----ERRKPLDGTVAKAEEGASMFN-----------YYI 289
             N +H + HLSFG ++        +R  P  G      +G    N           +Y+
Sbjct: 324 QMNMSHVVNHLSFGQRIMPQKFSELKRLSPYLGLSHDRLDGRPFINQRDLGPNVTIEHYL 383

Query: 290 KIIPTIYERLDGSKLG-----------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           +I+ T   + +G  L                 +P   F +ELSP+ V ITE SKS  H  
Sbjct: 384 QIVKTEVVKSNGQALVEAYEYTAHSSVAHSYYLPVAKFHFELSPMQVLITENSKSFSHFI 443

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           T +   I G +    ++D++LH  +  + K+E+G
Sbjct: 444 TNVCAIIGGVFTVAGILDSILHHSMTLMKKIELG 477


>gi|392331685|ref|XP_003752358.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Rattus norvegicus]
          Length = 290

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 161

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|356543934|ref|XP_003540413.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
          Length = 480

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 54/198 (27%), Positives = 86/198 (43%), Gaps = 38/198 (19%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--- 258
           T GC+I GY+ V +V G+  I+             +   + ++  N +H I HLSFG   
Sbjct: 291 TGGCRIDGYVRVKKVPGNLIISAR----------SNAHSFDASQMNMSHVINHLSFGRKV 340

Query: 259 -IKLQDDDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTI----------- 295
            +++  D +R  P  G+      G S  N           +Y++I+ T            
Sbjct: 341 SLRVMSDVKRLIPYVGSSHDRLNGRSFINTHDLGANVTIEHYLQIVKTEVITRKEYKLVE 400

Query: 296 -YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFML 354
            YE    S +      +P   F  ELSP+ V ITE  KS  H  T +   I G +    +
Sbjct: 401 EYEYTAHSSVAQS-LHIPVAKFHLELSPMQVLITENQKSFSHFITNVCAIIGGIFTVAGI 459

Query: 355 VDALLHSCVKKISKVEIG 372
           +DA+ H+ ++ + KVE+G
Sbjct: 460 MDAIFHNTIRLMKKVELG 477



 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 63/115 (54%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S ++K +D + K   D  E ++ G  ++IV  L + +L  +++  Y  VST+ ++ V
Sbjct: 1   MISSSKIKSVDFYRKIPRDLTEASLSGAGLSIVAALAMIFLFGMELNSYLSVSTSTQVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  +I  P +SC++ A+D  D  G   L++   + K  +D + +P
Sbjct: 61  DKSSDGDYLRIDFNISFPALSCEFAAVDVSDVLGTNRLNLTKTVRKFSIDSNLRP 115


>gi|397485838|ref|XP_003814045.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Pan paniscus]
          Length = 290

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDMLQV 161

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|410949214|ref|XP_003981318.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Felis catus]
          Length = 398

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 222 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 269

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 270 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 329

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 330 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 389

Query: 367 SKVEIG 372
            K+++G
Sbjct: 390 KKIQLG 395



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/108 (26%), Positives = 54/108 (50%), Gaps = 7/108 (6%)

Query: 3   FSERLKG---LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELF 59
           FS+  +G    D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+
Sbjct: 106 FSKPYEGTPLFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELY 165

Query: 60  VDSS---RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
           VD      G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 166 VDDPDKDSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 213


>gi|395736490|ref|XP_002816264.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Pongo abelii]
          Length = 290

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 QNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPHLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|351705474|gb|EHB08393.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Heterocephalus glaber]
          Length = 305

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 129 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 176

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 177 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQWYSYQYTVANKEYVAYSH 236

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 237 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 296

Query: 367 SKVEIG 372
            K+++G
Sbjct: 297 KKIQLG 302



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 52/101 (51%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
           ++G D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 20  VEGFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 79

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 80  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 120


>gi|355686511|gb|AER98080.1| endoplasmic reticulum-golgi intermediate compartment 1 [Mustela
           putorius furo]
          Length = 312

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 137 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 184

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 185 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 244

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 245 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 304

Query: 367 SKVEIG 372
            K+++G
Sbjct: 305 KKIQLG 310



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 28  FRRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 87

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 88  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 128


>gi|336370998|gb|EGN99338.1| hypothetical protein SERLA73DRAFT_108802 [Serpula lacrymans var.
           lacrymans S7.3]
 gi|336383753|gb|EGO24902.1| hypothetical protein SERLADRAFT_449635 [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 503

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 78/357 (21%), Positives = 139/357 (38%), Gaps = 66/357 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K    +  ++   G +TI        L+  D  +Y       E  VDS   S
Sbjct: 16  LAQFDAFPKLPSTYKSRSESRGFITIFITFLAFLLVLNDFGEYIWGWPDYEFSVDSQSNS 75

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++D+ V  + C  L++D  D  G++ L++                           
Sbjct: 76  FMSINVDMAV-NMPCHLLSVDLRDVVGDR-LYLSKGF----------------------- 110

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            ++  T  +    T L++     S   A +++RK     + V                  
Sbjct: 111 -RRDGTLFDVGQATSLKEHAAMLSARQALSQSRKSRGLLSSV----------------FR 153

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAA 245
           + + +Y            C+IYG L+V +V+ + HI   G  Y+ N VHV   +      
Sbjct: 154 RSQPDYRPTYNYQADGSACRIYGTLQVKKVTANLHITTLGHGYTSN-VHVDHTK------ 206

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI---------- 295
            N +H I   SFG    D  +   PLD +   A++    + Y++ ++PT           
Sbjct: 207 MNLSHVITEFSFGPYFPDITQ---PLDYSFEVAKDPFVAYQYFLHVVPTTFIAPRSEPLH 263

Query: 296 ---YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
              Y     +++  G  G PGIFF ++L P+++ I +++ S   L+ + +  I G +
Sbjct: 264 TNQYSVTHYTRVLKGHHGTPGIFFKFDLDPMVITIHQRTTSFLQLFIRCVGVIGGVF 320


>gi|392351111|ref|XP_001066818.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Rattus norvegicus]
          Length = 497

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 321 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 368

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 369 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 428

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 429 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 488

Query: 367 SKVEIG 372
            K+++G
Sbjct: 489 KKIQLG 494



 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/102 (25%), Positives = 52/102 (50%), Gaps = 4/102 (3%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-- 63
           +++  D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD    
Sbjct: 211 KVERFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDK 270

Query: 64  -RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
             G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 271 DSGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 312


>gi|281351238|gb|EFB26822.1| hypothetical protein PANDA_005115 [Ailuropoda melanoleuca]
          Length = 238

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 50/186 (26%), Positives = 84/186 (45%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 62  GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 109

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 110 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 169

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 170 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 229

Query: 367 SKVEIG 372
            K+++G
Sbjct: 230 KKIQLG 235


>gi|393221326|gb|EJD06811.1| DUF1692-domain-containing protein [Fomitiporia mediterranea MF3/22]
          Length = 537

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 85/348 (24%), Positives = 150/348 (43%), Gaps = 78/348 (22%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICV-DVCDYFQVSTTEELFVDSSRG 65
           L   DAF K    +  ++   G +T++   FIS+L+ V D+ +Y     T +  +D+  G
Sbjct: 22  LNQFDAFPKLPSTYKARSGGRGFLTVLV-AFISFLLVVNDIGEYIFGWPTYKFGLDNRPG 80

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             L I++D+VV  + C +L++D  D+ G++ L++    +KR                   
Sbjct: 81  HYLAINVDLVV-NMPCKHLSVDLRDAVGDR-LYLSDG-FKR------------------- 118

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                    +GT    L D    G     ++ T +  +    V +A + + +     DTI
Sbjct: 119 ---------DGT----LFD---IGQAQALQSHT-QALDARLAVAQARKSRGF----FDTI 157

Query: 186 VQCKNEYSTEKLKNTFT-----EGCQIYGYLEVNRVSGSFHIA-PGLSYSINHVHVHDIQ 239
           ++ +N+   +K + T+        C++YG ++  +V+ + HI   G  Y   H HV   Q
Sbjct: 158 LR-RNK---DKFRPTYNYKPDGGACRVYGSIQAKKVTANLHITTAGHGYRSMH-HVDHSQ 212

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
                  N +H I   SFG    D     +PL  T     E    + Y++ ++PT Y   
Sbjct: 213 ------MNLSHVITDFSFGPYFPD---MAQPLKNTFELTHEPFIAYQYFLSVVPTTYIAS 263

Query: 300 DGSKLGGG-------------DGGMPGIFFSYELSPLMVKITEKSKSL 334
           +G ++                + G PGIFF Y+L PL + I +K+ +L
Sbjct: 264 NGKQVHTSQYSVTHYTRVLQHEQGTPGIFFKYDLEPLQMTIHQKTTTL 311


>gi|297830752|ref|XP_002883258.1| hypothetical protein ARALYDRAFT_479582 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329098|gb|EFH59517.1| hypothetical protein ARALYDRAFT_479582 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 483

 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 58/204 (28%), Positives = 93/204 (45%), Gaps = 39/204 (19%)

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
           K   T GC++ GY+ V +V G+  I+        H   H    + S+  N +H + HLSF
Sbjct: 287 KAPVTGGCRVEGYVRVKKVPGNLVISA-------HSGAHS---FDSSQMNMSHVVSHLSF 336

Query: 258 G----IKLQDDDERRKP--------LDGT--VAKAEEGASM-FNYYIKIIPT-IYERLDG 301
           G     +L  D +R  P        LDG   + + E GA++   +Y++I+ T +  R  G
Sbjct: 337 GRMISPRLLTDMKRLLPYLGLSHDRLDGKAFINQHEFGANVTIEHYLQIVKTEVITRRSG 396

Query: 302 SKLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
            +                    +P   F +ELSP+ + ITE  KS  H  T +   I G 
Sbjct: 397 QEHSLIEEYEYTAHSSVAQTYYLPVAKFHFELSPMQILITENPKSFSHFITNLCAIIGGV 456

Query: 349 YITFMLVDALLHSCVKKISKVEIG 372
           +    ++D++ H+ V+ I KVE+G
Sbjct: 457 FTVAGILDSIFHNTVRLIKKVELG 480



 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/110 (33%), Positives = 61/110 (55%), Gaps = 1/110 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           MV S +LK +D + K   D  E ++ G  ++IV  LF+ +L  +++  Y +V+TT  + V
Sbjct: 1   MVSSTKLKSVDFYRKIPRDLTEASLSGAGLSIVAALFMMFLFGMELSSYLEVNTTTAVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
           D SS G  L I  +I  P +SC++ ++D  D  G   L++   I K  +D
Sbjct: 61  DKSSDGDFLRIDFNISFPALSCEFASVDVSDVLGTNRLNITKTIRKFPID 110


>gi|226497610|ref|NP_001145501.1| uncharacterized protein LOC100278902 [Zea mays]
 gi|195657145|gb|ACG48040.1| hypothetical protein [Zea mays]
          Length = 110

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 35/99 (35%), Positives = 56/99 (56%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK L+AF    E   +KT  G  VTI+  L +  L   ++  Y    T  ++ VD  RG 
Sbjct: 7   LKSLNAFPHAEEHLLKKTYSGAVVTILGLLIMITLFVHELQFYLTTYTVHQMSVDLKRGE 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
            LPIH+++  P++ C+ L++DA+D SG+  + +  NI+K
Sbjct: 67  TLPIHVNMSFPSLPCEVLSVDAIDMSGKHEVDLHTNIWK 105


>gi|389749487|gb|EIM90658.1| DUF1692-domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 533

 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 73/361 (20%), Positives = 138/361 (38%), Gaps = 68/361 (18%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           E +K  DAF K    +  ++   G +TI        L+  D+ ++       E  VD   
Sbjct: 18  ESIKSFDAFPKLPATYKSRSESRGFLTIFVAFLAFLLVLNDIGEFIWGWPDHEFAVDRDD 77

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
            S + +++D+VV  + C +L++D  D  G+            RL L              
Sbjct: 78  SSFMNVNVDLVV-NMPCRWLSVDLRDVVGD------------RLFLS------------K 112

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             ++     + G  T              A  E  K  +T   V+++ + + +     D 
Sbjct: 113 GFRRDGTLFDIGQAT--------------ALKEHAKALSTRQAVRQSRKSRGF----FDL 154

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTS 243
             + ++ Y            C++YG LEV +V+ + HI   G  Y+ + VHV   +    
Sbjct: 155 FRRSQDIYKPTYNYQADGSACRVYGSLEVKKVTANLHITSLGHGYA-SKVHVDHTK---- 209

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
              N +H I   SFG    D  +   PLD +     +  + + Y+++++PT Y     + 
Sbjct: 210 --INMSHVITEFSFGPHFPDIVQ---PLDNSFEITHDHFTAYQYFMRVVPTTYVAPRSAP 264

Query: 304 LGGGD--------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
           L                  G  PGIFF +E+ P+ +   +++ +    + +    + G +
Sbjct: 265 LNTNQYSVTHYTRTFEQHSGLAPGIFFKFEIEPVRLIQHQRTTTFAQFFVRWAGVVGGVF 324

Query: 350 I 350
           +
Sbjct: 325 V 325


>gi|440798302|gb|ELR19370.1| golgi family protein, putative [Acanthamoeba castellanii str. Neff]
          Length = 328

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 59/205 (28%), Positives = 94/205 (45%), Gaps = 44/205 (21%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAP----------GLSYSINHVHVHDIQP---YTSA----A 245
            GC I GY+ V +V G+FH++            + ++IN     D  P   Y S     A
Sbjct: 120 SGCSIAGYINVPKVPGNFHLSTHGRNVQAQDIDMQHNINSFFFTD-SPRVFYPSGVSVPA 178

Query: 246 FNTTHH--IRHLSFGIKLQDDDERR----KPLDGTV---AKAEEGASM-FNYYIKIIPTI 295
           +   H   +  L+   + QD D+      +PLDG     ++ + G  + + YYI+I+PTI
Sbjct: 179 WRNWHSNVVAELNAQARDQDTDDDVVGLFRPLDGITKANSQRKNGVGVSYEYYIQIVPTI 238

Query: 296 YERLDG------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
            E  DG            + +   +G  P ++F Y++SP+ VKIT    SLGH   + +C
Sbjct: 239 LEFPDGRTKHTYQFTYNFNDVATPEGKTPSVYFKYDISPITVKITRGRGSLGHFLLQ-LC 297

Query: 344 NISGTYITFMLVDALLHSCVKKISK 368
            I G   T   V  L+ S   +++K
Sbjct: 298 AIVGGIFT---VSGLIASVTARVAK 319



 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 28/88 (31%), Positives = 48/88 (54%), Gaps = 1/88 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG- 65
          LK  D + +  +D  + +V G  V++VC   ++ LI  +V  Y  + T  ++ VD+ R  
Sbjct: 9  LKSFDLYRRVPKDLTKGSVPGAIVSLVCLTIMAMLISWEVYCYASIKTETQMLVDTPRNL 68

Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSG 93
           K+ I++++ VP I C  +ALD  D  G
Sbjct: 69 EKIRININVTVPRIPCYVIALDTEDVLG 96


>gi|356517290|ref|XP_003527321.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
          Length = 480

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 52/195 (26%), Positives = 84/195 (43%), Gaps = 36/195 (18%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL-- 261
           GC++ GY+ V +V G+  I+             D   + ++  N +H I +LSFG K+  
Sbjct: 293 GCRVEGYVRVKKVPGNLIISAR----------SDAHSFDASQMNMSHFINNLSFGKKVTP 342

Query: 262 --QDDDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTIYERLDGSKL---- 304
               D +   P  G+      G S  N           +YI+I+ T     +G KL    
Sbjct: 343 RAMSDVKLLIPYIGSSHDRLNGRSFTNTHDLGANVTIEHYIQIVKTEVVTRNGYKLIEEY 402

Query: 305 -------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
                        +P   F  ELSP+ V ITE  +S  H  T +   I G +    ++D+
Sbjct: 403 EYTAHSSVAHSVDIPAAKFHLELSPMQVLITENQRSFSHFITNVCAIIGGVFTVAGILDS 462

Query: 358 LLHSCVKKISKVEIG 372
           +LH+ ++ + KVE+G
Sbjct: 463 ILHNTIRMMKKVELG 477


>gi|451847161|gb|EMD60469.1| hypothetical protein COCSADRAFT_98785 [Cochliobolus sativus ND90Pr]
          Length = 395

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 86/384 (22%), Positives = 144/384 (37%), Gaps = 90/384 (23%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K  + +  +     A T+   +   YL   ++  ++  +TT+   ++     
Sbjct: 22  VSSFDAFPKTKKTYLVQGRNSSAWTVTLIITCIYLTWSEIARWYAGTTTQSFTIEKGVSH 81

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I+LDI+V  + C  L ++  D++G++ L  E                         +
Sbjct: 82  DMQINLDIIV-AMKCADLHVNMQDAAGDRTLAGE------------------------LL 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +K               DP       G  TE        +E  +   ++++         
Sbjct: 117 RK---------------DPTSWSQWTGKNTEKGTHELGKDETTQIPEWEEYGDVHEHLGK 161

Query: 187 QCKNEYS-TEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSA 244
             K ++S T KL+   T+ C+IYG L  N+V G FHI A G  Y     H+        +
Sbjct: 162 ATKKKFSKTPKLRGP-TDSCRIYGNLVGNKVQGDFHITARGHGYMEFGEHLE------HS 214

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS---MFNYYIKIIPTIYE---- 297
           +FN +H IR +SFG           PLD T+A     A     F YY+ I+PTIY     
Sbjct: 215 SFNFSHIIREMSFGPYYP---SLTNPLDSTIAVTPTPADHFYKFQYYLSIVPTIYTDDPA 271

Query: 298 -------------------------------RLDGSKLGGGDGGMPGIFFSYELSPLMVK 326
                                           +        D  +PGIF  +++ P+M+ 
Sbjct: 272 LMPIMESMVSTNDQPSSNMFRMAHAIKTNQYAVTSQSHKVDDSYVPGIFVKFDIEPIMLA 331

Query: 327 ITEKSKSLGHLWTKIMCNISGTYI 350
           I E+SKS   L   ++  +SG  +
Sbjct: 332 IVEESKSFWKLVITLVNVVSGVMV 355


>gi|145524934|ref|XP_001448289.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124415833|emb|CAK80892.1| unnamed protein product [Paramecium tetraurelia]
          Length = 324

 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 61/231 (26%), Positives = 97/231 (41%), Gaps = 27/231 (11%)

Query: 161 CCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSF 220
             N   +  E    KK+     DTI+   ++     +K        I GY+ VN+V G+F
Sbjct: 99  VVNVEEQRMERQFLKKFIQIMKDTIIIINHQQILRDVK--------IAGYIIVNKVPGNF 150

Query: 221 HIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLS-----FGIKLQDDDERRKPLDGT- 274
           H++      I H      Q  T    +T     HL        IK Q       PLD T 
Sbjct: 151 HVSAHAFGGILHQVFQRSQISTLDLSHTYQSYSHLVKKDDLVKIKKQFQKGVLNPLDNTK 210

Query: 275 -VAKAEEGASM-FNYYIKIIPTIYERLDGS-----KLGGGDG-----GMPGIFFSYELSP 322
            +A+ + G  M F YYI ++PT Y  + G+     +            +P ++F Y+LSP
Sbjct: 211 KIAQPQGGTGMMFQYYISVVPTTYIDVSGNEYYVHQFTANSNEVQTDHLPAVYFRYDLSP 270

Query: 323 LMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLH-SCVKKISKVEIG 372
           + VK  +  +S  H   +I   + G +    ++D ++H S V  + K E+G
Sbjct: 271 VTVKFLQYRESFLHFLVQICAILGGVFTIASIIDGMIHKSVVALLKKYEMG 321



 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 32/104 (30%), Positives = 54/104 (51%), Gaps = 1/104 (0%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR- 64
           RL+ LD + K   D  E T  G  ++++  + I  L   ++  Y +V  + E+FVD +R 
Sbjct: 8   RLRKLDIYRKLPADLTEPTTAGALISVISTIVIVILFTTELQAYIEVDNSSEMFVDINRG 67

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
           G ++ ++LDI      CD L+LD  D  G   ++VE    +R+ 
Sbjct: 68  GEQIRVNLDIEFHKFPCDILSLDVQDIMGSHVVNVEEQRMERQF 111


>gi|348667045|gb|EGZ06871.1| hypothetical protein PHYSODRAFT_319561 [Phytophthora sojae]
          Length = 469

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 54/231 (23%), Positives = 95/231 (41%), Gaps = 46/231 (19%)

Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTF------------TEGCQIYGYLEVNRVSGSF 220
           ++K+    E+D +   K E   +  KN               EGC++YG+L V RV G+F
Sbjct: 247 KFKQLMAGEVDAVEARKKELFEQDKKNAREQGKAIARSAVGPEGCRLYGHLYVKRVPGNF 306

Query: 221 HI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA--- 276
           H+     +YS++           S+  N +H +  L FG  L   +    P D  +    
Sbjct: 307 HVHLANPAYSMD-----------SSLVNASHTVNELWFGEHLTSGEMSMLPRDAQMQLYT 355

Query: 277 ---KAEEGASMFN-----YYIKIIPTIYERLDGSKLGG-----------GDGGMPGIFFS 317
                ++  S +      +YIK++   Y + D + +                 +P I F 
Sbjct: 356 HRLDNQDYTSFYKNHTYVHYIKVVTNSYVQSDAADINVYKYTAHSNEYLETDDLPSIMFR 415

Query: 318 YELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISK 368
           Y+LSP+ V+I+E S    H  T     I G +    ++D ++H   + ++K
Sbjct: 416 YDLSPMSVRISEDSVPFYHFLTSACAIIGGVFTVIGILDQIIHQTARALNK 466



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 57/117 (48%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  D + K  ED    T+ G +++I     +  L  ++   Y  V    ++ +D     
Sbjct: 7   LKKWDFYKKIPEDLTVSTLPGVSLSIAGCFIMFLLFILEFNSYLTVDYKYDIVMDEGLDQ 66

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVV 123
            + I+ +I VP + C++  +D  D +G +  ++  NIYK RLD  G+ +   Q++ +
Sbjct: 67  TMRINFNITVPDLPCEFATVDVSDMTGTRKHNMTSNIYKIRLDQKGRSVGLAQEKQI 123


>gi|366997520|ref|XP_003678522.1| hypothetical protein NCAS_0J02050 [Naumovozyma castellii CBS 4309]
 gi|342304394|emb|CCC72184.1| hypothetical protein NCAS_0J02050 [Naumovozyma castellii CBS 4309]
          Length = 347

 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 81/353 (22%), Positives = 133/353 (37%), Gaps = 70/353 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K  E++ +K+  GG  TI  +LF+ ++   +   YF     ++  VD+    
Sbjct: 4   LKSFDAFPKTDEEYTKKSTKGGLSTIATYLFLLFIAWSEFGSYFGGFVEQKYVVDNQVRE 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
              I+LDI V T +C  L +   D + +  +  E       L  +      P    VN +
Sbjct: 64  VTEINLDIYVNT-TCRLLDVRVFDETKDMRMVSEE------LSFEDMVFFIPFGVKVN-L 115

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
             + VT +     +E        + +G   ++R+  N   +          ALP      
Sbjct: 116 MNEIVTADIDKILSE-----AVPAQFGPRVDSREFLNQGTD--------DVALPL----- 157

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
               EYS           C I+G + VNRV+G F I          +  H  QP  +   
Sbjct: 158 ----EYS----------ACHIFGSIPVNRVAGEFQIT--------TIDRH--QPIENVV- 192

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKA-EEGASMFNYYIKIIPTIYERLD----- 300
           + TH I   SFG      D    PLD T     +E  + + Y++ ++PTIY ++      
Sbjct: 193 DFTHVINEFSFGDFFPYVD---NPLDSTAKYVPDEKLTSYQYHLSVVPTIYNKMGVLINT 249

Query: 301 ----------GSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                      +     D   PGIF  Y    L + + ++         +++ 
Sbjct: 250 NQYSLSEYHYKNITNANDKNSPGIFIKYNFESLTIIVNDRRLGFTQFLIRLIA 302


>gi|440902711|gb|ELR53466.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1,
           partial [Bos grunniens mutus]
          Length = 290

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 51/186 (27%), Positives = 82/186 (44%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  LQ 
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161

Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
            +       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 HNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 51/101 (50%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
           L+  D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD     
Sbjct: 5   LRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|366998832|ref|XP_003684152.1| hypothetical protein TPHA_0B00460 [Tetrapisispora phaffii CBS 4417]
 gi|357522448|emb|CCE61718.1| hypothetical protein TPHA_0B00460 [Tetrapisispora phaffii CBS 4417]
          Length = 349

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 81/360 (22%), Positives = 135/360 (37%), Gaps = 73/360 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K  E   +K+  GG  +I+ + F+  +   +   YF     ++  VD     
Sbjct: 4   LKTFDAFPKTEERHVKKSKKGGLSSILTYAFLLLIAWTEFGSYFGGYIDKQYSVDKDIRK 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++DI V  + C++L ++ +D + ++ +  E  I+      +  P   P    VN +
Sbjct: 64  VVQINMDIYV-KMPCEWLHVNVLDDTNDRKIVSEELIF------EDMPFFVPHGSKVNNL 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                   N   T EL+D         AE           E +E    K    P+   I 
Sbjct: 117 --------NKVVTPELDD-------ILAEA-------IPAEFREKIETKPLLGPDGKPIF 154

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
           +                GC +YG + VNRV+G   I A G  Y        D+       
Sbjct: 155 ELT--------------GCHVYGSVTVNRVAGEMQITAKGYGYRDRKRAPKDL------- 193

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGA-SMFNYYIKIIPTIYERLDG--- 301
            +  H +   SFG           PLDGT         S +NY++ ++PT Y++L     
Sbjct: 194 IDFNHVVNEFSFG---DFYPYIENPLDGTCKMYPNSPFSSYNYFMSVVPTFYQKLGAEID 250

Query: 302 ---------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
                          S +      +PGIF  Y+  PL + I++   +      +++  +S
Sbjct: 251 TNQYSIREYHVDLKNSNVNAKLSTIPGIFLKYDFEPLAIIISDVRLTFLQFIVRLVAILS 310


>gi|327265232|ref|XP_003217412.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Anolis carolinensis]
          Length = 291

 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 47/187 (25%), Positives = 85/187 (45%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           +GC+   +  +N++ G+FH++          H    QP      + TH I  LSFG +LQ
Sbjct: 114 DGCRFESHFSINKIPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDQLQ 161

Query: 263 DDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK--------------LGG 306
               R     L+G    +    +  +Y +KI+PT+YE + G +              +  
Sbjct: 162 AQKIRGSFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQQYPFQYTVANKEYVVYS 221

Query: 307 GDGGM-PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
             G + P I+F Y+L+P+ +K  E+ + L    T I   I GT+    + D+ + +  + 
Sbjct: 222 HTGRITPAIWFRYDLTPITLKYIERRQPLYRFITTICAIIGGTFTVAGIFDSCIFTASEA 281

Query: 366 ISKVEIG 372
             K+++G
Sbjct: 282 WKKIQLG 288



 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 23/98 (23%), Positives = 48/98 (48%), Gaps = 4/98 (4%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV---DSSRGS 66
            D + K  +D  + T  G  +++ C  FI +L+  ++  +       EL+V   D     
Sbjct: 9   FDIYRKVPKDLTQPTFTGAIISVCCCFFILFLLLSELTGFIATEVVNELYVEDPDKDSSG 68

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
           K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 69  KIEVTLNISLPNLHCELIGLDIQDEMGRHEIGHIDNSV 106


>gi|426246271|ref|XP_004016918.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1 [Ovis aries]
          Length = 290

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/186 (27%), Positives = 82/186 (44%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  LQ 
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161

Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
            +       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 HNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|115497382|ref|NP_001069885.1| endoplasmic reticulum-Golgi intermediate compartment protein 1 [Bos
           taurus]
 gi|111308658|gb|AAI20358.1| Endoplasmic reticulum-golgi intermediate compartment (ERGIC) 1 [Bos
           taurus]
          Length = 290

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/186 (27%), Positives = 82/186 (44%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  LQ 
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 161

Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
            +       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 HNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQFSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|296475934|tpg|DAA18049.1| TPA: endoplasmic reticulum-golgi intermediate compartment 32 kDa
           protein [Bos taurus]
          Length = 290

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/186 (27%), Positives = 82/186 (44%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  LQ 
Sbjct: 114 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 161

Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
            +       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 162 HNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQFSYQYTVANKEYVAYSH 221

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +  
Sbjct: 222 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAW 281

Query: 367 SKVEIG 372
            K+++G
Sbjct: 282 KKIQLG 287



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFITTEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|389640739|ref|XP_003718002.1| hypothetical protein MGG_00949 [Magnaporthe oryzae 70-15]
 gi|351640555|gb|EHA48418.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Magnaporthe oryzae 70-15]
 gi|440464580|gb|ELQ33987.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Magnaporthe oryzae Y34]
 gi|440481695|gb|ELQ62250.1| endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Magnaporthe oryzae P131]
          Length = 376

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 71/364 (19%), Positives = 139/364 (38%), Gaps = 63/364 (17%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K    +  +T  GG  T+   L  + L   ++  +++   T    V+   G 
Sbjct: 22  VSAFDAFPKSKPQYVTRTSGGGKWTVAMLLVSAILTWSELARWWRGVETHTFAVEKGVGQ 81

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++D VV  + C             Q +HV               +Q+   + + A 
Sbjct: 82  SMQINMDTVV-HMRC-------------QDIHVN--------------VQDAAGDRIMAA 113

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            + K+         +    ++ G     +  T +        +  +        ++  + 
Sbjct: 114 ARLKMDDTTWAQWVDGSGVHRLGHDQHGKVVTGEGHEEGFGEEHIH--------DIVALG 165

Query: 187 QCKNEYS-TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
           + +  +S T +L     + C+I+G L++N+V G FHI      +  H ++        +A
Sbjct: 166 KKRARWSKTPRLWGATPDSCRIFGSLDLNKVQGDFHIT-----ARGHGYIEFGDHLDHSA 220

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLG 305
           FN +H +   SFG           PLD TV   E+    F Y++ ++PT+Y     +   
Sbjct: 221 FNFSHIVNEFSFGDFYP---SLVNPLDKTVNTCEKNFHKFQYFLSVVPTLYSVKSSTGAF 277

Query: 306 G------------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
           G                   +  +PGIFF Y++ P+++ I E   ++     K++  +SG
Sbjct: 278 GYSTIFTNQYAVTEQSSEISEMNVPGIFFKYDIEPILLDIEESRDTILVFLIKVINILSG 337

Query: 348 TYIT 351
             + 
Sbjct: 338 AMVA 341


>gi|397641928|gb|EJK74922.1| hypothetical protein THAOC_03372 [Thalassiosira oceanica]
          Length = 583

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 52/202 (25%), Positives = 89/202 (44%), Gaps = 51/202 (25%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG----- 258
           GCQ+ G+L VNRV G+FHI    + S+NH          +A  N TH + H+SFG     
Sbjct: 388 GCQVSGHLMVNRVPGNFHIE---AKSVNH-------NLNAAMTNLTHRVNHISFGEPITK 437

Query: 259 ------------------IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------ 294
                              ++ ++ ++  P+D       +    F++YIK++ T      
Sbjct: 438 LPYHMENTPFMRKVKRVLKQVPEEHKQFNPMDDQEYITTQFHQAFHHYIKVVSTHLNMGS 497

Query: 295 -----------IYERLDGSKLGGGDG-GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIM 342
                      +Y+ L+ S++   D   +P   FSY++SP+ V + ++ +      T + 
Sbjct: 498 SSTVNDVNSITVYQMLEQSQIVFYDEVNVPEARFSYDMSPMSVVVQKEGRKWYDYLTSLC 557

Query: 343 CNISGTYITFMLVDALLHSCVK 364
             I GT+ T  L+DA L+   K
Sbjct: 558 AIIGGTFTTLGLIDATLYKVFK 579


>gi|451997913|gb|EMD90378.1| hypothetical protein COCHEDRAFT_27091 [Cochliobolus heterostrophus
           C5]
          Length = 395

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 91/409 (22%), Positives = 154/409 (37%), Gaps = 88/409 (21%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K  + +  +     A T+   +   YL   ++  ++  +TT+   ++     
Sbjct: 22  VSSFDAFPKTKKTYLVQGRNSSAWTVTLIITCIYLTWSEIARWYAGTTTQSFTIEKGVSH 81

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I+LDI+V  + C  L ++  D++G++ L  E         L   P    Q    N  
Sbjct: 82  DMQINLDIIV-AMKCADLHVNMQDAAGDRTLAGEL--------LRKDPTSWSQWTGKN-- 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                 TE GT     +D  +         E  +  +    + +A               
Sbjct: 131 ------TEKGTHELGKDDTTQI-------PEWEEYGDVHEHLGKA--------------- 162

Query: 187 QCKNEYS-TEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
             K ++S T KL+   T+ C+IYG L  N+V G FHI      +  H ++   +    ++
Sbjct: 163 -TKKKFSKTPKLRGP-TDSCRIYGNLVGNKVQGDFHIT-----ARGHGYMEFGEHLDHSS 215

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS---MFNYYIKIIPTIYE----- 297
           FN +H IR +SFG           PLD T+A     A     F YY+ I+PTIY      
Sbjct: 216 FNFSHIIREMSFGPYYP---SLTNPLDSTIAVTPTPADHFYKFQYYLSIVPTIYTDDPSL 272

Query: 298 ------------------------------RLDGSKLGGGDGGMPGIFFSYELSPLMVKI 327
                                          +        D  +PGIF  +++ P+M+ I
Sbjct: 273 MPLMESVVSTNDQPSSNMFRMAHAIKTNQYAVTSQSHKVDDTYVPGIFVKFDIEPIMLAI 332

Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTV 376
            E+SKS   L   ++  +SG  +    V  +     + + K +  G  V
Sbjct: 333 VEESKSFWKLLITLVNVVSGVMVAGSWVWQMFDWASEFVGKRKRRGDGV 381


>gi|156841160|ref|XP_001643955.1| hypothetical protein Kpol_1001p9 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156114586|gb|EDO16097.1| hypothetical protein Kpol_1001p9 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 349

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 87/368 (23%), Positives = 148/368 (40%), Gaps = 78/368 (21%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K  E   +K+V GG  +I+ +  +  +   +   YF     E+  VD +   
Sbjct: 4   LKTFDAFPKTEERHVKKSVNGGLSSILTYFMLLLIAWTEFGSYFGGYIDEQYSVDPTIRE 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++D+ +  + C  + ++A+D +      ++       L  +  P   P        
Sbjct: 64  TVQINMDMYI-KMPCQLIHVNAMDET------MDRKFVSNELIFEDMPFFVPYG------ 110

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
              KV  +N   +  L++    G    AE             +E   +K     + + + 
Sbjct: 111 --TKVNNKNDIVSPGLDE--IIGEAIPAE------------FREKLDFKSQVDADGNPLF 154

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
           +               +GC IYG +++NRV+G     A G  Y  N        P     
Sbjct: 155 KV--------------DGCHIYGSVKLNRVAGELQFTAKGWGYRDNGR-----APLDQID 195

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASM--FNYYIKIIPTIYERLDGSK 303
           FN  H I   SFG      D    PLDGT AK E+  S+  + Y   ++PTI+++L G++
Sbjct: 196 FN--HVINEFSFGDFYPYID---NPLDGT-AKIEKQKSISRYIYSTSVVPTIFQKL-GAE 248

Query: 304 L------------GGGDG------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNI 345
           +               DG       +PGIFF Y+  PL + I++K  S      +++  +
Sbjct: 249 VDTNQYSLAEYHTAPKDGKIKLTTSIPGIFFRYDFEPLSIVISDKRLSFVQFIVRLVAIL 308

Query: 346 SGTYITFM 353
           S  +I +M
Sbjct: 309 S--FILYM 314


>gi|356549839|ref|XP_003543298.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
          Length = 480

 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 66/249 (26%), Positives = 104/249 (41%), Gaps = 42/249 (16%)

Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGY 210
            Y  + +T     T   +  +   +   LP  D     KN   TE+   + T GC+I GY
Sbjct: 244 SYYGDRDTDSLVKTMENLVASLPSESQKLPLEDKSDVAKN---TERPAPS-TGGCRIDGY 299

Query: 211 LEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ----DDDE 266
           + V +V G+      L +S       +   + ++  N +H I HLSFG K+      D +
Sbjct: 300 VRVKKVPGN------LIFSARS----NAHSFDASQMNMSHVINHLSFGRKVSPRVMSDVK 349

Query: 267 RRKPLDGTVAKAEEGASMFN-----------YYIKIIPTI------------YERLDGSK 303
           R  P  G+      G S  N           +Y++I+ T             YE    S 
Sbjct: 350 RLIPYVGSSHDRLNGRSFINTHDLGANVTMEHYLQIVKTEVITRKDYKLVEEYEYTAHSS 409

Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
           +      +P   F  ELSP+ V ITE  KS  H  T +   + G +    ++DA+LH+ +
Sbjct: 410 VAQS-LHIPVAKFHLELSPMQVLITENQKSFSHFITNVCAIVGGIFTVAGIMDAILHNTI 468

Query: 364 KKISKVEIG 372
           + + KVE+G
Sbjct: 469 RLMKKVELG 477



 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 63/115 (54%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S ++K +D + K   D  E ++ G  ++IV  L + +L  +++  Y  V+T+ ++ V
Sbjct: 1   MISSSKIKSVDFYRKIPRDLTEASLSGAGLSIVAALAMIFLFGMELNSYLSVTTSTQVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  +I  P +SC++ A+D  D  G   L++   + K  +D + +P
Sbjct: 61  DKSSDGDYLRIDFNISFPALSCEFAAVDVSDVLGTNRLNLTKTVRKFSIDSNLRP 115


>gi|363738942|ref|XP_414530.3| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 1 [Gallus gallus]
          Length = 291

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 86/188 (45%), Gaps = 30/188 (15%)

Query: 203 EGCQIYGYLEVNRVSG-SFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
           +GC+  G+  +N+VS    H++          H    QP      + TH I  LSFG KL
Sbjct: 113 DGCRFEGHFSINKVSPWXLHVS---------THSATAQPQNP---DMTHIIHKLSFGDKL 160

Query: 262 QDDDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGG 306
           Q  +       L+G    +    +  +Y +KI+PT+YE + G +             +  
Sbjct: 161 QVQNVHGAFNALEGADKLSSNPLASHDYILKIVPTVYEDMSGKQRYSYQYTVANKEYVAY 220

Query: 307 GDGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
              G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +
Sbjct: 221 SHTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITSICAIIGGTFTVAGILDSCIFTASE 280

Query: 365 KISKVEIG 372
              K+++G
Sbjct: 281 AWKKIQLG 288



 Score = 50.4 bits (119), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---S 63
            +  D + K  +D  + T  G  +++ C LFI +L   ++  +       EL+VD     
Sbjct: 5   FRRFDIYRKVPKDLTQPTYTGAIISVCCCLFILFLFLSELTGFIATEIVNELYVDDPDKD 64

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ ++L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 65  SGGKIEVNLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 105


>gi|407927953|gb|EKG20833.1| protein of unknown function DUF1692 [Macrophomina phaseolina MS6]
          Length = 366

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 78/360 (21%), Positives = 141/360 (39%), Gaps = 69/360 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  + + ++T  G   T++  +   +L   +   ++   T+    V+   G 
Sbjct: 20  LQAFDAFPKTKKTYLQQTTQGANWTLLLIVTCVWLSITETRRWWTGETSHTFSVEKGVGH 79

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           ++ I+LDIVV  + C  L ++  D+SG++              L G  + +     +  V
Sbjct: 80  EMQINLDIVV-AMRCRDLHVNIQDASGDR-------------ILAGVALAKDDTRWLQWV 125

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           +K K                                +     +E  RY +  + +     
Sbjct: 126 EKSK------------------------------NVHKLERSQEQKRYDEEDVHDYLGAS 155

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
           + K    T + +    + C+IYG L+ NRV G FHI A G  Y     H+   Q      
Sbjct: 156 KSKKFPKTPRYRGV-PDSCRIYGSLDANRVQGDFHITARGHGYMEFGEHLDHSQ------ 208

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA---KAEEGASMFNYYIKIIPTIYERLDGS 302
           FN +H I  LSFG           PLD T A     ++    F YY+ ++PT+Y     +
Sbjct: 209 FNFSHQINELSFGPYYP---SLTNPLDYTRAVTPTPDDHFYKFQYYLSVVPTVYTDNSHT 265

Query: 303 KLGGG-----------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
            +              +  +PG+F  +++ P+ + I+E +     L  +++  +SG  + 
Sbjct: 266 IVTNQYAVTEQSHSVPEMSVPGVFVKFDIEPIKLTISEYNGGFLALLIRLVNVVSGVMVA 325


>gi|345567560|gb|EGX50490.1| hypothetical protein AOL_s00075g219 [Arthrobotrys oligospora ATCC
           24927]
          Length = 354

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/176 (28%), Positives = 85/176 (48%), Gaps = 27/176 (15%)

Query: 205 CQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           C+I+G ++VNRV G FHI A G  Y     HV HD        FN +H +  LSFG   +
Sbjct: 164 CRIWGSMDVNRVMGDFHITAKGHGYWDPGQHVDHD-------TFNFSHVVNELSFG---E 213

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYER----LDGSKLGGGDGG-------M 311
              +   PLDG  +  E+    + Y++ ++PT Y+     L  ++    + G       +
Sbjct: 214 FYPKLVNPLDGVASVTEDKFYRYQYFMSVVPTTYKAHGRTLQTNQYSVTEQGRSMNPQSV 273

Query: 312 PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT----FMLVDALLHSCV 363
           PGIFF +++ P+M+ IT+      +L  ++   I G  +     + + D +L S +
Sbjct: 274 PGIFFKFDIEPIMLTITDTHTPWIYLIVRLANVIGGVMVAGGWLYKISDGVLGSVL 329



 Score = 45.1 bits (105), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 1/93 (1%)

Query: 5  ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
          E LK  DAF K    +  ++  GG +T+V      +L+  ++  Y    + E   V    
Sbjct: 8  EGLKSFDAFPKTRVSYTTRSSKGGVITMVFVAICVWLVWGELSLYLDGKSEEHFSVQGGE 67

Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
          G  + I+LD++V  + CD L ++  D++G++ L
Sbjct: 68 GHFMQINLDVIV-AMPCDSLHVNVQDAAGDRIL 99


>gi|219111363|ref|XP_002177433.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411968|gb|EEC51896.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 520

 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 51/192 (26%), Positives = 88/192 (45%), Gaps = 43/192 (22%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG----- 258
           GC I G+L ++RV G+FHI     +       HD+ P+ +   N +H + HLS G     
Sbjct: 338 GCNIAGHLLLDRVPGNFHIQARSPH-------HDLVPHMT---NVSHVVHHLSIGEPVAE 387

Query: 259 -------IKLQDDDERR-KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGD-- 308
                  + L +D +R+ KP++G     +E    +++Y+K+I T    +DG K G  D  
Sbjct: 388 RLIEQEKVILPEDVKRKLKPMNGNAYVTKELHEAYHHYLKVITT---NVDGLKFGKRDLR 444

Query: 309 ---------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
                            +P   F ++LSP+ V     S+     +T I+  I GT+    
Sbjct: 445 AYQILQSSQLSFYRNDIIPEAKFVFDLSPVAVSYRTTSRRWYDYFTSILAIIGGTFTVVG 504

Query: 354 LVDALLHSCVKK 365
           L+++ +H+ V +
Sbjct: 505 LLESTIHATVAR 516


>gi|18402672|ref|NP_566664.1| protein PDI-like 5-3 [Arabidopsis thaliana]
 gi|75273652|sp|Q9LJU2.1|PDI53_ARATH RecName: Full=Protein disulfide-isomerase 5-3; Short=AtPDIL5-3;
           AltName: Full=Protein disulfide-isomerase 12;
           Short=PDI12; AltName: Full=Protein disulfide-isomerase
           8-1; Short=AtPDIL8-1; Flags: Precursor
 gi|11994143|dbj|BAB01164.1| unnamed protein product [Arabidopsis thaliana]
 gi|15215847|gb|AAK91468.1| AT3g20560/K10D20_9 [Arabidopsis thaliana]
 gi|332642877|gb|AEE76398.1| protein PDI-like 5-3 [Arabidopsis thaliana]
          Length = 483

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 36/110 (32%), Positives = 61/110 (55%), Gaps = 1/110 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           MV S +LK +D + K   D  E ++ G  ++IV  LF+ +L  +++  Y +V+TT  + V
Sbjct: 1   MVSSTKLKSVDFYRKIPRDLTEASLSGAGLSIVAALFMMFLFGMELSSYLEVNTTTAVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
           D SS G  L I  +I  P +SC++ ++D  D  G   L++   + K  +D
Sbjct: 61  DKSSDGDFLRIDFNISFPALSCEFASVDVSDVLGTNRLNITKTVRKFPID 110



 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 55/204 (26%), Positives = 91/204 (44%), Gaps = 39/204 (19%)

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF 257
           K   T GC++ GY+ V +V G+  I+        H   H    + S+  N +H + H SF
Sbjct: 287 KGPVTGGCRVEGYVRVKKVPGNLVISA-------HSGAHS---FDSSQMNMSHVVSHFSF 336

Query: 258 G----IKLQDDDERRKP--------LDGT--VAKAEEGASM-FNYYIKIIPT-IYERLDG 301
           G     +L  D +R  P        LDG   + + E GA++   +Y++ + T +  R  G
Sbjct: 337 GRMISPRLLTDMKRLLPYLGLSHDRLDGKAFINQHEFGANVTIEHYLQTVKTEVITRRSG 396

Query: 302 SKLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGT 348
            +                    +P   F +ELSP+ + ITE  KS  H  T +   I G 
Sbjct: 397 QEHSLIEEYEYTAHSSVAQTYYLPVAKFHFELSPMQILITENPKSFSHFITNLCAIIGGV 456

Query: 349 YITFMLVDALLHSCVKKISKVEIG 372
           +    ++D++ H+ V+ + KVE+G
Sbjct: 457 FTVAGILDSIFHNTVRLVKKVELG 480


>gi|449542382|gb|EMD33361.1| hypothetical protein CERSUDRAFT_117979 [Ceriporiopsis subvermispora
           B]
          Length = 530

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 77/357 (21%), Positives = 134/357 (37%), Gaps = 68/357 (19%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
            DAF K    +  ++   G +T+        L+  D+ +Y       +  VDS   S L 
Sbjct: 27  FDAFPKLPTTYKARSESRGFLTLFVAFAAFLLVLNDLGEYIWGWPVYDFTVDSDPSSDLK 86

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           I++D++V  + C YL++D  D+ G+            RL L             NA ++ 
Sbjct: 87  INVDMMV-NMPCAYLSVDLRDAMGD------------RLYLS------------NAFRRD 121

Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
               + G  TT  E        + A    R+      + +  +          +   +  
Sbjct: 122 GTKFDIGQATTLQE--------HAAALSARQVIAQSRKSRGFFS---------NLFRRTN 164

Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAAFNT 248
             Y            C+++G +   +V+ + HI   G  Y+  H HV   +       N 
Sbjct: 165 GGYKATYNHQPDGSACRVFGSITAKKVTANLHITTLGHGYA-THSHVDHSK------MNL 217

Query: 249 THHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGD 308
           +H I   SFG    D  +   PLD +   A +    + Y++ ++PT Y     S L    
Sbjct: 218 SHVITEFSFGPHFPDITQ---PLDNSFEVAHDPFVAYQYFLHVVPTTYIAPRSSPLHTHQ 274

Query: 309 GGM---------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
             +               PGIFF ++L PL +KI +++ SL  L  + +  I G ++
Sbjct: 275 YSVTHYTRILDPSHHRHTPGIFFKFDLDPLAIKIEQRTTSLVQLAIRCVGVIGGVFV 331


>gi|302675040|ref|XP_003027204.1| hypothetical protein SCHCODRAFT_70909 [Schizophyllum commune H4-8]
 gi|300100890|gb|EFI92301.1| hypothetical protein SCHCODRAFT_70909 [Schizophyllum commune H4-8]
          Length = 528

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 77/353 (21%), Positives = 136/353 (38%), Gaps = 64/353 (18%)

Query: 2   VFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD 61
           V    L  LDAF K    +  ++   G +T+        L+  D+ +Y       E  VD
Sbjct: 11  VLPPGLAKLDAFPKLPGTYKARSESRGFLTLFVAFICFILVFNDISEYIWGWPDYEFSVD 70

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
               S + I++D+VV  + C ++++D  D+ G++     H +  RR   DG         
Sbjct: 71  RHSSSFMNINVDMVV-NMPCRFISVDLRDAVGDRLFLSNHGL--RR---DG--------- 115

Query: 122 VVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPE 181
                     T  +    T+L++  +  S   A  + RK     + +             
Sbjct: 116 ----------TKFDVGQATKLKEHARALSAREAVAQGRKNRGLFSGLFGG---------- 155

Query: 182 LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA-PGLSYSINHVHVHDIQP 240
                + K+ +            C+++G LEV +V+ + HI   G  Y+      H +  
Sbjct: 156 -----KSKDLFPPTYNYEPHGSACRVWGSLEVKKVTANLHITTAGHGYASREHADHKV-- 208

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
                 N TH I   SFG    D  +   PLD T   A++    + YY+ ++PT Y    
Sbjct: 209 -----MNLTHVISEFSFGPHFPDIVQ---PLDYTFEVAKDPFVAYQYYLHVVPTTYIAPR 260

Query: 301 GSKLGGG-------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
            + L                +   PGIFF +++ PL ++I +++ S   L+ +
Sbjct: 261 SAPLSTNQYSVTHYKKVFEHNQATPGIFFKFDIDPLAIQIHQRTTSFARLFIR 313


>gi|366987569|ref|XP_003673551.1| hypothetical protein NCAS_0A06100 [Naumovozyma castellii CBS 4309]
 gi|342299414|emb|CCC67168.1| hypothetical protein NCAS_0A06100 [Naumovozyma castellii CBS 4309]
          Length = 355

 Score = 66.2 bits (160), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 87/377 (23%), Positives = 146/377 (38%), Gaps = 74/377 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  E   +K+  GG  TI+ ++F  ++   +   YF     E   VD     
Sbjct: 6   LRVFDAFPKTEEQHEKKSTKGGVSTILIYIFAIFIAWSEFGSYFGGFVGERYVVDGDVKE 65

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++D+ V  I C ++ ++  D + ++ L  E       L+ +  P   P    +N +
Sbjct: 66  TVSINMDLFV-NIPCKWITVNVRDQTMDRKLASEE------LNFEEMPFFIPFDVRINDI 118

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            +          T +L++    G    AE   +       +  +   Y    LP+ +   
Sbjct: 119 AE--------IITPQLDE--ILGEAIPAEFREKLDTRMYYDENDPETYNN--LPDFN--- 163

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                            GC I+G L VNRV+G   I A G  Y+       +  P     
Sbjct: 164 -----------------GCHIFGSLPVNRVAGELQITAKGYGYA-----DRERTPMDQIK 201

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERLDGSKL 304
           FN  H I   SFG      D    PLD +     E   + ++Y + +IPT + +L G+++
Sbjct: 202 FN--HVINEFSFGDFYPYID---NPLDKSAKFDLETPKTAYSYDLSVIPTTFRKL-GTEV 255

Query: 305 G------------GGD------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
                        G D      G +PGIFF Y    L + +++   +      +++  +S
Sbjct: 256 NTFQYSVAEYHYKGKDSPVPRSGRVPGIFFDYNFESLSIIVSDSRLNFIQFIIRLIAILS 315

Query: 347 -GTYIT---FMLVDALL 359
              YI    F L D L+
Sbjct: 316 FALYIASWIFTLGDLLI 332


>gi|118386954|ref|XP_001026594.1| hypothetical protein TTHERM_01146090 [Tetrahymena thermophila]
 gi|89308361|gb|EAS06349.1| hypothetical protein TTHERM_01146090 [Tetrahymena thermophila
           SB210]
          Length = 712

 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 62/205 (30%), Positives = 95/205 (46%), Gaps = 16/205 (7%)

Query: 5   ERLKGLDAFTKPYEDFH-EKTVYGGAVTI-VCWLFISYLICVDVCDYFQVSTTEELFVDS 62
           ER K  D F K  +D   EKT+ GG +     +L I+ +I      +F    T     + 
Sbjct: 2   ERFKQFDYFRKVQDDLKSEKTLIGGLIGFSTIFLVITLVIYETYQVFFGNYKTFPFINNY 61

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEH-NIYKRRLDLDGKPIQEPQKE 121
           +   K+ ++L+I    I C  L++D  D SG  HL   H  ++K RLD  GK I     +
Sbjct: 62  NPNEKVRVNLNITFEEIFCKALSVDYQDVSGA-HLEDMHWTVHKIRLDQFGKFIN---YD 117

Query: 122 VVNAVKKKKVTTENGT-------TTTELEDP-NKCGSCYGAETETRKCCNTCNEVKEAYR 173
             N +KK++     G        T  ++++  +   SCYGAE    + C TC++V  A+ 
Sbjct: 118 SANDIKKQEQKFYPGNPFFEAVKTNNQVQNQFSNSVSCYGAELYEGQICLTCSDVLIAFA 177

Query: 174 YKKWALPELDTIVQCKNEYSTEKLK 198
            + W  P  + I QC NE + E  K
Sbjct: 178 QRGWPQPMKEQISQC-NEGTKENFK 201



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 45/164 (27%), Positives = 68/164 (41%), Gaps = 32/164 (19%)

Query: 203 EGCQIYGYLEVNRVSGSFHIA---PGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSF-- 257
           E CQIYG+  V +V G+FH++    GL           +   ++  FN  H I  L F  
Sbjct: 547 EKCQIYGHFYVKKVPGNFHVSFHNEGL-----------LLMNSNLIFNLRHTIHTLEFTT 595

Query: 258 ---GIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL-----------DGSK 303
               + L    +   PLD T+     G    +YY+K++ T++E +               
Sbjct: 596 EDGSLTLGKYTKSSNPLDKTIHNPGHGMDT-DYYLKVVNTVFENMLSEHNNIYSFTSLET 654

Query: 304 LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
            G  D  +P + F YE  P+ V    KS+SL       +C I G
Sbjct: 655 SGVRDFRLPSVNFRYEFDPITVLHYRKSRSLTQFIV-TLCAIVG 697


>gi|356545151|ref|XP_003541008.1| PREDICTED: protein disulfide-isomerase 5-4-like [Glycine max]
          Length = 453

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 51/195 (26%), Positives = 83/195 (42%), Gaps = 36/195 (18%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL-- 261
           GC++ GY+ V +V G+  I+             D   + ++  N +H I +LSFG K+  
Sbjct: 266 GCRVEGYVRVKKVPGNLIISAR----------SDAHSFDASQMNMSHVINNLSFGKKVTP 315

Query: 262 --QDDDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTIYERLDGSKL---- 304
               D +   P  G+      G S  N           +YI+I+ T      G KL    
Sbjct: 316 RAMSDVKLLIPYIGSSHDRLNGRSFINTRDLGANVTIEHYIQIVKTEVVTRKGYKLIEEY 375

Query: 305 -------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
                        +P   F  ELSP+ V ITE  +S  H  T +   I G +    ++D+
Sbjct: 376 EYTAHSSVAHSLDIPVAKFHLELSPMQVLITENQRSFSHFITNVCAIIGGVFTVAGILDS 435

Query: 358 LLHSCVKKISKVEIG 372
           +LH+ ++ + K+E+G
Sbjct: 436 ILHNTIRMVKKIELG 450


>gi|321465392|gb|EFX76393.1| hypothetical protein DAPPUDRAFT_306117 [Daphnia pulex]
          Length = 289

 Score = 65.9 bits (159), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 48/187 (25%), Positives = 81/187 (43%), Gaps = 29/187 (15%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           +GC       +NRV G+FH++          H  D QP ++   +  H+I  L+FG  L 
Sbjct: 112 KGCIFESRFHINRVPGNFHVS---------THSADKQPDSA---DMAHYITSLTFGEMLD 159

Query: 263 DDD--ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG-------------- 306
           + +      PL        + A   +Y +KI+PTIYE   G+ L                
Sbjct: 160 NKNLPGNFNPLARRDRSQADPAESHDYTMKIVPTIYEDSAGTTLVSYQYTYAYSNYVSFS 219

Query: 307 -GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
            G      I+F Y+L+P+ VK  E+ + +    T +   I GT+    ++D+ + +  + 
Sbjct: 220 LGGRSPAAIWFRYDLNPITVKYHERRQPIYAFLTSVCAIIGGTFTVAGIIDSFVFTASEI 279

Query: 366 ISKVEIG 372
             K E+G
Sbjct: 280 FKKFELG 286



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/101 (27%), Positives = 52/101 (51%), Gaps = 2/101 (1%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+ LD + K  +D  + TV G  ++I C  F+++L   +   +       ELFVD+   +
Sbjct: 5   LRRLDIYRKVPKDLTQPTVTGAVISICCCAFMTFLFFSEFFHFISPEVVSELFVDNPGNT 64

Query: 67  --KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
             K+P+ ++I +P ++C+Y+ +D  D  G   +    N  K
Sbjct: 65  DEKIPVQINITLPRLACEYVGIDIQDDLGRHDVGFIENTLK 105


>gi|384244593|gb|EIE18093.1| protein disulfide isomerase [Coccomyxa subellipsoidea C-169]
          Length = 479

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 61/110 (55%), Gaps = 1/110 (0%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD-SS 63
           ++L+ +D + K   D  E T+ G  +++V    I  L+  ++  +  + T EEL VD S+
Sbjct: 6   QKLRSVDFYRKIPNDLTEATLAGAGISLVAAFTIVVLLTAELSSFLAIETKEELIVDRSA 65

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
            G  L I+ +I  P++SC++  LD  D+ G + +++   I K  +D DG+
Sbjct: 66  HGDLLRINFNISFPSLSCEFATLDVSDALGTKRMNLTKTIRKLPIDEDGQ 115



 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 49/204 (24%), Positives = 82/204 (40%), Gaps = 43/204 (21%)

Query: 202 TEGCQIYGYLEVNRVSGSFHI---APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
           T GC + G++ V +V G+ H    +PG S+        D Q     A N +H + +L FG
Sbjct: 289 TSGCALSGFVLVKKVPGALHFLAKSPGHSF--------DYQ-----AMNMSHVVNYLYFG 335

Query: 259 IKLQD--------------DDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL 304
            K                  D+    L G    +    + F +Y++++ T  E       
Sbjct: 336 NKPSPRRHQSLAKLHPAGLSDDWADKLAGQDFFSRAAKATFEHYMQVVLTTIEPSKHRPE 395

Query: 305 GGGDG-------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              D               +P   F+Y+LSP+ + ++EK ++  H  T     I G +  
Sbjct: 396 LSYDAYEYTVHSHTYDTADIPAAKFTYDLSPIQILVSEKRRAWYHFVTTTCAIIGGVFTV 455

Query: 352 FMLVDALLHSCVKKISKVEIGGKT 375
             +VD L+H+  +   KVE+G  T
Sbjct: 456 AGIVDGLVHTGARFAKKVELGKHT 479


>gi|17570549|ref|NP_508375.1| Protein Y102A11A.6 [Caenorhabditis elegans]
 gi|351063407|emb|CCD71590.1| Protein Y102A11A.6 [Caenorhabditis elegans]
          Length = 286

 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 47/193 (24%), Positives = 80/193 (41%), Gaps = 37/193 (19%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+     E+N+V G+FH++          H    QP        ++ +RHL   IK  D
Sbjct: 110 GCRFESRFEINKVPGNFHLS---------THSAATQP-------ESYDMRHLIHSIKFGD 153

Query: 264 DDERRK------PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------- 306
           D   +       PL       E G +   Y +KI+P+++E   G+ L             
Sbjct: 154 DVSHKNLKGSFDPLAKRNTSQENGLNTHEYILKIVPSVHEDYSGTILNSYQYTFGHKSYI 213

Query: 307 ----GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
                   +P ++F YEL P+ +K TE+ +S     T I   + GT+    ++D+   + 
Sbjct: 214 TYHHSGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTI 273

Query: 363 VKKISKVEIGGKT 375
            + + K  +G  T
Sbjct: 274 SELVKKQRLGKLT 286



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 50/92 (54%), Gaps = 1/92 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
          ++  D + K  +D  + T  G  ++I+C LFIS++I  D+  Y  +    E F+D   R 
Sbjct: 4  IRRFDIYRKVPKDLTQPTTVGAVISILCVLFISFMIFNDILAYIFIDLRSEFFIDDPGRE 63

Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
           K+ + +++  P ++C+YL +D  D +G   +
Sbjct: 64 GKIDVQVNVSFPHMACEYLGVDIQDENGRHEV 95


>gi|324499844|gb|ADY39943.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Ascaris suum]
          Length = 429

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 91/387 (23%), Positives = 160/387 (41%), Gaps = 41/387 (10%)

Query: 3   FSERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEE--LF 59
             E ++ LDAF K  ++   EK   G  +++VC+  I  L+  ++  Y    T  E    
Sbjct: 15  LQEIVQSLDAFDKTTDEIKEEKKTSGAIISVVCFTVIGVLVFGELKTYIYGDTEFEYKFT 74

Query: 60  VDSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ 119
           VD++   +  + LD++V T  C  L      ++ E+   +  N +KR  D       E +
Sbjct: 75  VDTAFDEQPELELDMIVAT-PCTNLVAQLSGTAAEEFFLL--NQFKR--DPTRFEFTERE 129

Query: 120 KEVVNAVKK-KKVTTENGTTTTELED----PNKCGSCYGAETETRKCCNTCNEVKEAYRY 174
           ++  + +K+   VT   G     LE              AE E ++        KE    
Sbjct: 130 QKYWDELKRVHGVTKPGGMVFKGLEKMEFVSGHVEEGLKAEAEVKQREEAIAIEKERKNN 189

Query: 175 KK-----WALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSG-SFHIAPGLSY 228
           K+      A+  +   +   +  +++  K+  T  C+++G + VN+V G S  I  G   
Sbjct: 190 KQEDTFGGAILLIGNGINVFHILASDSQKDEGT-ACRVHGRVRVNKVKGDSVIITAGKGA 248

Query: 229 SINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYY 288
            I+ +  H      S A N +H I  L FG  +        PL GT   +E G   + Y+
Sbjct: 249 GIDGLFAH--VDGASNAGNISHRIARLHFGPWIGG---LLTPLAGTEQISESGIDEYRYF 303

Query: 289 IKIIPT-IYER--LDGSKL-------------GGGDGGMPGIFFSYELSPLMVKITEKSK 332
           +K++PT I+      GS +              G +   P I   YE + L+V++ E   
Sbjct: 304 LKVVPTRIFHSGFFGGSTMRYQYSVTKTHKRPSGREHMHPAIAIHYEFAALVVEVRETQT 363

Query: 333 SLGHLWTKIMCNISGTYITFMLVDALL 359
           SL  L+ ++   + G + T  +++ L 
Sbjct: 364 SLFQLFVRLCSVVGGVFATSSILNELF 390


>gi|154286632|ref|XP_001544111.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407752|gb|EDN03293.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 315

 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 77/318 (24%), Positives = 126/318 (39%), Gaps = 75/318 (23%)

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           ++LDIVV  + CD L ++  D++G++ L  +       LD          +E+       
Sbjct: 1   MNLDIVV-AMPCDALRVNVQDAAGDRILASD------LLDKQQTSWAAWNREL------N 47

Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
            VT+  G     L + +   S    +       +   E K +Y+ K    P+L       
Sbjct: 48  GVTSGGGREYQTLNEEDL--SRLMEQEADAHVGHALGEAKRSYKRKFPKGPKLK------ 99

Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV-HDIQPYTSAAFN 247
                   +    + C+IYG LE N+V G FHI A G  Y     H+ HD       AFN
Sbjct: 100 --------RGEKADSCRIYGSLEGNKVQGDFHITARGHGYFEFGEHLSHD-------AFN 144

Query: 248 TTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL-------- 299
            +H +  LSFG           PLD T++        F YY+ ++PTIY R         
Sbjct: 145 FSHMVTELSFGPHYP---SLLNPLDKTISVTPARFFKFQYYLSVVPTIYTRAGIVDPYNH 201

Query: 300 ---DGSKLGGGDGG-----------------------MPGIFFSYELSPLMVKITEKSKS 333
              D + +   + G                       +PGIFF Y + P+++ ++E+  S
Sbjct: 202 VLPDPTTIRPSERGSTIFTNQYAATSQSHEVPDPQYHIPGIFFKYNIEPILLVVSEERGS 261

Query: 334 LGHLWTKIMCNISGTYIT 351
           L  L  +++  ++G  + 
Sbjct: 262 LLALLVRLVNVLAGVVVA 279


>gi|409048375|gb|EKM57853.1| hypothetical protein PHACADRAFT_116248 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 546

 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 75/345 (21%), Positives = 135/345 (39%), Gaps = 66/345 (19%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
            DAF K    +  ++   G +T+        L+  D+ +Y       E  VD  R S L 
Sbjct: 27  FDAFPKLPSTYKARSEGRGFLTVFVTFMAFLLVLNDLGEYIWGWPDHEFSVDRDRSSDLR 86

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           I++D++V  + C YL++D  D+ G++ L++  + ++R   L         KE   A+  +
Sbjct: 87  INVDMLV-NMPCQYLSVDLRDAVGDR-LYLS-DSFRRDGTLFDIGQATALKEHAAALSAR 143

Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCK 189
           +V T++           K    +   T  R+        +  Y YK              
Sbjct: 144 QVVTQS----------RKSRGLF--ATLFRR---NSGGFRPTYNYKPSG----------- 177

Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAAFNT 248
                          C++YG + V +V+ + H+   G  Y+      H++        N 
Sbjct: 178 -------------SACRVYGSVAVKKVTANLHVTTLGHGYASRQHVDHNL-------MNL 217

Query: 249 THHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGG- 307
           +H I   SFG    D  +   PLD +    E+    + YY+ ++PT Y       L    
Sbjct: 218 SHVITEFSFGPYFPDITQ---PLDNSFELTEDSFVSYQYYLHVVPTTYIAPRSRPLHTHQ 274

Query: 308 ------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTK 340
                       + G+PGIFF +++ P+ + I +++ SL  L  +
Sbjct: 275 YSVTHYTRVLKHNNGIPGIFFKFDVDPMSLTIHQRTTSLLQLLIR 319


>gi|299116076|emb|CBN74492.1| DEAD box helicase [Ectocarpus siliculosus]
          Length = 865

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 83/189 (43%), Gaps = 34/189 (17%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGI---- 259
           GC + G++ VNRV G+FHI    + S +H        +  A  N +H + H+SFG     
Sbjct: 684 GCMVTGHIMVNRVPGNFHIE---AASKSHT-------FHGATTNLSHIVHHMSFGNDPPR 733

Query: 260 -------KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE------RLDGSKLGG 306
                  +L +D  +  PLDG V  A       ++Y++++ ++Y          G ++  
Sbjct: 734 RTQTKINRLTEDLRQNAPLDGNVYVANAYHQAPHHYLRVVGSMYHLSPMKTPWHGYQIVA 793

Query: 307 GDGGM-------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
               M       P   FSY +SP+ V +  + +      TK++  + GT+    LVDA +
Sbjct: 794 NSQMMLYDEEEVPEARFSYNISPMSVLVRSEKRPWYDFVTKVLAIVGGTFSMVGLVDAAV 853

Query: 360 HSCVKKISK 368
               +K  +
Sbjct: 854 FRASRKAGR 862



 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 25/104 (24%), Positives = 50/104 (48%), Gaps = 1/104 (0%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
           LD + K   D  + T  GG  + +  + +  L  V++  +       ++ VD+   +KL 
Sbjct: 411 LDLYPKIPTDLSQSTAVGGWFSTLTGVIMLLLFQVELFSFMSAPIESQVVVDNVLETKLQ 470

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVE-HNIYKRRLDLDG 112
           I+ ++    + C+YL++DA+D  G   +++    + K  LD  G
Sbjct: 471 INFNMSFLDLPCEYLSVDALDVLGSNRVNITGKEVQKWHLDPQG 514


>gi|406607484|emb|CCH41148.1| Endoplasmic reticulum-Golgi intermediate compartment protein 3
           [Wickerhamomyces ciferrii]
          Length = 359

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 54/192 (28%), Positives = 86/192 (44%), Gaps = 33/192 (17%)

Query: 198 KNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSIN---HVHVHDIQPYTSAAFNTTHHIR 253
           K++    C IYG + VN+VSG FHI A G  Y  N   HV +  +        N TH I 
Sbjct: 163 KDSGAPACHIYGSIPVNKVSGDFHITAQGYGYRGNSRSHVGIDGL--------NFTHIIS 214

Query: 254 HLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD------------G 301
             SFG   +       PLD TV   +E    + YY+ ++PT+Y++L              
Sbjct: 215 EFSFG---EFYPYIHNPLDATVQITKEHLQSYQYYLSVVPTVYKKLGVEIETNQYSTSLQ 271

Query: 302 SKLGGGDG-GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI----TFMLVD 356
            KL   +  G+PG+FF Y+  P+ + + +K         ++     G  +    ++ L D
Sbjct: 272 KKLYSFENKGVPGLFFKYDFEPISLIVEDKRIPFSTFLVRLATIYGGIIVVAKFSYKLFD 331

Query: 357 -ALLHSCVKKIS 367
            AL++   K+ +
Sbjct: 332 KALIYFFGKRFA 343


>gi|328771759|gb|EGF81798.1| hypothetical protein BATDEDRAFT_86854 [Batrachochytrium
           dendrobatidis JAM81]
          Length = 333

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 48/169 (28%), Positives = 67/169 (39%), Gaps = 31/169 (18%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV-HDIQPYTSAAFNTTHHIRHLSFGIKL 261
           + C+  G  + N+V G  H    L +    VH  HD       A N TH I  LSFG + 
Sbjct: 158 DACRFRGSFQANKVEGMLHFT-ALGHGYFGVHTPHD-------AINFTHRIDELSFGARY 209

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG--------------- 306
            D      PLD T+         F Y++ ++PTIY     S  G                
Sbjct: 210 PD---LHNPLDHTLEIGTTNFDSFMYFLGVVPTIYVDKARSLFGATLLTNQYAVTEFSHA 266

Query: 307 ----GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                   +PGIF  Y + P+ V+ITE    L    T++   I G ++T
Sbjct: 267 VDPQNPDALPGIFIKYHIEPISVRITESRLGLVQFTTRMCGIIGGAFVT 315



 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 45/92 (48%), Gaps = 3/92 (3%)

Query: 3   FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
            S+RL  LDAF K  +   + T  GG V+++    + YL C ++  +  +    +  VD 
Sbjct: 10  LSKRLASLDAFPKIEKQLQQTTKSGGLVSLMMLAVLVYLACTEIYRWRSIDQRYDFIVDQ 69

Query: 63  SRGSK--LPIHLDIVVPTISCDYLALDAVDSS 92
           +R  +  L I++D+ +  + C  L  D  D S
Sbjct: 70  TRSHEHSLQINVDLTI-AMDCKVLRADIQDIS 100


>gi|189207969|ref|XP_001940318.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187976411|gb|EDU43037.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 394

 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 79/382 (20%), Positives = 147/382 (38%), Gaps = 90/382 (23%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
            DAF K  + +  +     A T+   L   YL   ++  +   ST++   V+      + 
Sbjct: 24  FDAFPKTKKTYLVQGRNSSAWTVTLILTCIYLSWSEISRWLAGSTSQSFSVEKGISHDMQ 83

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           ++LD++V  + C  L ++  D++G++ L  E                         + +K
Sbjct: 84  LNLDVIV-AMRCADLHVNMQDAAGDRTLAGE-------------------------LLRK 117

Query: 130 KVTTENGTTTTELE-DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQC 188
             T+ +  T   LE   ++ G   G      +  +   ++ +A++ K    P +      
Sbjct: 118 DPTSWSQWTGRNLERGTHELGIDAGKAQPWEEVWDVHEQLGKAHKRKFSKTPRI------ 171

Query: 189 KNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNT 248
           + E          T+ C+IYG L+ N+V G FHI      +  H ++   Q    ++FN 
Sbjct: 172 RGE----------TDSCRIYGSLDGNKVQGDFHIT-----ARGHGYIEFGQHLDHSSFNF 216

Query: 249 THHIRHLSFGIKLQDDDERRKPLDGTVA---KAEEGASMFNYYIKIIPTIY--------- 296
           +H IR +SFG           PLD T+A     ++    F YY+ I+PTIY         
Sbjct: 217 SHIIREMSFGPYYP---SLTNPLDATIAVTPTPDDKFYKFQYYLSIVPTIYTDDPSLIPL 273

Query: 297 -ERLDGSKLGGGDGGM--------------------------PGIFFSYELSPLMVKITE 329
            E +  +    G   M                          PGIF  +++ P+++++ E
Sbjct: 274 LELVGSTSNHPGAASMFHGAHAIKTNQYAVTSQSHKVPENYVPGIFVKFDIEPIVLRVVE 333

Query: 330 KSKSLGHLWTKIMCNISGTYIT 351
           +      L   ++  +SG  + 
Sbjct: 334 EWGGFWRLIVTLINVVSGVMVA 355


>gi|449530722|ref|XP_004172342.1| PREDICTED: protein disulfide isomerase-like 5-4-like, partial
           [Cucumis sativus]
          Length = 176

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 35/110 (31%), Positives = 60/110 (54%), Gaps = 1/110 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S +LK +D + K   D  E T+ G  ++IV  L + +L  +++ +Y  VST+  + V
Sbjct: 1   MISSTKLKSVDFYRKIPRDLTEATLSGAGLSIVAALSMVFLFGMELSNYLSVSTSTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLD 109
           D S+ G  L +  +I  P +SC++ A+D  D  G   L++   I K  +D
Sbjct: 61  DNSTDGDFLRMDFNISFPALSCEFAAVDVNDVLGTNRLNITKTIRKFSID 110


>gi|402224967|gb|EJU05029.1| DUF1692-domain-containing protein [Dacryopinax sp. DJM-731 SS1]
          Length = 517

 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 74/346 (21%), Positives = 131/346 (37%), Gaps = 72/346 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K    +  ++  GG +T+   L    L+  D  +Y   +TT    VD     
Sbjct: 20  LKSFDAFPKVPSTYRTRSSGGGFITLGIALLCLLLVLNDWAEYVWGTTTWRFVVDDKIEK 79

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           ++ +++DI V  + C Y+++D  D+ G++ LH+     +     D +     +++  +  
Sbjct: 80  EMMLNVDITV-AMPCHYISVDLRDAVGDR-LHLSDQFKRDGTLFDARQATHIREQYTD-- 135

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                                    Y A+   R+       +                I 
Sbjct: 136 -------------------------YSAQQMVREAKTRRGRIG---------------IF 155

Query: 187 QCKNEYSTEKLKNTFT-----EGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQP 240
                      + TF        C++YG +EV +V  + HI   G  Y  N    H +  
Sbjct: 156 DWLRRRQPSAFQPTFNHVKDGSACRVYGSMEVKKVQANLHITTLGHGYHSNEHTDHSL-- 213

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLD 300
                 N +H I   SFG    D  +   PLD T+  +++  + F Y++ ++PT Y    
Sbjct: 214 -----MNLSHIITEFSFGPYFPDIVQ---PLDYTIESSDDPFTAFQYFLTVVPTEYRTSK 265

Query: 301 G----SKLGGGD--------GGMPGIFFSYELSPLMVKITEKSKSL 334
           G    ++   G          G P IFF Y+L PL + + +++ +L
Sbjct: 266 GVVKTNQYSVGSHMQHIQHGRGTPVIFFKYDLEPLSLIVEQRTTTL 311


>gi|443683891|gb|ELT87978.1| hypothetical protein CAPTEDRAFT_224400 [Capitella teleta]
          Length = 292

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 85/200 (42%), Gaps = 32/200 (16%)

Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
           +T+K+     EGC+     ++N+V G+FHI+          H    QP      N  H +
Sbjct: 102 NTDKVPINNNEGCRFKSSFKINKVPGNFHIS---------THASKEQPPQP---NMKHIV 149

Query: 253 RHLSFGIKLQDDDERRKPLDGTVA--KAEEGA-SMFNYYIKIIPTIYERLDGSKL----- 304
             L FG ++          +  +   K+E  A S  +YY+KI+P ++    G  L     
Sbjct: 150 HELIFGDRVPQTIHIPGSFNPLLEKDKSESNALSSHDYYLKIVPAVFNDYSGKTLMHPYQ 209

Query: 305 -----------GGGDGGMPGIFFSYELSPLMVKITEKSK-SLGHLWTKIMCNISGTYITF 352
                       GG   +P I+F Y+L+P+ VK +E+      H  T +   + GT+   
Sbjct: 210 YTFAYRHSIRQRGGQVVIPAIWFKYKLNPMCVKYSEQRPIPFYHFLTAVCAIVGGTFTVA 269

Query: 353 MLVDALLHSCVKKISKVEIG 372
            + D+ L +  +   K E+G
Sbjct: 270 GIFDSFLFTAAEIFKKAELG 289



 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 25/93 (26%), Positives = 47/93 (50%), Gaps = 2/93 (2%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD--SSR 64
          ++ LD + K  +D  + T  G  +++   LFI+YL   ++  Y       E++VD  ++ 
Sbjct: 5  IRRLDIYRKIPKDLTQPTKTGACISVGSVLFIAYLFISELTSYLSSEIVTEMYVDDPATN 64

Query: 65 GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
            ++P+ LDI +  + C Y+ LD  D  G   +
Sbjct: 65 SERIPVKLDISLLNMECKYIGLDIQDDLGRHEV 97


>gi|341874049|gb|EGT29984.1| hypothetical protein CAEBREN_24080 [Caenorhabditis brenneri]
          Length = 286

 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 44/193 (22%), Positives = 83/193 (43%), Gaps = 37/193 (19%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+     E+N+V G+FH++                 +++A+    + ++H+   IK  D
Sbjct: 110 GCRFESRFEINKVPGNFHLST----------------HSAASQPENYDMKHIIHSIKFGD 153

Query: 264 DDERRK------PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------- 306
           D   +       PL    +  E G S   Y +KI+P+++E   G+ L             
Sbjct: 154 DVSHKNLKGSFDPLANRDSLQENGLSTHEYILKIVPSVHEDYSGNILNSYQYTFGHKSYI 213

Query: 307 ----GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
                   +P ++F YEL P+ +K TE+ +S     T I   + GT+    ++D+   + 
Sbjct: 214 TYHHSGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTI 273

Query: 363 VKKISKVEIGGKT 375
            + + K ++G  T
Sbjct: 274 SELVKKQQMGKLT 286



 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 49/92 (53%), Gaps = 1/92 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
          ++  D + K  +D  + T  G  ++I C LFIS++I  DV  Y  +    E F+D   R 
Sbjct: 4  IRRFDIYRKVPKDLTQPTTVGALISIFCVLFISFMIFNDVLAYIFIDLRSEFFIDDPGRE 63

Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
           K+ + +++  P ++C+YL +D  D +G   +
Sbjct: 64 GKIDVQVNVSFPHMACEYLGVDIQDENGRHEV 95


>gi|324516732|gb|ADY46617.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Ascaris suum]
          Length = 286

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/186 (24%), Positives = 82/186 (44%), Gaps = 29/186 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+     E+N+V G+FH++          H    QP    +++  H +  + FG  LQ+
Sbjct: 110 GCRFEANFEINKVPGNFHLS---------THSAASQP---ESYDMRHIVNSVKFGDDLQE 157

Query: 264 DDE--RRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
             +     PL    A   +  +   Y +K++P++YE + G               +    
Sbjct: 158 KAQIGSFNPLQDRTALQGDPLNTHEYILKVVPSVYEDIAGRTKYSYQYTYAHKEYIAYHH 217

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
            G  +P ++F YEL P+ VK TE+ + L    T +   + GT+    ++D+ L S  +  
Sbjct: 218 SGRIIPAVWFKYELQPITVKYTERRQPLYAFITSVCAVVGGTFTVAGIIDSSLFSLSELY 277

Query: 367 SKVEIG 372
            K ++G
Sbjct: 278 KKHQLG 283



 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 51/92 (55%), Gaps = 1/92 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
          ++ LD + K  +D  + T  G  ++I+C  FI++++  D+  +  V    ELFVD   R 
Sbjct: 4  IRRLDIYRKVPKDLTQPTRTGAVISIICVCFIAFMLFNDLRMFLSVDLHSELFVDDPGRE 63

Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
           ++ +HL+  +P + C+YL +D  D +G   +
Sbjct: 64 GRIKVHLNATLPYLPCEYLGVDIQDENGRHEV 95


>gi|395326723|gb|EJF59129.1| hypothetical protein DICSQDRAFT_156384 [Dichomitus squalens
           LYAD-421 SS1]
          Length = 559

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/361 (22%), Positives = 141/361 (39%), Gaps = 72/361 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K  E +   +   G +T+        LI  D+ ++       E  VD    +
Sbjct: 28  LAQFDAFPKLPETYKTHSESRGFLTLFVAFVAFLLILNDLGEFIWGWPDFEFGVDKMPSA 87

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L I++D+VV  + C YL++D  D+ G+            RL L             +  
Sbjct: 88  NLDINVDMVV-NMPCQYLSIDLRDAVGD------------RLYLS------------DGF 122

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           ++     + G  T+  E        + A    R+       V ++ R + +     DT++
Sbjct: 123 RRDGTKFDIGQATSLKE--------HAAMLSARQA------VSQSRRSRGF----FDTLL 164

Query: 187 QCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAP-GLSYSINHVHV-HDIQPYT 242
             + + S +   N   +G  C+IYG +   RV+ + H+   G  Y+ +H HV H      
Sbjct: 165 H-RTKSSFKPTYNYQPDGSACRIYGTITAKRVTANLHVTTLGHGYA-SHEHVDHKF---- 218

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
               N +H I   SFG    D  +   PLD +   A +    + Y++ ++PT Y      
Sbjct: 219 ---MNLSHVITEFSFGPYFPDITQ---PLDNSFEMAHDPFVAYQYFLHVVPTTYIAPRSK 272

Query: 303 KLGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
            L                  G PGIFF ++L P+ + I +++ SL     +    + G +
Sbjct: 273 PLHTNQYSVTHYTRVLDHHRGTPGIFFKFDLEPIHMTIHQRTTSLAAFLLRCAGVVGGVF 332

Query: 350 I 350
           +
Sbjct: 333 V 333


>gi|361132020|gb|EHL03635.1| hypothetical protein M7I_0279 [Glarea lozoyensis 74030]
          Length = 235

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 54/180 (30%), Positives = 71/180 (39%), Gaps = 68/180 (37%)

Query: 207 IYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPY----TSAAFNTTHHIRHLSFGIKLQ 262
           I G L VN+V G+FHIAPG S+S  ++HVHD+  Y           +H I HL FG +L 
Sbjct: 38  IEGALRVNKVIGNFHIAPGRSFSNGNMHVHDLNNYFDTPVEGGHVFSHTIHHLRFGPQLP 97

Query: 263 DD-------------DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL---------- 299
           ++             +    PLD T     E A  F Y++K++ T Y  L          
Sbjct: 98  EELTKKLGTKTNLWTNHHLNPLDDTKQTTTEPAYNFMYFVKVVSTSYLPLGWETQAYKSQ 157

Query: 300 ---------------DGS-------------KLGGGD-------------GGMPGIFFSY 318
                          DGS              L GGD             GG+PG+FFSY
Sbjct: 158 LGSEWVGIGSYGHQHDGSVETHQYSVTSHRRSLNGGDDASEGHKEKVHARGGIPGVFFSY 217


>gi|308494873|ref|XP_003109625.1| hypothetical protein CRE_07522 [Caenorhabditis remanei]
 gi|308245815|gb|EFO89767.1| hypothetical protein CRE_07522 [Caenorhabditis remanei]
          Length = 286

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 46/193 (23%), Positives = 79/193 (40%), Gaps = 37/193 (19%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+     E+N+V G+FH++          H    QP         + +RH    IK  D
Sbjct: 110 GCRFESRFEINKVPGNFHLS---------THSAATQP-------DNYDMRHTIHSIKFGD 153

Query: 264 DDERRK------PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------- 306
           D   +       PL       E G +   Y +KI+P+++E   G+ L             
Sbjct: 154 DVSHKNLKGSFDPLANRDTSQENGLNTHEYILKIVPSVHEDYSGNILNSYQYTFGHKSYI 213

Query: 307 ----GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
                   +P ++F YEL P+ +K TE+ +S     T I   + GT+    ++D+   + 
Sbjct: 214 TYHHSGKIIPAVWFKYELQPITLKQTEQRQSFYAFLTSICAVVGGTFTVAGIIDSTFFTI 273

Query: 363 VKKISKVEIGGKT 375
            + + K ++G  T
Sbjct: 274 SELVKKQQMGKLT 286



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 24/92 (26%), Positives = 49/92 (53%), Gaps = 1/92 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRG 65
          ++  D + K  +D  + T  G  ++++C  FI+++I  DV  Y  +    E F+D   R 
Sbjct: 4  IRRFDIYPKIPKDLTQPTTAGAVISMLCVAFIAFMIFNDVLAYIFIDLRSEFFIDDPGRE 63

Query: 66 SKLPIHLDIVVPTISCDYLALDAVDSSGEQHL 97
           K+ + +++  P ++C+YL +D  D +G   +
Sbjct: 64 GKIDVQVNVSFPHMACEYLGVDIQDENGRHEV 95


>gi|392564830|gb|EIW58008.1| DUF1692-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 539

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 77/361 (21%), Positives = 141/361 (39%), Gaps = 72/361 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K    +  ++   G +T+        L+  D+ +Y       E  VD+ + +
Sbjct: 25  LAQFDAFPKVPSSYKTRSESRGFLTLFVAFVAFLLVLNDIGEYIWGWPDYEFGVDTDQTN 84

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L I++D+V+  + C +L++D  D+ G+            RL L             +  
Sbjct: 85  ALDINVDMVI-NMPCQFLSVDLRDAVGD------------RLFLS------------DGF 119

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEV--KEAYRYKKWALPELDT 184
           ++     + G  T+ L++  +  S   A +++R      + +  + A RYK         
Sbjct: 120 RRDGTKFDIGQATS-LKEHAEALSARQAVSQSRSSRGFFDVLLRRAAVRYKP-------- 170

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHV-HDIQPYT 242
                  Y  +         C+++G +   RV+ + HI   G  Y+ +  HV H +    
Sbjct: 171 ----TYNYQPDG------SACRVFGTITAKRVTANLHITTLGHGYA-SQTHVDHKL---- 215

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS 302
               N +H I   SFG    D  +   PLD +     E    + YY+ ++PT Y      
Sbjct: 216 ---MNLSHVITEFSFGPYFPDITQ---PLDNSFELTSEPFVAYQYYLHVVPTTYIAPRTK 269

Query: 303 KLGGGD-------------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
            L                  G PGIFF ++L P+ + I +++ S   L+ + +  I G +
Sbjct: 270 PLNTNQYSVTHYTRVLDHHRGTPGIFFKFDLEPMKLTIHQRTTSFVQLFIRTVGVIGGVF 329

Query: 350 I 350
           +
Sbjct: 330 V 330


>gi|300122162|emb|CBK22736.2| unnamed protein product [Blastocystis hominis]
          Length = 331

 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 30/112 (26%), Positives = 61/112 (54%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M +   ++ LD F K   D  E ++ G  +TIVC++ ++ L+ ++  +YF + T  +  +
Sbjct: 1   MGWRSTVRKLDMFRKVPVDLTEGSICGTILTIVCYILVAALVALEFNNYFTIDTRTDYII 60

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
           +      + I+ DI + ++SCD  +LD V+  G   ++V  NI + ++  +G
Sbjct: 61  EQHDDEYIQINFDITMKSLSCDLASLDIVNQMGTHRINVTQNIRRWQVFENG 112


>gi|397568633|gb|EJK46248.1| hypothetical protein THAOC_35093 [Thalassiosira oceanica]
          Length = 601

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 60/214 (28%), Positives = 98/214 (45%), Gaps = 54/214 (25%)

Query: 196 KLKNTFTE----GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHH 251
           K+K+++ E    GCQI G+L V+R  G+FHI    + S N    HD+  + +   N +H 
Sbjct: 396 KVKHSWDEDEHPGCQISGFLLVDRAPGNFHIQ---AQSKN----HDLAAHMT---NVSHI 445

Query: 252 IRHLSFG-------IK--LQDDD----ERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE- 297
           I HLSFG       IK  L++      +  +P DG V          ++Y+K+I T +E 
Sbjct: 446 INHLSFGKPFSKYFIKEGLKNTPAGFLDTTRPFDGNVYVTHNEHEAHHHYLKVITTEFEP 505

Query: 298 RLDGSKLGGGDGG--------------------------MPGIFFSYELSPLMVKITEKS 331
           + D  K  G   G                          +P   F+Y+LSP+ V  ++K 
Sbjct: 506 QRDTKKQYGKKKGFYKPPEPQRAYQILQSSQLSLYRNDIVPEAKFTYDLSPIAVSYSKKY 565

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
           ++    +T +M  I GT+    +V++ L++  KK
Sbjct: 566 RAWYDYFTSLMAIIGGTFTVVGMVESSLYAVSKK 599



 Score = 41.2 bits (95), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 33/143 (23%), Positives = 64/143 (44%), Gaps = 5/143 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L  LD + K   D  E T  G  ++ +  + ++ L  ++   +F  S +  L +DS+   
Sbjct: 80  LASLDMYRKVPVDLLEGTKRGSIMSTLAIMSMATLFFLETRAFFSSSLSTNLALDSNTDQ 139

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + ++ +I +  + CDY  +D V   G Q  +V  ++ K  +D  G   +  Q+     +
Sbjct: 140 NVRVNFNITMMDLRCDYATIDVVSVLGTQQ-NVTQHVQKYPIDQYGVRQRYQQRN----L 194

Query: 127 KKKKVTTENGTTTTELEDPNKCG 149
           K+  V   + T    +ED +  G
Sbjct: 195 KQHDVQQFDATVEETIEDLHADG 217


>gi|168012320|ref|XP_001758850.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689987|gb|EDQ76356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 487

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 87/195 (44%), Gaps = 36/195 (18%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK--- 260
           GC++ G++ V +V G   I+   ++S +H        + + + N TH++   SFG K   
Sbjct: 300 GCRVEGFVRVKKVPGELMIS---AHSGSH-------SFDATSMNMTHYVGFFSFGRKTSW 349

Query: 261 ---------LQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT----IYERLDGSKLGGG 307
                    L   D     L G V  +E      ++Y++++ T    ++ + D   L   
Sbjct: 350 RSVHWVNEMLPALDSNIDRLTGQVFPSEYENITHDHYLQVVKTEVITLHRKQDLRVLEQY 409

Query: 308 D----------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
           D            +P + F YELSP+ V + E  KS  H  T +   I G +    ++D+
Sbjct: 410 DYTAHSNMIQSTKVPVVKFHYELSPMQVLVKENPKSFSHFLTNLCAIIGGVFTVAGIIDS 469

Query: 358 LLHSCVKKISKVEIG 372
           +LH+ +  + KVE+G
Sbjct: 470 MLHNAMHIMKKVELG 484


>gi|330935325|ref|XP_003304912.1| hypothetical protein PTT_17645 [Pyrenophora teres f. teres 0-1]
 gi|311318248|gb|EFQ86993.1| hypothetical protein PTT_17645 [Pyrenophora teres f. teres 0-1]
          Length = 395

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 78/386 (20%), Positives = 144/386 (37%), Gaps = 91/386 (23%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +   DAF K  + +  +     A T+   L   YL   ++  ++  ST +   V+     
Sbjct: 21  VSSFDAFPKTKKTYLVQGRNSSAWTVTLILTCIYLSWSEISRWYAGSTWQSFAVEKGVSH 80

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I+LDI+V  + C  L ++  D++G++ L  E                         +
Sbjct: 81  DMQINLDIIV-AMRCADLHVNMQDAAGDRTLAGE-------------------------L 114

Query: 127 KKKKVTTENGTTTTELE-DPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            +K  T+ +  T   LE   ++ G+  G      +  +   ++ +A+             
Sbjct: 115 LRKDPTSWSQWTGRNLERGTHELGTEAGDAPSWEEAWDVREQLGKAH------------- 161

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
              K ++S         + C+IYG L+ N+V G FHI      +  H ++   +    ++
Sbjct: 162 ---KRKFSKTPRIRGNPDSCRIYGSLDGNKVQGDFHIT-----ARGHGYMEFGEHLDHSS 213

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA---KAEEGASMFNYYIKIIPTIYE----- 297
           FN +H IR +SFG           PLD T+A     ++    F YY+ I+PTIY      
Sbjct: 214 FNFSHIIREMSFGPYYP---SLTNPLDATIAVTPTPDDKFYKFQYYLSIVPTIYTDDPTL 270

Query: 298 ----RLDGSKLGGGDGG----------------------------MPGIFFSYELSPLMV 325
                   S  G   G                             +PG+F  +++ P+M+
Sbjct: 271 IPYLEAVSSTAGNHPGAASIFHGARAIKTNQYAVTSQSHKVPENYVPGVFVKFDIEPIML 330

Query: 326 KITEKSKSLGHLWTKIMCNISGTYIT 351
            + E+      L   ++  +SG  + 
Sbjct: 331 AVVEEWSGFWRLIVTLVNVVSGVMVA 356


>gi|241560364|ref|XP_002401002.1| COPII vesicle protein, putative [Ixodes scapularis]
 gi|215501827|gb|EEC11321.1| COPII vesicle protein, putative [Ixodes scapularis]
 gi|442749161|gb|JAA66740.1| Putative copii vesicle protein [Ixodes ricinus]
          Length = 285

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 48/185 (25%), Positives = 81/185 (43%), Gaps = 28/185 (15%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G   +++V G+FH++          H    QP      + TH I  L+FG K+ +
Sbjct: 110 GCRFEGKFYIHKVPGNFHMS---------THAAAKQP---DKIDMTHIIHDLTFGNKMVE 157

Query: 264 DDERR-KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG-------------GDG 309
                   LD        G    +Y +KI+PT++E+    ++                  
Sbjct: 158 GVRGSFNSLDEMDKSEANGLESHDYVMKIVPTVFEKSPSERIESYQYTYAYKSYVSISHS 217

Query: 310 G--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
           G  MP I+F Y+L+P+ VK T +S  L    T +   + GT+    +VD+L+ +  +   
Sbjct: 218 GRIMPAIWFRYDLTPITVKYTRRSVPLYSFLTSVCAIVGGTFTVAGIVDSLVFTASEIFK 277

Query: 368 KVEIG 372
           K E+G
Sbjct: 278 KYEMG 282



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 48/106 (45%), Gaps = 3/106 (2%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           MVF  R    D + K  +D  + TV G  ++I+   FIS L   +   Y       ELFV
Sbjct: 1   MVFDVR--RFDIYRKIPKDLTQPTVTGAVISILSCFFISILFLSEFISYMSPELASELFV 58

Query: 61  DS-SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
           D+ S   K+P+ ++I +  + C  + LD  D  G   +    N  K
Sbjct: 59  DNPSSADKIPVSINITLLKLDCSAVGLDIQDDMGRHEVGFVENTEK 104


>gi|357452761|ref|XP_003596657.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
 gi|355485705|gb|AES66908.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
          Length = 482

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 53/198 (26%), Positives = 84/198 (42%), Gaps = 40/198 (20%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG----I 259
           GC+I GY+ V +V G+  I+             D   + ++  N +H + HLSFG     
Sbjct: 293 GCRIEGYVRVKKVPGNLIISAR----------SDAHSFDASQMNMSHAVHHLSFGKKLSP 342

Query: 260 KLQDDDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTI------------Y 296
           KL  D +R  P  G      +G S  N           +Y++I+ T             Y
Sbjct: 343 KLMSDVQRLIPYVGNSHDRLDGLSFINSHDFGANVTLEHYLQIVKTEVITRQGYQLVEEY 402

Query: 297 ERLDGSKLGGGDGGMPGIFFSYELSPLMV--KITEKSKSLGHLWTKIMCNISGTYITFML 354
           E    S L      +P   F  +LSP+ V   ITE  KS  H  T +   + G +    +
Sbjct: 403 EYTAHSSLAHS-LHVPVARFHLQLSPMQVCVLITEDHKSFSHFITNVCAIVGGVFTVAGI 461

Query: 355 VDALLHSCVKKISKVEIG 372
            +++LH+ ++ + KVE+G
Sbjct: 462 TESILHNTIRLMRKVELG 479


>gi|393908149|gb|EJD74928.1| hypothetical protein LOAG_17836 [Loa loa]
          Length = 430

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 89/396 (22%), Positives = 166/396 (41%), Gaps = 56/396 (14%)

Query: 5   ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE--ELFVD 61
           E ++  DAF K  ++   EK   GG +  + +L I+ L+  ++ +YF           VD
Sbjct: 22  EVVRDFDAFNKTVDEVSEEKRATGGFLASLSFLIIAALVFGELQNYFYGDEGHYYRFSVD 81

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           ++      + +D++V T   + +A     +S   H     N +K     D    +  +KE
Sbjct: 82  TAFSEHPELEVDMIVATPCTNLMAHLTGTAS---HEFNSMNGFK----YDPTRFEFTEKE 134

Query: 122 VV--NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY----RYK 175
            +  N +KK +  T+ GTT    +  ++     G   E  K      + +EA+    + K
Sbjct: 135 AMYWNELKKVQHRTKEGTTL--FKSLDEMTFVSGRVEEGLKTEAETKQREEAHAIQLQRK 192

Query: 176 KWALPELD--TIVQCKNEYSTEKLKNTFTE-----GCQIYGYLEVNRVSG-SFHIAPGLS 227
           K     LD  T++   N ++   +  + +E      C+I+G + VN+V G SF I+ G  
Sbjct: 193 KNPKQSLDGGTLILIGNGFNVFHVVASNSEKNEGTACRIHGRMRVNKVKGDSFIISTGKG 252

Query: 228 YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
             ++ +  H      S+  N +H I   +FG ++        PL G    +E G   F Y
Sbjct: 253 LDVDGIFAHF--GGVSSPSNISHRIERFNFGPRIYG---LVTPLAGIEQISETGVDEFRY 307

Query: 288 YIKIIPTIYERLDGSKLGGGDG-------------------GMPGIFFSYELSPLMVKIT 328
           ++KI+PT   R+  S L GG                         I   YE +  ++++ 
Sbjct: 308 FLKIVPT---RIYHSGLFGGSTLTYQYSVTFMKKTPKKDVHKHTAIIIHYEFAATVIEVR 364

Query: 329 EKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVK 364
               SL  +  ++   + G + T +L++++   C++
Sbjct: 365 HVQSSLLQMLVRLCSAVGGVFATSILLNSI---CIR 397


>gi|145540599|ref|XP_001455989.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124423798|emb|CAK88592.1| unnamed protein product [Paramecium tetraurelia]
          Length = 322

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 76/383 (19%), Positives = 144/383 (37%), Gaps = 89/383 (23%)

Query: 7   LKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           L+ LD F K   D  +  +  GG +T + +  ++ L   +   +F      +  +D+   
Sbjct: 3   LRQLDFFRKLNTDIGDTSSALGGFLTTIAFALVTILTMNECRLFFSTELNYQTVIDNDTE 62

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             + +HLD++V    C  L+LD  D  G   + V   + K  LD D        + V+ +
Sbjct: 63  QFIKVHLDMIVGA-PCMVLSLDQQDEVGVHVMDVSGTLKKISLDKD--------RHVLPS 113

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
           +                 D N+  +  G+E E        N+                  
Sbjct: 114 I-----------------DSNERPNYEGSEQELLDAIEAINQ------------------ 138

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA-PGLSYSINHVHVHDIQPYTSA 244
                            E CQ+ G+ +VN+V G+FH++     Y +  +H  D+  +   
Sbjct: 139 ----------------GEQCQLKGFFQVNKVPGNFHVSYHAHHYLLQRIHQRDLSVFRKM 182

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRK----------PLDGTVAKAEEGASM-FNYYIKIIP 293
             +  H I  L FG ++    + RK               V  A EG    + YYI  +P
Sbjct: 183 KLD--HSIYELRFG-EITTTSKMRKYSKSLQKFQNSWKQIVKSAPEGEKQDYEYYIDALP 239

Query: 294 T-IYERLDGS-----KLGGGDGGMP-------GIFFSYELSPLMVKITEKSKSLGHLWTK 340
              Y+  + +     K    +  MP        I+F Y++SP+ +  + + KS+ H   +
Sbjct: 240 VRFYDENERNYQTLYKYSINEAQMPRTFTEIDSIYFKYQISPVNMVYSIQKKSVYHFIVQ 299

Query: 341 IMCNISGTYITFMLVDALLHSCV 363
           ++  I G +    ++++++   +
Sbjct: 300 LLAIIGGVFAVIGILNSIVQKAI 322


>gi|348690307|gb|EGZ30121.1| COPII vesicle trafficking protein [Phytophthora sojae]
          Length = 306

 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 45/205 (21%), Positives = 80/205 (39%), Gaps = 49/205 (23%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+++G ++V +V+G    A   S ++          +    FN++H + HL FG ++ D
Sbjct: 108 GCRLFGTVQVQKVAGDLSFAHEGSLTV-------FSFFDFLNFNSSHVVNHLRFGPQIPD 160

Query: 264 D--------------------------DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
                                      D     L   +A      + + Y++ ++P+ Y 
Sbjct: 161 METPLIDVSKILERNCTQESCWLARSWDSVAALLTSFIALLLFTVATYKYFVNVVPSRYV 220

Query: 298 RLDG----------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
            L+G                S+   G    PG+ FSYE SP+ V+  E   S+ H  T  
Sbjct: 221 YLNGRSVTTFQYSVTEHETSSRGPNGQVSFPGVIFSYEFSPIAVEYIESKPSVLHFLTST 280

Query: 342 MCNISGTYITFMLVDALLHSCVKKI 366
              + G +    ++D  ++S  KKI
Sbjct: 281 SAIVGGVFAVARMIDGAIYSVSKKI 305



 Score = 46.2 bits (108), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 26/105 (24%), Positives = 53/105 (50%)

Query: 8   KGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSK 67
           +  D   K  E   E+T+ GG VT++  + +++L+  +   ++ VS T  + VD+     
Sbjct: 4   RRFDLNVKGVEGIQERTIGGGVVTLLSCVVVAFLLLSEFSVWWTVSVTHRMHVDTDPDYP 63

Query: 68  LPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
           + I +D+     +C  +ALD  DS G + + ++ +I +     +G
Sbjct: 64  INIEVDVSFLHEACKEVALDVSDSKGHKEILLKKDIQEEPFGENG 108


>gi|363748002|ref|XP_003644219.1| hypothetical protein Ecym_1151 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356887851|gb|AET37402.1| hypothetical protein Ecym_1151 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 340

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 72/294 (24%), Positives = 116/294 (39%), Gaps = 59/294 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  +   +K+  GG  +IV +LF+ ++   +   YF     E+  VD    +
Sbjct: 4   LRTFDAFPKTEQQHVKKSSKGGLTSIVIYLFLLFIAWSEFGSYFGGYIDEQYIVDDEIRT 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
              I+++I V  + C YL + A D +G+        I   RL+      + P        
Sbjct: 64  TAQINMNIYV-KMPCKYLEVTARDQTGDLQ------IVSERLNFQDIHFRVPYG------ 110

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
              K+T  N   + +L+D         A+  +                    +PEL  I 
Sbjct: 111 --TKMTEFNDVISPDLDD--ILADAIPAQFTSD-------------------MPELPMI- 146

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                      +    +GC IYG + VN+VSG   I A G +Y           P+  + 
Sbjct: 147 -----------EGINFDGCSIYGSVPVNKVSGELQITAKGWTYMSTRR-----TPF--SV 188

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
            N +H I  LSFG      D     LDG    A+E    + Y+  ++PT Y+++
Sbjct: 189 LNFSHVINELSFGDFFPYIDNT---LDGVGRIADEPLKAYYYFTSVLPTAYKKM 239


>gi|162462518|ref|NP_001105762.1| protein disulfide isomerase12 [Zea mays]
 gi|59861281|gb|AAX09970.1| protein disulfide isomerase [Zea mays]
 gi|414590455|tpg|DAA41026.1| TPA: putative thioredoxin superfamily protein [Zea mays]
          Length = 483

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S +LK +D + K   D  E ++ G  ++IV  L + +L  +++  Y  V+TT  + V
Sbjct: 1   MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  ++  P +SC++ ++D  D  G   L++   + K  +D +  P
Sbjct: 61  DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDRNLVP 115



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/200 (24%), Positives = 85/200 (42%), Gaps = 42/200 (21%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+I G++ V RV GS  I+   + S +H        +  +  N +H++   SFG +L  
Sbjct: 292 GCRIEGFVRVKRVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTQFSFGKRLSP 341

Query: 264 D---------------DERRKPLDGTVAKAEEGASM-FNYYIKIIPTI------------ 295
                            +R      TV   E  A++   +Y++++ T             
Sbjct: 342 RMLHEFIRLTPYLRGYHDRLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRSSKELKV 401

Query: 296 ---YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
              YE    S L      +P + F +E SP+ V +TE  KS  H  T +   I G +   
Sbjct: 402 LEEYEYTAHSSLVHS-FYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVA 460

Query: 353 MLVDALLHSCVKKISKVEIG 372
            ++D++ H+ ++ + K+E+G
Sbjct: 461 GILDSIFHNTLRMVKKIELG 480


>gi|195639434|gb|ACG39185.1| PDIL5-4 - Zea mays protein disulfide isomerase [Zea mays]
          Length = 485

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S +LK +D + K   D  E ++ G  ++IV  L + +L  +++  Y  V+TT  + V
Sbjct: 1   MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  ++  P +SC++ ++D  D  G   L++   + K  +D +  P
Sbjct: 61  DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDRNLVP 115



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/200 (24%), Positives = 85/200 (42%), Gaps = 42/200 (21%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+I G++ V RV GS  I+   + S +H        +  +  N +H++   SFG +L  
Sbjct: 294 GCRIEGFVRVKRVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTQFSFGKRLSP 343

Query: 264 D---------------DERRKPLDGTVAKAEEGASM-FNYYIKIIPTI------------ 295
                            +R      TV   E  A++   +Y++++ T             
Sbjct: 344 RMLHEFIRLTPYLRGYHDRLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRSSKELKV 403

Query: 296 ---YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
              YE    S L      +P + F +E SP+ V +TE  KS  H  T +   I G +   
Sbjct: 404 LEEYEYTAHSSLVHS-FYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVA 462

Query: 353 MLVDALLHSCVKKISKVEIG 372
            ++D++ H+ ++ + K+E+G
Sbjct: 463 GILDSIFHNTLRMVKKIELG 482


>gi|325187435|emb|CCA21973.1| endoplasmic reticulumGolgi intermediate compartment protein
           putative [Albugo laibachii Nc14]
          Length = 283

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 45/178 (25%), Positives = 75/178 (42%), Gaps = 24/178 (13%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           EGC+  G L + ++ G      G S SI ++            FN++H I  L+FG+ + 
Sbjct: 115 EGCRYKGTLTIQKLQGDIFFCHGGSLSIFNL-------MEMFRFNSSHVITKLNFGLSIP 167

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGS--------------KLGGGD 308
              + + PL           + + Y+ K++P+ Y  LDG               K+ G  
Sbjct: 168 ---KMQTPLTDVHKTVLAQVATYKYFAKVVPSRYVYLDGKSTMTYQYSVTEHLLKMDGFV 224

Query: 309 GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
             +PG+  SY+ SP+ V   E   ++ H  T     + G      + DA L+S  KK+
Sbjct: 225 TNIPGVIISYDFSPIAVDYIETKPNIFHFITNTCAILGGVIAVARIFDAALYSMSKKL 282



 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/98 (24%), Positives = 53/98 (54%), Gaps = 3/98 (3%)

Query: 8   KGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSK 67
           +  DA+ K  E   E+T+ GG +T++  +F+ +L   ++  ++ V+    + VD++    
Sbjct: 5   RRFDAYAKAVEGIQERTIGGGIITLLSCVFVCFLFISEISVWWTVNVVHRMHVDTAPQES 64

Query: 68  LPIHLDIVVPTI--SCDYLALDAVDSSGEQHLHVEHNI 103
            PI LD+ +  +  +C  + +D  DS G+  + + +N+
Sbjct: 65  -PITLDVDISMLHETCRDIKVDVSDSQGDGSILIANNL 101


>gi|357122608|ref|XP_003563007.1| PREDICTED: protein disulfide isomerase-like 5-4-like [Brachypodium
           distachyon]
          Length = 485

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S +LK +D + K   D  E ++ G  ++IV  L + +L  +++  Y  V+TT  + V
Sbjct: 1   MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  ++  P +SC++ ++D  D  G   L++   + K  +D +  P
Sbjct: 61  DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSIDRNLVP 115



 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 49/202 (24%), Positives = 88/202 (43%), Gaps = 40/202 (19%)

Query: 201 FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
            T GC++ G++ V +V GS  I+   + S +H        +  +  N +H++   SFG +
Sbjct: 291 MTSGCRVEGFVRVKKVPGSVIIS---ARSGSH-------SFDPSQINVSHYVTQFSFGNR 340

Query: 261 LQ----DDDERRKPLDG-----------TVAKAEEGASM-FNYYIKIIPTIYERLDGSKL 304
           L      + +R  P  G            V   +  A++   +Y++I+ T    L  SK 
Sbjct: 341 LSPNMFSELKRLIPYVGGHHDRLAGQSYIVKHGDNNANVTIEHYLQIVKTELVTLRSSKE 400

Query: 305 GG--------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                               +P + F +E SP+ V +TE  KS  H  T +   I G + 
Sbjct: 401 LKVFEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVFT 460

Query: 351 TFMLVDALLHSCVKKISKVEIG 372
              ++D++LH+ ++ + KVE+G
Sbjct: 461 VAGILDSILHNTLRLVKKVELG 482


>gi|388501278|gb|AFK38705.1| unknown [Medicago truncatula]
          Length = 148

 Score = 62.4 bits (150), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 18/137 (13%)

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
           N +H I  LSFG K         PLD T     + +  F YYIKI+PT Y  +    L  
Sbjct: 10  NVSHVIHDLSFGPKYPGI---HNPLDETSRILHDASGTFKYYIKIVPTEYRYISKEVLPT 66

Query: 307 G---------------DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                           D   P ++F Y+LSP+ V I E+ +S  H  T++   + GT+  
Sbjct: 67  NQFSVTEYFSPITSQFDRTWPAVYFLYDLSPITVTIKEERRSFLHFITRLCAVLGGTFAV 126

Query: 352 FMLVDALLHSCVKKISK 368
             ++D  ++  V+  +K
Sbjct: 127 TGMLDRWMYRLVEAATK 143


>gi|414590456|tpg|DAA41027.1| TPA: putative thioredoxin superfamily protein [Zea mays]
          Length = 439

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S +LK +D + K   D  E ++ G  ++IV  L + +L  +++  Y  V+TT  + V
Sbjct: 1   MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  ++  P +SC++ ++D  D  G   L++   + K  +D +  P
Sbjct: 61  DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDRNLVP 115


>gi|414590454|tpg|DAA41025.1| TPA: putative thioredoxin superfamily protein [Zea mays]
          Length = 435

 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S +LK +D + K   D  E ++ G  ++IV  L + +L  +++  Y  V+TT  + V
Sbjct: 1   MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  ++  P +SC++ ++D  D  G   L++   + K  +D +  P
Sbjct: 61  DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDRNLVP 115


>gi|268577857|ref|XP_002643911.1| Hypothetical protein CBG02175 [Caenorhabditis briggsae]
          Length = 282

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 48/193 (24%), Positives = 81/193 (41%), Gaps = 38/193 (19%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+     E+N+V G+FH++          H    QP         + +RH+   IK  D
Sbjct: 107 GCRFESRFEINKVPGNFHLS---------THSATTQP-------DGYDMRHIIHSIKFGD 150

Query: 264 DDERRK------PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG----------- 306
           D   +       PL    AK E G +   Y +KI+P+++E   G+ L             
Sbjct: 151 DVSHKNLKGSFDPLANREAK-ESGLNTHEYILKIVPSVHEDYSGNILNSYQYTYGHKSYV 209

Query: 307 ----GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSC 362
                   +P ++F YEL P+ +K TE  +S     T I   + GT+    ++D+   + 
Sbjct: 210 TYHHSGKIIPAVWFKYELQPITLKQTEHRQSFYIFLTSICAVVGGTFTVAGIIDSTFFTI 269

Query: 363 VKKISKVEIGGKT 375
            + + K ++G  T
Sbjct: 270 SEMVKKQQMGKLT 282



 Score = 55.8 bits (133), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 24/88 (27%), Positives = 47/88 (53%), Gaps = 1/88 (1%)

Query: 11 DAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS-SRGSKLP 69
          D + K  +D  + T  G  ++I+C  FI+++I  D+  Y  +    E F+D   R  K+ 
Sbjct: 5  DIYRKVPKDLTQPTTAGAVISILCVAFITFMIFNDILAYIFIDLRSEFFIDDPGREGKID 64

Query: 70 IHLDIVVPTISCDYLALDAVDSSGEQHL 97
          + +++  P ++CDY+ +D  D +G   +
Sbjct: 65 VQVNVSFPHMACDYIGVDIQDENGRHEV 92


>gi|344301277|gb|EGW31589.1| hypothetical protein SPAPADRAFT_62204 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 353

 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 60/221 (27%), Positives = 87/221 (39%), Gaps = 36/221 (16%)

Query: 180 PELDTIVQ--CKNEYSTEKLK-NTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHV 235
           PELD I+Q   + E+  +  + N     C I+G + +N+V G F I A G  Y       
Sbjct: 127 PELDEIMQESLRAEFRVQGQRVNENAPACHIFGSIPINQVKGDFRITAKGYGY------- 179

Query: 236 HDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTI 295
            D+        N +H I+  S+G   +       PLD T    EE    + Y  K++PT 
Sbjct: 180 RDVIAAPIDKLNFSHVIQEFSYG---EFYPFINNPLDATGKVTEEKFQKYMYSAKVVPTS 236

Query: 296 YER------------------LDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
           YE+                  L  +   G   G+PGI+  Y+  P+ + I EK       
Sbjct: 237 YEKLGLIVETNQYSVTENHQVLQKNSQTGVPIGVPGIYIKYDFEPIKMVIKEKRMPFMQF 296

Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISKVEIGGKTVTK 378
             K+     G  IT     + L    +KI  V  G K V K
Sbjct: 297 VAKLATIAGGILIT----ASYLFRLYEKILGVVFGKKYVEK 333


>gi|388517493|gb|AFK46808.1| unknown [Lotus japonicus]
          Length = 156

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 46/153 (30%), Positives = 67/153 (43%), Gaps = 26/153 (16%)

Query: 246 FNTTHHIRHLSFGIKLQ----DDDERRKPLDGTVAKAEEGASMFN-----------YYIK 290
            N +H + HL+FG K+      D +R  P  G+      G S  N           +YI+
Sbjct: 1   MNMSHVVNHLTFGKKVTPRAISDMQRLIPHIGSSHDRLNGRSFVNTHNLEANVTIEHYIQ 60

Query: 291 IIPTIYERLDGSKL-----------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
           I+ T     +G KL                 +P   F  ELSP+ V ITE  KS  H  T
Sbjct: 61  IVKTEVVTRNGYKLIEDYEYTAHSSVAHSLDIPVAKFHLELSPMQVLITENQKSFSHFIT 120

Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
            +   I G +    +VD++LH+ ++ I KVE+G
Sbjct: 121 NVCAIIGGVFTVAGIVDSILHNTIRMIKKVELG 153


>gi|297847442|ref|XP_002891602.1| hypothetical protein ARALYDRAFT_474215 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297337444|gb|EFH67861.1| hypothetical protein ARALYDRAFT_474215 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 484

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 88/215 (40%), Gaps = 40/215 (18%)

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           +  N  ST K K   + GC+I GY+   +V G   I+        H   H    + ++  
Sbjct: 278 KSDNAASTIK-KAPVSGGCRIEGYVRAKKVPGELVISA-------HSGAHS---FDASQM 326

Query: 247 NTTHHIRHLSFGI----KLQDDDERRKPLDGTVAKAEEGASMFN-----------YYIKI 291
           N +H + HLSFG     +L  D +R  P  G       G S  N           +Y++I
Sbjct: 327 NMSHIVTHLSFGTMVSERLWTDMKRLLPYLGQSHDRLNGKSFINQRKFDVNVTIEHYLQI 386

Query: 292 IPT-IYERLDGSKLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
           + T +  R  G +                     P   F +ELSP+ V I+E  KS  H 
Sbjct: 387 VKTEVISRRSGKEHSLIEEYEYTAHSSVAHSYHYPEAKFHFELSPMQVLISENPKSFSHF 446

Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
            T +   I G +    ++D++  + V+ + K+E+G
Sbjct: 447 ITNVCAIIGGVFTVAGILDSIFQNTVRMVKKIELG 481


>gi|224030141|gb|ACN34146.1| unknown [Zea mays]
          Length = 483

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 61/115 (53%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S +LK +D + K   D  E ++ G  ++IV  L + +L  +++  Y  V+TT  + V
Sbjct: 1   MISSSKLKSVDFYRKIPRDLTEVSLSGAGLSIVAALAMVFLFGMELSSYLAVNTTTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  ++  P +SC++ ++D  D  G   L++   + K  +D +  P
Sbjct: 61  DRSSDGEFLRIDFNMSFPALSCEFASVDVSDVLGTNRLNITKTVRKYSIDRNLVP 115



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/200 (24%), Positives = 85/200 (42%), Gaps = 42/200 (21%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+I G++ V RV GS  I+   + S +H        +  +  N +H++   SFG +L  
Sbjct: 292 GCRIEGFVRVKRVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTQFSFGKRLSP 341

Query: 264 D---------------DERRKPLDGTVAKAEEGASM-FNYYIKIIPTI------------ 295
                            +R      TV   E  A++   +Y++++ T             
Sbjct: 342 RMLHEFIRLTPYLRGYHDRLAGQSYTVKHGEVNANVTIEHYLQVVKTELVTQRSSKELKV 401

Query: 296 ---YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
              YE    S L      +P + F +E SP+ V +TE  KS  H  T +   I G +   
Sbjct: 402 LEEYEYTAHSSLVHS-FYVPVVKFHFEPSPMQVLVTEVPKSFSHFITNVCAIIGGVFTVA 460

Query: 353 MLVDALLHSCVKKISKVEIG 372
            ++D++ H+ ++ + K+E+G
Sbjct: 461 GILDSIFHNTLRMVKKIELG 480


>gi|440293957|gb|ELP87004.1| hypothetical protein EIN_318630 [Entamoeba invadens IP1]
          Length = 316

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 56/189 (29%), Positives = 77/189 (40%), Gaps = 34/189 (17%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPG-LSY---SINHV----------HVHDIQPYTSAAFNTT 249
           GC+++G ++V+RVSG FH+A G ++Y     N V          H H        +FN T
Sbjct: 117 GCRMHGTMKVSRVSGEFHVAFGKIAYRQQRTNQVITATQKHTQMHTHQFTMQEMKSFNPT 176

Query: 250 HHIRHLSF-GIKLQDDDERRKPLDGT--VAKAEEGASMFNYYIKIIPT------------ 294
           H I +L+F             PL+G     K  + A  + YYI +IPT            
Sbjct: 177 HFINNLAFSNTPSYTTHAGETPLNGKEYTLKGYDNAR-YTYYINVIPTLNKYPTHTTRSY 235

Query: 295 ---IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
              I ER       G     PG+FF YELSP +V       S  H        I G +I 
Sbjct: 236 QLSINERFVPVTY-GPTFTQPGVFFKYELSPYIVINEMMDHSFAHSIASTAAIIGGVWII 294

Query: 352 FMLVDALLH 360
           F  +   L+
Sbjct: 295 FGWISRFLN 303


>gi|443920575|gb|ELU40475.1| endoplasmic reticulum-derived transport vesicle ERV46 [Rhizoctonia
           solani AG-1 IA]
          Length = 506

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 65/298 (21%), Positives = 119/298 (39%), Gaps = 63/298 (21%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           ++  DAF K   ++  +T  GG +T++  +    L+  D+ DY       E  VD++  +
Sbjct: 15  VRQFDAFPKVRPNYKARTTGGGLMTVLVAVISFILVLNDLGDYLWGWREYEFTVDNNLAT 74

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQ-HLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            + +++D+VV  + C +L++D  D++G++  L  EH  ++R                   
Sbjct: 75  VMYVNVDLVV-NMPCHFLSVDLRDAAGDRLFLTDEHGGFRR------------------- 114

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                    +G T           S Y       K   +  EV  A +  +  L    + 
Sbjct: 115 ---------DGAT-----------SAYALNFRDSKVSVSPQEVVSASKRSQRGL--FSSF 152

Query: 186 VQCKNEYSTEKLKNTF-----TEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQ 239
            + K+       + T+        C+++G + V +V+ + HI   G  Y       H + 
Sbjct: 153 KKPKD----PTFRPTYNHIPDASACRVFGTVAVKKVTANLHITTLGHGYRSAEHTDHTL- 207

Query: 240 PYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
                  N TH I   SFG  + D  +   PLD +     E  + F Y+I ++PT Y+
Sbjct: 208 ------MNLTHVINEFSFGPFIPDLSQ---PLDYSFEVTHEHFTAFQYFITVVPTTYQ 256


>gi|403413226|emb|CCL99926.1| predicted protein [Fibroporia radiculosa]
          Length = 546

 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 77/360 (21%), Positives = 143/360 (39%), Gaps = 70/360 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L   DAF K    +  ++   G +TI   L    LI  D+ +Y    +  E  VDS   +
Sbjct: 27  LAQFDAFPKLPSTYKARSESRGFLTIFVALVAFLLILNDLGEYLWGWSDHEFSVDSDTTN 86

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            L +++D++V  + C YL++D  D+ G++ L +     +  +  D          V +A 
Sbjct: 87  GLNLNVDLMV-NMPCQYLSVDLRDAVGDR-LFLSRGFRRDGIKFD----------VGHA- 133

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
                        T L++     S   A  ++RK     + +     ++K        + 
Sbjct: 134 -------------TALKEHAAALSAQQAIAQSRKSRGFFSTL-----FRK-------DVA 168

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV---HDIQPYTS 243
           Q +  ++ +K  +     C+IYG +   + + + HI      +I H +    H    Y  
Sbjct: 169 QYRPTHNYQKDGSA----CRIYGTITAKKATANLHIT-----TIGHGYASRDHVDHKY-- 217

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
              N +H I   SFG       E  +PLD +   A +    + YY+ ++PT Y     + 
Sbjct: 218 --MNLSHVINEFSFGPFFP---EIVQPLDNSFELALDPFVAYQYYLHVVPTTYIAPRSTP 272

Query: 304 LG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
           L                  G PGIFF ++L P+ + I +++ +L     + +  + G ++
Sbjct: 273 LHTHQYSVTHYTRTMSTHQGTPGIFFKFDLEPMHLTIHQRTTTLAQFLIRCVGVVGGIFV 332


>gi|47219772|emb|CAG03399.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 378

 Score = 61.6 bits (148), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 57/220 (25%), Positives = 83/220 (37%), Gaps = 79/220 (35%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPG--------------------------LSYSINHV--H 234
             C+I+G+L VN+V+G+FHI  G                          LS SI H   H
Sbjct: 130 RACRIHGHLYVNKVAGNFHITVGKYVTSLLGYSVVSLHSIPIGVTLFLLLSRSIPHPRGH 189

Query: 235 VHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGAS----------- 283
            H     +  ++N +H I HLSFG   +D      PLDGT   + +  +           
Sbjct: 190 AHLAALVSHDSYNFSHRIDHLSFG---EDLPGIISPLDGTEKVSADCTAVLSLTPLHRCD 246

Query: 284 ---------------------MFNYYIKIIPT---------------IYERLDGSKLGGG 307
                                +F Y+I I+PT               + E+        G
Sbjct: 247 FFLPRLFFKMCDFRFSLLANHIFQYFITIVPTKLNTYKVSAETHQYSVTEQDRAINHAAG 306

Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
             G+ GIF  Y++S LMVK+TE+   L     + +C I G
Sbjct: 307 SHGVSGIFMKYDISSLMVKVTEQHMPLWQFLVR-LCGIVG 345



 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 57/117 (48%), Gaps = 8/117 (6%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E T  GG V+++ +  ++ L  ++   Y       E  VD   GS
Sbjct: 13  VKELDAFPKVPESYVESTASGGTVSLIAFSLMAILAFLEFFVYRDTWMKYEYEVDKDFGS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKR--RLDLDGKPIQEPQKE 121
           KL I++DI V     D + +  +    ++ L VEH++     +  + G P  +PQ +
Sbjct: 73  KLRINVDITV----ADEMPMTLLHI--QERLKVEHSLQDLIFKTAMKGAPPPQPQTD 123


>gi|42562656|ref|NP_175508.2| protein Disulfide Isomerase (PDIa) family, redox active TRX
           domain-containing protein [Arabidopsis thaliana]
 gi|332194483|gb|AEE32604.1| protein Disulfide Isomerase (PDIa) family, redox active TRX
           domain-containing protein [Arabidopsis thaliana]
          Length = 484

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 88/215 (40%), Gaps = 40/215 (18%)

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           +  N  ST K K   + GC+I GY+   +V G   I+        H   H    + ++  
Sbjct: 278 KSDNAASTFK-KAPVSGGCRIEGYVRAKKVPGELVISA-------HSGAHS---FDASQM 326

Query: 247 NTTHHIRHLSFGI----KLQDDDERRKPLDGTVAKAEEGASMFN-----------YYIKI 291
           N +H + HL+FG     +L  D +R  P  G       G S  N           +Y++I
Sbjct: 327 NMSHIVTHLTFGTMVSERLWTDMKRLLPYLGQSYDRLNGKSFINERQLDANVTIEHYLQI 386

Query: 292 IPT-IYERLDGSKLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
           I T +  R  G +                     P   F +ELSP+ V I+E  KS  H 
Sbjct: 387 IKTEVISRRSGQEHSLIEEYEYTAHSSVARSYHYPEAKFHFELSPMQVLISENPKSFSHF 446

Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
            T +   I G +    ++D++  + V+ + K+E+G
Sbjct: 447 ITNVCAIIGGVFTVAGILDSIFQNTVRMVKKIELG 481


>gi|302808800|ref|XP_002986094.1| hypothetical protein SELMODRAFT_123299 [Selaginella moellendorffii]
 gi|300146242|gb|EFJ12913.1| hypothetical protein SELMODRAFT_123299 [Selaginella moellendorffii]
          Length = 475

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 91/199 (45%), Gaps = 40/199 (20%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL-- 261
           GC+I G++   +V G+  I+   ++S +H        + ++A N TH++   SFG +L  
Sbjct: 284 GCRIEGFIRAKKVPGNIIIS---AHSGSH-------SFDASAMNMTHYVSQFSFGRELNF 333

Query: 262 --QDDDERRKP------------LDGTVAKAEEGASMFNYYIKIIPT----IYERLDGSK 303
             + +  R  P            L G +  ++      ++Y++++ T    + +R + S 
Sbjct: 334 WMRRELYRIYPHLASVYDTVEANLTGRIYVSQHENITHDHYLQVVKTEVVSLQKRKEFSL 393

Query: 304 LGGGD----------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
           L   D            +P   F YELSP+ V + E  KS  H  T +   I G +    
Sbjct: 394 LEQYDYTSHSNTVQNTNVPVAKFHYELSPMQVLVKENPKSFSHFITNVCAIIGGVFTVAG 453

Query: 354 LVDALLHSCVKKISKVEIG 372
           +VD++LH  ++ + K+E+G
Sbjct: 454 IVDSMLHGAMRMVKKIELG 472



 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 27/114 (23%), Positives = 60/114 (52%), Gaps = 1/114 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M  + ++K +D + K   D  E ++ G  ++++    + +L  +++ +Y  VS+T  + V
Sbjct: 1   MTTTSKIKSIDFYRKIPRDLTEASLSGAGLSLIAAFAMIFLFGMELNNYLTVSSTTNVVV 60

Query: 61  DSSR-GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
           D S+ G  L I  ++  P +SC++ ++D  D+ G    ++   + K  +D + K
Sbjct: 61  DRSKDGEYLRIQFNMSFPALSCEFASVDVSDALGTNRYNLTKTVRKYPIDPNLK 114


>gi|255714272|ref|XP_002553418.1| KLTH0D16324p [Lachancea thermotolerans]
 gi|238934798|emb|CAR22980.1| KLTH0D16324p [Lachancea thermotolerans CBS 6340]
          Length = 340

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 71/366 (19%), Positives = 133/366 (36%), Gaps = 74/366 (20%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+  DAF K  E    K+  GG  +I+ ++F+ ++   +   +F     E+  V      
Sbjct: 4   LRTFDAFPKTEEQHVRKSSKGGYTSILTYVFLIFIAWSEFGSFFGGYVDEQYGVSKDLRE 63

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            + I++D+ V  + C +L +   D +G++ L       +  L ++  P   P    VN  
Sbjct: 64  AVQINMDMFV-HMPCQWLDVIVQDHTGDRKL------VREELKMESIPFFLPFGTAVNER 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
            +      +      +    +    +G+E E+++                          
Sbjct: 117 NEIASLGLDEVLAEAIPGQFRDQIDFGSEDESKEF------------------------- 151

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                            GC ++G + VN V G   I P          V D       A 
Sbjct: 152 ----------------NGCHVFGTITVNMVKGDLIIIP------RSQSVRDFGRMPPDAI 189

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
           N +H I   SFG      D    PLD +     E  + F+Y+  ++PTI+++L G+++  
Sbjct: 190 NLSHVINEFSFGDFYPYID---NPLDRSARITAEHTTSFHYHTSVVPTIFQKL-GAEVNT 245

Query: 307 GDGGM--------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
               +              P I FSY    L + I ++  S      +++  +S  +I +
Sbjct: 246 NQYSLSETKHETPPSGLRVPAIIFSYSFEALTITIRDERISFWQFIVRLVAILS--FIVY 303

Query: 353 MLVDAL 358
           ++  A 
Sbjct: 304 IMTWAF 309


>gi|12321801|gb|AAG50943.1|AC079284_18 hypothetical protein [Arabidopsis thaliana]
          Length = 451

 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 88/215 (40%), Gaps = 40/215 (18%)

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
           +  N  ST K K   + GC+I GY+   +V G   I+        H   H    + ++  
Sbjct: 245 KSDNAASTFK-KAPVSGGCRIEGYVRAKKVPGELVISA-------HSGAHS---FDASQM 293

Query: 247 NTTHHIRHLSFGI----KLQDDDERRKPLDGTVAKAEEGASMFN-----------YYIKI 291
           N +H + HL+FG     +L  D +R  P  G       G S  N           +Y++I
Sbjct: 294 NMSHIVTHLTFGTMVSERLWTDMKRLLPYLGQSYDRLNGKSFINERQLDANVTIEHYLQI 353

Query: 292 IPT-IYERLDGSKLG-------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
           I T +  R  G +                     P   F +ELSP+ V I+E  KS  H 
Sbjct: 354 IKTEVISRRSGQEHSLIEEYEYTAHSSVARSYHYPEAKFHFELSPMQVLISENPKSFSHF 413

Query: 338 WTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
            T +   I G +    ++D++  + V+ + K+E+G
Sbjct: 414 ITNVCAIIGGVFTVAGILDSIFQNTVRMVKKIELG 448


>gi|422295540|gb|EKU22839.1| hypothetical protein NGA_0271420 [Nannochloropsis gaditana CCMP526]
          Length = 405

 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 53/192 (27%), Positives = 82/192 (42%), Gaps = 50/192 (26%)

Query: 195 EKLKNT-FTE----GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTT 249
           EK++ T F E    GC + G+L VNRV G+FHI     Y       H++ P  +   N +
Sbjct: 217 EKIERTLFAEAEHPGCLLSGFLLVNRVPGNFHIEARSKY-------HNLNPTLT---NVS 266

Query: 250 HHIRHLSFGIKLQDD------------DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
           H +  L+FG  +  +             + R PL   V    +    F++Y+K++ T YE
Sbjct: 267 HVVHDLTFGPPVTREYREKLALLPKGFQQTRSPLADQVYVVSKVHHAFHHYLKVVSTHYE 326

Query: 298 RLDGSKLGGG--------------------DGGMPGIFFSYELSPLMVKITEKSKSLGHL 337
               S+  GG                    D  +P   FSY++SPL   I+ K ++    
Sbjct: 327 ---VSRTFGGQKSTVLQYQMVANSQVMHYQDDEVPEAKFSYDISPLATVISSKKRAWYEF 383

Query: 338 WTKIMCNISGTY 349
            T +M  I GT+
Sbjct: 384 LTSLMAIIGGTF 395


>gi|391338468|ref|XP_003743580.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Metaseiulus occidentalis]
          Length = 292

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 37/121 (30%), Positives = 63/121 (52%), Gaps = 2/121 (1%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           L+ LD + K   D  + T +G A+++ C +FI+ L+  +  ++F      +L+VD+   S
Sbjct: 5   LRRLDVYRKVPADLTQPTYFGAAISVGCIIFITTLLIYETYNFFSPELVSDLYVDNPAPS 64

Query: 67  -KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            K+ + L+I +P +SCD + LD  D +G   +    N  K  L+ DGK      K  +N 
Sbjct: 65  EKIIVFLNISLPKLSCDVVGLDIQDENGRHEVGHIDNTEKTVLN-DGKGCNFVSKFTINK 123

Query: 126 V 126
           V
Sbjct: 124 V 124



 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 46/201 (22%), Positives = 82/201 (40%), Gaps = 33/201 (16%)

Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
           +TEK      +GC       +N+V G+FH++          H    QP      + +H I
Sbjct: 101 NTEKTVLNDGKGCNFVSKFTINKVPGNFHVS---------THAAKTQP---DDIDMSHEI 148

Query: 253 RHLSFGIKL-----QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG- 306
             L+FG +L      D       L        +G    +Y +KI+PT+YE   G  L G 
Sbjct: 149 HSLTFGEQLIYELGDDIKGSFNALQNHDRLKADGKESHDYVMKIVPTVYELSSGDSLVGY 208

Query: 307 ---------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYIT 351
                              +P I+F Y+L+P+ V+   +++ L    T +   + GT+  
Sbjct: 209 QYTHAHKSYITLSFSAGRIIPAIWFKYDLNPITVRYHRRTQPLYSFLTNVCAIVGGTFTV 268

Query: 352 FMLVDALLHSCVKKISKVEIG 372
             +++++  +  +   K E+G
Sbjct: 269 VGIINSICFTAGEVFRKFEMG 289


>gi|299469370|emb|CBG91903.1| putative PDI-like protein [Triticum aestivum]
 gi|299469398|emb|CBG91917.1| putative PDI-like protein [Triticum aestivum]
          Length = 485

 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/115 (28%), Positives = 60/115 (52%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S +LK +D + K   D  E ++ G  ++I   L + +L  +++  Y  V+TT  + V
Sbjct: 1   MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIFAALAMVFLFGMELSSYLAVNTTTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L I  ++  P +SC++ ++D  D  G   L++   + K  +D +  P
Sbjct: 61  DRSSDGEFLRIDFNLSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSIDRNLVP 115



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 51/203 (25%), Positives = 87/203 (42%), Gaps = 42/203 (20%)

Query: 201 FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
            T GC+I G++ V +V GS  I+   + S +H        +  +  N +H++   SFG +
Sbjct: 291 MTGGCRIEGFVRVKKVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTTFSFGKR 340

Query: 261 LQ----DDDERRKPLDGTVAKAEEGAS------------MFNYYIKIIPTI--------- 295
           L     ++ +R  P  G       G S               +Y++I+ T          
Sbjct: 341 LSSKMFNELKRLFPYVGGHHDRLAGQSYIVKHGDVNANVTIEHYLQIVKTELVTLRYAKE 400

Query: 296 ------YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                 YE    S L      +P + F +E SP+ V +TE  KS  H  T +   I G +
Sbjct: 401 LKVLEEYEYTAHSSLVHS-FYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVF 459

Query: 350 ITFMLVDALLHSCVKKISKVEIG 372
               ++D++LH+ ++ + KVE+G
Sbjct: 460 TVAGILDSILHNTLRLVKKVELG 482


>gi|145536478|ref|XP_001453961.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124421705|emb|CAK86564.1| unnamed protein product [Paramecium tetraurelia]
          Length = 592

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 45/179 (25%), Positives = 79/179 (44%), Gaps = 27/179 (15%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
           ++ F K  ++ + +  +GG + ++  + I   I  ++ +  Q   T +L VD +  S++ 
Sbjct: 2   INLFPKIQDNQYNRQSWGGLLFLITIICIVVFIWAEITNALQ--GTIQLQVDPAIDSRIR 59

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           ++LD V+    C  L L+  D  G     V+H I K R+  D         E V+  +  
Sbjct: 60  VNLDAVIQA-PCQALTLNIQDMMGSYLQDVQHTIIKTRIVDDNL-------EYVDVKQNV 111

Query: 130 KVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQC 188
             T                 SCYGAE    + C +C +V  A+  ++W  P  ++IVQC
Sbjct: 112 NFT-----------------SCYGAELLIDQKCYSCQDVMMAFAQRRWRQPNFESIVQC 153


>gi|123407515|ref|XP_001303026.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121884369|gb|EAX90096.1| hypothetical protein TVAG_396530 [Trichomonas vaginalis G3]
          Length = 234

 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 57/236 (24%), Positives = 102/236 (43%), Gaps = 29/236 (12%)

Query: 147 KCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYS-TEKLKNTFTEGC 205
           +CGSCYGA   +  CCN+C EV +A++  + + P    I QC+N +S  + L N   + C
Sbjct: 14  ECGSCYGA---SNGCCNSCKEVLDAFQKIEKSHPPTAMIQQCRNTFSDADSLIN---DSC 67

Query: 206 QIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG------- 258
            +   L V    GSF I  G + +       D         N TH     S G       
Sbjct: 68  TLGITLTVPHTHGSFFITIGQNTTNTSA---DYLGVPKENLNFTHSFDFFSMGGGYHPAQ 124

Query: 259 -----IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGDGGMPG 313
                +K+Q +  R K +     +A    + ++    +  T Y+R            +PG
Sbjct: 125 ILQNYMKVQKEYGRYKAM--YYIRATRILNDYDTQYSLSVTSYDRYRDES----SDKLPG 178

Query: 314 IFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
           +F +Y++SPL+++     + +  +   +M  I G +   +L+D +  +   + S++
Sbjct: 179 VFINYDISPLILQYV-LDRPIYQIIIDMMAIIGGIFAFGLLIDNIYLASTLQSSQI 233


>gi|393908150|gb|EJD74929.1| hypothetical protein, variant [Loa loa]
          Length = 368

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 89/363 (24%), Positives = 153/363 (42%), Gaps = 40/363 (11%)

Query: 5   ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE--ELFVD 61
           E ++  DAF K  ++   EK   GG +  + +L I+ L+  ++ +YF           VD
Sbjct: 22  EVVRDFDAFNKTVDEVSEEKRATGGFLASLSFLIIAALVFGELQNYFYGDEGHYYRFSVD 81

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           ++      + +D++V T   + +A     +S   H     N +K     D    +  +KE
Sbjct: 82  TAFSEHPELEVDMIVATPCTNLMAHLTGTAS---HEFNSMNGFK----YDPTRFEFTEKE 134

Query: 122 VV--NAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY----RYK 175
            +  N +KK +  T+ GTT    +  ++     G   E  K      + +EA+    + K
Sbjct: 135 AMYWNELKKVQHRTKEGTTL--FKSLDEMTFVSGRVEEGLKTEAETKQREEAHAIQLQRK 192

Query: 176 KWALPELD--TIVQCKNEYSTEKLKNTFTE-----GCQIYGYLEVNRVSG-SFHIAPGLS 227
           K     LD  T++   N ++   +  + +E      C+I+G + VN+V G SF I+ G  
Sbjct: 193 KNPKQSLDGGTLILIGNGFNVFHVVASNSEKNEGTACRIHGRMRVNKVKGDSFIISTGKG 252

Query: 228 YSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNY 287
             ++ +  H      S+  N +H I   +FG ++        PL G    +E G   F Y
Sbjct: 253 LDVDGIFAH--FGGVSSPSNISHRIERFNFGPRIYG---LVTPLAGIEQISETGVDEFRY 307

Query: 288 YIKIIPTIYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
           ++KI+PT   R+  S L GG         +Y+ S   +K T K     H    I    + 
Sbjct: 308 FLKIVPT---RIYHSGLFGGST------LTYQYSVTFMKKTPKKDVHKHTAIIIHYEFAA 358

Query: 348 TYI 350
           T I
Sbjct: 359 TVI 361


>gi|255074657|ref|XP_002501003.1| predicted protein [Micromonas sp. RCC299]
 gi|226516266|gb|ACO62261.1| predicted protein [Micromonas sp. RCC299]
          Length = 515

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 86/210 (40%), Gaps = 54/210 (25%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
           T GC I G   VNRV G+F++ P       H   H++ P      N TH ++HLSFG  +
Sbjct: 311 TSGCIIDGSFRVNRVPGAFYVTP-------HSMGHNLNP---DVINMTHTVKHLSFGKHV 360

Query: 262 QD------DDERR------KPLDGTVAK-------AEEGASMFNYYIKIIPTIYERLDGS 302
                    + RR      K L G  A        +EE  ++  +Y+KI+   +E L+G 
Sbjct: 361 PGRPSYVPRNLRRVWNRVPKDLGGRFAAGDEATFYSEEPNTVHEHYLKIVSRTFEPLEGQ 420

Query: 303 KLG-----------------GGDGGM------PGIFFSYELSPLMVKITEKSKSLGHLWT 339
            +                    DG        P I FSY++SP+ V + E  K L   W 
Sbjct: 421 AVQLYEYTFNSNRFRLNPPLAADGDPDQHVDGPMIKFSYDVSPMSVVLKEVKKPLLD-WI 479

Query: 340 KIMCN-ISGTYITFMLVDALLHSCVKKISK 368
             MC  + G Y    L++  L S V  + +
Sbjct: 480 LGMCALLGGVYTCAGLLETFLQSSVCAVKR 509


>gi|323449499|gb|EGB05387.1| hypothetical protein AURANDRAFT_31008 [Aureococcus anophagefferens]
          Length = 445

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 46/183 (25%), Positives = 78/183 (42%), Gaps = 35/183 (19%)

Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
           NT   GC + G+L VNRV G+FH+   +++S +H          +   N +H + HLSFG
Sbjct: 266 NTDHPGCLVSGFLLVNRVPGNFHV---MAHSRHH-------SLNTLRTNLSHTVHHLSFG 315

Query: 259 IKLQDDDERR-----------KPLDGTVAKAEEGASMFNYYIKIIPTIY-------ERLD 300
           + L D   R+             LDG     ++    + +++ I+PT Y       +R  
Sbjct: 316 VPLTDAQHRKLATIDVRHARTDTLDGEDYYHDDYHYAYQHFVHIVPTKYNLGVFWRDRFA 375

Query: 301 GSK-------LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
             +       L   +   P   FSY++SP+ V +           T ++  + GT+  F 
Sbjct: 376 AFQTLHSHHLLKYAEHVPPEARFSYDISPMAVVVDTVRVKWYDFLTSLLAIVGGTFALFK 435

Query: 354 LVD 356
           L +
Sbjct: 436 LAN 438



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/104 (24%), Positives = 53/104 (50%)

Query: 9   GLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKL 68
            +D + K  ++  E +  GG +++     ++  +  ++  + +     ++ VD+  GS+L
Sbjct: 1   AMDFYRKVPDELKEASRTGGLLSLCACGVVALTLVTEIGAFLRTEVRTKIDVDTFAGSQL 60

Query: 69  PIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
            ++ ++  P + CDY ++D  D  G    +V  NI K +LD DG
Sbjct: 61  RVNFNLSFPHLHCDYASVDLWDKIGRNQANVTQNIEKWQLDEDG 104


>gi|223995687|ref|XP_002287517.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976633|gb|EED94960.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 457

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 58/206 (28%), Positives = 89/206 (43%), Gaps = 48/206 (23%)

Query: 189 KNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNT 248
           ++EYS   LKN    GCQI G+L V+R  G+FHI             HD+  + +   N 
Sbjct: 266 ESEYSV--LKNH--PGCQISGFLLVDRAPGNFHIQA-------QSKGHDLAAHMT---NV 311

Query: 249 THHIRHLSFGIK-----LQDDD--------ERRKPLDGTVAKAEEGASMFNYYIKIIPT- 294
           +H I HLSFG       L+D          E  KP DG V   +      ++Y+K+I T 
Sbjct: 312 SHIINHLSFGKPFSKYFLKDGLKNTPPGFLETTKPFDGNVYITQNEHEAHHHYLKVITTE 371

Query: 295 -------------------IYERLDGSKLGGGDGGM-PGIFFSYELSPLMVKITEKSKSL 334
                               Y+ L  S+L      + P   F+Y+LSP+ V   +K +  
Sbjct: 372 FEPEKGAQNSKYNKKEPSRAYQILQSSQLSLYRSDIVPEAKFTYDLSPIAVSYNKKYRHW 431

Query: 335 GHLWTKIMCNISGTYITFMLVDALLH 360
              +T +M  I GT+    ++++ +H
Sbjct: 432 YDYFTSLMAIIGGTFTVVGMLESGIH 457



 Score = 40.0 bits (92), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/95 (23%), Positives = 42/95 (44%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +  LD + K   D  E T  G  ++ +    ++ L  ++   YF  +    L +DS+   
Sbjct: 1   IANLDMYRKVPVDLLEGTRRGSILSTIAIFTMTTLFFLETKAYFSSTLATSLALDSNSDP 60

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEH 101
            + ++ +I +  + CDY  +D V   G Q    +H
Sbjct: 61  NIRVNFNITMMDLKCDYATIDVVSVLGTQQNVTQH 95


>gi|299115405|emb|CBN74236.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 447

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 48/192 (25%), Positives = 86/192 (44%), Gaps = 33/192 (17%)

Query: 196 KLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHL 255
           +LK  +  GCQ+ G++ VNRV G+FHI    +       +H I P    A N +H ++ L
Sbjct: 264 RLKQDY-PGCQLSGFIMVNRVPGNFHIEARSA-------LHSIDP---TAANISHVVKTL 312

Query: 256 SFGIKLQDDDER----------RKPLDGTVAKAEEGASMFNYYIKIIPTI---------- 295
            FG ++     R             L+  V   +   +  ++YIK++ T           
Sbjct: 313 KFGTQVPVRGRRVIESGVELEGLPALEDRVYSIDSLHTAPHHYIKVVSTFVGGLAKTDNL 372

Query: 296 -YERLDGSK-LGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
            Y+ +  S+ +      +P   FSY+LSP+ V I ++ +      T ++  + GT+    
Sbjct: 373 QYQMMVSSQTMPYEQDQVPEAKFSYDLSPMSVHIKQRRRKWYDFLTSVLAIVGGTFTVVG 432

Query: 354 LVDALLHSCVKK 365
           ++D +L   VK+
Sbjct: 433 VLDNILFRVVKQ 444



 Score = 45.4 bits (106), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 56/109 (51%), Gaps = 6/109 (5%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE---ELFVDSS 63
           +K  D + K   D  E T+ G AV   C LF   ++ + +C+     T E    + +DS+
Sbjct: 4   IKTFDFYRKIPLDLTETTLQG-AVMSGCALFC--MLILFLCELRAFLTPEVYTTVAIDSN 60

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
           + SKL I+ +I +  + CDY ++D +D  G   +++  NI K   D +G
Sbjct: 61  QDSKLRINFNITMLALPCDYASVDVLDLLGTNKVNMTQNIVKWHTDENG 109


>gi|326503558|dbj|BAJ86285.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 32/115 (27%), Positives = 60/115 (52%), Gaps = 1/115 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M+ S +LK +D + K   D  E ++ G  ++I   L + +L  +++  Y  V+TT  + V
Sbjct: 1   MISSSKLKSVDFYRKIPRDLTEASLSGAGLSIFAALAMVFLFGMELSSYLAVNTTTSVIV 60

Query: 61  D-SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           D SS G  L +  ++  P +SC++ ++D  D  G   L++   + K  +D +  P
Sbjct: 61  DRSSDGEFLRMDFNLSFPALSCEFASVDVSDVLGTNRLNITKTVRKFSIDRNLVP 115



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 51/203 (25%), Positives = 90/203 (44%), Gaps = 42/203 (20%)

Query: 201 FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
            T GC+I G++ V +V GS  I+   + S +H        +  +  N +H++   SFG +
Sbjct: 291 MTGGCRIEGFVRVKKVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTTFSFGKR 340

Query: 261 LQ----DDDERRKPLDG-----------TVAKAEEGASM-FNYYIKIIPTI--------- 295
           L     ++ +R  P  G            V   +  A++   +Y++I+ T          
Sbjct: 341 LSSKMFNELKRLFPYVGGHHDRLAGQSYVVKHGDVNANVTIEHYLQIVKTELVTLRYSKE 400

Query: 296 ------YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                 YE    S L      +P + F +E SP+ V +TE  KS  H  T +   I G +
Sbjct: 401 LKVLEEYEYTAHSSLVHS-FYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVF 459

Query: 350 ITFMLVDALLHSCVKKISKVEIG 372
               ++D++LH+ ++ + KVE+G
Sbjct: 460 TVAGILDSILHNTLRLVKKVELG 482


>gi|123454020|ref|XP_001314836.1| hypothetical protein [Trichomonas vaginalis G3]
 gi|121897494|gb|EAY02613.1| hypothetical protein TVAG_260730 [Trichomonas vaginalis G3]
          Length = 356

 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 86/377 (22%), Positives = 144/377 (38%), Gaps = 54/377 (14%)

Query: 7   LKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS--- 62
           +K  D F K   ++   KT+ GG +TI+ ++ I + +   + D       E L  ++   
Sbjct: 1   MKNFDLFPKVKNEYQGVKTISGGIITILTFILIQFSLIFFIKDALNYKIQESLHQNNTIL 60

Query: 63  SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEV 122
           S  ++L +  +I V    C++L +   D SG         + K+ LD D  P  + Q   
Sbjct: 61  SGDTELWLSFNITVDA-PCNFLQVYITDESGHHRKQSIRALMKQNLDKDYCPYGDFQ--- 116

Query: 123 VNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPEL 182
                           T  + D  +CG CYG + +  +CC TC +V   +     A P L
Sbjct: 117 --------------LFTKNISDNGECGYCYGHKYQ--ECCYTCLDVVYGHIATYRAPPSL 160

Query: 183 DTIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP 240
           + I QCK +       N +  G  C + G        G   I+      +    + D   
Sbjct: 161 EGISQCKRDL------NFYNNGSKCLLMGSTRTPYAYGQLIISMNSQNQVPKKTLID-NT 213

Query: 241 YTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPT-IY-- 296
             +   N +H I H  FG   ++    + PLD  +  + +     + Y + +I T IY  
Sbjct: 214 LVTKYLNLSHTIGHFFFG---KESKFIKNPLDSYIQIQNDTKYHQYIYRLSLIQTSIYYP 270

Query: 297 ----------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
                            L       PGI F + + P+  KIT     L  L   +   I 
Sbjct: 271 DQIFATTQYSAHFSDKILEKKSEERPGIIFKFSIYPINSKITVTKTKLHFLLLSVCSIIG 330

Query: 347 GTYITFMLVDALLHSCV 363
           G +    ++ +L+HSC+
Sbjct: 331 GGF----MISSLIHSCL 343


>gi|118357982|ref|XP_001012239.1| hypothetical protein TTHERM_00103880 [Tetrahymena thermophila]
 gi|89294006|gb|EAR91994.1| hypothetical protein TTHERM_00103880 [Tetrahymena thermophila
           SB210]
          Length = 323

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 80/387 (20%), Positives = 148/387 (38%), Gaps = 91/387 (23%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           +  +  DAF K  +D    +  GG  +I+       L C +  ++ + +   +L V S  
Sbjct: 2   QSFRKFDAFQKVNQDIDSSSSVGGLFSIIALAIGFILFCHEFQEWNKYTIVRKLEVQSLN 61

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
            + +  ++D+    + C  ++LD +   G+Q L  +++    R+ LD             
Sbjct: 62  QAIIKANIDLTFFNVPCSLISLDVLYQDGQQVLQ-DYSSTLTRIKLDR------------ 108

Query: 125 AVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDT 184
             + K++ TE  TT  E+E  N                                      
Sbjct: 109 --QNKEIGTE--TTYVEVEQENS------------------------------------- 127

Query: 185 IVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSA 244
             Q K E   E++KN   E C+I+G L +N + GSF            + +  +      
Sbjct: 128 --QQKIEEVLEQIKN--KEQCRIHGQLLLNTIPGSFKFRI--------LQMKGLDEQLLK 175

Query: 245 AFNTTHHIRHLSFGIKLQDDD-ERRKPLDGTVAKAEEGASMFNY--------YIKIIPTI 295
             N  H I  LSFG  ++    E+   LD + ++A +  S +NY        YIKI+P  
Sbjct: 176 QLNINHKINKLSFGDTIKTKKIEKVLGLDKSDSEAFD-ESRYNYEYRCSYDNYIKILPLN 234

Query: 296 --------YERLDGSKLGGGDGGMPG-------IFFSYELSPLMVKITEKSKSLGHLWTK 340
                   Y R +  +       +P        + F+Y++SP+ +    K+KS      +
Sbjct: 235 AENIKELGYIRTNSFRFTMYQQVIPKEQTDIIEVSFNYQVSPINIVYQTKNKSFYSFVVQ 294

Query: 341 IMCNISGTYITFMLVDALLHSCVKKIS 367
           +   I G +  F +++ L+ + +  I+
Sbjct: 295 VCAIIGGIFCVFGVINTLVLNIISSIN 321


>gi|170588701|ref|XP_001899112.1| hypothetical protein [Brugia malayi]
 gi|158593325|gb|EDP31920.1| conserved hypothetical protein [Brugia malayi]
          Length = 430

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 89/397 (22%), Positives = 161/397 (40%), Gaps = 52/397 (13%)

Query: 5   ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE--ELFVD 61
           E ++  DAF K  ++   EK   GG +  + +L I+ L+  ++ +YF           VD
Sbjct: 23  EVVRDFDAFNKTVDEVSEEKRAAGGFLASLSFLIIAALVFGELRNYFYGDEGHYYRFSVD 82

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           ++      + LD++V T   + +A     +S   H     N +K     D    +  +KE
Sbjct: 83  TAFSEHPELELDMIVATPCTNLMAHLTGTTS---HEFSSMNEFKH----DPTRFEFTEKE 135

Query: 122 VV--NAVKKKKVTTENGTT-------TTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY 172
            +  N +KK +  T+ GTT        T +    + G    AET+ R+  +     K+  
Sbjct: 136 AMYWNELKKVQHRTKEGTTLFKSLDEMTFISGQVEEGLKNEAETKQREEAHAIQLEKKKN 195

Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSG-SFHIAPGLSYS 229
             +      L  I    N +      +   EG  C+I+G + VN+V G SF ++ G    
Sbjct: 196 PKESMDGGMLILIGNGFNVFHVVASNSEKNEGTACRIHGRMRVNKVKGDSFVVSTGKGLG 255

Query: 230 INHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
           ++ +  H      S   N +H I   +FG  +        PL G    +E G   F Y++
Sbjct: 256 VDGIFAH--FGGVSNPGNLSHRIERFNFGPTIYG---LVTPLAGIEQISETGIDEFRYFL 310

Query: 290 KIIPTIYERLDGSKLGGGDG-------------------GMPGIFFSYELSPLMVKITEK 330
           K++PT   R+  S L GG                         I   YE +  ++++   
Sbjct: 311 KVVPT---RIYHSGLFGGSTLTYQYSVTFMKKTPKKDVHKHAAIVIHYEFAATVIEVRRI 367

Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
             SL  +  ++   + G + T +L++++   CV+ ++
Sbjct: 368 QSSLLQMLIRLCSAVGGVFATSVLLNSI---CVRVLT 401


>gi|225461068|ref|XP_002281649.1| PREDICTED: protein disulfide isomerase-like 5-4 [Vitis vinifera]
 gi|297735969|emb|CBI23943.3| unnamed protein product [Vitis vinifera]
          Length = 482

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 53/199 (26%), Positives = 88/199 (44%), Gaps = 39/199 (19%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
           T GC+I G++ V +V G+  I+   + S +H        +  +  N +H I HLSFG K+
Sbjct: 292 TGGCRIEGFVRVKKVPGNLVIS---ARSGSH-------SFDPSQMNMSHVISHLSFGRKI 341

Query: 262 ----QDDDERRKPLDGTVAKAEEGASMFNY------------YIKIIPTI---------- 295
                 D +R  P  G       G S  ++            Y++++ T           
Sbjct: 342 APRVMSDMKRVLPYIGGSHDRLNGRSYISHPSDSNANVTIEHYLQVVKTEVITTRDHKLV 401

Query: 296 --YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
             YE    S L      +P   F +ELSP+ V +TE  KS  H  T +   I G +    
Sbjct: 402 EEYEYTAHSSLVQS-LYIPVAKFHFELSPMQVLVTENRKSFWHFITNVCAIIGGVFTVAG 460

Query: 354 LVDALLHSCVKKISKVEIG 372
           ++D++LH+ ++ + K+E+G
Sbjct: 461 ILDSVLHNTMRLMKKIELG 479


>gi|224013160|ref|XP_002295232.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969194|gb|EED87536.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 488

 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 53/193 (27%), Positives = 86/193 (44%), Gaps = 38/193 (19%)

Query: 195 EKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRH 254
           E+ +     GC I G+L VNRV G F I    + S+NH  +H      SA  N TH +  
Sbjct: 297 EEFEEDHHPGCLISGHLMVNRVPGRFQIE---ARSVNH-ELH------SAMTNLTHRVHD 346

Query: 255 LSFGI----------------KLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---- 294
           L+FG                  + +  +   P+        E    F++++KII T    
Sbjct: 347 LTFGALSGPPGHMLHVLPFFDTVPEKYKHTNPMQDKYYPTYEFHQAFHHHLKIISTHIDY 406

Query: 295 -------IYERLDGSKLGG-GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
                  +Y+ L+ S+L    +  +P I FS++LSP+ V ++++ +      T +   I 
Sbjct: 407 LFSRSTVLYQILEQSQLVFYEEVNVPEIQFSFDLSPMSVNVSKEGRKWYEYVTSLCAIIG 466

Query: 347 GTYITFMLVDALL 359
           GTY T  L++A L
Sbjct: 467 GTYTTLGLINATL 479


>gi|426372082|ref|XP_004052960.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 2 [Gorilla gorilla
           gorilla]
          Length = 354

 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 43/151 (28%), Positives = 64/151 (42%), Gaps = 22/151 (14%)

Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
           H H        ++N +H I HLSFG  +        PLDGT   A +   MF Y+I ++P
Sbjct: 176 HAHLAALVNHESYNFSHRIDHLSFGELVP---AIINPLDGTEKIAIDHNQMFQYFITVVP 232

Query: 294 T---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           T               + ER        G  G+ GIF  Y+LS LMV +TE+       +
Sbjct: 233 TKLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFF 292

Query: 339 TKIMCNISGTYITFMLVDALLHSCVKKISKV 369
            ++   + G + T      +LH   K I ++
Sbjct: 293 VRLCGIVGGIFST----TGMLHGIGKFIVEI 319



 Score = 44.7 bits (104), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 31/123 (25%), Positives = 59/123 (47%), Gaps = 2/123 (1%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 22  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 81

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 82  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 139

Query: 127 KKK 129
           + +
Sbjct: 140 QSR 142


>gi|313227239|emb|CBY22386.1| unnamed protein product [Oikopleura dioica]
          Length = 380

 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 89/382 (23%), Positives = 153/382 (40%), Gaps = 75/382 (19%)

Query: 3   FSERLKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE-ELFV 60
           F E+ + LDAFTK  E+    +T +GG  T+V +  +  L+  ++  +F  +  + E  V
Sbjct: 21  FLEKFRELDAFTKITEEAESPQTSHGGVCTMVTFTIMLLLLLGEMTVWFTTTKIKYEFDV 80

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           DS   SK+ +++DI   +  C  ++ + VDSSG+         Y  +L  D    +  ++
Sbjct: 81  DSEYESKMHLNMDITFNS-PCHMISAEIVDSSGDAW------GYSFQLQEDAADFELTKE 133

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVK--EAYRYKKWA 178
           + +   K  K+          + DPN             +     ++VK  E  R K   
Sbjct: 134 KALERAKLLKM-------KESMTDPNM----------RDQLLREGHDVKHLEFSRKKNKK 176

Query: 179 LPE---LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP----------G 225
           + E   +  +VQ         L     +GC+++G +E+ +++G+  I            G
Sbjct: 177 MMEQGMMHKVVQI-------NLDPNEPQGCRVWGSVELQKIAGTIKIQAGGFGGMGGIPG 229

Query: 226 LSYSINHVHVH----------DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV 275
           LS  ++ +              IQ    A F  +H I H SFG            LDG +
Sbjct: 230 LSGGLDAIMGMFMMPMMGMGAQIQDGKKANF--SHRIDHFSFG---DPSSGLVYGLDGDI 284

Query: 276 AKAEEGASMFNYYIKIIPT----------IYERLDGSKLGGGDGGMPGIFFSYELSPLMV 325
              E+      Y +K++PT           Y+      +G  D   P +   Y+ S L V
Sbjct: 285 QIQEKENDDTTYVVKVVPTDLKTFKFQQKAYQYAVTQHVGKSD--KPAVTIKYDFSGLGV 342

Query: 326 KITEKSKSLGHLWTKIMCNISG 347
            ITE  +S   L T++   + G
Sbjct: 343 SITEYRESFVGLLTRLAGILGG 364


>gi|303279378|ref|XP_003058982.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226460142|gb|EEH57437.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 486

 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 58/111 (52%), Gaps = 1/111 (0%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-R 64
           +L+ +D + K   D  E TV G  ++I   L I+ L+  ++  Y   +   ++ VD S  
Sbjct: 8   KLRSVDFYRKIPRDMSEGTVPGSVISIGSALLIALLLVSEIGRYATPTWKTKVVVDRSLD 67

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPI 115
           G  + I+ ++  P +SC++ ++D  D+ G    ++   ++KR L  DG P+
Sbjct: 68  GDMMKINFNVSFPALSCEFASVDVGDAMGLNRYNLTKTVFKRALARDGTPL 118



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 48/226 (21%), Positives = 85/226 (37%), Gaps = 40/226 (17%)

Query: 177 WALPELD-----TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSIN 231
           W + E D      +V  +     E ++     GC + G++   +V       PG  +   
Sbjct: 268 WKIEEADKTESRAVVTREEALRHESVRAVKGPGCSVTGFVLAKKV-------PGHVWITA 320

Query: 232 HVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDD----ERRK---------PLDGTVAKA 278
           + + H   P      N TH + HL FG +L  +     ERR+          L G   ++
Sbjct: 321 NSNSHSFHP---EEMNMTHTVNHLFFGNQLGRNKLKALERRERGASSNWHDKLAGVTFRS 377

Query: 279 EEGASMFNYYIKIIPTI------------YERLDGSKLGGGDGGMPGIFFSYELSPLMVK 326
            +      +Y++ + T             YE    S        +P   F +  SP+ V 
Sbjct: 378 LQTNVTHEHYLQTVLTTLRPAGSYVAYHAYEYTQHSHALVTTRELPRAKFHFNPSPVQVV 437

Query: 327 ITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           +TE+ +   H  T +M  + G Y    + D  +H+ +  + K E+G
Sbjct: 438 VTEEREPFYHFITTLMAIVGGVYSVCGIADGFVHNTLNMMRKFELG 483


>gi|402595088|gb|EJW89014.1| hypothetical protein WUBG_00081 [Wuchereria bancrofti]
          Length = 578

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 89/397 (22%), Positives = 161/397 (40%), Gaps = 52/397 (13%)

Query: 5   ERLKGLDAFTKPYEDF-HEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE--ELFVD 61
           E ++  DAF K  ++   EK   GG +  + +L I+ L+  ++ +YF           VD
Sbjct: 172 EVVRDFDAFNKTVDEVSEEKRATGGFLASLSFLIIAALVFGELRNYFYDGEGHYYRFSVD 231

Query: 62  SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKE 121
           ++      + LD++V T  C  L      ++  +   V  N +K     D    +  +KE
Sbjct: 232 TAFSEHPELELDMIVAT-PCTNLMAHLTGTTSHEFSSV--NEFKH----DPTRFEFTEKE 284

Query: 122 VV--NAVKKKKVTTENGTT-------TTELEDPNKCGSCYGAETETRKCCNTCNEVKEAY 172
            +  N +KK +  T+ GTT        T +    + G    AET+ R+  +     K+  
Sbjct: 285 AMYWNELKKVQHRTKEGTTLFKSLDEMTFISGQVEEGLKNEAETKQREEAHAIQLEKKKN 344

Query: 173 RYKKWALPELDTIVQCKNEYSTEKLKNTFTEG--CQIYGYLEVNRVSG-SFHIAPGLSYS 229
             +      L  I    N +      +   EG  C+I+G + VN+V G SF ++ G    
Sbjct: 345 PKESMDGGMLILIGNGFNVFHVVASNSEKNEGTACRIHGRMRVNKVKGDSFVVSTGKGLG 404

Query: 230 INHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYI 289
           ++ +  H      S   N +H I   +FG  +        PL G    +E G   F Y++
Sbjct: 405 VDGIFAHF--GGLSNPGNVSHRIERFNFGPTIYG---LVTPLAGIEQISETGMDEFRYFL 459

Query: 290 KIIPTIYERLDGSKLGGGDG-------------------GMPGIFFSYELSPLMVKITEK 330
           K++PT   R+  S L GG                         I   YE +  ++++   
Sbjct: 460 KVVPT---RIYHSGLFGGSTLTYQYSVTFMKKTPKKDVHKHAAIIIHYEFAATVIEVRRI 516

Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
             SL  +  ++   + G + T +L++++   CV+ ++
Sbjct: 517 QSSLLQMLIRLCSAVGGVFATSVLLNSI---CVRVLT 550


>gi|365759132|gb|EHN00939.1| Erv41p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
 gi|401842937|gb|EJT44934.1| ERV41-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 285

 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 47/175 (26%), Positives = 76/175 (43%), Gaps = 33/175 (18%)

Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINH-VHVHDIQPYTSAAFN 247
           +E   +K K     GC I+G + VNRVSG   I A G  Y+ +H   + D+        N
Sbjct: 79  DENDPDKAKLLDFNGCHIFGSVPVNRVSGVLQITAKGFGYADSHRASLEDL--------N 130

Query: 248 TTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERLDGSKLG- 305
             H I   SFG      D    PLD T     +E  + + YY  ++PT++++L G+++  
Sbjct: 131 FAHVINEFSFGDFYPYID---NPLDNTAQFDQDEPLTTYLYYTSVVPTLFKKL-GAEVDT 186

Query: 306 -----------------GGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                             G+  +PGIFF Y   PL + +++   S      +++ 
Sbjct: 187 NQYSVNDYRYLNKDSSVKGNRRVPGIFFKYNFEPLSIVVSDVRISFIQFLVRLVA 241


>gi|344250048|gb|EGW06152.1| UPF0474 protein C5orf41-like [Cricetulus griseus]
          Length = 745

 Score = 57.8 bits (138), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 66/148 (44%), Gaps = 29/148 (19%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG--IKL 261
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  +++
Sbjct: 133 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHIIHKLSFGDTLQV 180

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
           Q+       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 181 QNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSH 240

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSL 334
            G  +P I+F Y+LSP+ VK TE+ + L
Sbjct: 241 TGRIIPAIWFRYDLSPITVKYTERRQPL 268



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 50/101 (49%), Gaps = 4/101 (3%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS--- 63
           L   D + K  +D  + T  G  ++I C LFI +L   ++  +       EL+VD     
Sbjct: 24  LHRFDIYRKVPKDLTQPTYTGAIISICCCLFILFLFLSELTGFITTEVVNELYVDDPDKD 83

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNI 103
            G K+ + L+I +P + C+ + LD  D  G   + H+++++
Sbjct: 84  SGGKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM 124


>gi|308807242|ref|XP_003080932.1| Thioredoxin/protein disulfide isomerase (ISS) [Ostreococcus tauri]
 gi|116059393|emb|CAL55100.1| Thioredoxin/protein disulfide isomerase (ISS) [Ostreococcus tauri]
          Length = 533

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 65/125 (52%), Gaps = 6/125 (4%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-RG 65
           ++G+D + K   +F E T+ G  ++I+  + + YL   ++  Y   S   ++ VD S  G
Sbjct: 26  IRGMDFYRKVPREFSEGTLGGSIISILSAVLMLYLFLSELGKYSTSSFETKVVVDRSVDG 85

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQ-----K 120
             L I+ ++  P +SC++ ++D  D+ G    ++   ++KR +D +  PI   Q     K
Sbjct: 86  ELLRINFNLSFPALSCEFASVDVGDALGLNRFNLTKTVFKRAIDAEMNPIGPLQWDRAVK 145

Query: 121 EVVNA 125
           EV+ A
Sbjct: 146 EVLKA 150


>gi|145549492|ref|XP_001460425.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428255|emb|CAK93028.1| unnamed protein product [Paramecium tetraurelia]
          Length = 320

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 84/398 (21%), Positives = 142/398 (35%), Gaps = 115/398 (28%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG- 65
           LK +D + K  +   E T  G  V+I+  + ++ +I  +  +Y  +    E+ VD     
Sbjct: 4   LKSIDLYGKVPKGLAEPTSSGAVVSIITLILLALMIINEGIEYITIDVQSEIIVDQKLSK 63

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            ++ ++LDI      CD+L +D  D+ G+        +   RLD + + I E        
Sbjct: 64  DRVQVNLDIKFIKAPCDFLEIDQQDAMGQSLSQQFMELKYYRLDSNERRISE-------- 115

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
                  T N     E+ED              R   N                      
Sbjct: 116 ------YTRNSNNWVEIED-------------ARTAIN---------------------- 134

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAA 245
                    EK      +GC++ G L+VNRV G        SYS           Y  A 
Sbjct: 135 ---------EK------QGCEVIGNLKVNRVRGKISFGAHRSYS-----------YIGAV 168

Query: 246 FNT------THHIRHLSFGIKLQDDDERRK--------PLD---GT--VAKAEEGASMFN 286
            N       +H     SFG    D+D  +K         LD   GT  + K E  +    
Sbjct: 169 GNLNLPLDYSHKFVSFSFG----DEDALKKVKSLFQQGQLDSFAGTQRIKKPELASQSMQ 224

Query: 287 --YYIKIIPTIYERLDG------------SKLGGGDGGMPGIFFSYELSPLMVKITEKSK 332
             ++I IIPT Y  L+             +++   + G   +   Y+ +P  V   +  +
Sbjct: 225 HEHFISIIPTHYTLLNKQVYSVYQYTANHNEVRSNNYG--NVQLRYDFAPTTVTYWQTKE 282

Query: 333 SLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVE 370
            + H + +I   I G +    +++A ++  ++ + KVE
Sbjct: 283 DILHFYVQICAVIGGIFTVSSMIEACVYKVMRMLLKVE 320


>gi|145543941|ref|XP_001457656.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124425473|emb|CAK90259.1| unnamed protein product [Paramecium tetraurelia]
          Length = 322

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 78/381 (20%), Positives = 144/381 (37%), Gaps = 87/381 (22%)

Query: 7   LKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           L+ LD F K   D  +  +  GG +T++ +  ++     +   +F      +  +D+   
Sbjct: 3   LRQLDFFRKLNTDIGDTSSSLGGFLTMIAFALVTIFTMNECRLFFSTELNYQTVIDNDTE 62

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
             + ++LD +V    C  L+LD  D  G   + V  N+ K  LD                
Sbjct: 63  QFIKVYLDAIVGA-PCMVLSLDQQDEVGVHVMDVSGNLKKIALD---------------- 105

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTI 185
            K++ V      T    E PN  GS                              EL   
Sbjct: 106 -KERHVLP----TIDNNERPNYRGSD----------------------------QELVDA 132

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIA-PGLSYSINHVHVHDIQPYTSA 244
           ++  N+           E CQ  G+  VN+V G+FHI+     + I  +H  D+  Y   
Sbjct: 133 IEAINQ----------GEQCQFKGFFSVNKVPGNFHISYHAHHHLIQRIHQRDLSTYRKL 182

Query: 245 AFNTTHHIRHLSFGIKLQDDDERRKPLD--------GTVAK-AEEGASM-FNYYIKIIP- 293
             +  H I  L FG        ++ P           ++AK A EG    + YYI  +P 
Sbjct: 183 KLD--HTIYELRFGDNSSSFKMKKYPKSLQKFQSSWNSIAKTAPEGEKQDYEYYINALPV 240

Query: 294 -----------TIYE-RLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
                      T+Y+  ++ +++      +  I+F Y++SP+ +  + + KS+ H   ++
Sbjct: 241 RFYDDKERNYQTLYKYSINEAQMTRSFTEIDSIYFKYQISPVNMVYSIQKKSVYHFIVQL 300

Query: 342 MCNISGTYITFMLVDALLHSC 362
           +  + G +    +V++++   
Sbjct: 301 LAIVGGVFAVIGIVNSIIQKA 321


>gi|396485364|ref|XP_003842153.1| hypothetical protein LEMA_P079130.1 [Leptosphaeria maculans JN3]
 gi|312218729|emb|CBX98674.1| hypothetical protein LEMA_P079130.1 [Leptosphaeria maculans JN3]
          Length = 486

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 51/204 (25%), Positives = 83/204 (40%), Gaps = 46/204 (22%)

Query: 202 TEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
           T+ C+I+G +E N+V G FHI A G  Y    VH+          FN +H IR LSFG  
Sbjct: 268 TDSCRIFGSIEGNKVQGDFHITARGHGYIEYGVHL------DHKTFNFSHIIRELSFGPY 321

Query: 261 LQDDDERRKPLDGTVA---KAEEGASMFNYYIKIIPTIY-------ERLDGSKLGGGDGG 310
                    PLD T+A     ++    F Y++ I+PTIY         LD     G +  
Sbjct: 322 YP---SLTNPLDNTIAITPTPDDHFYKFQYFLSIVPTIYTDDPSLIPYLDILNRYGKNPD 378

Query: 311 M--------------------------PGIFFSYELSPLMVKITEKSKSLGHLWTKIMCN 344
           +                          PG+F  +++ P+M+ + E+      L  +++  
Sbjct: 379 LFNSAHAVKTNQYAVTSQSHPVSEYYVPGVFVKFDIEPIMLNVVEEWGGFWRLLVRLVNV 438

Query: 345 ISGTYITFMLVDALLHSCVKKISK 368
           ISG  +       L+   ++ + +
Sbjct: 439 ISGVMVAGSWAWQLMDWAIEVMGR 462


>gi|323303637|gb|EGA57425.1| Erv41p [Saccharomyces cerevisiae FostersB]
          Length = 284

 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 66/158 (41%), Gaps = 28/158 (17%)

Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           GC I+G + VNRVSG   I A  L Y  +        P     FN  H I   SFG    
Sbjct: 93  GCHIFGSIPVNRVSGELQITAKSLXYVASRK-----APLEELKFN--HVINEFSFGDFYP 145

Query: 263 DDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DGSKLGGGD--------- 308
             D    PLD T     +E  + + YY  ++PT++++L    D ++    D         
Sbjct: 146 YID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDTNQYSVNDYRYLYKDVA 202

Query: 309 ---GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                MPGIFF Y   PL + +++   S      +++ 
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDXRLSFIQFLVRLVA 240


>gi|302800507|ref|XP_002982011.1| hypothetical protein SELMODRAFT_115492 [Selaginella moellendorffii]
 gi|300150453|gb|EFJ17104.1| hypothetical protein SELMODRAFT_115492 [Selaginella moellendorffii]
          Length = 476

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 47/199 (23%), Positives = 83/199 (41%), Gaps = 39/199 (19%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+I G++   +V       PG      H   H    + ++A N TH++   +FG +L  
Sbjct: 284 GCRIEGFIRAKKV------VPGNIIISAHSGSHS---FDASAMNMTHYVSQFTFGRELNF 334

Query: 264 DDERR----------------KPLDGTVAKAEEGASMFNYYIKIIPT----IYERLDGSK 303
              R                   L G +  ++      ++Y++++ T    + +R + S 
Sbjct: 335 WMRRELYRIYPHLASVYDTVEANLTGRIYVSQHENITHDHYLQVVKTEVVSLRKRKEFSL 394

Query: 304 LGGGD----------GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
           L   D            +P   F YELSP+ V + E  KS  H  T +   I G +    
Sbjct: 395 LEQYDYTSHSNTIQNTNVPVAKFHYELSPMQVLVKENPKSFSHFITNVCAIIGGVFTVAG 454

Query: 354 LVDALLHSCVKKISKVEIG 372
           +VD++LH  ++ + K+E+G
Sbjct: 455 IVDSMLHGAMRMVKKIELG 473



 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 27/114 (23%), Positives = 60/114 (52%), Gaps = 1/114 (0%)

Query: 1   MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           M  + ++K +D + K   D  E ++ G  ++++    + +L  +++ +Y  VS+T  + V
Sbjct: 1   MTTASKIKSIDFYRKIPRDLTEASLSGAGLSLIAAFAMIFLFGMELNNYLTVSSTTNVVV 60

Query: 61  DSSR-GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGK 113
           D S+ G  L I  ++  P +SC++ ++D  D+ G    ++   + K  +D + K
Sbjct: 61  DRSKDGEYLRIQFNMSFPALSCEFASVDVSDALGTNRYNLTKTVRKYPIDPNLK 114


>gi|357474783|ref|XP_003607677.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
 gi|355508732|gb|AES89874.1| Endoplasmic reticulum-Golgi intermediate compartment protein
           [Medicago truncatula]
          Length = 156

 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/153 (29%), Positives = 64/153 (41%), Gaps = 26/153 (16%)

Query: 246 FNTTHHIRHLSFGIKLQD----DDERRKPLDGTVAKAEEGASMFN-----------YYIK 290
            N +H I HLSFG K+      D +   P  G       G S  N           +YI+
Sbjct: 1   MNMSHVINHLSFGKKVTPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIEHYIQ 60

Query: 291 IIPTIYERLDGSKL-----------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWT 339
           ++ T      G KL                 +P   F  ELSP+ V ITE  KS  H  T
Sbjct: 61  VVKTEVITRKGYKLIEEYEYTAHSSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFIT 120

Query: 340 KIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
            +   I G +    ++D++LH+ +K + K+EIG
Sbjct: 121 NVCAIIGGVFTVAGILDSILHNTIKAMKKIEIG 153


>gi|339233696|ref|XP_003381965.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316979152|gb|EFV61980.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 331

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 47/187 (25%), Positives = 77/187 (41%), Gaps = 23/187 (12%)

Query: 190 NEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDI-QPYTSAAFNT 248
           N  STE       + C+I+GY  +N++ G   I    +  +  V    I     +  FN 
Sbjct: 134 NASSTE---TAIVDACRIHGYFLMNKLRGKLRIKFKETVRLEAVSNFIIFARRQNEGFNF 190

Query: 249 THHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKL---- 304
           +H I    FG ++        PLDG   ++ +   MF YYI+++PT    L+G +     
Sbjct: 191 SHRIEKFGFGPRIAGIIN---PLDGFQKESFDRRDMFYYYIQVVPTKITDLNGMETFTSQ 247

Query: 305 ------------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITF 352
                         G  G  GIF  ++ +P+MV I +   SL     +I   + G +   
Sbjct: 248 YSVTHKRRIIDHDQGSHGSCGIFIYFDFAPMMVLIRKSKTSLFVFALRICAIVGGIFACT 307

Query: 353 MLVDALL 359
             + AL+
Sbjct: 308 DFIIALM 314


>gi|167383125|ref|XP_001736415.1| hypothetical protein [Entamoeba dispar SAW760]
 gi|165901233|gb|EDR27345.1| hypothetical protein, conserved [Entamoeba dispar SAW760]
          Length = 116

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 16/107 (14%)

Query: 269 KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDG----------------SKLGGGDGGMP 312
            P+DG V       SM+ Y+++++P  Y  LD                   L   + G+P
Sbjct: 3   NPMDGIVKVDRTNNSMYQYFVQVVPMTYTSLDNRIINTNGYSVTEHYRPGNLKSPEQGIP 62

Query: 313 GIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALL 359
           G+F  Y++S + V   E+  S GHL T I   I G +  F L+D  +
Sbjct: 63  GVFVIYDISSIEVLYFEEKNSFGHLLTSICGIIGGVFALFSLLDYFI 109


>gi|365991164|ref|XP_003672411.1| hypothetical protein NDAI_0J02760 [Naumovozyma dairenensis CBS 421]
 gi|343771186|emb|CCD27168.1| hypothetical protein NDAI_0J02760 [Naumovozyma dairenensis CBS 421]
          Length = 341

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 70/297 (23%), Positives = 121/297 (40%), Gaps = 64/297 (21%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRG 65
           +L   DAF K  E+  +K+  GG  +I+ +LF+ ++I  +V  YF     ++  VD    
Sbjct: 3   KLGAFDAFPKTEEEHVKKSTRGGLSSILTYLFLLFMIYNEVGRYFGGFIEQQYIVDIEIQ 62

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNA 125
            +  I+ DI + T +CD + +  VD + +   +++ ++    +  +      P    +N 
Sbjct: 63  ERAQINFDIFLNT-TCDLIDVRIVDLTSD---NMKRSV-SDEISFEDLTFYIPYGTRINI 117

Query: 126 VKKKKVTTENGTTTTELEDPNKCGSCY--GAETETRKCCNTCNEVKEAYRYKKWALPELD 183
           +        NG  TTE ++       Y  G   + R                    PE D
Sbjct: 118 L--------NGIYTTEFDEVLTQAIPYEFGMRIDERP-------------------PEDD 150

Query: 184 TIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTS 243
                        + N     C ++G ++VNR+ G   I+   + +IN            
Sbjct: 151 -------------MPN--INACHLFGSVDVNRLPGILEISTNSTGNIND---------NG 186

Query: 244 AAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV-AKAEEGASMFNYYIKIIPTIYERL 299
            +F   H I  LSFG      D    PLD T     ++  + ++YY+ +IPTIYE+L
Sbjct: 187 KSF--AHVINELSFGEFFPFID---NPLDNTAKVLPDQPLTTYSYYLTVIPTIYEKL 238


>gi|397568493|gb|EJK46164.1| hypothetical protein THAOC_35181 [Thalassiosira oceanica]
          Length = 480

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 54/233 (23%), Positives = 100/233 (42%), Gaps = 48/233 (20%)

Query: 172 YRYKKWALPEL--DTIVQCKNEYSTEKL-------KNTFTEGCQIYGYLEVNRVSGSFHI 222
           Y + +  +P+   D  V    EY+T +L             GCQ+ G+L VNRV G+ H+
Sbjct: 258 YEHGRAVMPDYKGDRTVGALVEYATRRLGEGQEDESEDHHPGCQVSGHLMVNRVPGNLHM 317

Query: 223 -APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK----------------LQDDD 265
            A  + + IN           SA  N TH + HLSFG +                + D+ 
Sbjct: 318 EAKSIHHEIN-----------SAMTNLTHRVDHLSFGDERGPQGHFLDRFAFLGGVPDEF 366

Query: 266 ERRKPLDGTVAKAEEGASMFNYYIKIIPT----------IYERLDGSKLGGGD-GGMPGI 314
           +   P+ G + +       F++++K++ T          +Y+ L  S+L   +   +P I
Sbjct: 367 KHTNPMKGRLFQTHRFHESFHHHLKVVTTTIDYLFRPTALYQILAESQLVLYELQEVPEI 426

Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
            F +++SP+ +++  + +      T  +  + G Y +  L++  L +  K  S
Sbjct: 427 KFLWDMSPMGIEVDVERRPWYDYITTCLAIVGGAYASLGLINRALLAMFKPKS 479


>gi|323307814|gb|EGA61076.1| Erv41p [Saccharomyces cerevisiae FostersO]
          Length = 284

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 66/158 (41%), Gaps = 28/158 (17%)

Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           GC I+G + VNRVSG   I A  L Y  +        P     FN  H I   SFG    
Sbjct: 93  GCHIFGSIPVNRVSGELQITAKSLXYVASRK-----APLEELKFN--HVINEFSFGDFYP 145

Query: 263 DDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DGSKLGGGD--------- 308
             D    PLD T     +E  + + YY  ++PT++++L    D ++    D         
Sbjct: 146 YID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDTNQYSVNDYRYLYKDVA 202

Query: 309 ---GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                MPGIFF Y   PL + +++   S      +++ 
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDIRLSFIQFLVRLVA 240


>gi|351707253|gb|EHB10172.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Heterocephalus glaber]
          Length = 211

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 56/129 (43%), Gaps = 19/129 (14%)

Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
           H H        ++N +H I HLSFG  +        PLDGT   A +   MF Y+I ++P
Sbjct: 80  HAHLAALVNHDSYNFSHRIDHLSFGELVPG---IINPLDGTEKIAIDHNQMFQYFITVVP 136

Query: 294 T---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
           T               + ER        G  G+ GIF  Y+LS LMV +TE+       +
Sbjct: 137 TKLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEEHMPFWQFF 196

Query: 339 TKIMCNISG 347
            + +C I G
Sbjct: 197 VR-LCGIVG 204


>gi|207342541|gb|EDZ70277.1| YML067Cp-like protein [Saccharomyces cerevisiae AWRI1631]
 gi|323336174|gb|EGA77445.1| Erv41p [Saccharomyces cerevisiae Vin13]
 gi|323347070|gb|EGA81345.1| Erv41p [Saccharomyces cerevisiae Lalvin QA23]
          Length = 284

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/158 (28%), Positives = 66/158 (41%), Gaps = 28/158 (17%)

Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           GC I+G + VNRVSG   I A  L Y  +        P     FN  H I   SFG    
Sbjct: 93  GCHIFGSIPVNRVSGELQITAKSLGYVASRK-----APLEELKFN--HVINEFSFGDFYP 145

Query: 263 DDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DGSKLGGGD--------- 308
             D    PLD T     +E  + + YY  ++PT++++L    D ++    D         
Sbjct: 146 YID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDTNQYSVNDYRYLYKDVA 202

Query: 309 ---GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                MPGIFF Y   PL + +++   S      +++ 
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 240


>gi|313241668|emb|CBY33893.1| unnamed protein product [Oikopleura dioica]
          Length = 380

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 88/382 (23%), Positives = 152/382 (39%), Gaps = 75/382 (19%)

Query: 3   FSERLKGLDAFTKPYEDFHE-KTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTE-ELFV 60
           F E+ + LDAFTK  E+    +T +GG  T+  +  +  L+  ++  +F  +  + E  V
Sbjct: 21  FLEKFRELDAFTKITEEAESPQTSHGGVCTMFTFTIMLLLLLGEMTVWFTTTKIKYEFDV 80

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQK 120
           DS   SK+ +++DI   +  C  ++ + VDSSG+         Y  +L  D    +  ++
Sbjct: 81  DSEYESKMHLNMDITFNS-PCHMISAEIVDSSGDAW------GYSFQLQEDAADFELTKE 133

Query: 121 EVVNAVKKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVK--EAYRYKKWA 178
           + +   K  K+          + DPN             +     ++VK  E  R K   
Sbjct: 134 KALERAKLLKM-------KESMTDPNM----------RDQLLREGHDVKHLEFSRKKNKK 176

Query: 179 LPE---LDTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAP----------G 225
           + E   +  +VQ         L     +GC+++G +E+ +++G+  I            G
Sbjct: 177 MMEQGMMHKVVQI-------NLDPNEPQGCRVWGSVELQKIAGTIKIQAGGFGGMGGIPG 229

Query: 226 LSYSINHVHVH----------DIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTV 275
           LS  ++ +              IQ    A F  +H I H SFG            LDG +
Sbjct: 230 LSGGLDAIMGMFMMPMMGMGAQIQDGKKANF--SHRIDHFSFG---DPSSGLVYGLDGDI 284

Query: 276 AKAEEGASMFNYYIKIIPT----------IYERLDGSKLGGGDGGMPGIFFSYELSPLMV 325
              E+      Y +K++PT           Y+      +G  D   P +   Y+ S L V
Sbjct: 285 QIQEKENDDTTYVVKVVPTDLKTFKFQQKAYQYAVTQHVGKSD--KPAVTIKYDFSGLGV 342

Query: 326 KITEKSKSLGHLWTKIMCNISG 347
            ITE  +S   L T++   + G
Sbjct: 343 SITEYRESFVGLLTRLAGILGG 364


>gi|169614774|ref|XP_001800803.1| hypothetical protein SNOG_10535 [Phaeosphaeria nodorum SN15]
 gi|111060809|gb|EAT81929.1| hypothetical protein SNOG_10535 [Phaeosphaeria nodorum SN15]
          Length = 404

 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 76/382 (19%), Positives = 138/382 (36%), Gaps = 90/382 (23%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
            DAF K  + +  +     A T+   L   YL   ++  ++  STT+   V+      + 
Sbjct: 25  FDAFPKTKKTYLVQGRNSSAWTVTLILTCIYLSWSEITRWYAGSTTQSFSVEKGVSHDMQ 84

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAVKKK 129
           I+LDI+V  ++C  L ++  D++G+                         + +   + + 
Sbjct: 85  INLDIIV-AMNCHDLRVNMQDAAGD-------------------------RTLAGDLLRN 118

Query: 130 KVTTENGTTTTELEDP-NKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQC 188
             T  +  T  ++E   ++ G   G      +  +   ++ +A + K             
Sbjct: 119 DPTNWSQWTGRKMEKGMHELGKDDGVNPGWEELWDVHEQLGKAKKRK------------- 165

Query: 189 KNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFN 247
              +S         + C+I+G L+ N+V G FHI A G  Y          Q      FN
Sbjct: 166 ---FSKTPRVRGAPDACRIFGSLDGNKVQGDFHITARGHGY-----QEFGEQHLDHKTFN 217

Query: 248 TTHHIRHLSFGIKLQDDDERRKPLDGTVAKA---EEGASMFNYYIKIIPTIYERLDG--- 301
            +H IR +SFG           PLD T+A     ++    F YY+ I+PTIY    G   
Sbjct: 218 FSHIIREMSFGPYYP---SLTNPLDNTIATTPTDQDHFYKFQYYLSIVPTIYTDNPGLLP 274

Query: 302 --------------------------------SKLGGGDGGMPGIFFSYELSPLMVKITE 329
                                                 +  +PG+F  +++ P+M+ + E
Sbjct: 275 LLESVNRDPSAHPAKSIFSTHAIKTNQYAVTSQSHTVPENYVPGVFVKFDIEPIMLAVVE 334

Query: 330 KSKSLGHLWTKIMCNISGTYIT 351
           +      L  +I+  +SG  + 
Sbjct: 335 EWGGFWRLLVRIVNVVSGVMVA 356


>gi|159464951|ref|XP_001690702.1| hypothetical protein CHLREDRAFT_180779 [Chlamydomonas reinhardtii]
 gi|158270379|gb|EDO96229.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 656

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 46/194 (23%), Positives = 78/194 (40%), Gaps = 35/194 (18%)

Query: 208 YGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQP------YTSAAFNTTHHIRHLSFGIKL 261
           Y   +V RV+G  H+      S++   V  + P      +     N +H I+HL FG   
Sbjct: 84  YHTPQVKRVAGRLHL------SVHQNMVFQMLPQLLGTHHIPKILNMSHVIKHLGFGPHY 137

Query: 262 QDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGG-------------- 307
                +  PLDG V         + Y++K++PT Y     ++LG                
Sbjct: 138 PG---QLNPLDGYVRMVGREPFSYKYFLKVVPTEYY----NRLGRATETHQYSVTEYAQP 190

Query: 308 --DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKK 365
              G  P +   Y+LSP+++ I E+  SL H   ++   + G +    L D  +   V+ 
Sbjct: 191 LQRGYAPAVDVHYDLSPIVMTINERPPSLLHFVVRLCAVVGGVFAITRLTDRWVDWLVRL 250

Query: 366 ISKVEIGGKTVTKR 379
           ++K    G +   R
Sbjct: 251 VNKAAARGPSFVDR 264


>gi|558407|emb|CAA86253.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 284

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/158 (27%), Positives = 66/158 (41%), Gaps = 28/158 (17%)

Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           GC ++G + VNRVSG   I A  L Y  +        P     FN  H I   SFG    
Sbjct: 93  GCHVFGSIPVNRVSGELQITAKSLGYVASRK-----APLEELKFN--HVINEFSFGDFYP 145

Query: 263 DDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DGSKLGGGD--------- 308
             D    PLD T     +E  + + YY  ++PT++++L    D ++    D         
Sbjct: 146 YID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDTNQYSVNDYRYLYKDVA 202

Query: 309 ---GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
                MPGIFF Y   PL + +++   S      +++ 
Sbjct: 203 AKGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 240


>gi|323332255|gb|EGA73665.1| Erv41p [Saccharomyces cerevisiae AWRI796]
 gi|323352959|gb|EGA85259.1| Erv41p [Saccharomyces cerevisiae VL3]
 gi|365763687|gb|EHN05213.1| Erv41p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
          Length = 250

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 67/157 (42%), Gaps = 26/157 (16%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC I+G + VNRVSG   I    + S+ +V      P     FN  H I   SFG     
Sbjct: 59  GCHIFGSIPVNRVSGELQIT---AKSLGYVASRK-APLEELKFN--HVINEFSFGDFYPY 112

Query: 264 DDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERL----DGSKLGGGD---------- 308
            D    PLD T     +E  + + YY  ++PT++++L    D ++    D          
Sbjct: 113 ID---NPLDNTAQFNQDEPLTTYVYYTSVVPTLFKKLGAEVDTNQYSVNDYRYLYKDVAA 169

Query: 309 --GGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMC 343
               MPGIFF Y   PL + +++   S      +++ 
Sbjct: 170 KGDKMPGIFFKYNFEPLSIVVSDVRLSFIQFLVRLVA 206


>gi|255082155|ref|XP_002508296.1| predicted protein [Micromonas sp. RCC299]
 gi|226523572|gb|ACO69554.1| predicted protein [Micromonas sp. RCC299]
          Length = 507

 Score = 55.5 bits (132), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 46/221 (20%), Positives = 88/221 (39%), Gaps = 44/221 (19%)

Query: 183 DTIVQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPY 241
           DT +  +    T+ +K     GC + G++ V +V G   + A   S+S           +
Sbjct: 297 DTELAIRQPVETQTVKKIDGPGCSVTGFVLVKKVPGHLWVTATSKSHS-----------F 345

Query: 242 TSAAFNTTHHIRHLSFGIKLQDDDERRKPLD-------------------GTVAKAEEGA 282
            + + N +H + H  FG +L    +R++ LD                   GT    E+  
Sbjct: 346 HAESMNMSHVVHHFYFGQQLTP--QRKRYLDRFHSREKDPKGDWHDKLAGGTFTSEEDNV 403

Query: 283 SMFNYYIKIIPTI-----------YERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKS 331
           +  +Y   ++ TI           YE    S     +  +P   F ++ SP+ + ++E+ 
Sbjct: 404 THEHYLQTVLTTIKPSGSPAPFNVYEYTQHSHSLRSEKELPRAKFHFDPSPVQISVSEER 463

Query: 332 KSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           +   H  T +M  + G Y    + D  +H+ ++   K E+G
Sbjct: 464 QKFYHFITTLMAIVGGVYSVMGIADGFVHNSIQAWKKKELG 504



 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 30/139 (21%), Positives = 67/139 (48%), Gaps = 1/139 (0%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD-SSR 64
           + K +D + K  +D  E T+ G  ++++  L I  L+  +V  Y        + +D S+ 
Sbjct: 8   KFKNVDFYRKIPKDMTEGTIPGSVISMLAALVIGLLLVSEVGSYLTPKFDTRVVIDRSAD 67

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVN 124
           G  + I+ ++  P +SC++ ++D  D+ G    ++   ++KR +D    P+   Q E  +
Sbjct: 68  GEMMRINFNVSFPALSCEFASVDVGDAMGLNRFNLTKTVFKRAIDAKLNPLGPIQWERGH 127

Query: 125 AVKKKKVTTENGTTTTELE 143
             +K+    ++  T   ++
Sbjct: 128 ENRKEPEHADDAATAVAIK 146


>gi|302841900|ref|XP_002952494.1| hypothetical protein VOLCADRAFT_75374 [Volvox carteri f.
           nagariensis]
 gi|300262133|gb|EFJ46341.1| hypothetical protein VOLCADRAFT_75374 [Volvox carteri f.
           nagariensis]
          Length = 478

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 26/101 (25%), Positives = 54/101 (53%), Gaps = 1/101 (0%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVD-SSR 64
           +LK +D F K   D  E T+ G  ++I+  + + +L   ++  +   +TT +L VD S +
Sbjct: 7   KLKAIDFFKKIPSDLTEATLTGAWISILAAVIMVFLFTAEMMSFLSTTTTTQLIVDRSPQ 66

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYK 105
              L ++ +I  P +SC++  +D  D+ G + +++   + K
Sbjct: 67  NELLKLNFNISFPALSCEFATVDVSDTLGTKRMNLTKTVRK 107



 Score = 44.7 bits (104), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 43/195 (22%), Positives = 79/195 (40%), Gaps = 30/195 (15%)

Query: 202 TEGCQIYGYLEVNRVSGSFHI---APGLSYS---INHVH-VHDIQPYTSAAFNTTHHIRH 254
           T GC + G++ V +V G+  +   + G S+    +N  H VH     T  +      ++ 
Sbjct: 287 TPGCNLAGFVMVKKVPGTLTVVARSEGHSFDHTWMNMTHLVHTFHVGTRPSPRKYQQLKR 346

Query: 255 LSFGIKLQDD----DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGGGDG- 309
           L    + + D     E+R+       + E   S   +Y++I+ T  E       G  D  
Sbjct: 347 LHPAGEGEGDLFWWREKRE------KRGEHPQSTHEHYLQIVLTSIEPRRSRHSGNYDAY 400

Query: 310 ------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDA 357
                        +P   F+Y+LSP+ + + E ++      T     I G +    ++DA
Sbjct: 401 EYTAHSHTYQSDAIPSARFTYDLSPIQILVQETARPWYQFLTTSCAIIGGVFTVAGILDA 460

Query: 358 LLHSCVKKISKVEIG 372
           LL+   K + K+ +G
Sbjct: 461 LLYQSFKVVKKLNLG 475


>gi|442614645|ref|NP_001259099.1| CG4293, isoform E [Drosophila melanogaster]
 gi|440216271|gb|AGB94945.1| CG4293, isoform E [Drosophila melanogaster]
          Length = 439

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 38/138 (27%), Positives = 62/138 (44%), Gaps = 16/138 (11%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +        N TH I  LSFG   Q
Sbjct: 198 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 254

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYERL------------DGSKLGGGDG 309
                 +PL+G      E A+   Y++K++PT I++              +  KL     
Sbjct: 255 YSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFTTIYAFQYAVTENVRKLERNSY 314

Query: 310 GMPGIFFSYELSPLMVKI 327
           G PGI+F Y+ S L + +
Sbjct: 315 GSPGIYFKYDWSALKIIV 332


>gi|115472445|ref|NP_001059821.1| Os07g0524100 [Oryza sativa Japonica Group]
 gi|75118816|sp|Q69SA9.1|PDI54_ORYSJ RecName: Full=Protein disulfide isomerase-like 5-4;
           Short=OsPDIL5-4; AltName: Full=Protein disulfide
           isomerase-like 8-1; Short=OsPDIL8-1; Flags: Precursor
 gi|50508559|dbj|BAD30858.1| thioredoxin family-like protein [Oryza sativa Japonica Group]
 gi|113611357|dbj|BAF21735.1| Os07g0524100 [Oryza sativa Japonica Group]
 gi|215704615|dbj|BAG94243.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218199742|gb|EEC82169.1| hypothetical protein OsI_26259 [Oryza sativa Indica Group]
 gi|222637167|gb|EEE67299.1| hypothetical protein OsJ_24505 [Oryza sativa Japonica Group]
          Length = 485

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 49/202 (24%), Positives = 85/202 (42%), Gaps = 40/202 (19%)

Query: 201 FTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
            T GC+I G++ V +V GS  I+   + S +H        +  +  N +H++   SFG +
Sbjct: 291 LTSGCRIEGFVRVKKVPGSVVIS---ARSGSH-------SFDPSQINVSHYVTQFSFGKR 340

Query: 261 LQ----DDDERRKPLDGTVAKAEEGAS------------MFNYYIKIIPTIYERLDGSKL 304
           L     ++ +R  P  G       G S               +Y++I+ T    L  SK 
Sbjct: 341 LSAKMFNELKRLTPYVGGHHDRLAGQSYIVKHGDVNANVTIEHYLQIVKTELVTLRSSKE 400

Query: 305 GG--------------GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYI 350
                               +P + F +E SP+ V +TE  KS  H  T +   I G + 
Sbjct: 401 LKLVEEYEYTAHSSLVHSFYVPVVKFHFEPSPMQVLVTELPKSFSHFITNVCAIIGGVFT 460

Query: 351 TFMLVDALLHSCVKKISKVEIG 372
              ++D++ H+ ++ + KVE+G
Sbjct: 461 VAGILDSIFHNTLRLVKKVELG 482


>gi|410046954|ref|XP_003952285.1| PREDICTED: LOW QUALITY PROTEIN: endoplasmic reticulum-Golgi
           intermediate compartment protein 2 [Pan troglodytes]
          Length = 333

 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 75/363 (20%), Positives = 133/363 (36%), Gaps = 86/363 (23%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 22  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 81

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 82  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 139

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
           + +            L++ +                      K A++    ALP      
Sbjct: 140 QSR------------LQEEHSLQDVI---------------FKSAFKSASTALP------ 166

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAF 246
                   E   +   + C+I+G+L VN+V+G+FHI             + +  Y     
Sbjct: 167 ------PREDDSSQSPDACRIHGHLYVNKVAGNFHITVD----------NQMFQYFITVV 210

Query: 247 NTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
            T  H   +S         ER + ++   A    G S                       
Sbjct: 211 PTKLHTYKISADTHQFSVTERERIINH--AAGSHGVS----------------------- 245

Query: 307 GDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI 366
                 GIF  Y+LS LMV +TE+       + ++   + G + T      +LH   K I
Sbjct: 246 ------GIFMKYDLSSLMVTVTEEHMPFWQFFVRLCGIVGGIFST----TGMLHGIGKFI 295

Query: 367 SKV 369
            ++
Sbjct: 296 VEI 298


>gi|431918151|gb|ELK17379.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Pteropus alecto]
          Length = 313

 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 44/155 (28%), Positives = 66/155 (42%), Gaps = 29/155 (18%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
           GC+  G   +N+V G+FH++          H    QP      + TH I  LSFG  LQ 
Sbjct: 144 GCRFEGQFSINKVPGNFHVS---------THSATAQPQNP---DMTHVIHKLSFGDTLQV 191

Query: 264 DDERR--KPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGD 308
            +       L G         +  +Y +KI+PT+YE   G +             +    
Sbjct: 192 RNVHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQQYSYQYTVANKEYVAYSH 251

Query: 309 GG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKI 341
            G  +P I+F Y+LSP+ VK TE+ + L    T +
Sbjct: 252 TGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTV 286



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/108 (25%), Positives = 55/108 (50%), Gaps = 7/108 (6%)

Query: 9   GLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS---SRG 65
           G D + K  +D  + T  G  ++I C +FI +L   ++  +       EL+VD      G
Sbjct: 37  GFDIYRKVPKDLTQPTYTGAIISICCCVFILFLFLSELTGFLTTEVVNELYVDDPDKDSG 96

Query: 66  SKLPIHLDIVVPTISCDYLALDAVDSSGEQHL-HVEHNIYKRRLDLDG 112
            K+ + L+I +P + C+ + LD  D  G   + H+++++   ++ L+G
Sbjct: 97  GKIDVSLNISLPNLHCELVGLDIQDEMGRHEVGHIDNSM---KIPLNG 141


>gi|145347301|ref|XP_001418112.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578340|gb|ABO96405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 534

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 43/137 (31%), Positives = 66/137 (48%), Gaps = 5/137 (3%)

Query: 7   LKGLDAFT-KPYE--DFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS 63
           L+ LD +   P E   F E+TV GG  TIV  L    L  + V   F  +   ++ VD +
Sbjct: 27  LRRLDMYAHAPPEISGFTERTVGGGLFTIVVSLIFIALFTMQVSALFAATYVTDIVVDHT 86

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE-HNIYKRRLDLDGKPIQEPQKEV 122
             +KL +++ +  P + C++L LD VD+ G +  ++   N+YK  L    K +       
Sbjct: 87  ADAKLRVNVRVDFPFVECEFLHLDVVDAIGSRKTNISGENVYKHPLSGPMKYMNIQHAAP 146

Query: 123 VNAVKKKKVTTENGTTT 139
           VNA +      E GTTT
Sbjct: 147 VNA-ETLDDAFEYGTTT 162



 Score = 45.1 bits (105), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 55/128 (42%), Gaps = 38/128 (29%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAP-GLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
           T GC + G   VNRV G+F+  P   S+S+             A  + TH +RHLSFG  
Sbjct: 349 TPGCSVNGQFNVNRVPGAFYFVPRSRSHSL-------------ADVDMTHVVRHLSFGEH 395

Query: 261 LQDDD-----ERRK-----PLD--GTVAKAEEGA------------SMFNYYIKIIPTIY 296
           +           RK     P+D  G  AK + G             + F +Y+K+IP  +
Sbjct: 396 VPGKPSFIPRHLRKAWSLIPVDMGGRFAKKDNGGGGAQFDARENRRTAFEHYMKVIPRTF 455

Query: 297 ERLDGSKL 304
             +DG+ +
Sbjct: 456 APIDGAPI 463


>gi|385302753|gb|EIF46868.1| putative copii secretory vesicle component [Dekkera bruxellensis
           AWRI1499]
          Length = 203

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 38/95 (40%), Positives = 46/95 (48%), Gaps = 10/95 (10%)

Query: 205 CQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDD 264
           C+I+G L VNRV GS +I  G  +    +     QP T    N TH I   SFG      
Sbjct: 81  CRIFGTLPVNRVRGSLYIT-GKGFGSTFLRS---QPQT---LNFTHQITEFSFGDFYPFF 133

Query: 265 DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERL 299
           D    PLD T    EE A  F Y + +IPT YE+L
Sbjct: 134 D---NPLDMTYQVTEENAHTFQYKLSVIPTQYEKL 165


>gi|195402035|ref|XP_002059616.1| GJ14724 [Drosophila virilis]
 gi|194147323|gb|EDW63038.1| GJ14724 [Drosophila virilis]
          Length = 434

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 35/138 (25%), Positives = 60/138 (43%), Gaps = 15/138 (10%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +  +     N TH I  LSFG   Q
Sbjct: 200 DACRLHGTLGINKVAGVLHLVGGAQPVVGMFEDHWMIEFRRMPANFTHRINRLSFG---Q 256

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT------------IYERLDGSKLGGGDGG 310
                 +PL+G      E ++   Y++K++PT             Y   +         G
Sbjct: 257 YSRRIVQPLEGDETIIHEESTTVQYFLKVVPTEIQHTFSTISTFQYAVTENVHSERNSYG 316

Query: 311 MPGIFFSYELSPLMVKIT 328
            PGI+F Y+ S L + ++
Sbjct: 317 SPGIYFKYDWSALKIVVS 334



 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 1/91 (1%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSS 63
           E  K LDAF K  E + E T  GG ++++  L I YL+  ++  Y+ +     +   D +
Sbjct: 16  EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELRYYWNETEIIYQFEPDMA 75

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGE 94
              ++ +HLDI V         +D +D + +
Sbjct: 76  LDEQVQMHLDITVAMPCASLSGVDLMDETQQ 106


>gi|195042004|ref|XP_001991346.1| GH12601 [Drosophila grimshawi]
 gi|193901104|gb|EDV99970.1| GH12601 [Drosophila grimshawi]
          Length = 434

 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 40/144 (27%), Positives = 63/144 (43%), Gaps = 24/144 (16%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +  +     N TH I  LSFG   Q
Sbjct: 194 DACRLHGTLGINKVAGVLHLVGGAQPVVGMFDDHWMIEFRRMPANFTHRINRLSFG---Q 250

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE------------------RLDGSKL 304
                 +PL+G      E A+   Y+IK++PT  +                  +LD  + 
Sbjct: 251 YSRRIVQPLEGDETTITEEATTVQYFIKVVPTEIQQTFSTVSTFQYAVTENVRKLDSER- 309

Query: 305 GGGDGGMPGIFFSYELSPLMVKIT 328
                G PGI+F Y+ S L V I+
Sbjct: 310 --NSYGSPGIYFKYDWSALKVVIS 331



 Score = 38.9 bits (89), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 45/91 (49%), Gaps = 1/91 (1%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSS 63
           E  K LDAF K  E + E T  GG ++++  L I YL+  ++  Y+ + +   +   D S
Sbjct: 16  EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELRYYWSETNIIYQFEPDMS 75

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGE 94
              ++ +H+DI V         +D +D + +
Sbjct: 76  LDEQVQMHVDITVAMPCASLSGVDLMDETQQ 106


>gi|119616999|gb|EAW96593.1| ERGIC and golgi 2, isoform CRA_b [Homo sapiens]
          Length = 215

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 53/230 (23%), Positives = 96/230 (41%), Gaps = 43/230 (18%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           +K LDAF K  E + E +  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 13  VKELDAFPKVPESYVETSASGGTVSLIAFTTMALLTIMEFSVYQDTWMKYEYEVDKDFSS 72

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
           KL I++DI V  + C Y+  D +D +       +  +Y+  +  D  P Q+  + ++  +
Sbjct: 73  KLRINIDITV-AMKCQYVGADVLDLAETMVASADGLVYEPTV-FDLSPQQKEWQRMLQLI 130

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWAL-PELDTI 185
           + +            L++ +                      K A++    AL P  D  
Sbjct: 131 QSR------------LQEEHSLQDVI---------------FKSAFKSTSTALPPREDDS 163

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHV 235
            Q  N              C+I+G+L VN+V+G+FHI  G  + +  +H+
Sbjct: 164 SQSPN-------------ACRIHGHLYVNKVAGNFHITVGQFHILVVMHI 200


>gi|198468706|ref|XP_001354796.2| GA18088 [Drosophila pseudoobscura pseudoobscura]
 gi|198146533|gb|EAL31851.2| GA18088 [Drosophila pseudoobscura pseudoobscura]
          Length = 445

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 66/150 (44%), Gaps = 24/150 (16%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +        N TH I  LSFG   Q
Sbjct: 203 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWVIELRRMPANFTHRINRLSFG---Q 259

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
                 +PL+G  +   E A+   Y++K++PT I++                 +LD  + 
Sbjct: 260 YSRRIVQPLEGDESIIHEEATTVQYFLKVVPTEIHQTFTTINTFQYAVTENVRKLDSER- 318

Query: 305 GGGDGGMPGIFFSYELSPLMVKITEKSKSL 334
                G PGI+F Y+ S L + ++     L
Sbjct: 319 --NSYGSPGIYFKYDWSALKIVVSNDRDHL 346


>gi|194768867|ref|XP_001966532.1| GF22223 [Drosophila ananassae]
 gi|190617296|gb|EDV32820.1| GF22223 [Drosophila ananassae]
          Length = 448

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/181 (23%), Positives = 77/181 (42%), Gaps = 18/181 (9%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +        N TH I  LSFG   Q
Sbjct: 204 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 260

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT---------------IYERLDGSKLGGG 307
                 +PL+G     +E A+   Y++K++PT               + E +        
Sbjct: 261 YSRRIVQPLEGDETIIQEEATTVQYFLKVVPTEIRQTFSTINTFQYSVTENVRKLDSERN 320

Query: 308 DGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKIS 367
             G PGI+F Y+ S L + +      L     ++   ISG  +    +++LL +  +++ 
Sbjct: 321 SYGSPGIYFKYDWSALKIVVDNDRDHLATFVIRLCSIISGIIVISGAINSLLIAIQRRLL 380

Query: 368 K 368
           +
Sbjct: 381 R 381



 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 44/91 (48%), Gaps = 1/91 (1%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSS 63
           E  K LDAF K  E + E T  GG ++++  L I YL+  ++  Y+ +     +   D S
Sbjct: 16  EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELHYYWHETDIVYQFQPDMS 75

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGE 94
              ++ +H+DI V         +D +D + +
Sbjct: 76  LDDQVQMHVDITVAMPCASLSGVDLMDETQQ 106


>gi|195165324|ref|XP_002023489.1| GL20164 [Drosophila persimilis]
 gi|194105594|gb|EDW27637.1| GL20164 [Drosophila persimilis]
          Length = 445

 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 39/150 (26%), Positives = 66/150 (44%), Gaps = 24/150 (16%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +        N TH I  LSFG   Q
Sbjct: 203 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWVIELRRMPANFTHRINRLSFG---Q 259

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
                 +PL+G  +   E A+   Y++K++PT I++                 +LD  + 
Sbjct: 260 YSRRIVQPLEGDESIIHEEATTVQYFLKVVPTEIHQTFTTINTFQYAVTENVRKLDSER- 318

Query: 305 GGGDGGMPGIFFSYELSPLMVKITEKSKSL 334
                G PGI+F Y+ S L + ++     L
Sbjct: 319 --NSYGSPGIYFKYDWSALKIVVSNDRDHL 346



 Score = 37.7 bits (86), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 26/88 (29%), Positives = 45/88 (51%), Gaps = 4/88 (4%)

Query: 8   KGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSSRGS 66
           + LDAF K  E + E T  GG ++++  L I YL+  ++  Y+ +     +   D S   
Sbjct: 19  RNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELHYYWHETDIVYQFEPDISLDE 78

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGE 94
           ++ +H+DI   T++   +AL  VD   E
Sbjct: 79  QVQMHVDI---TVAMPCVALSGVDLMDE 103


>gi|195469521|ref|XP_002099686.1| GE16580 [Drosophila yakuba]
 gi|194187210|gb|EDX00794.1| GE16580 [Drosophila yakuba]
          Length = 430

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 24/143 (16%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +        N TH I  LSFG   Q
Sbjct: 198 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 254

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
                 +PL+G      E A+   Y++K++PT I++                 +LD  + 
Sbjct: 255 YSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFTTINAFQYAVTENVRKLDSER- 313

Query: 305 GGGDGGMPGIFFSYELSPLMVKI 327
                G PGI+F Y+ S L + +
Sbjct: 314 --NSYGSPGIYFKYDWSALKIMV 334


>gi|365986066|ref|XP_003669865.1| hypothetical protein NDAI_0D03080 [Naumovozyma dairenensis CBS 421]
 gi|343768634|emb|CCD24622.1| hypothetical protein NDAI_0D03080 [Naumovozyma dairenensis CBS 421]
          Length = 353

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 77/377 (20%), Positives = 141/377 (37%), Gaps = 75/377 (19%)

Query: 7   LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
           LK  DAF K  +   +K+  GG  +I+ ++ I ++   +   YF     ++  VD     
Sbjct: 5   LKVFDAFPKIEDQNKKKSTKGGITSILTYVLIIFIAWSEFGSYFGGFVDQQYIVDGMLRE 64

Query: 67  KLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKPIQEPQKEVVNAV 126
            +PI+LD+ V  + C+++ ++  D +      ++     + L  +  P   P    +N  
Sbjct: 65  TVPINLDLYV-NVPCEWVHVNVRDQT------LDRKFASQELKFEEMPFFIPFDVRLND- 116

Query: 127 KKKKVTTENGTTTTELEDPNKCGSCYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIV 186
             + VT E      E        + +  + +TR   +  N  K         LP+ +   
Sbjct: 117 NPEIVTPELDEILGE-----AIPAEFREKLDTRMFFDENNPDKS-------HLPDFN--- 161

Query: 187 QCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAA 245
                            GC I+G + VN+V+G   + A G  Y+       D        
Sbjct: 162 -----------------GCHIFGSVNVNQVAGELQVTAKGHGYA-------DYHRAPLEK 197

Query: 246 FNTTHHIRHLSFGIKLQDDDERRKPLDGTVA-KAEEGASMFNYYIKIIPTIYERLDGSKL 304
            N  H I   SFG      D    PLD +     ++  + + Y   +IP IY ++ G+++
Sbjct: 198 VNFAHVINEFSFGEFFPYID---NPLDNSAKFNMDDPLTAYVYDTSVIPMIYRKM-GAEV 253

Query: 305 GGGDGG------------------MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNIS 346
                                   +PGIFF Y    L + ++++         +++  +S
Sbjct: 254 DTFQYSVAEHQYKSKESSSSNSFRVPGIFFQYNFENLSIVVSDRRLGFIQFIVRLVAILS 313

Query: 347 -GTYIT---FMLVDALL 359
              YI    F+L D  +
Sbjct: 314 FAVYIASWLFILADMFI 330


>gi|18921097|ref|NP_569847.1| CG4293, isoform A [Drosophila melanogaster]
 gi|24638890|ref|NP_726677.1| CG4293, isoform B [Drosophila melanogaster]
 gi|85724768|ref|NP_001033816.1| CG4293, isoform D [Drosophila melanogaster]
 gi|85724770|ref|NP_001033817.1| CG4293, isoform C [Drosophila melanogaster]
 gi|2961397|emb|CAA18090.1| EG:65F1.1 [Drosophila melanogaster]
 gi|7290051|gb|AAF45518.1| CG4293, isoform A [Drosophila melanogaster]
 gi|7290052|gb|AAF45519.1| CG4293, isoform B [Drosophila melanogaster]
 gi|15292011|gb|AAK93274.1| LD35174p [Drosophila melanogaster]
 gi|84798360|gb|ABC67159.1| CG4293, isoform C [Drosophila melanogaster]
 gi|84798361|gb|ABC67160.1| CG4293, isoform D [Drosophila melanogaster]
 gi|220955778|gb|ACL90432.1| CG4293-PA [synthetic construct]
          Length = 441

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 39/143 (27%), Positives = 62/143 (43%), Gaps = 24/143 (16%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +        N TH I  LSFG   Q
Sbjct: 198 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 254

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT--------IY----------ERLDGSKL 304
                 +PL+G      E A+   Y++K++PT        IY           +LD  + 
Sbjct: 255 YSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFTTIYAFQYAVTENVRKLDSER- 313

Query: 305 GGGDGGMPGIFFSYELSPLMVKI 327
                G PGI+F Y+ S L + +
Sbjct: 314 --NSYGSPGIYFKYDWSALKIIV 334


>gi|195629654|gb|ACG36468.1| hypothetical protein [Zea mays]
          Length = 76

 Score = 53.5 bits (127), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 27/70 (38%), Positives = 44/70 (62%)

Query: 3  FSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDS 62
          F +RLK LDA+ K  EDF++ T++GG VT+V  + +  L   +   YF  +T  +L VD+
Sbjct: 4  FLQRLKRLDAYPKVNEDFYKWTLFGGIVTLVAAVVMLLLFISETRSYFYSATETKLVVDT 63

Query: 63 SRGSKLPIHL 72
          SR  +L +++
Sbjct: 64 SRRERLRVNV 73


>gi|303275141|ref|XP_003056869.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461221|gb|EEH58514.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 604

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 60/210 (28%), Positives = 88/210 (41%), Gaps = 54/210 (25%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
           T GC I G + VNRV G+F       Y   H   H+I        N TH +RHLSFG  +
Sbjct: 400 TSGCIIEGSVRVNRVPGAF-------YVTAHSKGHNIN---VDVVNMTHVLRHLSFGKTV 449

Query: 262 QDDDE------RR------KPLDG--TVAKAEEGAS------MFNYYIKIIPTIYERLDG 301
                      RR      K + G   VA AEE  +      +  +Y+K++   +E +DG
Sbjct: 450 PGRPSYVPRHMRRVWSKIPKDMGGRFAVAGAEETFASAEPYTVHEHYLKVVSHAFEPIDG 509

Query: 302 S--------------KLGGGDGG--------MPGIFFSYELSPLMVKITEKSKSLGHLWT 339
                          KL     G         P I FSY++SP+ V + E++K +   WT
Sbjct: 510 DAVQLYEYTFNSNRFKLAPAAYGDEDDAHVDGPMIKFSYDVSPMRVVLREETKPVLD-WT 568

Query: 340 KIMCNI-SGTYITFMLVDALLHSCVKKISK 368
             MC +  G Y    L++A + + V  + +
Sbjct: 569 LGMCALMGGVYTCSGLLEAFISNGVSVVKR 598


>gi|194911936|ref|XP_001982403.1| GG12755 [Drosophila erecta]
 gi|190648079|gb|EDV45372.1| GG12755 [Drosophila erecta]
          Length = 441

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 24/143 (16%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +        N TH I  LSFG   Q
Sbjct: 198 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 254

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
                 +PL+G      E A+   Y++K++PT I++                 +LD  + 
Sbjct: 255 YSGRIVQPLEGDEIVIHEEATTIQYFLKVVPTEIHQTFTTINAFQYAVTENVRKLDSER- 313

Query: 305 GGGDGGMPGIFFSYELSPLMVKI 327
                G PGI+F Y+ S L + +
Sbjct: 314 --NSYGSPGIYFKYDWSALKIVV 334



 Score = 38.5 bits (88), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 24/91 (26%), Positives = 45/91 (49%), Gaps = 1/91 (1%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYF-QVSTTEELFVDSS 63
           E  K LDAF K  E + E T  GG ++++  L I YL+  ++  Y+ + +   +   D +
Sbjct: 16  EFAKNLDAFKKVPEKYTETTEIGGTLSLLSRLLIVYLVYTELYYYWHETAIVYQFEPDIA 75

Query: 64  RGSKLPIHLDIVVPTISCDYLALDAVDSSGE 94
              ++ +H+DI V         +D +D + +
Sbjct: 76  LDEQVQMHVDITVAMPCASLSGVDLMDETQQ 106


>gi|444732203|gb|ELW72509.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Tupaia chinensis]
          Length = 250

 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 18/112 (16%)

Query: 234 HVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIP 293
           H H        ++N +H I HLSFG  +        PLDGT   A +   MF Y+I ++P
Sbjct: 114 HAHLAALVNHDSYNFSHRIDHLSFGELVPG---IINPLDGTEKIAVDHNQMFQYFITVVP 170

Query: 294 T---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEK 330
           T               + ER        G  G+ GIF  Y+LS LMV +TE+
Sbjct: 171 TKLHTYKISADTHQFSVTERERIINHAAGSHGVSGIFMKYDLSSLMVTVTEE 222


>gi|195564437|ref|XP_002105825.1| GD16474 [Drosophila simulans]
 gi|194203186|gb|EDX16762.1| GD16474 [Drosophila simulans]
          Length = 441

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 24/143 (16%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +        N TH I  LSFG   Q
Sbjct: 198 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 254

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
                 +PL+G      E A+   Y++K++PT I++                 +LD  + 
Sbjct: 255 YSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFTTINAFQYAVTENVRKLDSER- 313

Query: 305 GGGDGGMPGIFFSYELSPLMVKI 327
                G PGI+F Y+ S L + +
Sbjct: 314 --NSYGSPGIYFKYDWSALKIMV 334


>gi|12006037|gb|AAG44724.1|AF267855_1 HT034 [Homo sapiens]
          Length = 199

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 37/138 (26%), Positives = 64/138 (46%), Gaps = 17/138 (12%)

Query: 252 IRHLSFG--IKLQDDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK------ 303
           I  LSFG  +++Q+       L G         +  +Y +KI+PT+YE   G +      
Sbjct: 59  IHKLSFGDTLQVQNIHGAFNALGGADRLTSNPLASHDYILKIVPTVYEDKSGKQRYSYQY 118

Query: 304 -------LGGGDGG--MPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFML 354
                  +     G  +P I+F Y+LSP+ VK TE+ + L    T I   I GT+    +
Sbjct: 119 TVANKEYVAYSHTGRIIPAIWFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGI 178

Query: 355 VDALLHSCVKKISKVEIG 372
           +D+ + +  +   K+++G
Sbjct: 179 LDSCIFTASEAWKKIQLG 196


>gi|195347402|ref|XP_002040242.1| GM19035 [Drosophila sechellia]
 gi|194121670|gb|EDW43713.1| GM19035 [Drosophila sechellia]
          Length = 437

 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 38/143 (26%), Positives = 63/143 (44%), Gaps = 24/143 (16%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           + C+++G L +N+V+G  H+  G    +     H +        N TH I  LSFG   Q
Sbjct: 194 DACRLHGTLGINKVAGVLHLVGGAQPVVGLFEDHWMIELRRMPANFTHRINRLSFG---Q 250

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPT-IYE-----------------RLDGSKL 304
                 +PL+G      E A+   Y++K++PT I++                 +LD  + 
Sbjct: 251 YSGRIVQPLEGDEIVIHEEATTVQYFLKVVPTEIHQTFTTINAFQYAVTENVRKLDSER- 309

Query: 305 GGGDGGMPGIFFSYELSPLMVKI 327
                G PGI+F Y+ S L + +
Sbjct: 310 --NSYGSPGIYFKYDWSALKIMV 330


>gi|378726952|gb|EHY53411.1| hypothetical protein HMPREF1120_01605 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 326

 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/96 (38%), Positives = 48/96 (50%), Gaps = 9/96 (9%)

Query: 203 EGCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
           + C+IYG LE N+V G FHI A G  Y    +  H       + FN +HHI  LSFG   
Sbjct: 86  DSCRIYGSLEGNKVQGDFHITARGHGYMEFGMQQH----LDHSRFNFSHHINELSFGPHY 141

Query: 262 QDDDERRKPLDGTVAKAEEGASM-FNYYIKIIPTIY 296
                   PLD T A   +   M + YY+ I+PTI+
Sbjct: 142 PG---LLNPLDKTSAVTTDVHFMRYQYYLSIVPTIF 174


>gi|412994089|emb|CCO14600.1| predicted protein [Bathycoccus prasinos]
          Length = 528

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/110 (24%), Positives = 57/110 (51%), Gaps = 1/110 (0%)

Query: 6   RLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSS-R 64
           + KG+D + K   D  + T  G  ++I+    I +L+  +   Y + +   ++ VD S  
Sbjct: 6   KAKGMDFYRKIPRDMTQGTYLGTILSILATSLIVFLLIAETRAYLKTTFETKVVVDRSVD 65

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDGKP 114
           G  L I+ ++  P +SC++ ++D  D+ G    ++   ++KR +D + +P
Sbjct: 66  GELLRINFNVSFPALSCEFASVDVGDALGLTRYNLTKTVFKRPIDGNFRP 115


>gi|119928709|ref|XP_001256294.1| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like [Bos taurus]
          Length = 144

 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/118 (26%), Positives = 56/118 (47%), Gaps = 15/118 (12%)

Query: 270 PLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK-------------LGGGDGG--MPGI 314
           P   +V +     +  +Y +KI+PT+YE   G +             +     G  +P I
Sbjct: 24  PTPASVRRTFRALASHDYILKIVPTVYEDKSGKQQFSYQYTVANKEYVAYSHTGRIIPAI 83

Query: 315 FFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           +F Y+LSP+ VK TE+ + L    T I   I GT+    ++D+ + +  +   K+++G
Sbjct: 84  WFRYDLSPITVKYTERRQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAWKKIQLG 141


>gi|157872987|ref|XP_001685013.1| conserved hypothetical protein [Leishmania major strain Friedlin]
 gi|68128084|emb|CAJ08215.1| conserved hypothetical protein [Leishmania major strain Friedlin]
          Length = 341

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 52/245 (21%), Positives = 94/245 (38%), Gaps = 33/245 (13%)

Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQCKNEYSTEKLKNTFTEGCQIYGY 210
           C+   TET        + +       + +P    +      Y + ++ +   +GC + G 
Sbjct: 90  CHRIATETVSVFAHDEQTERDTHVSLYHIPYGSYVSNSSAAYISGEVLSGTEDGCLVTGT 149

Query: 211 LEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD---DDER 267
             +     SF+I            + D +   S  +     I H S G    D      R
Sbjct: 150 APIAAKPSSFNII-----------LKDYRVEDSRKYRPDFQIHHFSGGNAYDDWGVPQVR 198

Query: 268 RK---PLDG-TVAKAEEGASMFNYYIKIIPTIYERLDG--SKLG------------GGDG 309
           R+   P+ G   A+A +G   F +++++IPT  + L G  S+ G             G G
Sbjct: 199 RQTLEPMSGLKSARALQGPYFFQFFLQLIPTTVD-LAGKDSRFGYQYTAFHSMLRYNGHG 257

Query: 310 GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKV 369
             PG++FSY+LSP  +    +  ++ H    +   + G Y    +V+A L    +K    
Sbjct: 258 RAPGLYFSYKLSPFSMDCAVQYDTMSHFVVNLCAVVGGVYTVAEMVEAGLEWLARKRRLR 317

Query: 370 EIGGK 374
           E+  +
Sbjct: 318 EVSAR 322


>gi|388497088|gb|AFK36610.1| unknown [Medicago truncatula]
          Length = 457

 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/174 (27%), Positives = 67/174 (38%), Gaps = 36/174 (20%)

Query: 202 TEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKL 261
           T GC++ GY+ V +V GS  ++             D   + ++  N +H I HLSFG K+
Sbjct: 288 TGGCRVEGYVRVKKVPGSLVVSAR----------SDAHSFDASQMNMSHVINHLSFGKKV 337

Query: 262 QD----DDERRKPLDGTVAKAEEGASMFN-----------YYIKIIPTIYERLDGSKL-- 304
                 D +   P  G       G S  N           +YI+++ T      G KL  
Sbjct: 338 TPRAMIDVKHWIPYLGINHDRLNGRSFINTRDLEGNVTIEHYIQVVKTEVITRKGYKLIE 397

Query: 305 ---------GGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTY 349
                          +P   F  ELSP+ V ITE  KS  H  T +   I G +
Sbjct: 398 EYEYTAHSSVAHSVNIPVARFHLELSPMQVLITENQKSFSHFITNVCAIIGGCF 451


>gi|412989304|emb|CCO15895.1| predicted protein [Bathycoccus prasinos]
          Length = 674

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 56/109 (51%), Gaps = 4/109 (3%)

Query: 4   SERLKGLDAFTK--PYEDFH-EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
           ++ LK +D F +    E+F  E +  GG +T++   FI  L+   V   F  S   +L V
Sbjct: 50  TQTLKTVDVFKRNDALEEFSKEGSNKGGVLTLLFAWFIFGLVTSQVQKLFATSMRTDLSV 109

Query: 61  DSSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVE-HNIYKRRL 108
           D      L +  D+  P I+C++L++D VD+ G +  ++   +IYK  +
Sbjct: 110 DHDMDPTLVMQFDVSFPAINCEHLSVDLVDAVGHRAFNLSGESIYKHSM 158


>gi|323445875|gb|EGB02274.1| hypothetical protein AURANDRAFT_69033 [Aureococcus anophagefferens]
          Length = 329

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/103 (24%), Positives = 53/103 (51%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
           +D + K  ++  E +  GG +++     ++  +  ++  + +     ++ VD+  GS+L 
Sbjct: 1   MDFYRKVPDELKEASRTGGLLSLCACGVVALTLVTEIGAFLRTEVRTKIDVDTFAGSQLR 60

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
           ++ ++  P + CDY ++D  D  G    +V  NI K +LD DG
Sbjct: 61  VNFNLSFPHLHCDYASVDLWDKIGRNQANVTQNIEKWQLDEDG 103



 Score = 44.7 bits (104), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 38/70 (54%), Gaps = 10/70 (14%)

Query: 199 NTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
           NT   GC + G+L VNRV G+FH+   +++S +H          +   N +H + HLSFG
Sbjct: 265 NTDHPGCLVSGFLLVNRVPGNFHV---MAHSRHH-------SLNTLRTNLSHTVHHLSFG 314

Query: 259 IKLQDDDERR 268
           + L D   R+
Sbjct: 315 VPLTDAQHRK 324


>gi|323449341|gb|EGB05230.1| hypothetical protein AURANDRAFT_72293 [Aureococcus anophagefferens]
          Length = 221

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 33/105 (31%), Positives = 51/105 (48%), Gaps = 22/105 (20%)

Query: 204 GCQIYGYLEVNRVSGSFHI-APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           GC + G++ VNRV G+FHI A  L +++N           +A  N +H + HLSFG  L 
Sbjct: 128 GCMVSGHVLVNRVPGNFHIEARSLHHNLN-----------AAMTNLSHIVNHLSFGTPLA 176

Query: 263 DDDERR----------KPLDGTVAKAEEGASMFNYYIKIIPTIYE 297
            D +R+           PLDG      +     ++Y K++ T +E
Sbjct: 177 RDLQRKVSKYPQFQSAHPLDGGSFINRDYHQAHHHYSKVVSTHFE 221


>gi|159483443|ref|XP_001699770.1| protein disulfide isomerase [Chlamydomonas reinhardtii]
 gi|158281712|gb|EDP07466.1| protein disulfide isomerase [Chlamydomonas reinhardtii]
          Length = 474

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 45/201 (22%), Positives = 77/201 (38%), Gaps = 39/201 (19%)

Query: 202 TEGCQIYGYLEVNRVSGSFH-IAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIK 260
           T GC + G++ V +V G+ H +A    +S +H  +           N TH I     G +
Sbjct: 284 TPGCNLAGFVMVKKVPGTVHFVARSEGHSFDHTWM-----------NMTHMIHSFHVGTR 332

Query: 261 LQDDD----ERRKP----------LDGTVAKAEEGASMFNYYIKIIPTIYERLDGSKLGG 306
                    +R  P          L   +  +E   S   +Y++++ T  E       G 
Sbjct: 333 PSPRKYQQLKRLHPAGLTADWADKLHDQLFVSEHTQSTHEHYLQVVLTTIEPRHSRHTGN 392

Query: 307 GDG-------------GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFM 353
            D               +P   F+Y+LSP+ + + E SK      T     I G +    
Sbjct: 393 YDAYEYTAHSHSYQSDSIPSARFTYDLSPIQILVHETSKPWYQFLTTSCAIIGGVFTVAG 452

Query: 354 LVDALLHSCVKKISKVEIGGK 374
           ++DALL+   K + K+ +G +
Sbjct: 453 ILDALLYQSFKVVKKLNLGKQ 473


>gi|428175103|gb|EKX43995.1| hypothetical protein GUITHDRAFT_159761 [Guillardia theta CCMP2712]
          Length = 475

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/117 (25%), Positives = 55/117 (47%), Gaps = 7/117 (5%)

Query: 3   FSERLKGLDAFTKPYEDFH----EKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEEL 58
           F + LK +D + K   D      E +V G A++I+  + +  L+  ++  Y  V +   +
Sbjct: 4   FLQGLKSVDFYRKLKRDLQQELTEASVSGAALSIIAAVIMIGLVAAELTAYLTVQSESRV 63

Query: 59  FVD---SSRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRLDLDG 112
            +D   SS    L ++ +   P + CDY ++DA +  G     +   + K RLD +G
Sbjct: 64  VLDHFESSSDDTLQVNFNFTFPHLKCDYASVDATNFMGTHDAGLAARVSKIRLDKNG 120


>gi|388493200|gb|AFK34666.1| unknown [Medicago truncatula]
          Length = 106

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/106 (31%), Positives = 50/106 (47%), Gaps = 20/106 (18%)

Query: 284 MFNYYIKIIPTIY----------------ERLDGSKLGGGDGGMPGIFFSYELSPLMVKI 327
           M  Y+IK++PT+Y                E    S+LG     +PG+FF Y++SP+ V  
Sbjct: 1   MCQYFIKVVPTVYTDIRGRVIHSNQYSVTEHFKSSELGAA---VPGVFFFYDISPIKVNF 57

Query: 328 TEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKI-SKVEIG 372
            E+     H  T I   I G +    +VD+ ++   K I  K+EIG
Sbjct: 58  KEEHIPFLHFLTNICAIIGGIFTIAGIVDSSIYYGQKTIKKKMEIG 103


>gi|32566449|ref|NP_510494.2| Protein C18B12.6 [Caenorhabditis elegans]
 gi|25809204|emb|CAA20929.2| Protein C18B12.6 [Caenorhabditis elegans]
          Length = 428

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 43/167 (25%), Positives = 71/167 (42%), Gaps = 23/167 (13%)

Query: 215 RVSGSFHIAPGLSYSI-----NHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQDDDERRK 269
           R+ G F +  G    I     N + + D Q   S   N +H I   +FG ++        
Sbjct: 227 RLHGKFKVRKGKEEKIVMSISNPMMMFDHQEKQSG--NISHRIEKFNFGPRIPG---LVT 281

Query: 270 PLDGTVAKAEEGASMFNYYIKIIPT-IYERLD------------GSKLGGGDGGMPGIFF 316
           PL G    +E G  ++ Y+IKI+PT IY                  +L  G+    GI F
Sbjct: 282 PLAGAEHISESGQDIYRYFIKIVPTKIYGYFSYTMAYQYSVTFLKKQLKEGEHSHGGILF 341

Query: 317 SYELSPLMVKITEKSKSLGHLWTKIMCNISGTYITFMLVDALLHSCV 363
            YE +  ++++ + S +L     +I   + G Y T  +V+ +L  C+
Sbjct: 342 EYEFTANVIEVHKTSITLISYLIRICSILGGVYATSTIVNNILQFCL 388


>gi|444706692|gb|ELW48018.1| Endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Tupaia chinensis]
          Length = 821

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 51/102 (50%), Gaps = 15/102 (14%)

Query: 286 NYYIKIIPTIYERLDGSK-------------LGGGDGG--MPGIFFSYELSPLMVKITEK 330
           +Y +KI+PT+YE   G +             +     G  +P I+F Y+LSP+ VK TE+
Sbjct: 717 DYILKIVPTVYEDKSGKQRYSYQYTVANKEYVAYSHTGRIIPAIWFRYDLSPITVKYTER 776

Query: 331 SKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
            + L    T I   I GT+    ++D+ + +  +   KV++G
Sbjct: 777 RQPLYRFITTICAIIGGTFTVAGILDSCIFTASEAWKKVQLG 818


>gi|390370794|ref|XP_001186477.2| PREDICTED: endoplasmic reticulum-Golgi intermediate compartment
           protein 1-like, partial [Strongylocentrotus purpuratus]
          Length = 221

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 31/113 (27%), Positives = 53/113 (46%), Gaps = 13/113 (11%)

Query: 193 STEKLKNTFTEGCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHI 252
           +T+K+      GC  Y    +N+V G+FH++            H +      + +  H I
Sbjct: 101 NTKKIPLNNGLGCLFYSAFTINKVPGNFHVS-----------THAVGMNQPQSTDFAHII 149

Query: 253 RHLSFGIKLQDD--DERRKPLDGTVAKAEEGASMFNYYIKIIPTIYERLDGSK 303
             +SFG  +Q+        PL+G   +  +     +YY+KI+PT+YE L G+K
Sbjct: 150 HEVSFGDDIQNKTLGASFNPLEGRDKRDSKSDLSHDYYMKIVPTVYEDLWGTK 202



 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 26/96 (27%), Positives = 48/96 (50%), Gaps = 3/96 (3%)

Query: 1  MVFSERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFV 60
          MVF  R   LD + K  +D  + T  G  V+++  LFI++L+  +   + +     EL+V
Sbjct: 1  MVFDFR--RLDVYRKIPKDLTQPTYAGACVSLLSMLFITFLLLSEFMSFIRPEVVSELYV 58

Query: 61 DS-SRGSKLPIHLDIVVPTISCDYLALDAVDSSGEQ 95
          D+     +L + +++ +P + C  + LD  D  G  
Sbjct: 59 DNPGEIERLTVRVNLSLPKLHCGVVGLDIQDDMGRH 94


>gi|50545267|ref|XP_500171.1| YALI0A17600p [Yarrowia lipolytica]
 gi|49646036|emb|CAG84103.1| YALI0A17600p [Yarrowia lipolytica CLIB122]
          Length = 337

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 40/158 (25%), Positives = 62/158 (39%), Gaps = 25/158 (15%)

Query: 205 CQIYGYLEVNRVSGSFHI--APGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQ 262
           C+I G + +N V G+  I   P   Y IN +   D         N TH I  LSFG    
Sbjct: 151 CRISGSVPINHVEGALQIFNLPDNQYFINPMKASD-------GLNLTHAIHELSFGDYF- 202

Query: 263 DDDERRKPLDGTVAKAEEGASMFNYYIKIIPTIYE-------------RLDGSKLGGGDG 309
              +   PLDG     +E    + Y++  +P  Y              +   + L     
Sbjct: 203 --PKVLNPLDGVSTVTDEPLMSYQYFLSAVPVEYSSGRKKIHTYQYAVKKQTTNLQEHFV 260

Query: 310 GMPGIFFSYELSPLMVKITEKSKSLGHLWTKIMCNISG 347
             P IFF Y+  P+ +KI +  ++L     K++  + G
Sbjct: 261 TRPAIFFHYKYEPVTLKIQDSRETLTVFVVKLLSILGG 298


>gi|339244785|ref|XP_003378318.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Trichinella spiralis]
 gi|316972786|gb|EFV56437.1| endoplasmic reticulum-Golgi intermediate compartment protein 1
           [Trichinella spiralis]
          Length = 334

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/103 (26%), Positives = 49/103 (47%), Gaps = 15/103 (14%)

Query: 285 FNYYIKIIPTIYERLDGS---------------KLGGGDGGMPGIFFSYELSPLMVKITE 329
           ++Y +KI+PT+YE + G+               ++       P ++F Y+ +P+ VK  E
Sbjct: 155 YDYILKIVPTVYENIAGNMKHAYQYTYARKTYIEMSFTGQTNPTLWFRYDFTPITVKYHE 214

Query: 330 KSKSLGHLWTKIMCNISGTYITFMLVDALLHSCVKKISKVEIG 372
           + + L    T I   I GT+    L+D+   +  +   KVE+G
Sbjct: 215 RRQPLYIFLTSICAIIGGTFTVAGLIDSFFFTASQLYKKVELG 257


>gi|145544034|ref|XP_001457702.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124425520|emb|CAK90305.1| unnamed protein product [Paramecium tetraurelia]
          Length = 463

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 30/98 (30%), Positives = 47/98 (47%), Gaps = 3/98 (3%)

Query: 93  GEQHLHVEHNIYKR-RLDLDGKPIQEPQKEVVNAVKKKKVTTENGTTTTELEDPN-KCGS 150
           G+  L V+  I  R R++LD   IQ P + +   ++         T    +  PN    S
Sbjct: 44  GKISLLVDSTIDSRIRVNLDAT-IQAPCQALFQHIRYDGFLFIRSTFEEAIFKPNVNFTS 102

Query: 151 CYGAETETRKCCNTCNEVKEAYRYKKWALPELDTIVQC 188
           CYGAE    + C +C +V  A+  ++W  P  ++IVQC
Sbjct: 103 CYGAELIVDQRCYSCQDVMMAFAQRRWTQPNFESIVQC 140


>gi|223646904|gb|ACN10210.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Salmo salar]
 gi|223672767|gb|ACN12565.1| Endoplasmic reticulum-Golgi intermediate compartment protein 2
           [Salmo salar]
          Length = 238

 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 39/71 (54%), Gaps = 3/71 (4%)

Query: 204 GCQIYGYLEVNRVSGSFHIAPGLSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFGIKLQD 263
            C+I+G+L VN+V+G+FHI  G +      H H     +   +N +H I HLSFG   ++
Sbjct: 169 ACRIHGHLYVNKVAGNFHITVGKAIPHPRGHAHLAALVSHDTYNFSHRIDHLSFG---EE 225

Query: 264 DDERRKPLDGT 274
                 PLDGT
Sbjct: 226 IPGIINPLDGT 236



 Score = 43.5 bits (101), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 1/84 (1%)

Query: 7  LKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGS 66
          +K LDAF K  E + E T  GG V+++ +  ++ L  ++   Y       E  VD    S
Sbjct: 14 VKELDAFPKVPESYVETTATGGTVSLIAFTAMALLAFLEFFVYRDTWMQYEYEVDKDFSS 73

Query: 67 KLPIHLDIVVPTISCDYLALDAVD 90
          KL I++DI V  + C ++  D +D
Sbjct: 74 KLRINIDITV-AMRCQFVGADVLD 96


>gi|312374049|gb|EFR21698.1| hypothetical protein AND_16520 [Anopheles darlingi]
          Length = 252

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 29/91 (31%), Positives = 46/91 (50%), Gaps = 1/91 (1%)

Query: 5   ERLKGLDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSR 64
           E +  LDAF K  E+F E T  GG ++++  L I +LI  +V  Y           D+  
Sbjct: 12  EAVSQLDAFPKVKEEFVEATRVGGTLSLISRLVIIFLIYHEVTYYLDSRLVFTFKPDTDL 71

Query: 65  GSKLPIHLDIVVPTISCDYLALDAVDSSGEQ 95
            SKL +H+D+ V  + C  +  D +DS+ + 
Sbjct: 72  HSKLKVHIDLTV-AMPCKSIGADILDSTNQN 101



 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 36/58 (62%), Gaps = 4/58 (6%)

Query: 203 EGCQIYGYLEVNRVSGSFHIAPG--LSYSINHVHVHDIQPYTSAAFNTTHHIRHLSFG 258
           + C+I+G L +N+V+G+FHI  G  + ++  H+H++ I  + +   N +H I   SFG
Sbjct: 169 DACRIHGVLTLNKVAGNFHITVGKTIHFARGHIHLNSI--FANTQTNFSHRINRFSFG 224


>gi|325185550|emb|CCA20033.1| thioredoxinlike protein putative [Albugo laibachii Nc14]
          Length = 503

 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 47/217 (21%), Positives = 80/217 (36%), Gaps = 42/217 (19%)

Query: 186 VQCKNEYSTEKLKNTFTEGCQIYGYLEVNRVSGSFHI---APGLSYSINHVHVHDIQPYT 242
           +   N     KL     EGC++ G L VNRV         +  LS+ +  +         
Sbjct: 299 LNANNPEKNVKLPVGSVEGCEVSGSLNVNRVPSRLVFTARSKDLSFDLRGI--------- 349

Query: 243 SAAFNTTHHIRHLSFGIKLQDDDERRK---------PLDGTVAKAEEGASMFNYYIKIIP 293
               N TH + HLSFG   +    +           PLDG   + E       +++ +I 
Sbjct: 350 ----NVTHVVHHLSFGQVTRKQSTKSTQLSMSFDHFPLDGKTFRTENENITVEHFLSVIG 405

Query: 294 T---------------IYERLDGSKLGGGDGGMPGIFFSYELSPLMVKITEKSKSLGHLW 338
                            Y+ +  S        +P   F++++SPL+++++  S       
Sbjct: 406 VDHMEAKSKHMGLVERTYQIVARSNQYNATDMLPAALFTFDISPLVIQMSSDSTPFYRFL 465

Query: 339 TKIMCNISGTYITFM-LVDALLHSCVKKISKVEIGGK 374
           T  +C I G  +T +  VDA  +  +  I +    GK
Sbjct: 466 TS-LCAIVGGMVTIIGFVDAGAYHAMNSIKRKRQLGK 501



 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/99 (24%), Positives = 46/99 (46%)

Query: 10  LDAFTKPYEDFHEKTVYGGAVTIVCWLFISYLICVDVCDYFQVSTTEELFVDSSRGSKLP 69
            D F K  E   E++  G   T++  +   YLI V+   Y   S    + +D  +  +L 
Sbjct: 11  FDLFRKVPEHLSERSSLGTVFTVLTLVLSVYLITVNFRSYQDTSIHSIVVMDDHQEDQLR 70

Query: 70  IHLDIVVPTISCDYLALDAVDSSGEQHLHVEHNIYKRRL 108
           I+ +I +  I C + ++D  D  G Q +++  ++   +L
Sbjct: 71  INFNISLLAIPCQFASVDVSDYIGMQLINITRHLRHFQL 109


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.411 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,192,350,207
Number of Sequences: 23463169
Number of extensions: 267442456
Number of successful extensions: 589453
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 795
Number of HSP's successfully gapped in prelim test: 284
Number of HSP's that attempted gapping in prelim test: 585261
Number of HSP's gapped (non-prelim): 1757
length of query: 379
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 235
effective length of database: 8,980,499,031
effective search space: 2110417272285
effective search space used: 2110417272285
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)