BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 020832
         (321 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224109688|ref|XP_002315277.1| predicted protein [Populus trichocarpa]
 gi|222864317|gb|EEF01448.1| predicted protein [Populus trichocarpa]
          Length = 322

 Score =  515 bits (1326), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 236/321 (73%), Positives = 286/321 (89%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           MSR+RP+R  LPPA+ENI KLE ++NEGN+YGAQQMYKSISARYV+AQR+SEALD+LH+G
Sbjct: 1   MSRERPRRATLPPAEENIKKLENVINEGNYYGAQQMYKSISARYVSAQRHSEALDILHSG 60

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDD 120
           ACLQLKN Q+TCG+ELAV +VE LVK KV YD++ LD +RKIY+ FPQIP+P +L +D+D
Sbjct: 61  ACLQLKNGQVTCGSELAVIYVETLVKAKVPYDDDVLDCIRKIYKTFPQIPLPQDLGEDED 120

Query: 121 LHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMA 180
           + +L EALGAAK+RV+ C SFLKAAIKWS EFGA + GSPELHVMLAEY+YSESPELDM 
Sbjct: 121 MQQLNEALGAAKIRVDCCLSFLKAAIKWSAEFGAHRNGSPELHVMLAEYVYSESPELDMT 180

Query: 181 RVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK 240
           RVS+HFVRGNNP+KFAST+VNFMGKCYPGEDDLA+ARAILMYL++GN++DAN++MDE+KK
Sbjct: 181 RVSYHFVRGNNPKKFASTLVNFMGKCYPGEDDLAIARAILMYLAMGNLRDANFLMDELKK 240

Query: 241 QVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
             + K+  L  SDLIQFV ++L TL+RDALPLFNMLR  YK+SI+REPAFNE LD+IAE 
Sbjct: 241 HAQYKEHDLHRSDLIQFVNHLLPTLQRDALPLFNMLRTKYKSSIDREPAFNERLDEIAEL 300

Query: 301 FFGVKRRNPMQGIFGDIFKMM 321
           F+GV+RRNP+QG+FGDIFKMM
Sbjct: 301 FYGVQRRNPLQGMFGDIFKMM 321


>gi|225449158|ref|XP_002278274.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Vitis vinifera]
          Length = 322

 Score =  511 bits (1315), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 237/321 (73%), Positives = 282/321 (87%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           M RQR ++  LPPAQE+IDKLEK VN+G++YGAQQMYKSISARY +A+R  EALD+L +G
Sbjct: 1   MLRQRSRKGPLPPAQEHIDKLEKTVNDGDYYGAQQMYKSISARYASAERCYEALDILESG 60

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDD 120
           AC+QL+  Q+TCGAELA+ FVE LVKGK  YD+  LDRVRKIY+ FPQI VP  LEDDDD
Sbjct: 61  ACIQLEKGQVTCGAELAILFVETLVKGKFPYDDNTLDRVRKIYKNFPQISVPQQLEDDDD 120

Query: 121 LHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMA 180
           + +L+EALGAAK RVE CSSFLKAA+KWS EFG  + GSPE+H MLAEYLYSESPELDM+
Sbjct: 121 MQKLSEALGAAKTRVEVCSSFLKAAMKWSAEFGFHRQGSPEIHDMLAEYLYSESPELDMS 180

Query: 181 RVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK 240
           R+S HFVRGNNPEKFAST+VNFMGKCYPGEDDLA+ARA+LMYLSLGN++DANY+MDEVKK
Sbjct: 181 RISLHFVRGNNPEKFASTLVNFMGKCYPGEDDLAIARAVLMYLSLGNLRDANYLMDEVKK 240

Query: 241 QVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
           QVE+K+L    SDL +F+ Y+L TL+RDALPLFNMLR +YK+SI+REPAFNE+LD+IAEK
Sbjct: 241 QVESKELDYPESDLTEFIDYLLLTLQRDALPLFNMLRQSYKSSIDREPAFNELLDEIAEK 300

Query: 301 FFGVKRRNPMQGIFGDIFKMM 321
           F+GV+RRNPMQG+FGD FK+M
Sbjct: 301 FYGVRRRNPMQGMFGDFFKLM 321


>gi|255565818|ref|XP_002523898.1| conserved hypothetical protein [Ricinus communis]
 gi|223536828|gb|EEF38467.1| conserved hypothetical protein [Ricinus communis]
          Length = 324

 Score =  510 bits (1314), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 233/324 (71%), Positives = 287/324 (88%), Gaps = 3/324 (0%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           MSR+RP+R  LPP QENI+KLE ++NEGN+YGAQQMYKSISARY +AQRY EALDLLH+G
Sbjct: 1   MSRERPRRANLPPVQENIEKLENVINEGNYYGAQQMYKSISARYSSAQRYFEALDLLHSG 60

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLED--- 117
           ACLQLKN Q+TCG+ELAV FVE LVKGKV YD+  LDRVR+IY++FPQIP+P  L+D   
Sbjct: 61  ACLQLKNGQVTCGSELAVLFVETLVKGKVPYDDNTLDRVREIYKMFPQIPLPQRLDDFGD 120

Query: 118 DDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPEL 177
           D+++ +LTEA+GAAK RVE CSSFL+AAI+WS EFGA K GS +LH MLAEYL+SESPE+
Sbjct: 121 DEEVQQLTEAIGAAKTRVECCSSFLRAAIRWSAEFGASKNGSLQLHAMLAEYLFSESPEV 180

Query: 178 DMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDE 237
           DM R+S+HFVRG++P +FAST+VNFMGKCYPGEDDLA+ARAIL+YL+LGN++DAN +MDE
Sbjct: 181 DMTRISYHFVRGDDPMQFASTLVNFMGKCYPGEDDLAIARAILLYLALGNLRDANRLMDE 240

Query: 238 VKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDI 297
           VKKQVE+K+L    SDLIQF+ Y+L TL+RDA PLFNMLRA+YK+SI+REP FNE+LD+I
Sbjct: 241 VKKQVESKKLDFPKSDLIQFITYLLPTLQRDAFPLFNMLRASYKSSIDREPVFNELLDEI 300

Query: 298 AEKFFGVKRRNPMQGIFGDIFKMM 321
           AEK +G++RRNP+QG+FGDIFKM+
Sbjct: 301 AEKLYGIQRRNPLQGMFGDIFKMI 324


>gi|356576185|ref|XP_003556214.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Glycine max]
          Length = 323

 Score =  503 bits (1295), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 228/321 (71%), Positives = 284/321 (88%), Gaps = 1/321 (0%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           MSRQR +R  LPPAQENI+KLEK+VN+GN+YGAQQMYKS+SARYV+AQRYSEALD+LH+G
Sbjct: 1   MSRQRSRRAELPPAQENIEKLEKVVNDGNYYGAQQMYKSVSARYVSAQRYSEALDILHSG 60

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDD 120
           AC+QL + Q+TCGAELA+ FVE L KGK+ YD E L+R++KIY+ FP++ +P NL D DD
Sbjct: 61  ACIQLSHGQVTCGAELALLFVETLGKGKIPYDEEILERLKKIYKSFPRVSLPQNLWDVDD 120

Query: 121 LHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMA 180
           + +L+E +G+AK RVEGCSSFLKAAIKWS E GA   GSPELH+MLAEY++SESPE+DMA
Sbjct: 121 MQQLSENIGSAKTRVEGCSSFLKAAIKWSAEHGANNNGSPELHIMLAEYIFSESPEVDMA 180

Query: 181 RVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK 240
           +V++HFVRGNNP+KFAST+V+F+GKCYPGEDDLA+ARA+L YLS GN+KDAN +M+E+KK
Sbjct: 181 KVTYHFVRGNNPKKFASTLVSFLGKCYPGEDDLAIARAVLRYLSSGNLKDANILMEEIKK 240

Query: 241 QVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
           Q E+ ++    ++L+QF+ Y+LQTLERDALPLFNMLR N+K SIEREPAFNEMLDDIAE+
Sbjct: 241 QTESAEVAFPKTELMQFITYLLQTLERDALPLFNMLRTNFKPSIEREPAFNEMLDDIAER 300

Query: 301 FFGVKRRNPMQGIFGDIFKMM 321
           FFGV+RRNPM G+FGDIFK+M
Sbjct: 301 FFGVQRRNPM-GMFGDIFKLM 320


>gi|357443413|ref|XP_003591984.1| hypothetical protein MTR_1g095910 [Medicago truncatula]
 gi|355481032|gb|AES62235.1| hypothetical protein MTR_1g095910 [Medicago truncatula]
 gi|388501440|gb|AFK38786.1| unknown [Medicago truncatula]
          Length = 323

 Score =  495 bits (1274), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 223/321 (69%), Positives = 283/321 (88%), Gaps = 1/321 (0%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           M+RQR KR  LPPA EN++KLEK+V +GN YGAQQMYKSISARYV AQRY+EALD+LH+G
Sbjct: 1   MARQRSKRFELPPADENVEKLEKVVKDGNCYGAQQMYKSISARYVTAQRYTEALDILHSG 60

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDD 120
           AC+QL + Q+TCGAELA+ FVE LVKGK+ YD+  LDR++KIYE FP++P+P +L D DD
Sbjct: 61  ACIQLAHGQVTCGAELALLFVETLVKGKIPYDDGTLDRLKKIYEGFPRVPLPQHLWDVDD 120

Query: 121 LHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMA 180
           + +L+E LG+AK RVEGCSSFLKAAIKWS EFG   +G+PELH+MLAEY++SESPE+DM 
Sbjct: 121 MQQLSENLGSAKTRVEGCSSFLKAAIKWSAEFGTSGSGAPELHIMLAEYIFSESPEVDMN 180

Query: 181 RVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK 240
           RV++HFVRGN+ +KFAST+VNF+GKCYPGEDDLA+ARA+L YLSLGN+KDAN ++DE+KK
Sbjct: 181 RVTYHFVRGNDTQKFASTLVNFLGKCYPGEDDLAIARAVLRYLSLGNLKDANILVDEIKK 240

Query: 241 QVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
           Q E+ +++   +DL+QF+ ++LQT+ERDA PLFNMLRANYK SIEREP+FNE+LD+IA+K
Sbjct: 241 QTESTEVEFPKTDLMQFLNFLLQTMERDAFPLFNMLRANYKPSIEREPSFNELLDEIAQK 300

Query: 301 FFGVKRRNPMQGIFGDIFKMM 321
           F+GV+RRNPM G+FGDIFK+M
Sbjct: 301 FYGVQRRNPM-GMFGDIFKLM 320


>gi|358249304|ref|NP_001239772.1| uncharacterized protein LOC100780059 [Glycine max]
 gi|255636423|gb|ACU18550.1| unknown [Glycine max]
          Length = 323

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 225/321 (70%), Positives = 281/321 (87%), Gaps = 1/321 (0%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           MSRQR +R  LPPAQENI+KLEK+VN+GN+YGAQQMYKS+SARYV+AQRY EALD+LH+G
Sbjct: 1   MSRQRSRRVELPPAQENIEKLEKVVNDGNYYGAQQMYKSVSARYVSAQRYCEALDILHSG 60

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDD 120
           AC+QL + Q+TCGAELA+ FVE L KGK+ Y+ E L+R+ KIY+ FP++ +P NL D DD
Sbjct: 61  ACIQLSHGQVTCGAELALLFVETLGKGKIPYNEEILERLDKIYKSFPRVSLPQNLWDVDD 120

Query: 121 LHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMA 180
           + +L+E +G+AK RVEGCSSFLKAAIKWS   G  K GSPELH+MLAEY++SESPE+DMA
Sbjct: 121 MQQLSENIGSAKTRVEGCSSFLKAAIKWSAGNGVNKNGSPELHIMLAEYIFSESPEVDMA 180

Query: 181 RVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK 240
           +V+ HFVRGNNP+KFAST+V+F+GKCYPGEDDLA+ARA+L YLS GN+KDAN +M+E+KK
Sbjct: 181 KVTHHFVRGNNPKKFASTLVSFLGKCYPGEDDLAIARAVLRYLSSGNLKDANILMEEIKK 240

Query: 241 QVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
           Q E+ ++    ++L+QF+ Y+LQTLERDALPLF+MLRAN+K SIEREPAFNEMLDDIAE+
Sbjct: 241 QTESAEVAFPKTELMQFIAYLLQTLERDALPLFSMLRANFKPSIEREPAFNEMLDDIAER 300

Query: 301 FFGVKRRNPMQGIFGDIFKMM 321
           FFGV+RRNPM G+FGDIFK+M
Sbjct: 301 FFGVQRRNPM-GMFGDIFKLM 320


>gi|30697809|ref|NP_201127.2| uncharacterized protein [Arabidopsis thaliana]
 gi|48958523|gb|AAT47814.1| At5g63220 [Arabidopsis thaliana]
 gi|51970596|dbj|BAD43990.1| putative protein [Arabidopsis thaliana]
 gi|332010336|gb|AED97719.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 324

 Score =  457 bits (1175), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 210/324 (64%), Positives = 269/324 (83%), Gaps = 4/324 (1%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           MSR+R KR  LPP QE+IDKL K++ EGN+YGA QMYKSISARYV AQR+SEALD+L +G
Sbjct: 1   MSRERIKRE-LPPVQEHIDKLRKVIEEGNYYGALQMYKSISARYVTAQRFSEALDILFSG 59

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNL---ED 117
           AC++L++  + CGA+LA+ FV+ LVK K   ++E LDR+R I++LFP++PVP +L    D
Sbjct: 60  ACIELEHGLVNCGADLAILFVDTLVKAKSPCNDETLDRIRCIFKLFPRVPVPPHLVDVSD 119

Query: 118 DDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPEL 177
           D+D+  L E+LG A+ RVE  +SFL+AAIKWS EFG P+TG PELH ML +YLY+E PEL
Sbjct: 120 DEDVQNLQESLGEARSRVENLTSFLRAAIKWSAEFGGPRTGYPELHAMLGDYLYTECPEL 179

Query: 178 DMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDE 237
           DM R+S HFVR  +PEKFAS +VNFMG+CYPGEDDLA+ARA+LMYLS+GNMKDAN++MDE
Sbjct: 180 DMVRISRHFVRAEDPEKFASMLVNFMGRCYPGEDDLAIARAVLMYLSMGNMKDANFMMDE 239

Query: 238 VKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDI 297
           +KKQ E K  +L  SDLIQF+ Y+L+TL+RDALPLFNMLR  YK+SI+R+   NE+LD+I
Sbjct: 240 IKKQAETKNPELSESDLIQFISYLLETLQRDALPLFNMLRVKYKSSIDRDQLLNELLDEI 299

Query: 298 AEKFFGVKRRNPMQGIFGDIFKMM 321
           AE+F+GV+R+NP+QG+FGDIFKMM
Sbjct: 300 AERFYGVQRKNPLQGMFGDIFKMM 323


>gi|297797313|ref|XP_002866541.1| hypothetical protein ARALYDRAFT_496506 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297312376|gb|EFH42800.1| hypothetical protein ARALYDRAFT_496506 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 324

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 211/324 (65%), Positives = 269/324 (83%), Gaps = 4/324 (1%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           MSR+R KR  LPP QE+IDKL K+V EGN+YGA QMYKSISARYV AQR+SEALD+L +G
Sbjct: 1   MSRERIKRE-LPPVQEHIDKLRKVVEEGNYYGALQMYKSISARYVTAQRFSEALDILFSG 59

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNL---ED 117
           AC++L++  + CGA+LA+ FV+ LVK K   ++E LDR+R I++LFP++PVP +L    D
Sbjct: 60  ACIELEHGLVNCGADLAILFVDTLVKAKSPCNDEALDRIRCIFKLFPRVPVPPHLVDVSD 119

Query: 118 DDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPEL 177
           D+D+  L E+LG A+ RVE  +SFL+AAIKWS EFG P+TG PELH ML +YLY+E PEL
Sbjct: 120 DEDVQNLQESLGEARSRVENLTSFLRAAIKWSAEFGGPRTGYPELHAMLGDYLYTECPEL 179

Query: 178 DMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDE 237
           DM R+S HFVR  +PEKFAS +VNFMG+CYPGEDDLA+ARA+LMYLS+GNMKDAN++MDE
Sbjct: 180 DMVRISRHFVRAEDPEKFASMLVNFMGRCYPGEDDLAIARAVLMYLSMGNMKDANFMMDE 239

Query: 238 VKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDI 297
           +KKQ E K  +L  SDLIQF+ Y+L+TL+RDALPLFNMLR  YK+SI+R+   NE+LD+I
Sbjct: 240 IKKQAETKYPELSESDLIQFISYLLETLQRDALPLFNMLRVKYKSSIDRDQLLNELLDEI 299

Query: 298 AEKFFGVKRRNPMQGIFGDIFKMM 321
           AE+F+GV+R+NP+QG+FGDIFKMM
Sbjct: 300 AERFYGVQRKNPLQGMFGDIFKMM 323


>gi|224100845|ref|XP_002312037.1| predicted protein [Populus trichocarpa]
 gi|222851857|gb|EEE89404.1| predicted protein [Populus trichocarpa]
          Length = 282

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 208/282 (73%), Positives = 249/282 (88%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           MSRQRP +  LPPA+ NI+KLE ++NEGN+YGAQQMYKSISARYV+AQR+SEALD+LH+G
Sbjct: 1   MSRQRPGKAPLPPAEVNIEKLENVINEGNYYGAQQMYKSISARYVSAQRHSEALDILHSG 60

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDD 120
           ACLQLKN+Q+TCG+ELAV FVE LVK KV YD++ LDR+RKIY+ FPQIP+P +L +DDD
Sbjct: 61  ACLQLKNSQVTCGSELAVIFVETLVKAKVPYDDDVLDRIRKIYKTFPQIPLPQDLGEDDD 120

Query: 121 LHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMA 180
           + +L EALGAAK RVE C SFLKAAIKWS EFGA + GSPE+H MLAEY+YSESPELDM 
Sbjct: 121 MQQLNEALGAAKTRVECCLSFLKAAIKWSAEFGAHRNGSPEIHFMLAEYVYSESPELDMT 180

Query: 181 RVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK 240
           RVS+HFVRGNNP+KFASTIVNFMGKCYPGE+DLA+ARAILMYLS+GN++DAN++MDE+KK
Sbjct: 181 RVSYHFVRGNNPKKFASTIVNFMGKCYPGEEDLAIARAILMYLSMGNLRDANFLMDELKK 240

Query: 241 QVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKA 282
             + K+  L  SDLIQF+ Y+L TL+RDALPLFNMLR NYK+
Sbjct: 241 HAQYKEHDLPQSDLIQFINYLLPTLQRDALPLFNMLRTNYKS 282


>gi|147819146|emb|CAN78083.1| hypothetical protein VITISV_021302 [Vitis vinifera]
          Length = 391

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 206/285 (72%), Positives = 244/285 (85%), Gaps = 8/285 (2%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           ++IDKLEK VN+G++YGAQQMYKSISARY +A+RY EALD+L +GAC+QL+  Q+TCGAE
Sbjct: 61  KHIDKLEKTVNDGDYYGAQQMYKSISARYASAERYYEALDILESGACIQLEKGQVTCGAE 120

Query: 76  LAVSFVEALVKGKVAYDNENLD--------RVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
           LA+ FVE LVKGK  YD+  LD        RVRKIY+ FPQI VP  LEDDDD+ +L+EA
Sbjct: 121 LAILFVETLVKGKFPYDDNTLDYDKIFMLDRVRKIYKNFPQISVPQQLEDDDDMQKLSEA 180

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
           LGAAK RVE CSSFLKAA+KWS EFG  + GSPE+H MLAEYLYSESPELDM+R+S HFV
Sbjct: 181 LGAAKTRVEVCSSFLKAAMKWSAEFGFHRQGSPEIHDMLAEYLYSESPELDMSRISLHFV 240

Query: 188 RGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQL 247
           RGNNPEKFAST+VNFMGKCYPGEDDLA+ARA+LMYLSLGN++DANY+MDEVKKQVE+K+L
Sbjct: 241 RGNNPEKFASTLVNFMGKCYPGEDDLAIARAVLMYLSLGNLRDANYLMDEVKKQVESKEL 300

Query: 248 QLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNE 292
               SDL +F+ Y+L TL+RDALPLFNMLR +YK+SI+REPAFNE
Sbjct: 301 DYPESDLTEFIDYLLLTLQRDALPLFNMLRQSYKSSIDREPAFNE 345


>gi|357134948|ref|XP_003569076.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Brachypodium
           distachyon]
          Length = 326

 Score =  419 bits (1078), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 190/312 (60%), Positives = 253/312 (81%), Gaps = 2/312 (0%)

Query: 11  LPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQL 70
           LPP Q+ ID+LE +V  GN+Y AQQMYKS S+RY+  QRYSEALD+L +GA +QLK+ Q+
Sbjct: 15  LPPPQKTIDRLENMVEGGNYYEAQQMYKSTSSRYITCQRYSEALDILQSGATVQLKHVQV 74

Query: 71  TCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDD--DDLHELTEAL 128
           TCGAELAV FV+ LVKG+  Y+ E   R++K+YE FP+I +PH L DD  DD  +L+EA+
Sbjct: 75  TCGAELAVLFVDTLVKGQFPYNEETFGRIKKMYEAFPRIHMPHFLGDDYDDDGQKLSEAM 134

Query: 129 GAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVR 188
            +AK+R EGCSSFLKAA++WS EFG  + GSPELHVMLAEY+YSESPE DM +VS HFVR
Sbjct: 135 SSAKVRAEGCSSFLKAALRWSAEFGTSRNGSPELHVMLAEYIYSESPETDMTKVSSHFVR 194

Query: 189 GNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQ 248
           GN+P+KFA+ +VNFMGKCYPGEDD A+AR++LMYLS GN++DAN +MDE+K+ +++  L+
Sbjct: 195 GNDPKKFAAMLVNFMGKCYPGEDDTAMARSVLMYLSQGNLRDANLLMDEMKELLKSADLE 254

Query: 249 LQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRN 308
              +DLIQFV Y+L TLERDA PLF  LR  YK S +R+P F+E+LD+IA  F+G++++N
Sbjct: 255 FPKTDLIQFVKYLLPTLERDAYPLFRTLRQKYKTSTDRDPVFDELLDEIAAIFYGIRQQN 314

Query: 309 PMQGIFGDIFKM 320
           P++G+FG++FK+
Sbjct: 315 PLEGLFGEMFKI 326


>gi|242056289|ref|XP_002457290.1| hypothetical protein SORBIDRAFT_03g005010 [Sorghum bicolor]
 gi|241929265|gb|EES02410.1| hypothetical protein SORBIDRAFT_03g005010 [Sorghum bicolor]
          Length = 328

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 190/315 (60%), Positives = 248/315 (78%), Gaps = 2/315 (0%)

Query: 8   RTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKN 67
           R  LPP ++ I++LE +V+ GNFY AQQMYKS SARY+ AQ+YSEALD+L +GA +QLK+
Sbjct: 14  RRDLPPPEKTIERLESMVDGGNFYEAQQMYKSTSARYITAQKYSEALDILQSGALVQLKH 73

Query: 68  NQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDD--DDLHELT 125
            Q+TCG ELAV FVE L+ G+  Y  +  DR+RK+YE FP+I VPH L DD  D+ H+L+
Sbjct: 74  GQVTCGGELAVLFVETLITGEFPYSEQIFDRIRKMYEAFPRITVPHFLGDDYDDEGHQLS 133

Query: 126 EALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFH 185
           EA+ AAK+R E CSSFLKAAI+WS EFG  + GSPELHVMLAEY+YSESPE DM +VS H
Sbjct: 134 EAISAAKVRAESCSSFLKAAIRWSAEFGTSRNGSPELHVMLAEYIYSESPETDMTKVSSH 193

Query: 186 FVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENK 245
           FVRGN+P+KFAS +VNFM KCYPGEDD A+AR +LMYLS  N++DAN +MDE+K+Q+++ 
Sbjct: 194 FVRGNDPKKFASMLVNFMSKCYPGEDDTAIARGVLMYLSQENLRDANLLMDEMKEQLKSA 253

Query: 246 QLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
                 +DLIQF+ Y+L TLERDA PLF  LR  YK S +R+P F E+LD+IA KF+ ++
Sbjct: 254 NSDFPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDRDPVFQELLDEIAAKFYNIQ 313

Query: 306 RRNPMQGIFGDIFKM 320
           R+NP++G+F ++FK+
Sbjct: 314 RQNPLEGLFSEMFKI 328


>gi|218187572|gb|EEC69999.1| hypothetical protein OsI_00529 [Oryza sativa Indica Group]
          Length = 330

 Score =  417 bits (1072), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 189/312 (60%), Positives = 248/312 (79%), Gaps = 2/312 (0%)

Query: 11  LPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQL 70
           LPP Q+ I+KLE +V EGN+Y AQQMYKS  ARY+AAQ+Y EALD+L +GA +QLK+ Q+
Sbjct: 19  LPPPQQTIEKLENMVAEGNYYEAQQMYKSTGARYIAAQKYLEALDILQSGALVQLKHGQV 78

Query: 71  TCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDD--DDLHELTEAL 128
           TCG ELA+ FV+ LVK  + Y+ E  DR+RK+Y  FP+I VPH L DD  DD  +L+EA+
Sbjct: 79  TCGGELAIMFVDTLVKAALPYNEETFDRIRKMYSAFPRISVPHFLGDDYDDDGQKLSEAI 138

Query: 129 GAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVR 188
            AAK+R E CSSFL+AAI+WS E GA ++GSPELHVMLAEY+YSESPE DM +VS HFVR
Sbjct: 139 SAAKVRSESCSSFLRAAIRWSAEVGASRSGSPELHVMLAEYIYSESPETDMTKVSSHFVR 198

Query: 189 GNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQ 248
           GN+P+KFAS + NFMGKCYPGEDD A+AR +LMYLS GN++DAN +MDE+K Q+++  L+
Sbjct: 199 GNDPKKFASMLANFMGKCYPGEDDTAIARGVLMYLSQGNLRDANLLMDELKDQLKSADLE 258

Query: 249 LQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRN 308
           +  +DLIQF+ Y+L TLERDA PLF  LR  YK S +RE  F E+LD+IA KF+G++ ++
Sbjct: 259 IPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDREAVFEELLDEIAAKFYGIRSQS 318

Query: 309 PMQGIFGDIFKM 320
            ++G+FGD+F++
Sbjct: 319 ALEGLFGDMFRV 330


>gi|195635135|gb|ACG37036.1| hypothetical protein [Zea mays]
          Length = 328

 Score =  417 bits (1072), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 187/315 (59%), Positives = 250/315 (79%), Gaps = 2/315 (0%)

Query: 8   RTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKN 67
           R  LPP ++ I++LE +V+ GNFY AQQMYKS SARY+AAQ+YSEALD+L +GA +QLK+
Sbjct: 14  RRDLPPPEKTIERLESMVDGGNFYEAQQMYKSTSARYIAAQKYSEALDILQSGALVQLKH 73

Query: 68  NQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDD--DDLHELT 125
            Q+TCG ELAV FV+ L+ G++ Y  +  DR+RK+YE FP++ VPH L DD  D+ H+L+
Sbjct: 74  GQVTCGGELAVLFVDTLITGELPYSEQIFDRIRKMYEAFPRVTVPHFLGDDYDDEGHQLS 133

Query: 126 EALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFH 185
           EA+ AAK+R E CSSF+KAAI+WS EFG  + GS ELHVMLAEY+YSESPE DM +VS H
Sbjct: 134 EAISAAKVRAESCSSFMKAAIRWSAEFGTSRNGSSELHVMLAEYIYSESPETDMTKVSSH 193

Query: 186 FVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENK 245
           FVRGN+P+KFAS +VNFMGKCYPGEDD A+AR +LMYLS GN++DAN +MDE+ +Q+++ 
Sbjct: 194 FVRGNDPKKFASMLVNFMGKCYPGEDDTAIARGVLMYLSQGNLRDANLLMDEMNEQLKSA 253

Query: 246 QLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
             +   +DLIQF+ Y+L TLERDA PLF  LR  YK S +R+  F E+LD+IA KF+ ++
Sbjct: 254 NSEFPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDRDAVFQELLDEIAAKFYSIQ 313

Query: 306 RRNPMQGIFGDIFKM 320
           R+NP++G+F ++FK+
Sbjct: 314 RQNPLEGLFSEMFKI 328


>gi|222617794|gb|EEE53926.1| hypothetical protein OsJ_00497 [Oryza sativa Japonica Group]
          Length = 330

 Score =  417 bits (1071), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 188/312 (60%), Positives = 249/312 (79%), Gaps = 2/312 (0%)

Query: 11  LPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQL 70
           LPP Q+ I+KLE +V EGN+Y AQQMYKS  ARY+AAQ+Y EALD+L +GA +QLK+ Q+
Sbjct: 19  LPPPQQTIEKLENMVAEGNYYEAQQMYKSTGARYIAAQKYLEALDILQSGALVQLKHGQV 78

Query: 71  TCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDD--DDLHELTEAL 128
           TCG ELA+ FV+ LVK  + Y+ E  DR+RK+Y+ FP+I VPH L DD  DD  +L+EA+
Sbjct: 79  TCGGELAIMFVDTLVKAALPYNEETFDRIRKMYDAFPRISVPHFLGDDYDDDGQKLSEAI 138

Query: 129 GAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVR 188
            AAK+R E CSSFL+AAI+WS E GA ++GSPELHVMLAEY+YSESPE DM +VS HFVR
Sbjct: 139 SAAKVRSESCSSFLRAAIRWSAEVGASRSGSPELHVMLAEYIYSESPETDMTKVSSHFVR 198

Query: 189 GNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQ 248
           GN+P+KFAS + NFMGKCYPGEDD A+AR +LMYLS GN++DAN +MDE+K Q+++  L+
Sbjct: 199 GNDPKKFASMLANFMGKCYPGEDDTAIARGVLMYLSQGNLRDANLLMDELKDQLKSADLE 258

Query: 249 LQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRN 308
           +  +DLIQF+ Y+L TLERDA PLF  LR  YK S + +P F E+LD+IA KF+G++ ++
Sbjct: 259 IPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDHDPVFEELLDEIAAKFYGIRSQS 318

Query: 309 PMQGIFGDIFKM 320
            ++G+FGD+F++
Sbjct: 319 ALEGLFGDMFRV 330


>gi|212276184|ref|NP_001130267.1| uncharacterized protein LOC100191361 [Zea mays]
 gi|194688706|gb|ACF78437.1| unknown [Zea mays]
 gi|413947455|gb|AFW80104.1| hypothetical protein ZEAMMB73_498554 [Zea mays]
          Length = 328

 Score =  416 bits (1070), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 187/315 (59%), Positives = 250/315 (79%), Gaps = 2/315 (0%)

Query: 8   RTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKN 67
           R  LPP ++ I++LE +V+ GNFY AQQMYKS SARY+AAQ+YSEALD+L +GA +QLK+
Sbjct: 14  RRDLPPPEKTIERLESMVDGGNFYEAQQMYKSTSARYIAAQKYSEALDILQSGALVQLKH 73

Query: 68  NQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDD--DDLHELT 125
            Q+TCG ELAV FV+ L+ G++ Y  +  DR+RK+YE FP++ VPH L DD  D+ H+L+
Sbjct: 74  GQVTCGGELAVLFVDTLITGELPYSEQIFDRIRKMYEAFPRVTVPHFLGDDYDDEGHQLS 133

Query: 126 EALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFH 185
           EA+ AAK+R E CSSF+KAAI+WS EFG  + GS ELHVMLAEY+YSESPE DM +VS H
Sbjct: 134 EAISAAKVRAESCSSFMKAAIRWSAEFGTSRNGSSELHVMLAEYIYSESPETDMTKVSSH 193

Query: 186 FVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENK 245
           FVRGN+P+KFAS +VNFMGKCYPGEDD A+AR +LMYLS GN++DAN +MDE+ +Q+++ 
Sbjct: 194 FVRGNDPKKFASMLVNFMGKCYPGEDDTAIARGVLMYLSQGNLRDANLLMDEMNEQLKSA 253

Query: 246 QLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
             +   +DLIQF+ Y+L TLERDA PLF  LR  YK S +R+  F E+LD+IA KF+ ++
Sbjct: 254 NSEFPKTDLIQFIKYLLPTLERDAYPLFRTLRQKYKTSTDRDAVFQELLDEIAAKFYNIQ 313

Query: 306 RRNPMQGIFGDIFKM 320
           R+NP++G+F ++FK+
Sbjct: 314 RQNPLEGLFSEMFKI 328


>gi|226495807|ref|NP_001143710.1| uncharacterized protein LOC100276450 [Zea mays]
 gi|195625230|gb|ACG34445.1| hypothetical protein [Zea mays]
          Length = 328

 Score =  413 bits (1061), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 190/315 (60%), Positives = 247/315 (78%), Gaps = 2/315 (0%)

Query: 8   RTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKN 67
           R  LPP  + I++LE +V+ GNFY AQQMYKS SARY+A  +YSEALD+L +GA +QLK+
Sbjct: 14  RRDLPPPGKTIERLESMVDGGNFYEAQQMYKSTSARYIAVHKYSEALDILQSGALVQLKH 73

Query: 68  NQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNL-ED-DDDLHELT 125
            Q+TCG ELAV FV+ L+ G+  Y  +  DR+RKIYE FP+I VPH L ED DD+ H+L+
Sbjct: 74  GQVTCGGELAVLFVDILITGEFPYSEQFFDRIRKIYEAFPRITVPHFLGEDYDDEGHKLS 133

Query: 126 EALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFH 185
           EA+ AAK+R E CSSFLKAAI+WS EFG  + GSPELHVMLAEY+YSESPE DM +VS H
Sbjct: 134 EAISAAKVRAESCSSFLKAAIRWSAEFGTSRNGSPELHVMLAEYIYSESPETDMTKVSSH 193

Query: 186 FVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENK 245
           FVRGN+P+KFAS +VNFM KCYPGEDD A+AR ILMYLS GN++DAN +MDE+K+Q+++ 
Sbjct: 194 FVRGNDPKKFASMLVNFMSKCYPGEDDTAIARGILMYLSQGNLRDANLLMDEMKEQLKSV 253

Query: 246 QLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
                 +DLIQF+ Y+L TLERDA PLF  LR  YK S +R+  F E+LD+IA KF+ ++
Sbjct: 254 NSDFPKTDLIQFIMYLLPTLERDAYPLFRTLRQKYKTSTDRDAVFQELLDEIAAKFYNIQ 313

Query: 306 RRNPMQGIFGDIFKM 320
           R+NP++G+F ++F++
Sbjct: 314 RQNPLEGLFSEMFRI 328


>gi|194697920|gb|ACF83044.1| unknown [Zea mays]
 gi|195633827|gb|ACG36758.1| hypothetical protein [Zea mays]
          Length = 328

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 189/315 (60%), Positives = 247/315 (78%), Gaps = 2/315 (0%)

Query: 8   RTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKN 67
           R  LPP  + I++LE +V+ GNFY AQQMYKS SARY+A  +YSEALD+L +GA +QLK+
Sbjct: 14  RRDLPPPGKTIERLESMVDGGNFYEAQQMYKSTSARYIAVHKYSEALDILQSGALVQLKH 73

Query: 68  NQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNL-ED-DDDLHELT 125
            Q+TCG ELAV FV+ L+ G+  Y  +  DR+RKIYE FP+I VPH L ED DD+ H+L+
Sbjct: 74  GQVTCGGELAVLFVDILITGEFPYSEQFFDRIRKIYEAFPRITVPHFLGEDYDDEGHKLS 133

Query: 126 EALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFH 185
           EA+ AAK+R E CSSFLKAAI+WS EFG  + GSPELHVMLAEY+YSESPE DM +VS H
Sbjct: 134 EAISAAKVRAESCSSFLKAAIRWSAEFGTSRNGSPELHVMLAEYIYSESPETDMTKVSSH 193

Query: 186 FVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENK 245
           FVRGN+P+KFAS +VNFM KCYPGEDD A+AR +LMYLS GN++DAN +MDE+K+Q+++ 
Sbjct: 194 FVRGNDPKKFASMLVNFMSKCYPGEDDTAIARGVLMYLSQGNLRDANLLMDEMKEQLKSV 253

Query: 246 QLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
                 +DLIQF+ Y+L TLERDA PLF  LR  YK S +R+  F E+LD+IA KF+ ++
Sbjct: 254 NSDFPKTDLIQFIMYLLPTLERDAYPLFRTLRQKYKTSTDRDAVFQELLDEIAAKFYNIQ 313

Query: 306 RRNPMQGIFGDIFKM 320
           R+NP++G+F ++F++
Sbjct: 314 RQNPLEGLFSEMFRI 328


>gi|10177303|dbj|BAB10564.1| unnamed protein product [Arabidopsis thaliana]
          Length = 316

 Score =  393 bits (1010), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 185/315 (58%), Positives = 240/315 (76%), Gaps = 30/315 (9%)

Query: 36  MYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNEN 95
           MYKSISARYV AQR+SEALD+L +GAC++L++  + CGA+LA+ FV+ LVK K   ++E 
Sbjct: 1   MYKSISARYVTAQRFSEALDILFSGACIELEHGLVNCGADLAILFVDTLVKAKSPCNDET 60

Query: 96  LDRVRKIYELFPQIPVPHNL---EDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEF 152
           LDR+R I++LFP++PVP +L    DD+D+  L E+LG A+ RVE  +SFL+AAIKWS EF
Sbjct: 61  LDRIRCIFKLFPRVPVPPHLVDVSDDEDVQNLQESLGEARSRVENLTSFLRAAIKWSAEF 120

Query: 153 GAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGK------- 205
           G P+TG PELH ML +YLY+E PELDM R+S HFVR  +PEKFAS +VNFMG+       
Sbjct: 121 GGPRTGYPELHAMLGDYLYTECPELDMVRISRHFVRAEDPEKFASMLVNFMGRNFKKCLD 180

Query: 206 --------------------CYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENK 245
                               CYPGEDDLA+ARA+LMYLS+GNMKDAN++MDE+KKQ E K
Sbjct: 181 FNLTSRGQKYGVSYLNTTLQCYPGEDDLAIARAVLMYLSMGNMKDANFMMDEIKKQAETK 240

Query: 246 QLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
             +L  SDLIQF+ Y+L+TL+RDALPLFNMLR  YK+SI+R+   NE+LD+IAE+F+GV+
Sbjct: 241 NPELSESDLIQFISYLLETLQRDALPLFNMLRVKYKSSIDRDQLLNELLDEIAERFYGVQ 300

Query: 306 RRNPMQGIFGDIFKM 320
           R+NP+QG+FGDIFK+
Sbjct: 301 RKNPLQGMFGDIFKV 315


>gi|294464803|gb|ADE77907.1| unknown [Picea sitchensis]
          Length = 321

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 179/315 (56%), Positives = 243/315 (77%), Gaps = 1/315 (0%)

Query: 7   KRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK 66
           KR++   A+  ++KLEK V  GN+Y AQQMYKSI ARY+AAQ+Y EALD+L +GA +QL+
Sbjct: 2   KRSSAGTAK-TLEKLEKSVQAGNYYEAQQMYKSIYARYMAAQKYLEALDILQSGATIQLR 60

Query: 67  NNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTE 126
           + Q+TCGAELA+ FVE L K K+ Y+ E LDR+  I+  FPQ+ VP    D+DDL +L+E
Sbjct: 61  HGQVTCGAELALLFVETLAKAKIPYEQETLDRIGAIFHEFPQVAVPRQFFDEDDLQKLSE 120

Query: 127 ALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHF 186
            + AAK + +GC+SFLKAAIKWS E G P  G+PELH MLA+ +  +SPELD+ + S HF
Sbjct: 121 TILAAKAQYDGCTSFLKAAIKWSAEAGGPSKGAPELHDMLAQCICLQSPELDIVKASTHF 180

Query: 187 VRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQ 246
           VRG+ P+ FA  +++FM KCYPGEDDLA+AR ILMYL+ GN++DAN +MDE+K+Q  +KQ
Sbjct: 181 VRGSQPKAFAFALLDFMDKCYPGEDDLAIARGILMYLAAGNLRDANCLMDELKEQSVSKQ 240

Query: 247 LQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKR 306
            QL  + L++F+ Y+L TLERDALPLF ML+ NYK+SIER+ +F E++D+IAE+F+GV+R
Sbjct: 241 TQLPDTPLLEFIGYLLLTLERDALPLFRMLQQNYKSSIERDSSFTELVDEIAERFYGVRR 300

Query: 307 RNPMQGIFGDIFKMM 321
           R+ +  I G +FKMM
Sbjct: 301 RSGLHSILGGMFKMM 315


>gi|449445467|ref|XP_004140494.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Cucumis sativus]
 gi|449532306|ref|XP_004173123.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Cucumis sativus]
          Length = 232

 Score =  363 bits (933), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 162/229 (70%), Positives = 205/229 (89%)

Query: 93  NENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEF 152
           N  LDRVRKIY+ FPQIP+P +L +DDD+ +L+EALGAAK RVEGCSSFLKAA+KWS+EF
Sbjct: 3   NIMLDRVRKIYKNFPQIPLPQHLGEDDDMQQLSEALGAAKTRVEGCSSFLKAALKWSMEF 62

Query: 153 GAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDD 212
           G+ ++GSPE+H+MLA Y+YSESPE+DM RVS+HF+RG+NP+KFAS +VNFMGKCYPGEDD
Sbjct: 63  GSQRSGSPEIHIMLATYIYSESPEVDMTRVSYHFIRGDNPKKFASILVNFMGKCYPGEDD 122

Query: 213 LAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPL 272
           +A+ARA+LMYLSLGN++DAN + DE+KK  E ++L+L  S+LI+F+ Y+L TL+RDALPL
Sbjct: 123 MAIARAVLMYLSLGNLRDANVLFDELKKVEEREELELPDSELIEFIVYLLLTLQRDALPL 182

Query: 273 FNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIFKMM 321
           FNMLRANYK+S+EREP  NE+LD+IAEKF+GV+RRNP+QGIFGD  KMM
Sbjct: 183 FNMLRANYKSSLEREPVLNELLDEIAEKFYGVRRRNPLQGIFGDFLKMM 231


>gi|217073790|gb|ACJ85255.1| unknown [Medicago truncatula]
          Length = 221

 Score =  336 bits (862), Expect = 8e-90,   Method: Compositional matrix adjust.
 Identities = 149/216 (68%), Positives = 193/216 (89%), Gaps = 1/216 (0%)

Query: 106 FPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVM 165
           FP++P+P +L D DD+ +L+E LG+AK RVEGCSSFLKAAIKWS EFG   +G+PELH+M
Sbjct: 4   FPRVPLPQHLWDVDDMQQLSENLGSAKTRVEGCSSFLKAAIKWSAEFGTSGSGAPELHIM 63

Query: 166 LAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSL 225
           LAEY++SESPE+DM RV++HFVRGN+ +KFAST+VNF+GKCYPGEDDLA+ARA+L YLSL
Sbjct: 64  LAEYIFSESPEVDMNRVTYHFVRGNDTQKFASTLVNFLGKCYPGEDDLAIARAVLRYLSL 123

Query: 226 GNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIE 285
           GN+KDAN ++DE+KKQ E+ +++   +DL+QF+ ++LQT+ERDA PLFNMLRANYK SIE
Sbjct: 124 GNLKDANILVDEIKKQTESTEVEFPKTDLMQFLNFLLQTMERDAFPLFNMLRANYKPSIE 183

Query: 286 REPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIFKMM 321
           REP+FNE+LD+IA+KF+GV+RRNPM G+FGDIFK+M
Sbjct: 184 REPSFNELLDEIAQKFYGVQRRNPM-GMFGDIFKLM 218


>gi|168015668|ref|XP_001760372.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162688386|gb|EDQ74763.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 326

 Score =  316 bits (809), Expect = 9e-84,   Method: Compositional matrix adjust.
 Identities = 148/312 (47%), Positives = 218/312 (69%), Gaps = 5/312 (1%)

Query: 14  AQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCG 73
             + ++ L K V+EGN+Y AQQMYK++ +RY+ A+++ EA +LL++GA   LK+ ++TCG
Sbjct: 10  TSKQLETLRKKVHEGNYYEAQQMYKAVHSRYMHAKKHKEASELLNSGAVAMLKHGEVTCG 69

Query: 74  AELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIP-VPHNLEDDDDLH----ELTEAL 128
            ELA+  ++     K  +D+  LD++R+IY  +P+       L  D        E +EA 
Sbjct: 70  TELALLLIQTFHVAKTPFDSSTLDQIREIYNAYPRASSTDQKLSGDKGASYSDTEASEAA 129

Query: 129 GAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVR 188
             A+ RVEGC+SFL+AA+KWS E G    G+PELH ML EY+++E+P  +M +VS HFVR
Sbjct: 130 VIARTRVEGCTSFLRAALKWSTETGGFSRGAPELHDMLGEYIWTEAPTPEMTKVSQHFVR 189

Query: 189 GNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQ 248
           GN+PE FA  +V+ M KCY GE DL VARAIL+YLSLGN++DAN + D VK+ + +   +
Sbjct: 190 GNHPETFAKVVVDCMDKCYSGEADLVVARAILLYLSLGNLRDANRLWDSVKQTLSDNFQE 249

Query: 249 LQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRN 308
           L  + L+QF+ ++L TLERDALPLF ML+  YK+S++R+P+++E L++I E+F+ ++RRN
Sbjct: 250 LPDTPLLQFIKFLLLTLERDALPLFRMLKQEYKSSLDRDPSYDEYLNEIGERFYNLQRRN 309

Query: 309 PMQGIFGDIFKM 320
            MQGI GD  KM
Sbjct: 310 GMQGILGDFMKM 321


>gi|168010909|ref|XP_001758146.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162690602|gb|EDQ76968.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 326

 Score =  314 bits (805), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 147/312 (47%), Positives = 217/312 (69%), Gaps = 5/312 (1%)

Query: 14  AQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCG 73
             + ++KLEK V+EGN+Y AQQMYK+I +RY+ A++Y+EA++LL++GA   L + ++TCG
Sbjct: 10  TSKQLEKLEKRVHEGNYYEAQQMYKTIHSRYMHAKKYNEAIELLNSGAVAMLNHGEVTCG 69

Query: 74  AELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPH----NLEDDDDLH-ELTEAL 128
            EL V  ++     K+ +D+  LD++R IY  +P+         +  D  +L  E +EA 
Sbjct: 70  TELGVLLIQTFNVAKLPFDSSTLDQIRTIYNAYPRTAGTRQEKSSSNDTSNLGVESSEAA 129

Query: 129 GAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVR 188
             AK RVEGC+SFL++A+KWSIE G    G+PELH ML EY+++ESP  +  + S HFVR
Sbjct: 130 VIAKTRVEGCTSFLRSALKWSIETGGFPRGAPELHDMLGEYIWTESPHPETMKASQHFVR 189

Query: 189 GNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQ 248
           GN PE FA  +V  M KCYPGE DL VARA+L+YLSLGN++DAN + D V++++ +   +
Sbjct: 190 GNQPETFAKVVVECMDKCYPGEADLVVARAVLLYLSLGNLRDANRLWDSVRQRLSDSSQE 249

Query: 249 LQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRN 308
           L  + L+ F+ ++L TLERDALPLF MLR  YK S++R+ + +E LD+I E+FF ++RR+
Sbjct: 250 LPDTPLLHFIKFLLLTLERDALPLFKMLRQKYKPSLDRDTSLDEYLDEIGERFFNLQRRS 309

Query: 309 PMQGIFGDIFKM 320
            +QG+ GD  +M
Sbjct: 310 GVQGMLGDFMRM 321


>gi|302773055|ref|XP_002969945.1| hypothetical protein SELMODRAFT_92949 [Selaginella moellendorffii]
 gi|300162456|gb|EFJ29069.1| hypothetical protein SELMODRAFT_92949 [Selaginella moellendorffii]
          Length = 303

 Score =  301 bits (771), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 146/306 (47%), Positives = 213/306 (69%), Gaps = 18/306 (5%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + ++KLEK V +G FY AQQMYK+I ARY+A+++++EALDLL +GA +QL++ Q+TCG E
Sbjct: 1   QTLEKLEKNVRDGKFYEAQQMYKTIFARYMASKKHTEALDLLQSGASIQLRHGQVTCGVE 60

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRV 135
           L +  VE +   K+AY  E LDR++ +++ FP+ P               EA  + K  +
Sbjct: 61  LGLLLVEGMTTIKLAYGPEALDRIKAVFKEFPRSPT-------------LEAQYSTKTLI 107

Query: 136 EGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKF 195
           EG S+F+KAAI+WS EFGAP  GSPELH +LA++L  ESP+ D+A+  +H+VRGNNP  F
Sbjct: 108 EGVSTFMKAAIRWSSEFGAPSRGSPELHELLADFLARESPQPDLAKAFYHYVRGNNPSAF 167

Query: 196 ASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQV-ENKQLQLQPSDL 254
           AS I++  G   P E DL +AR + +YLS+GN++DAN + DE+  Q+ ++ Q  +  S L
Sbjct: 168 ASAIIS--GMLEPEEGDLLLARGVFLYLSMGNLRDANKLFDELHSQLKDSSQASVLASPL 225

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIF 314
                ++L TLERDALPLF MLR NYK+S++R+P+ NE+LD +AEKF+ V++R    G+F
Sbjct: 226 THLAKFILLTLERDALPLFRMLRENYKSSLDRDPSLNELLDVVAEKFYNVQQRK--NGLF 283

Query: 315 GDIFKM 320
           GD+ K+
Sbjct: 284 GDLMKV 289


>gi|302799312|ref|XP_002981415.1| hypothetical protein SELMODRAFT_420863 [Selaginella moellendorffii]
 gi|300150955|gb|EFJ17603.1| hypothetical protein SELMODRAFT_420863 [Selaginella moellendorffii]
          Length = 312

 Score =  297 bits (760), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 147/312 (47%), Positives = 213/312 (68%), Gaps = 23/312 (7%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQL----- 70
           + ++KLEK V +G FY AQQMYK+I ARY+A+++++EALDLL +GA +QL++ QL     
Sbjct: 13  KTLEKLEKNVRDGKFYEAQQMYKTIFARYMASKKHTEALDLLQSGASIQLRHGQLILRKV 72

Query: 71  TCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGA 130
           TCG EL +  VE +   K+AY  E LDR++ +++ FP+ P               EA  +
Sbjct: 73  TCGVELGLLLVEGMTTIKLAYGPEALDRIKAVFKEFPRSPT-------------LEAQYS 119

Query: 131 AKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGN 190
            K  +EG S+F+K AI+WS EFGAP  GSPELH +LA++L  ESP+ D+A+  +H+VRGN
Sbjct: 120 TKTLIEGVSTFMKVAIRWSSEFGAPSRGSPELHELLADFLARESPQPDLAKAFYHYVRGN 179

Query: 191 NPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQV-ENKQLQL 249
           NP  FAS I++  G   P E DL +AR + +YLS+GN++DAN + DE+  Q+ ++ Q  +
Sbjct: 180 NPSAFASAIIS--GMLEPEEGDLLLARGVFLYLSMGNLRDANKLFDELHSQLKDSSQASV 237

Query: 250 QPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNP 309
             S L     ++L TLERDALPLF MLR NYK+S++R+P+ NE+LD +AEKF+ V++R  
Sbjct: 238 LASPLTHLAKFILLTLERDALPLFRMLRENYKSSLDRDPSLNELLDVVAEKFYNVRQRR- 296

Query: 310 MQGIFGDIFKMM 321
             G+FGD+ KM+
Sbjct: 297 -NGLFGDLMKML 307


>gi|296086073|emb|CBI31514.3| unnamed protein product [Vitis vinifera]
          Length = 684

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 126/177 (71%), Positives = 148/177 (83%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           M RQR ++  LPPAQE+IDKLEK VN+G++YGAQQMYKSISARY +A+R  EALD+L +G
Sbjct: 1   MLRQRSRKGPLPPAQEHIDKLEKTVNDGDYYGAQQMYKSISARYASAERCYEALDILESG 60

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDD 120
           AC+QL+  Q+TCGAELA+ FVE LVKGK  YD+  LDRVRKIY+ FPQI VP  LEDDDD
Sbjct: 61  ACIQLEKGQVTCGAELAILFVETLVKGKFPYDDNTLDRVRKIYKNFPQISVPQQLEDDDD 120

Query: 121 LHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPEL 177
           + +L+EALGAAK RVE CSSFLKAA+KWS EFG  + GSPE+H MLAEYLYSESPEL
Sbjct: 121 MQKLSEALGAAKTRVEVCSSFLKAAMKWSAEFGFHRQGSPEIHDMLAEYLYSESPEL 177



 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 113/149 (75%), Positives = 137/149 (91%)

Query: 173 ESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN 232
           E+  LDM+R+S HFVRGNNPEKFAST+VNFMGKCYPGEDDLA+ARA+LMYLSLGN++DAN
Sbjct: 535 ENWALDMSRISLHFVRGNNPEKFASTLVNFMGKCYPGEDDLAIARAVLMYLSLGNLRDAN 594

Query: 233 YIMDEVKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNE 292
           Y+MDEVKKQVE+K+L    SDL +F+ Y+L TL+RDALPLFNMLR +YK+SI+REPAFNE
Sbjct: 595 YLMDEVKKQVESKELDYPESDLTEFIDYLLLTLQRDALPLFNMLRQSYKSSIDREPAFNE 654

Query: 293 MLDDIAEKFFGVKRRNPMQGIFGDIFKMM 321
           +LD+IAEKF+GV+RRNPMQG+FGD FK+M
Sbjct: 655 LLDEIAEKFYGVRRRNPMQGMFGDFFKLM 683


>gi|388490986|gb|AFK33559.1| unknown [Medicago truncatula]
          Length = 154

 Score =  234 bits (598), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 103/146 (70%), Positives = 134/146 (91%)

Query: 165 MLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLS 224
           MLAEY++SESPE+DM RV++HFVRGN+ +KFAST+VNF+GKCYPGEDDLA+ARA+L YLS
Sbjct: 1   MLAEYIFSESPEVDMNRVTYHFVRGNDTQKFASTLVNFLGKCYPGEDDLAIARAVLRYLS 60

Query: 225 LGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASI 284
           LGN+KDAN ++DE+KKQ E+ +++   +DL+QF+ ++LQT+ERDA PLFNMLRANYK SI
Sbjct: 61  LGNLKDANILVDEIKKQTESTEVEFPKTDLMQFLNFLLQTMERDAFPLFNMLRANYKPSI 120

Query: 285 EREPAFNEMLDDIAEKFFGVKRRNPM 310
           EREP+FNE+LD+IA+KF+GV+RRNPM
Sbjct: 121 EREPSFNELLDEIAQKFYGVQRRNPM 146


>gi|414876206|tpg|DAA53337.1| TPA: hypothetical protein ZEAMMB73_977843, partial [Zea mays]
          Length = 153

 Score =  171 bits (433), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 84/140 (60%), Positives = 106/140 (75%), Gaps = 2/140 (1%)

Query: 8   RTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKN 67
           R  LPP  + I++LE +V+ GNFY AQQMYKS SARY+A  +YSEALD+L +GA +QLK 
Sbjct: 14  RRDLPPPGKTIERLESMVDGGNFYEAQQMYKSTSARYIAVHKYSEALDILQSGALVQLKL 73

Query: 68  NQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNL-ED-DDDLHELT 125
            Q+TCG ELAV FV+ L+ G+  Y  +  DR+RKIYE FP+I VPH L ED DD+ H+L+
Sbjct: 74  GQVTCGGELAVLFVDILITGEFPYSEQFFDRIRKIYEAFPRITVPHFLGEDYDDEGHKLS 133

Query: 126 EALGAAKLRVEGCSSFLKAA 145
           EA+ AAK+R E CSSFLKAA
Sbjct: 134 EAISAAKVRAESCSSFLKAA 153


>gi|328873318|gb|EGG21685.1| DUF410 family protein [Dictyostelium fasciculatum]
          Length = 308

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 102/291 (35%), Positives = 144/291 (49%), Gaps = 34/291 (11%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           +  I  LE    +GN+Y   Q YK++  R  AA+++ E + LL +GA   LK  Q  C A
Sbjct: 3   ERQIASLENKFQQGNYYDILQSYKALYFRNSAAKKFKETVALLMSGATNFLKYEQWNCAA 62

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           E A   V+     K  Y++E+ + + KI++ F +                     AAK  
Sbjct: 63  ECAQLLVDTYKNFKTQYNDESKENIVKIFKGFGETEC------------------AAK-- 102

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
               ++F++ AIKWS E GA K GSPE H +LA  L  E   +D  +   H++ G++P  
Sbjct: 103 ----TNFMRDAIKWSAENGAEKKGSPEFHTLLANSLAREGDYIDAQK---HYIFGDDPIA 155

Query: 195 FASTIVNFMGKCYPGED-DLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSD 253
           FAS +  +        D DL + RA+L YL  G + DA  +      QV         S 
Sbjct: 156 FASMLKTWAEDSPNKSDADLYIVRAVLGYLCTGKLNDATTLFKSFTSQVT------IDSP 209

Query: 254 LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           LI F  ++L TL RDALPLF  LRA Y  SI R+P F++ LD IA  F+ V
Sbjct: 210 LINFTNFLLMTLTRDALPLFKTLRAKYAPSIGRDPDFSKYLDQIAFVFYKV 260


>gi|66812876|ref|XP_640617.1| DUF410 family protein [Dictyostelium discoideum AX4]
 gi|74855310|sp|Q54TH4.1|GET4_DICDI RecName: Full=Golgi to ER traffic protein 4 homolog
 gi|60468666|gb|EAL66669.1| DUF410 family protein [Dictyostelium discoideum AX4]
          Length = 309

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 93/291 (31%), Positives = 142/291 (48%), Gaps = 32/291 (10%)

Query: 14  AQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCG 73
           A+  +  LE    EGN+Y   Q YK++  R+   ++Y E + LL +G    L+  Q  C 
Sbjct: 2   AERVLANLEAKFTEGNYYDILQSYKALYNRFSTQKKYKETVTLLESGCNKFLEYKQWNCA 61

Query: 74  AELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKL 133
           A+LA   +E     K+ Y +E+ + + KI++ F                         K 
Sbjct: 62  ADLAKLLIECYKNFKIQYSDESKEPIIKIFKNF-------------------------KG 96

Query: 134 RVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE 193
              G  SF++ AI+WS + G    GS E H +LA  L  E   +D  +   HF+ GN+  
Sbjct: 97  ECAGKISFMRDAIEWSSKNGGDSKGSEEFHTLLAITLSEEGDYIDAQK---HFIFGNDYF 153

Query: 194 KFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSD 253
            F   + N+       E DL + RAI   L L  +K A+ + +    +V    ++  PS 
Sbjct: 154 SFCEMLKNWTEDVDEEEKDLYITRAIFGLLCLKKLKQASDLYNLFTTKV----IKGDPSP 209

Query: 254 LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           L+ F  ++L TLERDALPLFN+LR  Y+ S++R+P F + LD IA  F+ V
Sbjct: 210 LLNFDRFLLLTLERDALPLFNLLRQKYERSLKRDPQFKKFLDQIANIFYNV 260


>gi|449532591|ref|XP_004173264.1| PREDICTED: Golgi to ER traffic protein 4 homolog, partial [Cucumis
           sativus]
          Length = 105

 Score =  141 bits (355), Expect = 5e-31,   Method: Composition-based stats.
 Identities = 64/96 (66%), Positives = 80/96 (83%)

Query: 1   MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
           MSR R +R  LPP QE+I K+E +++ G++YGAQQMYKS+SARYVAA+RYSEALD+L +G
Sbjct: 10  MSRARSRRIVLPPVQEHIIKIEDVIDTGDYYGAQQMYKSVSARYVAAERYSEALDILQSG 69

Query: 61  ACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENL 96
           AC QLK+ Q+TCG+ELAV FVE LVKGKV YD+  L
Sbjct: 70  ACTQLKHEQITCGSELAVLFVETLVKGKVPYDDNTL 105


>gi|320166122|gb|EFW43021.1| hypothetical protein CAOG_08153 [Capsaspora owczarzaki ATCC 30864]
          Length = 293

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 90/303 (29%), Positives = 140/303 (46%), Gaps = 36/303 (11%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KLEK +  G++Y A QMY+S+ ARY A   +  A+ L++ G+ L  K+ Q     +LA
Sbjct: 11  LAKLEKSIEAGDYYEAHQMYRSVYARYAAQNDFDSAILLVYTGSTLFFKHGQTGSAGDLA 70

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           V  ++A        D ++L R   I ELF      H   D                    
Sbjct: 71  VLMLDAYTAQPTPLDEKSLAR---IAELFSGYKEDHATRD-------------------- 107

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE--KF 195
               ++ ++KWS   G    G P LH +     + E    +      H V G        
Sbjct: 108 --KLVRQSLKWSTAVGRNTRGDPSLHHLFGTAFWREH---NYQLAESHLVLGTEASALSL 162

Query: 196 ASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPS 252
           A  ++ +  K  P E +L + RA+L YL LGN++DAN +      Q   + N   +L   
Sbjct: 163 AEMLLEWSQKGSPSELELFITRAVLKYLELGNLRDANVLFRTFTAQHPSIRNPPFRLP-- 220

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQG 312
            L+ F  ++L TLERDA PLF MLR  Y+ S++R+ +F+  LD I + FF ++  + +  
Sbjct: 221 -LLNFCHFLLATLERDAAPLFKMLREKYRPSLQRDASFDASLDAIGQCFFNIRSSSGVSS 279

Query: 313 IFG 315
           + G
Sbjct: 280 MPG 282


>gi|301093450|ref|XP_002997571.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110534|gb|EEY68586.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 315

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 96/293 (32%), Positives = 145/293 (49%), Gaps = 35/293 (11%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYV-----AAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           KLE  +  G+FYGA QMYK++  R +     +A++ ++A+ L    A   ++++Q T   
Sbjct: 34  KLELQLEAGDFYGALQMYKTLFMRLLKGDEPSAEQQNKAVVLAQEAALKLIEHDQNTAAT 93

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           E+A   V        A D+ +  R+R+I   F   P       D                
Sbjct: 94  EMANLMVSVFSDFHHAVDDAHKQRIRQIDAAFQSKP---QFSAD---------------- 134

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
               + FLK  +KWS E GA K G PEL ++LA    +     D      HF+   NP++
Sbjct: 135 ---VAVFLKNTVKWSAEEGARKRGDPELQLLLARAYRTAG---DFTHALKHFLHAENPQE 188

Query: 195 FASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDL 254
            A T+  +  + YP E DL +ARA+L  LSL N++DAN + +     V   Q   +P DL
Sbjct: 189 LADTLFQWSTQGYPSESDLYLARAVLQLLSLENLRDANKVYEAY---VAKCQSVGRPVDL 245

Query: 255 IQFVF--YVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
             F F  ++L TLERDALPLF ML+  Y  ++ R+ +    L  I +KF+G++
Sbjct: 246 PLFNFTRFLLLTLERDALPLFQMLQERYAPALARDSSLKNYLSVIGQKFYGLQ 298


>gi|330790201|ref|XP_003283186.1| hypothetical protein DICPUDRAFT_25422 [Dictyostelium purpureum]
 gi|325086867|gb|EGC40250.1| hypothetical protein DICPUDRAFT_25422 [Dictyostelium purpureum]
          Length = 305

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 88/294 (29%), Positives = 141/294 (47%), Gaps = 36/294 (12%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFV 81
           E    +GN Y   Q Y+++  R  + ++  E ++LL +G     +  Q  C AELA   V
Sbjct: 9   ENKFKDGNSYDILQSYRALYFRLSSQKKIQETINLLESGCKKFQEYAQWNCAAELAKLIV 68

Query: 82  EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSF 141
           E      + YD ++ +++  I++ F                         K   +G   F
Sbjct: 69  ETFRNFNIPYDTDSKNKIETIFKGF-------------------------KGECQGKIGF 103

Query: 142 LKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVN 201
           ++ AI WS +  + K G PE H +LA  L  E   +D  +   HF+ GN+P +F   +  
Sbjct: 104 MREAIDWSSKNSSNKKGVPEFHTLLAISLTEEGDFIDAQK---HFIFGNSPSEFNKMLKE 160

Query: 202 FMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYV 261
           +  +    E DL + RAI  YL + N+KDA  + D   + +       + S L+ +  Y+
Sbjct: 161 WTSEVDEEEKDLYITRAIFGYLCVNNLKDAQTLFDLFTEGI------TEFSPLLNYDRYL 214

Query: 262 LQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFG 315
           L TL+RDALPLFN+L+  Y+ S++R+P F + L+ IA  F+ V   N  QG  G
Sbjct: 215 LLTLQRDALPLFNILKQKYERSLKRDPQFFKYLEQIANVFYKVPISN--QGGLG 266


>gi|321458040|gb|EFX69115.1| hypothetical protein DAPPUDRAFT_62606 [Daphnia pulex]
          Length = 321

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 95/322 (29%), Positives = 167/322 (51%), Gaps = 41/322 (12%)

Query: 10  ALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQ 69
           A P A+  + KL K + +GN+Y A QMY+++  RY+  ++Y   L LL+ G+ L  ++ Q
Sbjct: 7   AGPGAERVLQKLRKSIEDGNYYEAHQMYRTLYFRYMGQEKYESLLKLLYDGSLLLFQHQQ 66

Query: 70  LTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALG 129
            + GA+LA  +VE L K K+   N   D +  + ++  +IP                   
Sbjct: 67  HSSGADLAKLYVEVLQKAKM---NVRKDIINNLADMMSKIP------------------A 105

Query: 130 AAKLRVEGCSSFLKAAIKWSIEFGAPKT----GSPELHVMLAEYLYSESPELDMARVSFH 185
           AA  R     +FL +A+ WS +   P++    G P+LH M+A   + E+    MAR  +H
Sbjct: 106 AAPER----QAFLMSALSWS-QSENPESKKFEGHPQLHQMVANIYWRET-NYHMAR--YH 157

Query: 186 FVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMD---EVKKQ 241
           F+R ++ +   S +V   +   YP E DL V +A+L  L + N+  AN + D   E    
Sbjct: 158 FIRSDDMQNLGSMLVEIQVDLGYPSEIDLFVTQAVLQILCIRNLSFANSLFDNYLECHPL 217

Query: 242 VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKF 301
           ++N+     P  L+ F + +L T++   +  + +L   Y+ S++R+P++ + LD I + +
Sbjct: 218 LDNEPPFKYP--LLNFTWMLLLTIQSTKISSYAILCEYYQPSLKRDPSYKDYLDRIGQIY 275

Query: 302 FGV--KRRNPMQGIFGDIFKMM 321
           FG+   R  P QG+FG++ + +
Sbjct: 276 FGLPPPRGMPNQGMFGNLIQSL 297


>gi|156404518|ref|XP_001640454.1| predicted protein [Nematostella vectensis]
 gi|156227588|gb|EDO48391.1| predicted protein [Nematostella vectensis]
          Length = 310

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 92/312 (29%), Positives = 154/312 (49%), Gaps = 32/312 (10%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  + KL K + EGN+Y A QM +++  RY + ++Y +A++LLH G  L  K  Q+  G 
Sbjct: 11  QRVLQKLNKSIEEGNYYEAHQMIRTLYFRYTSQKKYQDAIELLHNGTLLFFKYKQMGSGT 70

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +LA+  ++ L  GK   D+E L ++  I++ F       + E D D  E           
Sbjct: 71  DLAMLMLDCLKAGKDTIDSECLQKIISIFKAF-------DPEADTDRQE----------- 112

Query: 135 VEGCSSFLKAAIKWSIEFG-APKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE 193
                 F++ A++ + +   + K GS +LH M A   + E    +     +HF+   +  
Sbjct: 113 ------FIQKALRITADKDPSQKFGSTDLHHMCARIYWQEK---NYGESRYHFLYTQDGF 163

Query: 194 KFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS 252
           + AS +V F   K +  E DL V + +L YL L N   A+ +  +   +  +        
Sbjct: 164 QCASMLVEFATTKGFKSEQDLFVTQTVLQYLCLQNSSTASIVFFKFTNKHPDFSGPPFQQ 223

Query: 253 DLIQFVFYVLQTLERDA-LPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNP-- 309
            L+ FV+ +L  +ER   L +F +L   Y+ SIER+P + + LD IA+ FFG+    P  
Sbjct: 224 PLLNFVWLLLMAIERQGPLSMFTVLCEKYQPSIERDPTYKQYLDRIAQLFFGLPPPQPTG 283

Query: 310 MQGIFGDIFKMM 321
           +QGI GD+ + +
Sbjct: 284 LQGIMGDLVQSL 295


>gi|4680679|gb|AAD27729.1|AF132954_1 CGI-20 protein [Homo sapiens]
          Length = 320

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 93/329 (28%), Positives = 170/329 (51%), Gaps = 39/329 (11%)

Query: 1   MSRQRPKRTALPP--AQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLH 58
           M+ +R   TAL P   Q    KL   V +G++Y A QMY+++  RY++  +++EA +L++
Sbjct: 1   MADRRAPATALQPRAVQRVEGKLRASVEKGDYYEAHQMYRTLFFRYMSQSKHTEARELMY 60

Query: 59  AGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDD 118
           +GA L   + Q    A+L++  +E+L K +V   +E L+ + K++ L      P++ E  
Sbjct: 61  SGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADELLENLAKVFSLMD----PNSPE-- 114

Query: 119 DDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELD 178
                          RV    +F+  A+KWS   G+ K G P LH +LA  L+ E    +
Sbjct: 115 ---------------RV----TFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---N 151

Query: 179 MARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDE 237
                +HF+   + E  A  +V +   + +  E D+ VA+A+L +L L N   A+ +   
Sbjct: 152 YCESRYHFLHSADGEGCADMLVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTT 211

Query: 238 VKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEML 294
             ++   +E+    ++P  L+ F++++L  ++   L +F +L   Y+ S+ R+P +NE L
Sbjct: 212 YTQKHPSIEDGPPFVEP--LLNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYL 269

Query: 295 DDIAEKFFGV--KRRNPMQGIFGDIFKMM 321
           D I + FFGV  K+ +   G+ G++   +
Sbjct: 270 DRIGQLFFGVPPKQTSSYGGLLGNLLTSL 298


>gi|348673567|gb|EGZ13386.1| hypothetical protein PHYSODRAFT_316672 [Phytophthora sojae]
          Length = 316

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/298 (32%), Positives = 145/298 (48%), Gaps = 36/298 (12%)

Query: 16  ENI-DKLEKIVNEGNFYGAQQMYKSISARYV-----AAQRYSEALDLLHAGACLQLKNNQ 69
           EN+  KLE  +  G+FYGA QMYK++  R++     +A++  +A  L    A   ++++Q
Sbjct: 29  ENVGKKLELRLAAGDFYGALQMYKTLFMRFLKGDEPSAEQQEKAAALALDAALKLVEHDQ 88

Query: 70  LTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALG 129
            T   E+A   V          D+ +  RVR +   F   P P    D            
Sbjct: 89  NTAATEMANLMVSVFADFHHPVDDAHKQRVRLVAAAFE--PKPQFSAD------------ 134

Query: 130 AAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG 189
                    + FLK A+KWS   GA K G PEL ++LA    +     D      HF+  
Sbjct: 135 --------LAVFLKNAVKWSAAEGARKRGDPELQLLLARAYRTAG---DFTHAMKHFLHS 183

Query: 190 NNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQL 249
            NP++ A  +  +  + YP E DL +ARA+L  LSL N++DAN + +     V   Q   
Sbjct: 184 ENPQELADALFQWSTQGYPSESDLYLARAVLQLLSLENLRDANKVYEAY---VAKCQSAG 240

Query: 250 QPSDLIQFVF--YVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
           +P DL  F F  ++L TLERDALPLF ML+  Y  ++ R+ +    L  I +KF+G++
Sbjct: 241 RPVDLPLFNFTRFLLLTLERDALPLFQMLQERYTPALARDSSLKSYLSVIGQKFYGLQ 298


>gi|281205376|gb|EFA79568.1| DUF410 family protein [Polysphondylium pallidum PN500]
          Length = 303

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 89/287 (31%), Positives = 138/287 (48%), Gaps = 38/287 (13%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I  LE   + GN+Y   Q YK++  R  +A++Y E + LL +GA   LK  Q  C AEL 
Sbjct: 6   IASLESKFSAGNYYDLLQSYKALYFRNASAKKYKETISLLLSGATNFLKYQQWNCAAELG 65

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              +E        YD+E+ + V  +++ F     P N               AA +    
Sbjct: 66  QLLIETYKSFGTKYDDESKNTVSTLFKGFN---CPEN---------------AAMI---- 103

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
             +F++ AIKWS   GA   GS E H +LA  +      +D  +   HF+ GN+P  FA 
Sbjct: 104 --TFMRDAIKWSESAGAGDKGSDEFHALLAIAMAKNGDYIDAQK---HFIFGNDPISFAD 158

Query: 198 TIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQF 257
            +V +       E DL +ARA           DA  +     +++  +     P  L+ +
Sbjct: 159 MLVAWTRDASQSEVDLYIARA---------SADAEQLFQSFVQKLSIQTADFTP--LLNY 207

Query: 258 VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + ++L TL+RDALPLFN+L+  Y+ SI R+P F++ L+ IA  ++ V
Sbjct: 208 IRFLLLTLQRDALPLFNLLKQKYEPSIARDPMFSKYLEQIANLYYNV 254


>gi|332376763|gb|AEE63521.1| unknown [Dendroctonus ponderosae]
          Length = 326

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 90/305 (29%), Positives = 147/305 (48%), Gaps = 34/305 (11%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           ++KLE  V  G +Y A QMY+++  RYV  + Y +  D+L+ G+ L L  NQ   GA+L 
Sbjct: 13  LEKLENSVKNGEYYEAHQMYRTLYFRYVGQKNYKDLKDMLYQGSILFLDANQKASGADLG 72

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  ++ L+K +   D       RK+  LF +I   +  E D                   
Sbjct: 73  LLLIDVLIKSE---DKSTEIWCRKLSTLFAKIGTTYATERD------------------- 110

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
             SFL  A+KWS   G+ + G P LH  +A+  +    EL+  +   H++   + +  A 
Sbjct: 111 --SFLVQAVKWS-SLGSSR-GDPALHQYIAKVYWD---ELNYNQARHHYIHSQDGKNCAK 163

Query: 198 TIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS--DL 254
            ++ F M K +  E DL V +A+L +L L N   A+       +     +    P    L
Sbjct: 164 LLIEFQMTKGFKCEIDLFVTQAVLQFLCLRNQITASQTFTTYTENHPTIKKAEPPYLLPL 223

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQ--G 312
           + F++++LQ LE   L  F +L   Y+ S+ER+P + + LD I + FFG+K   P +  G
Sbjct: 224 LNFLWFLLQALETKKLQTFAVLCEQYQKSLERDPCYLQYLDKIGQLFFGLKPPQPKKSGG 283

Query: 313 IFGDI 317
           +FG +
Sbjct: 284 LFGSL 288


>gi|260824231|ref|XP_002607071.1| hypothetical protein BRAFLDRAFT_68138 [Branchiostoma floridae]
 gi|229292417|gb|EEN63081.1| hypothetical protein BRAFLDRAFT_68138 [Branchiostoma floridae]
          Length = 300

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 83/290 (28%), Positives = 147/290 (50%), Gaps = 50/290 (17%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   +  G++Y A QMY+++  RY A ++YSEAL+L ++GA L L++NQL  GA+LA+ 
Sbjct: 17  KLRACIERGDYYEAHQMYRTLYFRYSAQKKYSEALELTYSGASLLLEHNQLCSGADLAML 76

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
           ++E L +  +    EN++R+                                        
Sbjct: 77  WLEVLTQANIHVSKENIERI---------------------------------------- 96

Query: 140 SFLKAAIKWSIEFGAPKT-GSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
           S++ AA+KW+      +T G P+LH  LA   + E    +  +  +HF+  ++ +  A+ 
Sbjct: 97  SYINAALKWTKTVHPDETSGHPQLHNQLAIVFWKEK---NYIQSRYHFLHCSDGQGCATM 153

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDA--NYIM-DEVKKQVENKQLQLQPSDL 254
           ++   + + +P E DL +A+ +  YL L N   A   Y+   E   Q+E+     +P  L
Sbjct: 154 LIELSLQQGFPSEVDLFIAQTVFQYLCLNNKSTAKVTYMTYTESHPQIEDGPPFQKP--L 211

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + FV+Y+L  +E   + +F++L   Y+ SI R+P++ + LD I + FFG+
Sbjct: 212 LNFVWYLLLAVEDGKVTVFSVLCDVYQPSIRRDPSYLQYLDRIGQLFFGL 261


>gi|443715115|gb|ELU07266.1| hypothetical protein CAPTEDRAFT_220393 [Capitella teleta]
          Length = 316

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 91/313 (29%), Positives = 154/313 (49%), Gaps = 38/313 (12%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  + KL+K ++EGN+Y A QMY+++  RY    +++EA+DLL+ G+   L   Q   G 
Sbjct: 8   QRVLAKLQKSLDEGNYYEAHQMYRTLYFRYNGQSKFTEAIDLLYNGSLGLLNAKQFASGT 67

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +LA+  VE L K      +E L+++ K+          H+L + D +   T         
Sbjct: 68  DLALLLVETLDKSSTKPSSETLEKLGKL----------HSLMEKDSVERPT--------- 108

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+ AA++WS      K G PELH  LA  L+ E      +   +H+VR N+ E+
Sbjct: 109 ------FVGAAVRWSS--AEQKFGHPELHKYLAHNLWLER---SYSESRYHYVRSNDGEE 157

Query: 195 FASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIM---DEVKKQVENKQLQLQ 250
            AS ++ + +   +P E DL +A+ ++  L L N   AN       +   QVE+    ++
Sbjct: 158 CASMLIEYQITAGFPSEVDLFIAQTVMQCLCLKNKLTANSCFVSYTQQHPQVESGPPFVR 217

Query: 251 PSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV--KRRN 308
           P     ++  +L   +   + +F +L   Y+ ++ R+P +NE LD I + FFGV   R N
Sbjct: 218 PLLNFLWL--LLLACDGGTVAVFTILVEQYQPTLRRDPTYNEYLDRIGQLFFGVPPPRSN 275

Query: 309 PMQGIFGDIFKMM 321
              G+ G++   +
Sbjct: 276 NPSGMIGNLLSSL 288


>gi|185135778|ref|NP_001117086.1| Golgi to ER traffic protein 4 homolog [Salmo salar]
 gi|317376175|sp|A0MT11.1|GET4_SALSA RecName: Full=Golgi to ER traffic protein 4 homolog; AltName:
           Full=Conserved edge expressed protein
 gi|117643981|gb|ABK35126.2| Cee [Salmo salar]
          Length = 322

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 157/289 (54%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  ++++A +L++ GA L    NQL   A+L++ 
Sbjct: 24  KLRASVEKGDYYEAHQMYRTLFFRYISQAKHTDARELMYNGAQLFFSYNQLNSAADLSML 83

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K +   ++E+L+ + K++ L      P++ E                 RV    
Sbjct: 84  VLESLEKSEAKVEDEDLEHLAKLFSLMD----PNSPE-----------------RV---- 118

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G+P+LH +LA  L+ E    + +   +HF+  ++ E  A  +
Sbjct: 119 AFVSRALKWSTG-GSGKLGAPKLHQLLAVTLWKEQ---NYSESRYHFLHSSDGEGCAQML 174

Query: 200 VNFMG-KCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLI 255
           V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E     +QP  L+
Sbjct: 175 VEYSAQRGFRSEVDMFVAQAVLQFLCLKNKNSASVVFSTYTQKHPSIEKDPPFVQP--LL 232

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 233 NFIWFLLLAVDGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 281


>gi|38570062|ref|NP_057033.2| Golgi to ER traffic protein 4 homolog [Homo sapiens]
 gi|74738593|sp|Q7L5D6.1|GET4_HUMAN RecName: Full=Golgi to ER traffic protein 4 homolog; AltName:
           Full=Conserved edge-expressed protein; AltName:
           Full=Transmembrane domain recognition complex 35 kDa
           subunit; Short=TRC35
 gi|32425304|gb|AAH03550.2| Chromosome 7 open reading frame 20 [Homo sapiens]
 gi|51094447|gb|EAL23708.1| chromosome 7 open reading frame 20 [Homo sapiens]
          Length = 327

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 160/309 (51%), Gaps = 39/309 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 29  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHTEARELMYSGALLFFSHGQQNSAADLSML 88

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   +E L+ + K++ L  P  P                       RV   
Sbjct: 89  VLESLEKAEVEVADELLENLAKVFSLMDPNSPE----------------------RV--- 123

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ 
Sbjct: 124 -TFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANM 178

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    ++P  L
Sbjct: 179 LVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--L 236

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV--KRRNPMQG 312
           + F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV  K+ +   G
Sbjct: 237 LNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGVPPKQTSSYGG 296

Query: 313 IFGDIFKMM 321
           + G++   +
Sbjct: 297 LLGNLLTSL 305


>gi|157278561|ref|NP_001098381.1| Golgi to ER traffic protein 4 homolog [Oryzias latipes]
 gi|317376174|sp|A1Z3X3.1|GET4_ORYLA RecName: Full=Golgi to ER traffic protein 4 homolog; AltName:
           Full=Conserved edge expressed protein
 gi|121281842|gb|ABM53480.1| conserved edge expressed protein [Oryzias latipes]
          Length = 323

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 148/289 (51%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V  G++Y A QMY+++  RY++  +++EA +L++ GA L   +NQ    A+L++ 
Sbjct: 24  KLRASVERGDYYEAHQMYRTLFFRYMSQAKHAEARELMYRGALLFFSHNQQNSAADLSML 83

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E L K     ++E L+ + K++ L  Q   P                          +
Sbjct: 84  VLEVLEKSDAKVEDEILEHLAKLFSLMDQ-NSPER------------------------A 118

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P+LH +LA  L+ E    + +   +HF+  ++ E  A  +
Sbjct: 119 AFVSRALKWSTG-GSSKLGHPKLHQLLALTLWKEQ---NYSESRYHFLHSSDGEGCAQML 174

Query: 200 VNFMG-KCYPGEDDLAVARAILMYLSLGNMKDANYIMD---EVKKQVENKQLQLQPSDLI 255
           V +   + Y  E DL VA+A+L +L L N   A+ +     E    ++     +QP  L+
Sbjct: 175 VEYWASRGYRNEVDLFVAQAVLQFLCLKNKSSASVVFSTYTEKHPSIQKGPPFVQP--LL 232

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 233 NFIWFLLLAVDGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 281


>gi|332864320|ref|XP_518924.3| PREDICTED: Golgi to ER traffic protein 4 homolog [Pan troglodytes]
 gi|410219512|gb|JAA06975.1| golgi to ER traffic protein 4 homolog [Pan troglodytes]
 gi|410291500|gb|JAA24350.1| golgi to ER traffic protein 4 homolog [Pan troglodytes]
 gi|410328889|gb|JAA33391.1| golgi to ER traffic protein 4 homolog [Pan troglodytes]
          Length = 327

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 160/309 (51%), Gaps = 39/309 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 29  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 88

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   +E L+ + K++ L  P  P                       RV   
Sbjct: 89  VLESLEKAEVEVADELLENLAKVFSLMDPNSPE----------------------RV--- 123

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ 
Sbjct: 124 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANM 178

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    ++P  L
Sbjct: 179 LVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--L 236

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV--KRRNPMQG 312
           + F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV  K+ +   G
Sbjct: 237 LNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGVPPKQTSSYGG 296

Query: 313 IFGDIFKMM 321
           + G++   +
Sbjct: 297 LLGNLLTSL 305


>gi|426355262|ref|XP_004045046.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Gorilla gorilla
           gorilla]
          Length = 327

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 160/309 (51%), Gaps = 39/309 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 29  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 88

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   +E L+ + K++ L  P  P                       RV   
Sbjct: 89  VLESLEKAEVEVADELLENLAKVFSLMDPNSPE----------------------RV--- 123

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ 
Sbjct: 124 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANM 178

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    ++P  L
Sbjct: 179 LVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--L 236

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV--KRRNPMQG 312
           + F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV  K+ +   G
Sbjct: 237 LNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGVPPKQTSSYGG 296

Query: 313 IFGDIFKMM 321
           + G++   +
Sbjct: 297 LLGNLLTSL 305


>gi|325191408|emb|CCA26185.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 324

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 93/306 (30%), Positives = 157/306 (51%), Gaps = 26/306 (8%)

Query: 14  AQENID-KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTC 72
           A +NI  KL++ +  G++YGA QMYK++  R +     SEA               Q T 
Sbjct: 39  AIDNISRKLDEQLQSGDYYGALQMYKTLFMRTLK----SEA------------SVKQPTE 82

Query: 73  GAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAK 132
             ELA    + LVK K A  N   +    +  +F       +    + +  + +A  A  
Sbjct: 83  SIELASDAAKILVKHKQA--NAAAEMANMMLTVFTDYHCKVDEYTKNLIFSINDAFAAEG 140

Query: 133 LRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNP 192
              +  +SFL+ A+KWS   G+ K G PE+ ++LA   +     ++  + + H++    P
Sbjct: 141 GFSKELASFLRNAVKWSAAEGSRKQGHPEIQLLLARAYFKG---MEYKQAAKHYLHAEKP 197

Query: 193 EKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS 252
           E+FA  ++ +  + YP E DL +AR++L  L L N+KDAN + +  + +  N  L+   +
Sbjct: 198 EEFAEFLLTWSKEGYPSEFDLYIARSVLQLLCLENLKDANKLFETFQIKA-NCSLE---T 253

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQG 312
            L  F+ ++L+TLERDALPL+ +L+  Y A++ R+ +F + LD I E F+GV+ +     
Sbjct: 254 PLCHFIRFLLRTLERDALPLYQLLKDRYAAALTRDKSFEKYLDKIGENFYGVQPQRSGLS 313

Query: 313 IFGDIF 318
              D+F
Sbjct: 314 SLLDLF 319


>gi|317419159|emb|CBN81196.1| Conserved edge expressed protein [Dicentrarchus labrax]
          Length = 323

 Score =  124 bits (310), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 85/289 (29%), Positives = 150/289 (51%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L++ GA L    NQ    A+L++ 
Sbjct: 24  KLRASVEKGDYYEAHQMYRTLFFRYMSQAKHAEARELMYNGALLFFSYNQQNSAADLSML 83

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E L K +   D E L+ + K++      P+  N  +                RV    
Sbjct: 84  VLEVLEKSEAKVDEEILENLAKLFS-----PMDQNSPE----------------RV---- 118

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P+LH +LA  L+ E    + +   +HF+  ++ E  A  +
Sbjct: 119 AFVSRALKWSTG-GSSKLGHPKLHQLLAVTLWKEQ---NYSESRYHFLHSSDGEGCAQML 174

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMD---EVKKQVENKQLQLQPSDLI 255
           V +   + +  E D+ VA+A+L +L L N   A+ +     E    +E     +QP  L+
Sbjct: 175 VEYSASRGFRSEVDMFVAQAVLQFLCLKNKNSASVVFSTYTEKHPSIEKGPPFVQP--LL 232

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 233 NFIWFLLLAVDGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 281


>gi|134288855|ref|NP_001077091.1| Golgi to ER traffic protein 4 homolog [Takifugu rubripes]
 gi|317376177|sp|A4GWN3.1|GET4_TAKRU RecName: Full=Golgi to ER traffic protein 4 homolog; AltName:
           Full=Conserved edge expressed protein
 gi|130774548|gb|ABO32372.1| conserved edge expressed protein [Takifugu rubripes]
          Length = 323

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/289 (29%), Positives = 149/289 (51%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L++ GA L    NQ    A+L++ 
Sbjct: 24  KLRASVEKGDYYEAHQMYRTLYFRYMSQAKHAEARELMYNGALLFFSYNQQNSAADLSML 83

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E L K K   ++E L+ + K++ L  Q                      +  RV    
Sbjct: 84  VLEVLEKSKGKVEDEILECLVKLFSLMDQ---------------------NSPERV---- 118

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P LH +LA  L+ E    + +   +HF+  ++ E  A  +
Sbjct: 119 AFVSRALKWSTG-GSGKLGHPRLHQLLALTLWKEQ---NYSESXYHFLHSSDGEGCAQML 174

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMD---EVKKQVENKQLQLQPSDLI 255
           V +   + +  E D+ VA+A+L +L L N   A+ +     E    +E     +QP  L+
Sbjct: 175 VEYSASRGFHSEVDMFVAQAVLQFLCLKNKNGASVVXSTYTEKHPSIEKGPPFVQP--LL 232

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   YK S++R+P +NE LD I + FFGV
Sbjct: 233 NFIWFLLLAVDGGKLTVFTVLCEQYKPSLKRDPMYNEYLDRIGQLFFGV 281


>gi|427785099|gb|JAA58001.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 340

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/314 (27%), Positives = 148/314 (47%), Gaps = 46/314 (14%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           ++KLE  ++ GN+Y A QMY+++  RY   +RY+E  +LL+ GA   L+ +Q   GA+LA
Sbjct: 29  LNKLEACIDSGNYYEAHQMYRTLYFRYRGQKRYAELKELLYNGAVKLLRLDQCNSGADLA 88

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              V+ LV+      +E ++R+ +++ L      PH+ E                     
Sbjct: 89  NLLVDVLVQSDTEPCDEQIERLGRLFALL----APHSPER-------------------- 124

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
            S+FL  A++WS     P  G P+LH ++A  L+ E    +      HFV   + +  A+
Sbjct: 125 -SAFLARALQWSAGKEQPARGHPQLHRLVALTLWKEK---NYPEARHHFVHSTDGDGCAA 180

Query: 198 TIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQ 256
            ++ F   K Y  E DL +A+ +  YL L N   A+ +     ++         PS    
Sbjct: 181 MLIEFQTTKGYSSEIDLFIAQTVFQYLCLRNPSTASVVFLAYTRR--------HPSVHPG 232

Query: 257 FVFY---------VLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRR 307
             +Y         +L  +E   L +F +L   Y+ +I R+P + + LD I + FFG+   
Sbjct: 233 PPYYLPLLNFLWFLLLAVETRKLAVFTVLCEQYQPTISRDPTYPQYLDKIGQLFFGLPPP 292

Query: 308 NPMQGIFGDIFKMM 321
              QG+FG++ + +
Sbjct: 293 PKPQGMFGNLIQTL 306


>gi|384251705|gb|EIE25182.1| hypothetical protein COCSUDRAFT_65140 [Coccomyxa subellipsoidea
           C-169]
          Length = 304

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/311 (27%), Positives = 145/311 (46%), Gaps = 27/311 (8%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + ++E        Y AQQM K++  R  + ++ +E+ DLL  GA LQLK +QLTCG+ELA
Sbjct: 7   LSRIENATTTEAAYEAQQMLKTVYHRLRSRKKLTESYDLLQQGARLQLKGDQLTCGSELA 66

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  +  L       D   L  + +I +L   I  P+   +  D  E+ EA          
Sbjct: 67  LLLIHDL-----EADGFKLSELHRITDLLECIRKPYT--ESRDSPEIKEA---------- 109

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
            +    AAI+W  + G  +  + +LH  LA  +        +   S HF RG++P  FA+
Sbjct: 110 -TKITSAAIRWVKKMG-DEEAAKKLHSTLAALICKCQGAEGLLAASEHFSRGDDPAAFAA 167

Query: 198 TIVNFMGKCYPGEDDLAVARAILMYLSLGNMKD-------ANYIMDEVKKQVENKQLQLQ 250
            + +   K    E DL + RA+L  L++G           A  +M     +  +    + 
Sbjct: 168 YLADVSSKGSHSEHDLILTRAVLQVLAVGRASQLDMQLAFARELMGGFSNRTASAAASI- 226

Query: 251 PSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM 310
            + L  F+ ++LQ L+  +L L  +L+  Y  S+ R+P   EML+ + + +F       M
Sbjct: 227 STPLTHFLEFLLQALDGRSLSLLELLQKEYSHSLARDPDLGEMLETVKQTYFPSMGGQGM 286

Query: 311 QGIFGDIFKMM 321
             + G+IF+M+
Sbjct: 287 SSLLGNIFQMI 297


>gi|115495649|ref|NP_001069993.1| Golgi to ER traffic protein 4 homolog [Bos taurus]
 gi|122142187|sp|Q0P5I8.1|GET4_BOVIN RecName: Full=Golgi to ER traffic protein 4 homolog
 gi|112362160|gb|AAI19984.1| Chromosome 7 open reading frame 20 ortholog [Bos taurus]
          Length = 325

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 85/289 (29%), Positives = 152/289 (52%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY+A  +++EA +L+ +GA L   + Q    A+L++ 
Sbjct: 28  KLRASVEKGDYYEAHQMYRTLFFRYMAQSKHAEARELMCSGALLFFSHGQQNSAADLSML 87

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K +V   +E L+ + K++ L      P++ E                 RV    
Sbjct: 88  VLESLEKAEVEVADELLESLAKLFSLMD----PNSPE-----------------RV---- 122

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+  ++ E  A+ +
Sbjct: 123 AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSSDGEGCANML 178

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLI 255
           V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E     +QP  L+
Sbjct: 179 VEYSTARGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEGGPPFVQP--LL 236

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 237 NFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 285


>gi|444515974|gb|ELV11032.1| Golgi to ER traffic protein 4 like protein [Tupaia chinensis]
          Length = 728

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 83/289 (28%), Positives = 153/289 (52%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 430 KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 489

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K +V   +E L+ + +++ L      P++ E                 RV    
Sbjct: 490 VLESLEKAEVDVADELLENLAQVFSLMD----PNSPE-----------------RV---- 524

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ +
Sbjct: 525 AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSTDGEGCANML 580

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLI 255
           V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    +QP  L+
Sbjct: 581 VEYSTSRGFRSEVDMFVAQAVLQFLCLKNKNSASVVFTTYTQKHPSIEDGPPFVQP--LL 638

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 639 NFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 687


>gi|359319665|ref|XP_003639138.1| PREDICTED: LOW QUALITY PROTEIN: Golgi to ER traffic protein 4
           homolog [Canis lupus familiaris]
          Length = 325

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 85/290 (29%), Positives = 149/290 (51%), Gaps = 37/290 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+ +GA L   + Q    A+L++ 
Sbjct: 28  KLLASVEKGDYYEAHQMYRTLFFRYMSQSKHTEARELMCSGALLFFSHGQQNSAADLSML 87

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   +E L+ + K++ L  P  P                       RV   
Sbjct: 88  VLESLEKAEVEVADELLENLAKLFSLMDPNSPE----------------------RV--- 122

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A  
Sbjct: 123 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCAHM 177

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +EN    +QP  L
Sbjct: 178 LVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIENGPPFVQP--L 235

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 236 LNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 285


>gi|350537871|ref|NP_001232308.1| Golgi to ER traffic protein 4 homolog [Taeniopygia guttata]
 gi|317376176|sp|B5KFI0.1|GET4_TAEGU RecName: Full=Golgi to ER traffic protein 4 homolog
 gi|197129830|gb|ACH46328.1| putative comparative gene identification 20 [Taeniopygia guttata]
          Length = 328

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 153/289 (52%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   +NQ    A+L++ 
Sbjct: 30  KLRASVEKGDYYEAHQMYRTLFFRYMSQGKHAEARELMYSGALLFFSHNQQNSAADLSML 89

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K       + L+ + K++ L      P++ E                 RV    
Sbjct: 90  VLESLEKSDAKVAEDLLENLAKLFSLMD----PNSPE-----------------RV---- 124

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P+LH +LA  L+ E    + +   +HF+   + E  A+ +
Sbjct: 125 AFVSRALKWS-SGGSGKLGHPKLHQLLAITLWKEQ---NYSESRYHFLHSTDGEGCANML 180

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLI 255
           V +   + Y  E D+ VA+A+L +L L N   A+ +     ++   +E     +QP  L+
Sbjct: 181 VEYSSSRGYRSEVDMFVAQAVLQFLCLKNKTSASVVFTTYTQKHPSIEKGPPFVQP--LL 238

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 239 NFIWFLLLAVDGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 287


>gi|62859731|ref|NP_001016708.1| Golgi to ER traffic protein 4 homolog [Xenopus (Silurana)
           tropicalis]
 gi|317376178|sp|A4QNE0.1|GET4_XENTR RecName: Full=Golgi to ER traffic protein 4 homolog
 gi|140833033|gb|AAI35490.1| hypothetical protein LOC549462 [Xenopus (Silurana) tropicalis]
          Length = 325

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 86/290 (29%), Positives = 151/290 (52%), Gaps = 37/290 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  ++ EA +L+++GA L   ++Q    A+L++ 
Sbjct: 27  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHIEARELMYSGALLFFSHSQQNSAADLSML 86

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V    E L+ + K++ L  P  P                       RV   
Sbjct: 87  VLESLEKHEVKVTEELLENLAKLFSLMDPNSPE----------------------RV--- 121

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P+LH  LA  L+ E    +     +HF+  ++ E  A+ 
Sbjct: 122 -AFVSRALKWS-SGGSGKFGHPKLHQFLAITLWKEQ---NYYESRYHFLHSSDGEGCANM 176

Query: 199 IVNFMG-KCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + Y  E D+ VA+A+L +L L N   A+ +     ++   +E     +QP  L
Sbjct: 177 LVEYSSTRGYRSEVDMFVAQAVLQFLCLKNKTSASVVFTTYTQKHPSIERGPPFVQP--L 234

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + F++++L  +E   L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 235 LNFIWFLLLAVEGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 284


>gi|340720776|ref|XP_003398806.1| PREDICTED: LOW QUALITY PROTEIN: Golgi to ER traffic protein 4
           homolog [Bombus terrestris]
          Length = 318

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 94/308 (30%), Positives = 152/308 (49%), Gaps = 31/308 (10%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  + KLE  +N  N+Y A QMY+++  RY+  ++YSE L+LL+ G+ L L++ Q   GA
Sbjct: 10  QRVLAKLEASINSENYYEAHQMYRTLYFRYLGQKKYSELLELLYNGSTLLLQHEQHASGA 69

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L + FV  L +  +     + D   KI  LF  I  P + E D                
Sbjct: 70  DLGILFVNVLTQSGI---EPSQDYFEKITNLFS-IMSPLSPERD---------------- 109

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+++A++WSI+    KTG P+LH  +A+  + E   + MAR   HF+   +   
Sbjct: 110 -----VFVQSALRWSIKGTEYKTGHPDLHQKIAQVFWREKNYI-MARQ--HFIYSRDGSG 161

Query: 195 FASTIVNFM-GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVEN-KQLQLQPS 252
            A+ +V     + Y  E DL +A+A+L YL L N   A  + +    Q            
Sbjct: 162 CAAMLVELHEQRGYKNEIDLFIAQAVLQYLCLQNKTTAQEVFNSYTSQHPKINSGPPYLL 221

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM-Q 311
            L+ F+F+ L+T++   L +F +L   Y+ S+ R+P + + LD I + FF V    P  Q
Sbjct: 222 PLLNFLFFXLKTVDSGKLAVFTVLCEQYRISLNRDPCYRQYLDKIGQLFFNVPSPRPRNQ 281

Query: 312 GIFGDIFK 319
           G+FG + +
Sbjct: 282 GLFGTLLQ 289


>gi|27229052|ref|NP_080545.2| Golgi to ER traffic protein 4 homolog isoform 1 [Mus musculus]
 gi|81904741|sp|Q9D1H7.2|GET4_MOUSE RecName: Full=Golgi to ER traffic protein 4 homolog
 gi|26385805|dbj|BAB22851.2| unnamed protein product [Mus musculus]
 gi|148687218|gb|EDL19165.1| RIKEN cDNA 1110007L15, isoform CRA_b [Mus musculus]
          Length = 327

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 153/289 (52%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 29  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 88

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K +V   +E L+ + K++ L      P++ E                 RV    
Sbjct: 89  VLESLEKAEVDVADELLENLAKVFSLMD----PNSPE-----------------RV---- 123

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+  ++ E  A+ +
Sbjct: 124 AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSSDGEGCANML 179

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLI 255
           V +   + +  E D+ VA+A+L +L L N   A  +     ++   +E+    +QP  L+
Sbjct: 180 VEYSTARGFRSEVDMFVAQAVLQFLCLKNKNSALVVFTTYTQKHPSIEDGPPFVQP--LL 237

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 238 NFIWFLLLAVDGGKLAVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 286


>gi|254281180|ref|NP_001156792.1| conserved edge expressed protein [Rattus norvegicus]
 gi|149035059|gb|EDL89779.1| similar to RIKEN cDNA 1110007L15 (predicted) [Rattus norvegicus]
          Length = 330

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 152/289 (52%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 32  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 91

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K +V   +E L+ + K++ L      P++ E                 RV    
Sbjct: 92  VLESLEKAEVDVADELLENLAKVFSLMD----PNSPE-----------------RV---- 126

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ +
Sbjct: 127 AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSTDGEGCANML 182

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLI 255
           V +   + +  E D+ VA+A+L +L L N   A  +     ++   +E+    +QP  L+
Sbjct: 183 VEYSTARGFRSEVDMFVAQAVLQFLCLKNKNSALVVFTTYTQKHPSIEDGPPFVQP--LL 240

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 241 NFIWFLLLAVDGGKLAVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 289


>gi|402862692|ref|XP_003895680.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Papio anubis]
          Length = 328

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 83/290 (28%), Positives = 151/290 (52%), Gaps = 37/290 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 30  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 89

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   +E L+ + K++ L  P  P                       RV   
Sbjct: 90  VLESLEKAEVEVADELLENLAKVFSLMDPNSPE----------------------RV--- 124

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ 
Sbjct: 125 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANM 179

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    ++P  L
Sbjct: 180 LVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--L 237

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 238 LNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 287


>gi|380813638|gb|AFE78693.1| Golgi to ER traffic protein 4 homolog [Macaca mulatta]
 gi|383419065|gb|AFH32746.1| Golgi to ER traffic protein 4 homolog [Macaca mulatta]
 gi|384947600|gb|AFI37405.1| Golgi to ER traffic protein 4 homolog [Macaca mulatta]
          Length = 327

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 83/290 (28%), Positives = 151/290 (52%), Gaps = 37/290 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 29  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 88

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   +E L+ + K++ L  P  P                       RV   
Sbjct: 89  VLESLEKAEVEVADELLENLAKVFSLMDPNSPE----------------------RV--- 123

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ 
Sbjct: 124 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANM 178

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    ++P  L
Sbjct: 179 LVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--L 236

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 237 LNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 286


>gi|126334393|ref|XP_001378223.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Monodelphis
           domestica]
          Length = 331

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 85/289 (29%), Positives = 151/289 (52%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G +Y A QMY+++  RY++  ++ EA +L+++GA L   ++Q    A+L++ 
Sbjct: 33  KLRASVEKGEYYEAHQMYRTLFFRYMSQGKHIEARELMYSGALLFFSHSQQNSAADLSML 92

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K      +E L+ + K++ L      P++ E                 RV    
Sbjct: 93  VLESLEKSDAKVTDELLENLAKVFSLMD----PNSPE-----------------RV---- 127

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P+LH +LA  L+ E    +     +HF+   + E  A+ +
Sbjct: 128 AFVSRALKWS-SGGSGKLGHPKLHQLLAITLWKEQ---NYCESRYHFLHSTDGEGCANML 183

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQ---LQPSDLI 255
           V +   + Y  E D+ VA+A+L +L L N   A+ +     ++  + Q     +QP  L+
Sbjct: 184 VEYSASRGYRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIQKGPPFVQP--LL 241

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  +E   L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 242 NFIWFLLLAVEGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 290


>gi|91093461|ref|XP_967372.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
           castaneum]
 gi|270012681|gb|EFA09129.1| hypothetical protein TcasGA2_TC015990 [Tribolium castaneum]
          Length = 320

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 86/306 (28%), Positives = 149/306 (48%), Gaps = 34/306 (11%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           ++KLE  V  GN+Y A QMY+++  RY+A ++Y E LD+L+ G+ L L+++Q   GA+LA
Sbjct: 13  LEKLENSVKNGNYYEAHQMYRTLYFRYLAQKKYKETLDMLYNGSLLFLEHDQQISGADLA 72

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  V+ LVK +    +    ++ KI+                       ++ +     E 
Sbjct: 73  ILLVDVLVKSEEKDYDGWTPKLCKIFS----------------------SISSTTPERE- 109

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
             +FL  A++WS    +   G P LH  +A+  + E    +  +   H++   + +  A 
Sbjct: 110 --TFLAQALRWSSNHRS--FGHPFLHQCIAQVYWDEK---NYFQARHHYIYSKDGQGCAK 162

Query: 198 TIVNFMG-KCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS--DL 254
            ++ F   + +  E DL +A+A+  YL L N   A        +Q    +    P    L
Sbjct: 163 LLIEFQSTQGFQCEADLFIAQAVFQYLCLSNKITAYQTFTSYTEQHPRIKRSGPPYLLPL 222

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQ-GI 313
           + F+F++LQ +E   LP F +L   Y+ SI+R+P +   LD I + FFG+K     Q G+
Sbjct: 223 LNFIFFLLQAIESRKLPTFVILCEQYEPSIKRDPCYVGYLDKIGQIFFGLKPPQQKQGGL 282

Query: 314 FGDIFK 319
           F + F+
Sbjct: 283 FDNFFE 288


>gi|355690021|gb|AER99021.1| golgi to ER traffic protein 4-like protein [Mustela putorius furo]
          Length = 324

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 84/290 (28%), Positives = 149/290 (51%), Gaps = 37/290 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+ +GA L   + Q    A+L++ 
Sbjct: 28  KLRASVEKGDYYEAHQMYRTLFFRYMSQNKHTEARELMCSGALLFFSHGQQNSAADLSML 87

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   +E L+ + K++ L  P  P                       RV   
Sbjct: 88  VLESLEKAEVEVADELLENLAKLFSLMDPNSPE----------------------RV--- 122

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A  
Sbjct: 123 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCAHM 177

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    +QP  L
Sbjct: 178 LVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEHGPPFVQP--L 235

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 236 LNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 285


>gi|351709992|gb|EHB12911.1| hypothetical protein GW7_14514 [Heterocephalus glaber]
          Length = 393

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 151/289 (52%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 95  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 154

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K +V   +E L+ + K++ L      P++ E                 RV    
Sbjct: 155 VLESLEKAEVDVADELLENLAKVFSLMD----PNSPE-----------------RV---- 189

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
            F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ +
Sbjct: 190 VFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSTDGEGCANML 245

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLI 255
           V +   + +  E D+ VA+A+L +L L N   A  +     ++   +E+    +QP  L+
Sbjct: 246 VEYSTARGFRSEVDMFVAQAVLQFLCLKNKNSALVVFTTYTQKHPSIEDGPPFVQP--LL 303

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 304 NFIWFLLLAVDGGKLAVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 352


>gi|348568392|ref|XP_003469982.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Cavia porcellus]
          Length = 383

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 83/290 (28%), Positives = 150/290 (51%), Gaps = 37/290 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 85  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 144

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   ++ L+ + K++ L  P  P                       RV   
Sbjct: 145 VLESLEKAEVDVADDLLENLAKVFSLMDPNSPE----------------------RV--- 179

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ 
Sbjct: 180 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSTDGEGCANM 234

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A  +     ++   +E+    +QP  L
Sbjct: 235 LVEYSTARGFRSEVDMFVAQAVLQFLCLKNKNSALVVFTTYTQKHPSIEDGPPFVQP--L 292

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 293 LNFIWFLLLAVDGGKLAVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 342


>gi|350398037|ref|XP_003485066.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Bombus impatiens]
          Length = 318

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 93/308 (30%), Positives = 152/308 (49%), Gaps = 31/308 (10%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  + KLE  +N  N+Y A QMY+++  RY+  ++YSE L+LL+ G+ L L++ Q   GA
Sbjct: 10  QRVLAKLEASINSENYYEAHQMYRTLYFRYLGQKKYSELLELLYNGSTLLLQHEQHASGA 69

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L + F+  L +        + D   KI  LF  I  P + E D                
Sbjct: 70  DLGILFINVLTQSGT---EPSQDYFEKITNLFS-IMSPLSPERD---------------- 109

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+++A++WSI+    KTG P+LH  +A+  + E   + MAR   HF+   +   
Sbjct: 110 -----VFVQSALRWSIKGTEYKTGHPDLHQKVAQVFWREKNYI-MARQ--HFIYSRDGSG 161

Query: 195 FASTIVNFM-GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVEN-KQLQLQPS 252
            A+ +V     + Y  E DL +A+A+L YL L N   A  + +    Q            
Sbjct: 162 CAAMLVELHEQRGYKNEIDLFIAQAVLQYLCLQNKTTAQEVFNSYTSQHPKINSGPPYLL 221

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM-Q 311
            L+ F+F++L+T++   L +F +L   Y+ S+ R+P + + LD I + FF V    P  Q
Sbjct: 222 PLLNFLFFLLKTVDSGKLAVFTVLCEQYRISLNRDPCYRQYLDKIGQLFFNVPSPRPRNQ 281

Query: 312 GIFGDIFK 319
           G+FG + +
Sbjct: 282 GLFGTLLQ 289


>gi|348502393|ref|XP_003438752.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Oreochromis
           niloticus]
          Length = 322

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 87/289 (30%), Positives = 152/289 (52%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L++ GA L    NQ    A+L++ 
Sbjct: 24  KLRASVEKGDYYEAHQMYRTLFFRYMSQGKHAEARELMYNGALLFFSYNQQNSAADLSML 83

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E L K +   ++E L+ + K   LF Q+        D +  E          RV    
Sbjct: 84  VLEVLEKSETKVEDEILECLAK---LFSQM--------DQNSPE----------RV---- 118

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P+LH +LA  L+ E    + +   +HF+  ++ E  A  +
Sbjct: 119 AFVSRALKWSTG-GSGKLGHPKLHQLLAVTLWKEQ---NYSESRYHFLHSSDGEGCAHML 174

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMD---EVKKQVENKQLQLQPSDLI 255
           V +   + +  E D+ VA+A+L +L L N   A+ +     E    +E     +QP  L+
Sbjct: 175 VEYSASRGFRSEVDMFVAQAVLQFLCLKNKNSASVVFSTYTEKHPSIERGPPFVQP--LL 232

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 233 NFIWFLLLAVDGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 281


>gi|147903645|ref|NP_001086312.1| Golgi to ER traffic protein 4 homolog B [Xenopus laevis]
 gi|82183867|sp|Q6GLK9.1|GET4B_XENLA RecName: Full=Golgi to ER traffic protein 4 homolog B
 gi|49256514|gb|AAH74468.1| MGC84763 protein [Xenopus laevis]
          Length = 325

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 152/289 (52%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  ++ EA +L+++GA L   ++Q    A+L++ 
Sbjct: 27  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHIEARELMYSGALLFFSHSQRNSAADLSML 86

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K +V    E L+ + K++ L      P++ E                 RV    
Sbjct: 87  VLESLEKHEVKVTEELLENLAKLFSLMD----PNSPE-----------------RV---- 121

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P+LH  LA  L+ E    +     +HF+  ++ E  A+ +
Sbjct: 122 AFVSRALKWSTG-GSGKFGDPKLHQFLAITLWKEQ---NYYESRYHFLHSSDGEGCANML 177

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLI 255
           V +   + Y  E D+ V +A+L +L L N   A+ +     ++   +E     +QP  L+
Sbjct: 178 VEYSSSRGYRSEVDMFVVQAVLQFLCLKNKTSASVVFTTYTQKHPSIERGPPFVQP--LL 235

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  +E   L +F +L   Y+ S++R+P +NE LD I + FFG+
Sbjct: 236 NFIWFLLLAVEGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGL 284


>gi|395845583|ref|XP_003795508.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Otolemur
           garnettii]
          Length = 326

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 82/290 (28%), Positives = 150/290 (51%), Gaps = 37/290 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  ++ EA +L+++GA L   + Q    A+L++ 
Sbjct: 28  KLRASVEKGDYYEAHQMYRTLFFRYMSQGKHVEARELMYSGALLFFSHGQQNSAADLSML 87

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   ++ L+ + K++ L  P  P                       RV   
Sbjct: 88  VLESLEKAEVDVADDLLENLAKVFSLMDPNSPE----------------------RV--- 122

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ 
Sbjct: 123 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSTDGEGCANM 177

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    ++P  L
Sbjct: 178 LVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKNSASVVFTTYTQKHPSIEDGPPFVEP--L 235

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 236 LNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 285


>gi|449445706|ref|XP_004140613.1| PREDICTED: Golgi to ER traffic protein 4-like [Cucumis sativus]
          Length = 91

 Score =  119 bits (297), Expect = 3e-24,   Method: Composition-based stats.
 Identities = 52/78 (66%), Positives = 67/78 (85%)

Query: 1  MSRQRPKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAG 60
          MSR R +R  LPP QE+I K+E +++ G++YGAQQMYKS+SARYVAA+RYSEALD+L +G
Sbjct: 10 MSRARSRRIVLPPVQEHIIKIEDVIDTGDYYGAQQMYKSVSARYVAAERYSEALDILQSG 69

Query: 61 ACLQLKNNQLTCGAELAV 78
          AC QLK+ Q+TCG+ELAV
Sbjct: 70 ACTQLKHEQITCGSELAV 87


>gi|440801616|gb|ELR22628.1| DUF410 family protein [Acanthamoeba castellanii str. Neff]
          Length = 271

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 84/273 (30%), Positives = 129/273 (47%), Gaps = 32/273 (11%)

Query: 45  VAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYE 104
           +A +RY EA   L  G    L ++Q    +EL    ++  V    +  +  ++ + + + 
Sbjct: 1   MAQKRYDEAETTLVTGMEALLTHHQTVSASELGQQLIKLYVDTHKSPTDATIETIVRHFH 60

Query: 105 LFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFG-APKTGSPELH 163
            +        L+ DD                 G  SF+KAAIKW+I  G   K G P LH
Sbjct: 61  GY-------GLDVDD-----------------GRVSFMKAAIKWTIGEGCGGKHGEPRLH 96

Query: 164 VMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYL 223
             LA+Y  S+    D  R   H+VRG NP KF+S +V +     P E DL + RA+LMYL
Sbjct: 97  DALADYYASKG---DYGRAQQHYVRGTNPSKFSSMLVEYAKGGNPDEADLFITRAVLMYL 153

Query: 224 SLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQTLER-DALPLFNMLRANYKA 282
            L    +A  ++     Q  N  L+  P+ LI F   +L +  R +A  LF++LR +Y  
Sbjct: 154 CLQKQDEAKQVLHAYLAQ--NSALKTHPTPLINFTHLLLDSFTRSNAQALFSLLRKHYTP 211

Query: 283 SIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFG 315
           S+ R+P F + +D I   ++ ++ R    G FG
Sbjct: 212 SLSRDPEFLQYMDQIGSVYYRLQPRANAGG-FG 243


>gi|395514690|ref|XP_003761547.1| PREDICTED: LOW QUALITY PROTEIN: Golgi to ER traffic protein 4
           homolog [Sarcophilus harrisii]
          Length = 328

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 150/289 (51%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G +Y A QMY+++  RY++  ++ EA +L+++GA L   ++Q    A+L++ 
Sbjct: 30  KLRASVEKGEYYEAHQMYRTLFFRYMSQGKHIEARELMYSGALLFFSHSQQNSAADLSML 89

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L        +E L+ + K++ L      P++ E                 RV    
Sbjct: 90  VLESLEISDAKVTDELLENLAKVFSLMD----PNSPE-----------------RV---- 124

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P+LH +LA  L+ E    +     +HF+   + E  A+ +
Sbjct: 125 AFVSRALKWS-SGGSGKLGHPKLHQLLAVTLWKEQ---NYCESRYHFLHSTDGEGCANML 180

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQ---LQPSDLI 255
           V +   + Y  E D+ VA+A+L +L L N   A+ +     ++  + Q     +QP  L+
Sbjct: 181 VEYSASRGYRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIQKGPPFVQP--LL 238

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  +E   L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 239 NFIWFLLLAVEGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 287


>gi|72166355|ref|XP_786360.1| PREDICTED: Golgi to ER traffic protein 4 homolog B-like
           [Strongylocentrotus purpuratus]
          Length = 326

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 86/307 (28%), Positives = 153/307 (49%), Gaps = 40/307 (13%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KL+K +  G++Y A QMY+++  RY+  ++++EA+++L++GA L L++NQ   GA+L+
Sbjct: 21  LGKLKKSIEAGDYYEAHQMYRTLYFRYMTQKKHAEAIEMLYSGASLLLQHNQHMSGADLS 80

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYE-LFPQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
           +  +EAL   +     EN+ R+  + + L P+ P                          
Sbjct: 81  LLLIEALESAEAPISKENISRIGSLGKYLLPEAPER------------------------ 116

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
               F+ AAIKWS +    K G PELH   A  L+ E    +     +HF+R  + +  A
Sbjct: 117 --GKFIVAAIKWSQKDN--KKGDPELHKEFANMLWQEK---NYFEARYHFLRSTDGKGCA 169

Query: 197 STIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLI 255
             +V + + K Y  E D+ VA+A+L YL L N   A+  +    +   +  ++  P  ++
Sbjct: 170 LMLVEYQLSKGYGSEADMFVAQAVLQYLCLRNKTTASLTLQVYTEN--HPAIKAGPPFIL 227

Query: 256 ---QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM-- 310
               F++ +L  ++   L  F +L   Y+ S++R+P +   LD I + FFG+    P   
Sbjct: 228 PLLNFMWLLLLAIKTQKLTAFTVLCEKYQPSLQRDPTYAAYLDKIGQVFFGLPPPMPTGR 287

Query: 311 QGIFGDI 317
           QG  G++
Sbjct: 288 QGFIGNL 294


>gi|291231439|ref|XP_002735673.1| PREDICTED: conserved edge expressed protein-like [Saccoglossus
           kowalevskii]
          Length = 312

 Score =  117 bits (294), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 89/313 (28%), Positives = 161/313 (51%), Gaps = 39/313 (12%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KL+  V  G++Y A QMY+++  RY+A ++++EA++L+++GA + L +NQ   G +L 
Sbjct: 14  LKKLKASVEAGSYYEAHQMYRTLYFRYMAQKKHAEAIELIYSGANVLLTHNQHGSGTDLC 73

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  +E L     A   +NLD+V     LF Q+  P + E                     
Sbjct: 74  MLLIEVLNTSNTAPAADNLDKVSS---LFTQLG-PDSTER-------------------- 109

Query: 138 CSSFLKAAIKWSIEFG-APKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
              +  AA+KWS +   + K+G PELH +LA  L+ E    + AR  +HF+   + +  A
Sbjct: 110 -PLYKNAALKWSCKNDPSHKSGHPELHTLLALILWKEKSYYE-AR--YHFLHSFDGQSCA 165

Query: 197 STIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDAN---YIMDEVKKQVENKQLQLQPS 252
           + ++ + + +    E D+ VA+ +L YL L N + A+   Y   E    ++     +QP 
Sbjct: 166 TMLIEYSLRQGLRSEYDMFVAQIVLQYLCLKNEETASVVFYTYTEKHPDIQESPPFMQP- 224

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK--RRNPM 310
            L+ F++++L  L      +F++L   Y+ +I+R+P++ + L  I   FFG+       M
Sbjct: 225 -LLNFIWFLLLALPGGKTTIFSVLCDKYEPAIKRDPSYVDYLSRIGHLFFGMPPLAEESM 283

Query: 311 Q--GIFGDIFKMM 321
           Q  G+FG+I + +
Sbjct: 284 QPGGVFGNILQSL 296


>gi|157129645|ref|XP_001655435.1| hypothetical protein AaeL_AAEL002521 [Aedes aegypti]
 gi|157134022|ref|XP_001656303.1| hypothetical protein AaeL_AAEL012936 [Aedes aegypti]
 gi|108870630|gb|EAT34855.1| AAEL012936-PA [Aedes aegypti]
 gi|108882036|gb|EAT46261.1| AAEL002521-PA [Aedes aegypti]
          Length = 345

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 88/314 (28%), Positives = 147/314 (46%), Gaps = 36/314 (11%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KLE  +  GN+Y A QMY+++  RY++ Q+Y E LDLL+ GA   L + Q T GA+L 
Sbjct: 21  LAKLESSIEAGNYYEAHQMYRTLYFRYLSQQKYEELLDLLYQGALTLLSHEQNTSGADLG 80

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  V+ L K K   +      ++KI  L  +I  P N+ + + L                
Sbjct: 81  LLLVDTLEKSKTVAEGAE-KWMQKIALLLSKI--PQNVVERETL---------------- 121

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
               L  A+KWS      + G P +H ++A+ +++E    ++A+   HF+   +      
Sbjct: 122 ----LVKAVKWSASVSKSQVGHPLMHKLIAQIMWNED---NLAQARHHFLLSKDGSGCGH 174

Query: 198 TIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQP--SDL 254
            ++     K +P E DL +A+ IL  L L     A        K          P  + L
Sbjct: 175 MLIQLSQTKGFPSEMDLFIAQVILQQLCLKETSTAAETFATYTKYHPKIACSEPPFITPL 234

Query: 255 IQFVFYVLQTLERD--ALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKR-RNP-- 309
           + F+F++L+ +E +   L +F  L   YK S+ER+P++ + L  I   FF   + + P  
Sbjct: 235 LNFIFFLLRAIEANQRKLIVFRTLCELYKPSLERDPSYEKYLQKIGMIFFEASQPQRPQS 294

Query: 310 --MQGIFGDIFKMM 321
             M GIFG++   +
Sbjct: 295 EGMGGIFGNLLNQL 308


>gi|148223313|ref|NP_001084795.1| Golgi to ER traffic protein 4 homolog A [Xenopus laevis]
 gi|82185251|sp|Q6NRL4.1|GET4A_XENLA RecName: Full=Golgi to ER traffic protein 4 homolog A
 gi|47124919|gb|AAH70733.1| MGC83720 protein [Xenopus laevis]
          Length = 325

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 84/290 (28%), Positives = 150/290 (51%), Gaps = 37/290 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  ++ EA +L+++GA L   ++Q    A+L++ 
Sbjct: 27  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHIEARELMYSGALLFFSHSQQNSAADLSML 86

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V    E L+ + K++ L  P  P                       RV   
Sbjct: 87  VLESLEKHEVKVTEELLENLAKLFSLMDPNSPE----------------------RV--- 121

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G  +LH  LA  L+ E    +     +HF+  ++ E  A+ 
Sbjct: 122 -AFVSRALKWS-SGGSGKFGHQKLHQFLAITLWKEQ---NYYESRYHFLHSSDGEGCANM 176

Query: 199 IVNFMG-KCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + Y  E D+ VA+A+L +L L N   A+ +     ++   +E     +QP  L
Sbjct: 177 LVEYSSTRGYRSEVDMFVAQAVLQFLCLKNKTSASVVFTTYTQKHPSIERGPPFVQP--L 234

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + F++++L  +E   L +F +L   Y+ S++R+P +NE LD I + FFG+
Sbjct: 235 LNFIWFLLLAVEGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGL 284


>gi|125524555|gb|EAY72669.1| hypothetical protein OsI_00535 [Oryza sativa Indica Group]
          Length = 120

 Score =  117 bits (292), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 62/122 (50%), Positives = 79/122 (64%), Gaps = 4/122 (3%)

Query: 102 IYELFPQIPVPHNLEDD--DDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGS 159
           +Y  FP+I VPH L DD  DD  +L+EA+ AAK+R E CSSFL+AAI+   E        
Sbjct: 1   MYSAFPRISVPHFLGDDYDDDGQKLSEAISAAKVRSESCSSFLRAAIR--AETLCLFNCY 58

Query: 160 PELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAI 219
             + +  A+         DM +VS HFVRGN+P+KFAS + NFMGKCYPGEDD A+AR +
Sbjct: 59  SLMPLWFAKVRVDCRWMQDMTKVSSHFVRGNDPKKFASMLANFMGKCYPGEDDTAIARGV 118

Query: 220 LM 221
           LM
Sbjct: 119 LM 120


>gi|380029039|ref|XP_003698190.1| PREDICTED: LOW QUALITY PROTEIN: Golgi to ER traffic protein 4
           homolog [Apis florea]
          Length = 315

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 90/307 (29%), Positives = 149/307 (48%), Gaps = 32/307 (10%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  + KLE  +N  N+Y A QMY+++  RY+  ++YSE L+LL+ G+ L L++ Q   GA
Sbjct: 10  QRVLAKLEASINSENYYEAHQMYRTLYFRYLGQKKYSELLELLYNGSTLLLQHEQHASGA 69

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L + FV  L +        + D   KI  LF  I  P + E D                
Sbjct: 70  DLGILFVNVLTQSGT---EPSQDYFEKITNLFS-IMSPLSPERD---------------- 109

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+++A++WSI+    K G P+LH  +A+  + E   + MAR   HF+   +   
Sbjct: 110 -----VFVQSALRWSIKNTDYKIGHPDLHQKIAQVFWREKNYV-MARQ--HFIYSRDGSG 161

Query: 195 FASTIVNFM-GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSD 253
            A+ +V     + Y  E DL +A+A+L YL L N   A  + +    Q  + ++   P  
Sbjct: 162 CAAMLVELHEQRGYKNEIDLFIAQAVLQYLCLQNKTTAQEVFNSYTSQ--HPKINRGPPY 219

Query: 254 LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM-QG 312
           L+  + ++    +   L +F +L   Y+ S+ R+P + + LD I + FF V    P  QG
Sbjct: 220 LLPLLNFLFFYXKYGKLAVFTVLCEQYRISLNRDPCYRQYLDKIGQLFFNVPSPRPRNQG 279

Query: 313 IFGDIFK 319
           +FG + +
Sbjct: 280 LFGTLLQ 286


>gi|66520405|ref|XP_395262.2| PREDICTED: Golgi to ER traffic protein 4 homolog [Apis mellifera]
          Length = 318

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 92/308 (29%), Positives = 151/308 (49%), Gaps = 31/308 (10%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  + KLE  +N  N+Y A QMY+++  RY+  ++YSE L+LL+ G+ L L++ Q   GA
Sbjct: 10  QRVLAKLEASINSENYYEAHQMYRTLYFRYLGQKKYSELLELLYNGSTLLLQHEQHASGA 69

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L + FV  L +        + D   KI  LF  I  P + E D                
Sbjct: 70  DLGILFVNVLTQSGT---EPSQDYFEKITNLFS-IMSPLSPERD---------------- 109

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+++A++WSI+    K G P+LH  +A+  + E   + MAR   HF+   +   
Sbjct: 110 -----VFVQSALRWSIKNTDYKIGHPDLHQKIAQVFWREKNYV-MARQ--HFIYSRDGSG 161

Query: 195 FASTIVNFM-GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVEN-KQLQLQPS 252
            A+ +V     + Y  E DL +A+A+L YL L N   A  + +    Q     +      
Sbjct: 162 CAAMLVELHEQRGYKNEIDLFIAQAVLQYLCLQNKTTAQEVFNSYTSQHPKINRGPPYLL 221

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM-Q 311
            L+ F+F++L+ ++   L +F +L   Y+ S+ R+P + + LD I + FF V    P  Q
Sbjct: 222 PLLNFLFFLLKIVDSGKLAVFTVLCEQYRISLNRDPCYRQYLDKIGQLFFNVPSPRPRNQ 281

Query: 312 GIFGDIFK 319
           G+FG + +
Sbjct: 282 GLFGTLLQ 289


>gi|315507147|ref|NP_001186836.1| golgi to ER traffic protein 4 homolog [Danio rerio]
          Length = 322

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 84/290 (28%), Positives = 147/290 (50%), Gaps = 37/290 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V  G++Y A QMY+++  RY++  +++EA +L++ GA L    NQ    ++L++ 
Sbjct: 24  KLRASVERGDYYEAHQMYRTLFFRYLSQAKHAEARELMYNGALLFFSYNQQNSASDLSML 83

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +   D+E L+ + K++ L  P  P                       RV   
Sbjct: 84  VLESLEKSEAKVDDETLEHLAKLFSLMDPNSPE----------------------RV--- 118

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  AIKWS   G+ K G P+LH +LA  L+ E    + +   +HF+R ++ E  A  
Sbjct: 119 -AFVSRAIKWS-SGGSGKLGHPKLHQLLALTLWKEQ---NYSESRYHFLRSSDGEGCAQM 173

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + + GE D+ VA+A+L +L L N   A  +     ++   +E     +QP   
Sbjct: 174 LVEYSSARGFHGEVDMFVAQAVLQFLCLKNKTSALVVFTTYTQKHPSIEKGPPFVQPLLN 233

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
             +   +L  ++   L +F +L   Y+ S++++P +NE LD I + FFGV
Sbjct: 234 FLWF--LLLAVDGGKLTVFTVLCEQYQPSLKKDPMYNEYLDRIGQLFFGV 281


>gi|289741337|gb|ADD19416.1| uncharacterized conserved protein [Glossina morsitans morsitans]
          Length = 332

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 90/311 (28%), Positives = 142/311 (45%), Gaps = 37/311 (11%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KLE  +N GN+Y A QMY+++  RY+A ++Y E +DLL++GA   ++N Q    A+L + 
Sbjct: 18  KLEHSINTGNYYEAHQMYRTLYFRYIAQRKYFECMDLLYSGALRLVRNAQEPSAADLGLL 77

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            V+ L K             R   EL+ +            L  L E L    +  E   
Sbjct: 78  IVDTLEKSGH----------RTNIELWIE-----------RLSALIENLSPTTVERE--- 113

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVR-GNNPEKFAST 198
           + +   IKWS E      G P LH  +A  L+SE    + AR      R GN   +F   
Sbjct: 114 TLINRCIKWSGEVSGHNLGHPSLHKRIARLLWSEG-NTEQARHHVLLCRDGNMCGRF--- 169

Query: 199 IVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSD--LIQ 256
           +V    K Y  E DL + +A+L  L L + K A        +Q  + + +  P    L+ 
Sbjct: 170 LVKISEKSYTSEVDLIIVQAVLQQLCLKDRKTAEETFTTYIRQHPDIKRKAAPYKEILLN 229

Query: 257 FVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNP------M 310
           F++++   ++ +   +F  L   YK S++R+P+F + L  I   FF +    P      M
Sbjct: 230 FIYFLFLCIDINRSAVFRSLCDLYKPSLDRDPSFQKYLVKIGVLFFDINPSPPSSGANMM 289

Query: 311 QGIFGDIFKMM 321
            G+FGD+F  +
Sbjct: 290 GGMFGDLFSRL 300


>gi|307166166|gb|EFN60415.1| UPF0363 protein C7orf20 [Camponotus floridanus]
          Length = 319

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 86/309 (27%), Positives = 152/309 (49%), Gaps = 32/309 (10%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  + KLE  +N  N+Y A QMY+++  RY++ ++YSE L+LL+ G+ L L++ Q   GA
Sbjct: 10  QRVLAKLEASINSENYYEAHQMYRTLYFRYLSQRKYSELLELLYNGSMLLLQHEQHASGA 69

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L + F+  L +          +++  ++ L      P + E +                
Sbjct: 70  DLGILFINVLTQSGTKPTQSFFEKITSLFSLMS----PASPERE---------------- 109

Query: 135 VEGCSSFLKAAIKWSIEFGAP-KTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE 193
                +F++ A+KWS++     KTG P+LH  +A+  + E   + MAR   HF+   +  
Sbjct: 110 -----AFVQLALKWSVKSAVSNKTGHPDLHQKIAQVFWREKNYI-MARQ--HFIHSRDGS 161

Query: 194 KFASTIVNFM-GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVK-KQVENKQLQLQP 251
             A+ +V     + Y  E DL +A+A+L YL L N   A    +    +  +        
Sbjct: 162 GCAAMLVELHEQRGYTNEIDLFIAQAVLQYLCLQNKATAQEAFNSYTLRHPKINSGPPYL 221

Query: 252 SDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM- 310
             L+ F+F++L+T++   L +F +L   Y+ S+ R+P + + LD I + FF +    P  
Sbjct: 222 LPLLNFLFFLLKTIDSGKLAVFTILCEQYQISLNRDPYYRQYLDKIGQLFFNIPPPRPRN 281

Query: 311 QGIFGDIFK 319
           QG+FG I +
Sbjct: 282 QGLFGSILQ 290


>gi|391337680|ref|XP_003743194.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Metaseiulus
           occidentalis]
          Length = 316

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 86/316 (27%), Positives = 146/316 (46%), Gaps = 46/316 (14%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KLEK    G+FY   QMY++I  R+   ++Y+E  D++  GA   L   +L  GA+LA  
Sbjct: 15  KLEKCFESGDFYHTHQMYRTIYFRFKRQEKYAECKDIMFEGAERLLSKGELNSGADLAKL 74

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
           ++E L KG +  D+E + ++ K++ +  P  P                       RV+  
Sbjct: 75  YLEVLSKGSIQADSEIMKKLAKLHSMMKPSSPE----------------------RVD-- 110

Query: 139 SSFLKAAIKWSIEFG----APKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
             F    +++S   G        G P +H +LA   ++E+   D     +HF R  +   
Sbjct: 111 --FTGECLQYSTVLGDLSDKKAFGHPNMHRLLALNYWAETNFRD---AKYHFTRSLDGTS 165

Query: 195 FASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDAN---YIMDEVKKQVENKQLQLQ 250
            A  ++ + + K Y  E DL +A+ +L YL L N + A    Y+       V+     L 
Sbjct: 166 CALMLIEYHLSKGYKSEVDLFIAQTVLQYLCLNNQRTATVAFYVYTRWHPNVDGPPFVLP 225

Query: 251 PSDLIQFVFYVLQTLE-RDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV----K 305
              L+ F++++L  +E R  L +F +L   YK+ + R+P +N  LD I + FFG      
Sbjct: 226 ---LLNFIWFLLLAVETRANLAIFTVLCERYKSFLARDPMYNVYLDKIGQLFFGKAVAPS 282

Query: 306 RRNPMQGIFGDIFKMM 321
            R+P  G+ G++   +
Sbjct: 283 ERHPAGGLLGNMISQL 298


>gi|324513879|gb|ADY45685.1| Golgi to ER traffic protein 4 [Ascaris suum]
          Length = 385

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 84/310 (27%), Positives = 151/310 (48%), Gaps = 29/310 (9%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   +  G++Y A Q+Y+++  R  A  ++ +  DLL+ GA   L  N+ T   +LA  
Sbjct: 21  KLADRLESGDYYEAHQIYRTLYYRMSAQGKWQQLQDLLYNGAIKLLDVNEPTSAVDLAEL 80

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
           FVE+L +  +   +  LDR  +++   P +     LE D      T   G    RV+  +
Sbjct: 81  FVESLQQSNMPVSSATLDRFEQLFTRLPPV-----LEKD------TTVDGN---RVDRRN 126

Query: 140 SFLKAAIKWSIEFGAPK----TGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKF 195
             L  A+KW++  G  K     G P+ H  +A+ L+ E    +      HF+  ++ EKF
Sbjct: 127 ELLSRALKWTMSVGEKKRYREKGHPDFHARVAKILWHEG---NYVAARNHFILSDDAEKF 183

Query: 196 ASTIVNFMGKC-YPGEDDLAVARAILMYLSLGNMKDANYIMD---EVKKQVENKQLQLQP 251
           AS +++F  +C +  E DL +A+A L  L     K A+ ++    E   Q+  +     P
Sbjct: 184 ASFLIDFQQRCGFKSEKDLFIAQATLQVLCTRRAKMASVLLQLYCEKHPQIGGEAPYELP 243

Query: 252 SDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV--KRRNP 309
             L+ F++ +   ++   +  F++L   Y+  +ER+P++   LD I + +FG+   +++ 
Sbjct: 244 --LLNFIWLLTLCVQLKKVIYFSILVEKYQKCLERDPSYRVYLDKIGQLYFGLPPAQKSS 301

Query: 310 MQGIFGDIFK 319
             G+ G I K
Sbjct: 302 CGGLLGSILK 311


>gi|410984253|ref|XP_003998444.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Felis catus]
          Length = 368

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 80/282 (28%), Positives = 145/282 (51%), Gaps = 35/282 (12%)

Query: 27  EGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVK 86
           +G++Y A QMY+++  RY++  +++EA +L+ +GA L   + Q    A+L++  +E+L K
Sbjct: 78  KGDYYEAHQMYRTLFFRYMSQSKHTEARELMCSGALLFFSHGQQNSAADLSMLVLESLEK 137

Query: 87  GKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAI 146
            +V    + L+ + K++ L      P++ E                 RV    +F+  A+
Sbjct: 138 AEVEVAADLLENLAKLFSLMD----PNSPE-----------------RV----AFVSRAL 172

Query: 147 KWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGK 205
           KWS   G+ K G P LH +LA  L+            +HF+   + E  A  +V +   +
Sbjct: 173 KWS-SGGSGKLGHPRLHQLLALTLWKGR---XYCESRYHFLHSADGEGCAHMLVEYSTSR 228

Query: 206 CYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVL 262
            +  E D+ VA+A+L +L L N   A+ +     ++   +EN    +QP  L+ F++++L
Sbjct: 229 GFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIENGPPFVQP--LLNFIWFLL 286

Query: 263 QTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
             ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 287 LAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 328


>gi|322794363|gb|EFZ17467.1| hypothetical protein SINV_03292 [Solenopsis invicta]
          Length = 338

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/308 (27%), Positives = 151/308 (49%), Gaps = 31/308 (10%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  + KLE  ++  N+Y A QMY+++  RY+  ++YSE L+LL+ G+ L L++ Q   GA
Sbjct: 30  QRVLAKLEASISSENYYEAHQMYRTLYFRYLGQRKYSELLELLYNGSMLLLQHEQHASGA 89

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L + FV  L +          +++  ++ L      P + E +                
Sbjct: 90  DLGILFVNVLTQSGTVPAQGFFEKITSLFSLMS----PTSPERE---------------- 129

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+++A+KWS++    KTG P+LH  +A+  + E   + MAR   HF+   +   
Sbjct: 130 -----IFVQSALKWSVKGTDYKTGHPDLHQKIAQVFWREKNYI-MARQ--HFIHSRDGSG 181

Query: 195 FASTIVNFM-GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDE-VKKQVENKQLQLQPS 252
            A+ +V     + Y  E DL +A+A+L YL L N   A    +    +  +         
Sbjct: 182 CAAMLVELHEQRGYMNEIDLFIAQAVLQYLCLQNKATAQEAFNSYTSRHPKINSGPPYLL 241

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM-Q 311
            L+ F+F++L+T++   L +F +L   Y+ S+ R+P + + LD I + FF +    P  Q
Sbjct: 242 PLLNFLFFLLKTIDSGKLAVFTVLCEQYQISLNRDPCYRQYLDKIGQLFFNIPPPRPRNQ 301

Query: 312 GIFGDIFK 319
           G+FG + +
Sbjct: 302 GLFGSLLQ 309


>gi|119607585|gb|EAW87179.1| chromosome 7 open reading frame 20, isoform CRA_a [Homo sapiens]
 gi|119607586|gb|EAW87180.1| chromosome 7 open reading frame 20, isoform CRA_a [Homo sapiens]
          Length = 283

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 79/292 (27%), Positives = 152/292 (52%), Gaps = 37/292 (12%)

Query: 36  MYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNEN 95
           MY+++  RY++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E 
Sbjct: 1   MYRTLFFRYMSQSKHTEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADEL 60

Query: 96  LDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAP 155
           L+ + K++ L      P++ E                 RV    +F+  A+KWS   G+ 
Sbjct: 61  LENLAKVFSLMD----PNSPE-----------------RV----TFVSRALKWS-SGGSG 94

Query: 156 KTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLA 214
           K G P LH +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ 
Sbjct: 95  KLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANMLVEYSTSRGFRSEVDMF 151

Query: 215 VARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALP 271
           VA+A+L +L L N   A+ +     ++   +E+    ++P  L+ F++++L  ++   L 
Sbjct: 152 VAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--LLNFIWFLLLAVDGGKLT 209

Query: 272 LFNMLRANYKASIEREPAFNEMLDDIAEKFFGV--KRRNPMQGIFGDIFKMM 321
           +F +L   Y+ S+ R+P +NE LD I + FFGV  K+ +   G+ G++   +
Sbjct: 210 VFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGVPPKQTSSYGGLLGNLLTSL 261


>gi|417409672|gb|JAA51331.1| Putative golgi to er traffic protein 4, partial [Desmodus rotundus]
          Length = 318

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 82/290 (28%), Positives = 152/290 (52%), Gaps = 37/290 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+ +GA L  ++ Q    A+L++ 
Sbjct: 21  KLRASVEKGDYYEAHQMYRTLFFRYMSQNKHAEARELMCSGALLFFRHGQQNSAADLSML 80

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V    E L+ + +++ L  P  P                       RV   
Sbjct: 81  VLESLEKAEVEVAEELLESLARLFSLMDPNSPE----------------------RV--- 115

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ + G P LH +LA  L+ E    +     +HF+  ++ E  A+ 
Sbjct: 116 -AFVSRALKWSGG-GSGRLGHPRLHQLLALTLWREQ---NYCESRYHFLHSSDGEGCANM 170

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    +QP  L
Sbjct: 171 LVEYSTARGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPPFVQP--L 228

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           + F++++L  ++   L +F +L  +Y+ S+ R+P +NE LD I + FFGV
Sbjct: 229 LNFIWFLLLAVDGGKLTVFTVLCEHYQPSLRRDPMYNEYLDRIGQLFFGV 278


>gi|57525036|ref|NP_001006159.1| Golgi to ER traffic protein 4 homolog [Gallus gallus]
 gi|82082134|sp|Q5ZKG8.1|GET4_CHICK RecName: Full=Golgi to ER traffic protein 4 homolog
 gi|53130909|emb|CAG31775.1| hypothetical protein RCJMB04_10o20 [Gallus gallus]
          Length = 312

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 82/288 (28%), Positives = 143/288 (49%), Gaps = 48/288 (16%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +Y EA +L+++GA L   +NQ    A+L++ 
Sbjct: 29  KLRASVEKGDYYEAHQMYRTLFFRYMSQGKYVEARELMYSGALLFFSHNQQNSAADLSML 88

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K      ++ L+ + K++ L      P++ E                 RV    
Sbjct: 89  VLESLEKSDAKVTDDLLENLAKLFSLMD----PNSPE-----------------RV---- 123

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P+LH +LA  L+ +        V +   RG          
Sbjct: 124 AFVSRALKWS-SGGSGKLGHPKLHQLLAITLWKDGEGCANMLVEYSSSRG---------- 172

Query: 200 VNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQ 256
                  Y  E D+ VA+A+L +L L N   A+ +     ++   +E     +QP  L+ 
Sbjct: 173 -------YRSEVDMFVAQAVLQFLCLKNKTSASVVFTTYTQKHPSIEKGPPFVQP--LLN 223

Query: 257 FVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
           F++++L  ++   L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 224 FIWFLLLAVDGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 271


>gi|307206359|gb|EFN84411.1| UPF0363 protein C7orf20 [Harpegnathos saltator]
          Length = 318

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 84/313 (26%), Positives = 150/313 (47%), Gaps = 39/313 (12%)

Query: 14  AQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCG 73
            Q  + KLE  +N  N+Y A QMY+++  RY+  ++YSE L+LL+ G+ L L+++Q   G
Sbjct: 9   VQRVLAKLEASINSENYYEAHQMYRTLYFRYLGQKKYSELLELLYNGSMLLLQHDQHASG 68

Query: 74  AELAVSFVEALVKGKVAYDNENLDRVRKIY----ELFPQIPVPHNLEDDDDLHELTEALG 129
           A+L + F+  L + +        +++  ++     +FP+  V                  
Sbjct: 69  ADLGILFINVLTQSETEPTQSFFEKITTLFGTMSPMFPEREV------------------ 110

Query: 130 AAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG 189
                      F+++A+KWSI+    KTG P+LH  +A+  + E   + MAR   HF+  
Sbjct: 111 -----------FIQSALKWSIKGTDYKTGHPDLHQKIAQIFWREKNYI-MARQ--HFMHS 156

Query: 190 NNPEKFASTIVNFM-GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDE-VKKQVENKQL 247
            +    A+ +V     + Y  E DL +A A+  YL L N   A    +    +  +    
Sbjct: 157 RDGSGCAAMLVELHEQRGYMNEIDLFIAHAVFQYLCLQNKATAKEAFNSYTSRHPKISSG 216

Query: 248 QLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRR 307
                 L+ F+F++L+T++   L +F +L   Y+ S+ R+P + + LD I + FF +   
Sbjct: 217 PPYLLPLLNFLFFLLKTIDSGKLAVFTVLCDQYQISLNRDPCYRQYLDKIGQLFFNIPPP 276

Query: 308 NPM-QGIFGDIFK 319
            P  QG+F  + +
Sbjct: 277 RPRNQGLFSSLLQ 289


>gi|332030784|gb|EGI70460.1| UPF0363 protein C7orf20 [Acromyrmex echinatior]
          Length = 320

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 85/310 (27%), Positives = 147/310 (47%), Gaps = 33/310 (10%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  + KLE  +N  N+Y A QMY+++  RY+  ++YSE L+LL++G+ L L++ Q   GA
Sbjct: 10  QRVLAKLEASINSENYYEAHQMYRTLYFRYLGQRKYSELLELLYSGSMLLLQHEQHASGA 69

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L + FV  L +          +++  ++ L      P + E +                
Sbjct: 70  DLGILFVNVLTQSGTVPAQGFFEKIMSLFSLMS----PASPERE---------------- 109

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+++A+KWSI+    KTG P+LH  +A+  + E   + +AR   HF+   +   
Sbjct: 110 -----VFVQSALKWSIKGTDYKTGHPDLHQKIAQIFWREKNYI-LARQ--HFIYSRDGSG 161

Query: 195 FASTIVNFM-GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK---QVENKQLQLQ 250
            A+ +V     + Y  E DL +A+A+L YL L N   A    +       ++ +    L 
Sbjct: 162 CAAMLVELHEQRGYMNEIDLFIAQAVLQYLCLQNKATAQEAFNSYTSRHPKINSGPPYLL 221

Query: 251 PSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM 310
           P     F    +    + A P+F +L   Y+ S+ R+P + + LD I + FF +    P 
Sbjct: 222 PLLNFLFFLLKIIDSGKLAYPVFAVLCEQYQISLNRDPCYRQYLDKIGQLFFNIPPPRPR 281

Query: 311 -QGIFGDIFK 319
            QG+FG + +
Sbjct: 282 NQGLFGSLLQ 291


>gi|345305114|ref|XP_001511707.2| PREDICTED: Golgi to ER traffic protein 4 homolog [Ornithorhynchus
           anatinus]
          Length = 386

 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 82/289 (28%), Positives = 147/289 (50%), Gaps = 35/289 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y    MY  +   Y++  ++ EA +L+++GA L   + Q    A+L++ 
Sbjct: 88  KLRASVEKGDYYKLTVMYWILFFPYMSQGKHIEARELMYSGALLFFSHGQQNSAADLSML 147

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E+L K      +E L+ + K++ L      P++ E                 RV    
Sbjct: 148 VLESLEKSDAKVTDELLENLAKVFSLMD----PNSPE-----------------RV---- 182

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
           +F+  A+KWS   G+ K G P+LH +LA  L+ E    +     +HF+  ++ E  A+ +
Sbjct: 183 AFVSRALKWS-SGGSGKLGHPKLHQLLAITLWKEQ---NYCESRYHFLHSSDGEGCANML 238

Query: 200 VNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLI 255
           V +   + Y  E DL VA+A+L +L L N   A+ +     ++   +E     +QP  L+
Sbjct: 239 VEYSASRGYRSEVDLFVAQAVLQFLCLKNKSSASMVFTTYTQKHPSIEKGPPFVQP--LL 296

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            F++++L  ++   L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 297 NFIWFLLLAVDGGKLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 345


>gi|300117072|ref|NP_001177864.1| UPF0363 protein CG9853 homolog [Nasonia vitripennis]
          Length = 319

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 81/314 (25%), Positives = 145/314 (46%), Gaps = 44/314 (14%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KLE  +N G++Y A QMY+++  RY+  ++Y E + +L+ G+ L L+  Q T GA+L 
Sbjct: 13  LSKLESSINAGDYYEAHQMYRTLYFRYLGQKKYHELIQMLYNGSLLLLQCEQYTSGADLG 72

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  ++   K ++                    P     +   DL  L   +   +     
Sbjct: 73  ILLIDVFKKAEIK-------------------PTHSYFQKMIDLFTLMNPVCPER----- 108

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
             +F++ A++WS+     KTG P+ H  LA+ +Y +     +AR   H++   +    AS
Sbjct: 109 -ETFVQNALRWSMTGSTYKTGHPDFHRKLAQ-VYWQEKNYPLARQ--HYLYTKDGFGCAS 164

Query: 198 TIVNFMGK-CYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS---D 253
            ++    K  Y  E D  + + +L YL L N + A         Q  + ++  QP     
Sbjct: 165 MLIELHQKHGYSNEIDFFITQTVLQYLCLQNAEAAEEAFTSFINQ--HPKIAKQPPFILP 222

Query: 254 LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK------RR 307
           L+ F F++L+T++   + +F +L   Y  S+ R+P F + L+ I + FF V+      RR
Sbjct: 223 LLNFSFFLLKTIQFGKISMFTILCDQYFTSLSRDPCFLQYLEKIGQIFFDVQPSQIYSRR 282

Query: 308 ----NPMQGIFGDI 317
               + +Q  F DI
Sbjct: 283 GLFSSFLQSFFNDI 296


>gi|296472999|tpg|DAA15114.1| TPA: golgi to ER traffic protein 4 homolog [Bos taurus]
          Length = 278

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 78/278 (28%), Positives = 142/278 (51%), Gaps = 37/278 (13%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY+A  +++EA +L+ +GA L   + Q    A+L++ 
Sbjct: 28  KLRASVEKGDYYEAHQMYRTLFFRYMAQSKHAEARELMCSGALLFFSHGQQNSAADLSML 87

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   +E L+ + K++ L  P  P                       RV   
Sbjct: 88  VLESLEKAEVEVADELLESLAKLFSLMDPNSPE----------------------RV--- 122

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+  ++ E  A+ 
Sbjct: 123 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSSDGEGCANM 177

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E     +QP  L
Sbjct: 178 LVEYSTARGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEGGPPFVQP--L 235

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNE 292
           + F++++L  ++   L +F +L   Y+ S+ R+P +NE
Sbjct: 236 LNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNE 273


>gi|170052456|ref|XP_001862230.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167873385|gb|EDS36768.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 360

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 86/314 (27%), Positives = 145/314 (46%), Gaps = 40/314 (12%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KLE  +  GN+Y A QMY+++  RY++ ++Y E L+LL+ G+   L + Q T GA+L 
Sbjct: 23  LAKLESSIETGNYYEAHQMYRTLYFRYLSQRKYEELLELLYKGSLTLLNHEQHTSGADLG 82

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  V+ L K     + E    ++KI  L  +I  P N+ + + L                
Sbjct: 83  LLIVDTLEKA-AKPEQEAEPWMQKIAMLLSKI--PQNVVERETL---------------- 123

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
               L  A+KWS      + G P +H ++A+ +++E    ++A+   HF+   +      
Sbjct: 124 ----LVKAVKWSATVCKSQVGHPLMHKLIAQIMWNED---NLAQARHHFLLSKDGSGCGH 176

Query: 198 TIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQP---SD 253
            ++     K  P E DL +A+ IL  L L     A        K    K    +P   + 
Sbjct: 177 MLIQLAQTKGVPSEMDLFIAQVILQQLCLKETTTAAETFATYTK-YHPKIASSEPPFITP 235

Query: 254 LIQFVFYVLQTLE----RDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK---- 305
           L+ F F++L+ +E    +  L +F  L   Y+ S+ER+P++ + L  I   FFG      
Sbjct: 236 LLNFCFFLLRAIETQQNQRKLAVFRTLCDLYRPSLERDPSYEKYLQKIGMIFFGASLPQR 295

Query: 306 -RRNPMQGIFGDIF 318
            +   M GIFGD+ 
Sbjct: 296 PQGEGMGGIFGDLL 309


>gi|449281445|gb|EMC88525.1| hypothetical protein A306_02731, partial [Columba livia]
          Length = 279

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 77/269 (28%), Positives = 139/269 (51%), Gaps = 35/269 (13%)

Query: 40  ISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRV 99
           I  RY++  +++EA +L+++GA L   +NQ    A+L++  +E+L K       E L+ +
Sbjct: 1   IVFRYMSQGKHAEARELMYSGALLFFSHNQQNSAADLSMLVLESLEKSDAKVAEELLEHL 60

Query: 100 RKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGS 159
            K++ L      P++ E                 RV    +F+  A+KWS   G+ K G 
Sbjct: 61  AKLFSLMD----PNSPE-----------------RV----AFVSRALKWS-SGGSGKLGH 94

Query: 160 PELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARA 218
           P+LH +LA  L+ E    + +   +HF+   + E  A+ +V +   + Y  E D+ VA+A
Sbjct: 95  PKLHQLLAITLWKEQ---NYSESRYHFLHSTDGEGCANMLVEYSSSRGYRSEVDMFVAQA 151

Query: 219 ILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNM 275
           +L +L L N   A+ +     ++   +E     +QP  L+ F++++L  ++   L +F +
Sbjct: 152 VLQFLCLKNKTSASVVFTTYTQKHPSIEKGPPFVQP--LLNFIWFLLLAVDGGKLTVFTV 209

Query: 276 LRANYKASIEREPAFNEMLDDIAEKFFGV 304
           L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 210 LCEQYQPSLKRDPMYNEYLDRIGQLFFGV 238


>gi|383865327|ref|XP_003708126.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Megachile
           rotundata]
          Length = 312

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 78/293 (26%), Positives = 142/293 (48%), Gaps = 30/293 (10%)

Query: 14  AQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCG 73
            Q  + KLE  +N  N+Y A QMY+++  RY+  ++YSE L+LL+ G+ L L++ Q   G
Sbjct: 9   VQRVLTKLEASINSENYYEAHQMYRTLYFRYLGQKKYSELLELLYNGSTLLLQHEQHASG 68

Query: 74  AELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKL 133
           A+L + F+  L +          +++  +      + +  +   + D+            
Sbjct: 69  ADLGILFINVLTQSGTEPSQVYFEKITNL------LCIMSSSSPERDI------------ 110

Query: 134 RVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE 193
                  F+++A++WSI     KTG P+LH  +A+  + E   + MAR   HF+   +  
Sbjct: 111 -------FIQSALRWSIRGNEYKTGHPDLHQKIAQVFWKEKNYI-MARQ--HFIHSKDGS 160

Query: 194 KFASTIVNFM-GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVEN-KQLQLQP 251
            +A+ +V     + Y  E DL +A+A+L YL L N   A  + +    Q     +     
Sbjct: 161 GYAAMLVELHEQRGYMNEIDLFIAQAVLQYLCLQNKTAAQEVFNSYTSQHPKINRGPPYL 220

Query: 252 SDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
             L+ F+F++L+ ++   L  F +L   Y+ S+ R+P + + LD I + FF +
Sbjct: 221 LPLLNFLFFLLKIIDSGKLTAFKVLCEQYEISLNRDPCYRQYLDKIEQVFFNI 273


>gi|357622878|gb|EHJ74239.1| hypothetical protein KGM_03680 [Danaus plexippus]
          Length = 363

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 81/338 (23%), Positives = 161/338 (47%), Gaps = 45/338 (13%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           +DKLE  +N G +Y A QMY+++  RY+  ++Y++ L+LL+ G+ L L+ +Q   GA+LA
Sbjct: 13  LDKLEASINSGQYYEAHQMYRTLYFRYLGQKKYADLLNLLYKGSTLLLQRDQQGSGADLA 72

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELF-PQIP---------VPHNLEDDDDLHELTEA 127
           +  VE L K ++   +E ++++ K++E+    IP         V  +++++   H     
Sbjct: 73  ILLVEVLNKSEMPACHEWMEKIAKLFEIMNSSIPEREIFLTNAVKWSMDENKKGHHFLH- 131

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVS---- 183
                 + + C    K+ + +    G  +  + ++  +L +        L + +++    
Sbjct: 132 ------KRDYCMGLFKSDVLYLNTCGVLRVRNTQVMPLLLQTYLGRYDLLLLNKIAEVYW 185

Query: 184 ---------FHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMK---D 230
                     HF+   +   +A+ ++     K    E DL + +A+L  L L N+K   D
Sbjct: 186 KEKVYTAAHRHFLHSRDGATYAAMLIELHTTKGLKSEIDLFITQAVLQCLCLRNVKMATD 245

Query: 231 ANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAF 290
           A Y   E   +++N +       L+ F++++L+ +E   +  F +LR  Y  SI+R+P +
Sbjct: 246 AFYKYTENHPKIKNDKGPPYLFPLLNFLWFLLKAIEEKHVMRFKVLRDCYAVSIKRDPNY 305

Query: 291 NEMLDDIAEKFFGV----KRRNP-------MQGIFGDI 317
           +  LD I   +FG+      RN        ++ I GD+
Sbjct: 306 SVYLDTIGRIWFGIGIPQTNRNSNSVIGGLLKSIIGDV 343


>gi|384487961|gb|EIE80141.1| hypothetical protein RO3G_04846 [Rhizopus delemar RA 99-880]
          Length = 324

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 84/329 (25%), Positives = 145/329 (44%), Gaps = 54/329 (16%)

Query: 14  AQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCG 73
            ++ + KL + VN GN+Y A QMY++++ RY   Q+Y + + LLH GA L L++ Q   G
Sbjct: 6   TEKTLQKLRESVNGGNYYEAHQMYRTVARRYNKQQKYEDTIRLLHDGAILLLQHKQNGSG 65

Query: 74  AELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKL 133
           ++LA   ++      +     +LDR+ +I  LFP   V                      
Sbjct: 66  SDLANYMLDTYKLANLPVSETSLDRIVEILNLFPVNEV---------------------- 103

Query: 134 RVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE 193
              G   F+  AI+W+ +      G PELH  +   LY+E      ++   H + G    
Sbjct: 104 ---GRKPFISNAIRWTKDDEHYPDGDPELHDFVGTMLYNEKK---YSQAEEHLLLGTEHS 157

Query: 194 K--FASTIVNFMGKCYPGEDDLAVARAILMYLS-----LGNMKDANYIMDEVKKQVENKQ 246
                +    +  +       + +AR +L Y++       N+   N+I  +   Q    Q
Sbjct: 158 AGLLGTLAYEWATEQKSTSKGIYLARVVLQYVASKDIHYANLAYTNFI--KAGNQPTAAQ 215

Query: 247 LQL------QPSDL-------IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEM 293
           L++      +PSD+       I F   +L T++RDA  LF  L++ Y+     E  F E+
Sbjct: 216 LKVRRAPADEPSDIDVYGDSWINFTQLLLLTIQRDASDLFKELKSKYQGLYGSESNFIEL 275

Query: 294 LDDIAEKFFGV----KRRNPMQGIFGDIF 318
           +DDI   +F +    K+ N +Q +  ++F
Sbjct: 276 MDDIGLVYFNIPKPRKQSNMLQEMMANLF 304


>gi|344289797|ref|XP_003416627.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Loxodonta
           africana]
          Length = 320

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 76/269 (28%), Positives = 138/269 (51%), Gaps = 35/269 (13%)

Query: 40  ISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRV 99
           +  RY++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L  +
Sbjct: 42  LDKRYMSQGKHTEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVDVADELLGNL 101

Query: 100 RKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGS 159
            K++ L      P++ E                 RV    +F+  A+KWS   G+ K G 
Sbjct: 102 AKLFSLMD----PNSPE-----------------RV----AFVSRALKWS-GGGSGKLGH 135

Query: 160 PELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARA 218
           P LH +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ VA+A
Sbjct: 136 PRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANMLVEYSTSRGFRSEVDMFVAQA 192

Query: 219 ILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNM 275
           +L +L L N   A+ +     ++   +EN    +QP  L+ F++++L  ++   L +F +
Sbjct: 193 VLQFLCLKNKSSASVVFTTYTQKHPSIENGPPFVQP--LLNFIWFLLLAVDGGKLTVFTV 250

Query: 276 LRANYKASIEREPAFNEMLDDIAEKFFGV 304
           L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 251 LCEQYQPSLRRDPMYNEYLDRIGQLFFGV 279


>gi|242008205|ref|XP_002424901.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212508481|gb|EEB12163.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 252

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 78/270 (28%), Positives = 136/270 (50%), Gaps = 39/270 (14%)

Query: 19  DKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAV 78
            KLE  +N GN+Y A Q+Y+++  RY++ ++Y+  LDLL  GA + LK++Q + GA+LA+
Sbjct: 14  SKLETCINSGNYYEAHQLYRTLYFRYLSQEKYTALLDLLCDGALVLLKHDQQSSGADLAI 73

Query: 79  SFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
            +VE LVK            +  ++++   +IP     E +                   
Sbjct: 74  LYVEVLVKSTTKPGEAYFSTLANLFQMINSEIP-----ERE------------------- 109

Query: 138 CSSFLKAAIKWSIEFGAP--KTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE-K 194
             +FL  A++WS+  G+   K+G P+LH  +AE  + E   + +AR  +H++  ++ + K
Sbjct: 110 --TFLTMALRWSVT-GSEIYKSGHPQLHQFIAEIFWKEKNYV-LAR--YHYLHSSDGQGK 163

Query: 195 FASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQL---QLQP 251
               I   + K Y  E DL +A+A+L YL L N   A    +    Q  N +     L P
Sbjct: 164 LKFQIELHILKGYQSEVDLFIAQAVLQYLCLQNKNTAAVAFNSYTTQHPNIKTGFPYLLP 223

Query: 252 SDLIQFVFYVLQTLERDALPLFNMLRANYK 281
             L+ F++++L+ +E   L +F +L   Y+
Sbjct: 224 --LLNFIWFLLKAVESGKLTVFTVLCERYQ 251


>gi|355747461|gb|EHH51958.1| Conserved edge-expressed protein, partial [Macaca fascicularis]
          Length = 276

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 74/266 (27%), Positives = 138/266 (51%), Gaps = 35/266 (13%)

Query: 43  RYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKI 102
           RY++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L+ + K+
Sbjct: 1   RYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADELLENLAKV 60

Query: 103 YELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPEL 162
           + L      P++ E                 RV    +F+  A+KWS   G+ K G P L
Sbjct: 61  FSLMD----PNSPE-----------------RV----AFVSRALKWS-SGGSGKLGHPRL 94

Query: 163 HVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILM 221
           H +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ VA+A+L 
Sbjct: 95  HQLLALTLWKEQ---NYCESRYHFLHSADGEGCANMLVEYSTSRGFRSEVDMFVAQAVLQ 151

Query: 222 YLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRA 278
           +L L N   A+ +     ++   +E+    ++P  L+ F++++L  ++   L +F +L  
Sbjct: 152 FLCLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--LLNFIWFLLLAVDGGKLTVFTVLCE 209

Query: 279 NYKASIEREPAFNEMLDDIAEKFFGV 304
            Y+ S+ R+P +NE LD I + FFGV
Sbjct: 210 QYQPSLRRDPMYNEYLDRIGQLFFGV 235


>gi|149759031|ref|XP_001488970.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Equus caballus]
          Length = 310

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/266 (28%), Positives = 135/266 (50%), Gaps = 35/266 (13%)

Query: 43  RYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKI 102
           RY +  +++EA +L+ +GA L   + Q    A+L++  +E+L K +V   +E L+ + K+
Sbjct: 36  RYTSQSKHAEARELMCSGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADELLENLAKL 95

Query: 103 YELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPEL 162
           + L      P++ E                 RV    +F+  A+KWS   G+ K G P L
Sbjct: 96  FSLMD----PNSPE-----------------RV----AFVSRALKWS-SGGSGKLGHPRL 129

Query: 163 HVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILM 221
           H +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ VA+A+L 
Sbjct: 130 HQLLALTLWKEQ---NYCESRYHFLHSTDGEGCANMLVEYSTSRGFRSEVDMFVAQAVLQ 186

Query: 222 YLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRA 278
           +L L N   A+ +     ++   +E     +QP  L+ F++++L  ++   L +F +L  
Sbjct: 187 FLCLKNKSSASVVFTTYTQKHPSIEGGPPFVQP--LLNFIWFLLLAVDGGKLTVFTVLCE 244

Query: 279 NYKASIEREPAFNEMLDDIAEKFFGV 304
            Y+ S+ R+P +NE LD I + FFGV
Sbjct: 245 QYQPSLRRDPMYNEYLDRIGQLFFGV 270


>gi|298714388|emb|CBJ27445.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 311

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 89/346 (25%), Positives = 155/346 (44%), Gaps = 63/346 (18%)

Query: 1   MSRQR-PKRTALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHA 59
           MSR+R P   +L    + + +L++ V+EG FY A Q+YK+  +R  A  R  +A +LL A
Sbjct: 1   MSRRRKPGGNSL----DVVQRLQQRVSEGEFYEALQLYKTSYSRLKAQGRLDDAEELLTA 56

Query: 60  GACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKI-YELFPQIPVPHNLEDD 118
           GA    K N+L    EL +  V +  +     D   L +++++   + P + +       
Sbjct: 57  GAIAMSKENELNAATELGLLIVTSFEENGREVDEARLSQLKRVAVAMTPGVEM------- 109

Query: 119 DDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSP-ELHVMLAEYLYSESPEL 177
                               S FLK AI+W+      KT +  EL +++A    +     
Sbjct: 110 --------------------SEFLKRAIRWTRH---TKTEAAFELRLIMARSAVTTG--- 143

Query: 178 DMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDE 237
           ++A  + +F     P ++++ +  +  K YPGE DL + RA+L  LS    +DA  ++D 
Sbjct: 144 NLAEAARYFAVSGCPTEYSALVKQWAAKGYPGETDLFICRAVLHTLSYDRTEDAGVLLDA 203

Query: 238 VKKQVENKQLQLQP-------------SDLIQFVFYVLQTLERDAL---------PLFNM 275
               V  +     P             S L  F  +V++  +  +L           FN 
Sbjct: 204 CASWVLGEDAASSPGSEASSGAQAVLESPLWHFTRFVVELCQVKSLAGSPSQEMAAAFNK 263

Query: 276 LRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIFKMM 321
           LR  Y+AS+ER+   +++L+ I +  FG+K   P  G+ G + ++M
Sbjct: 264 LREVYRASLERDVNLDDLLNRIGDVHFGIKPPQP-GGLMGMLGQLM 308


>gi|296421968|ref|XP_002840535.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295636753|emb|CAZ84726.1| unnamed protein product [Tuber melanosporum]
          Length = 325

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 89/332 (26%), Positives = 142/332 (42%), Gaps = 60/332 (18%)

Query: 14  AQENIDK----LEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQ 69
           A +N++K    LEK + EG +Y A Q  + +S RY+ A  Y  A+D+L +GA   LK  Q
Sbjct: 2   ASKNVEKTRARLEKRIEEGAYYEAHQQLRVVSQRYIKAHNYDAAIDILCSGAEALLKAGQ 61

Query: 70  LTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALG 129
            + G +L +  V     G +A    +  R+ ++  L P    P                 
Sbjct: 62  NSSGCDLCLMLVGVYKTGSLAPSATSRARLIQLISLIP----PEE--------------- 102

Query: 130 AAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG 189
                  G   F+  AI WS + G    G PELH  + + L  E    D      H V G
Sbjct: 103 ------PGRKRFINEAIAWSSKHGEYPAGDPELHHFIGKLLAQED---DTYEAEKHLVVG 153

Query: 190 N--NPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDE-VKKQVEN-- 244
              + E FA  +  +  +  P    +  ARA+L Y+ +GN++DA  ++    +K +EN  
Sbjct: 154 TKESAETFADMLYEWYTEDEPHTAPIYAARAVLPYMLIGNLRDATRLLTTFTRKLIENNS 213

Query: 245 ----KQLQLQPSDL-----------IQFVFYVLQTLERDALPLFNMLRANYKASIEREPA 289
               +++Q    D            +  +    QT  +D   +F  L+++Y  S+   P 
Sbjct: 214 SLAVQEVQTASVDCKVVPSLPLLNFLNLLLLACQTGGQD---MFRNLKSHYSNSLMEVPH 270

Query: 290 FNEMLDDIAEKFFGVKRRNPMQGIFGDIFKMM 321
           +NE L+ I E + G++ R P       IF MM
Sbjct: 271 WNEPLEQIGEIYLGIQIRRP-----ASIFDMM 297


>gi|148687217|gb|EDL19164.1| RIKEN cDNA 1110007L15, isoform CRA_a [Mus musculus]
          Length = 301

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 75/269 (27%), Positives = 136/269 (50%), Gaps = 37/269 (13%)

Query: 41  SARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVR 100
           S  Y++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L+ + 
Sbjct: 24  SGSYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVDVADELLENLA 83

Query: 101 KIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGS 159
           K++ L  P  P                       RV    +F+  A+KWS   G+ K G 
Sbjct: 84  KVFSLMDPNSPE----------------------RV----AFVSRALKWS-SGGSGKLGH 116

Query: 160 PELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARA 218
           P LH +LA  L+ E    +     +HF+  ++ E  A+ +V +   + +  E D+ VA+A
Sbjct: 117 PRLHQLLALTLWKEQ---NYCESRYHFLHSSDGEGCANMLVEYSTARGFRSEVDMFVAQA 173

Query: 219 ILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNM 275
           +L +L L N   A  +     ++   +E+    +QP  L+ F++++L  ++   L +F +
Sbjct: 174 VLQFLCLKNKNSALVVFTTYTQKHPSIEDGPPFVQP--LLNFIWFLLLAVDGGKLAVFTV 231

Query: 276 LRANYKASIEREPAFNEMLDDIAEKFFGV 304
           L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 232 LCEQYQPSLRRDPMYNEYLDRIGQLFFGV 260


>gi|6453428|emb|CAB61355.1| hypothetical protein [Homo sapiens]
 gi|193783635|dbj|BAG53546.1| unnamed protein product [Homo sapiens]
 gi|193785105|dbj|BAG54258.1| unnamed protein product [Homo sapiens]
 gi|193785854|dbj|BAG51289.1| unnamed protein product [Homo sapiens]
          Length = 274

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 75/284 (26%), Positives = 143/284 (50%), Gaps = 39/284 (13%)

Query: 45  VAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYE 104
           ++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L+ + K++ 
Sbjct: 1   MSQSKHTEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADELLENLAKVFS 60

Query: 105 LF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELH 163
           L  P  P                       RV    +F+  A+KWS   G+ K G P LH
Sbjct: 61  LMDPNSPE----------------------RV----TFVSRALKWS-SGGSGKLGHPRLH 93

Query: 164 VMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMY 222
            +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ VA+A+L +
Sbjct: 94  QLLALTLWKEQ---NYCESRYHFLHSADGEGCANMLVEYSTSRGFRSEVDMFVAQAVLQF 150

Query: 223 LSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRAN 279
           L L N   A+ +     ++   +E+    ++P  L+ F++++L  ++   L +F +L   
Sbjct: 151 LCLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--LLNFIWFLLLAVDGGKLTVFTVLCEQ 208

Query: 280 YKASIEREPAFNEMLDDIAEKFFGV--KRRNPMQGIFGDIFKMM 321
           Y+ S+ R+P +NE LD I + FFGV  K+ +   G+ G++   +
Sbjct: 209 YQPSLRRDPMYNEYLDRIGQLFFGVPPKQTSSYGGLLGNLLTSL 252


>gi|397497940|ref|XP_003819758.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Pan paniscus]
          Length = 274

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 75/284 (26%), Positives = 143/284 (50%), Gaps = 39/284 (13%)

Query: 45  VAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYE 104
           ++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L+ + K++ 
Sbjct: 1   MSQSKHAEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADELLENLAKVFS 60

Query: 105 LF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELH 163
           L  P  P                       RV    +F+  A+KWS   G+ K G P LH
Sbjct: 61  LMDPNSPE----------------------RV----AFVSRALKWS-SGGSGKLGHPRLH 93

Query: 164 VMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMY 222
            +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ VA+A+L +
Sbjct: 94  QLLALTLWKEQ---NYCESRYHFLHSADGEGCANMLVEYSTSRGFRSEVDMFVAQAVLQF 150

Query: 223 LSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRAN 279
           L L N   A+ +     ++   +E+    ++P  L+ F++++L  ++   L +F +L   
Sbjct: 151 LCLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--LLNFIWFLLLAVDGGKLTVFTVLCEQ 208

Query: 280 YKASIEREPAFNEMLDDIAEKFFGV--KRRNPMQGIFGDIFKMM 321
           Y+ S+ R+P +NE LD I + FFGV  K+ +   G+ G++   +
Sbjct: 209 YQPSLRRDPMYNEYLDRIGQLFFGVPPKQTSSYGGLLGNLLTSL 252


>gi|343958498|dbj|BAK63104.1| UPF0363 protein C7orf20 [Pan troglodytes]
          Length = 274

 Score =  101 bits (252), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 75/284 (26%), Positives = 143/284 (50%), Gaps = 39/284 (13%)

Query: 45  VAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYE 104
           ++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L+ + K++ 
Sbjct: 1   MSQSKHAEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADELLENLAKVFS 60

Query: 105 LF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELH 163
           L  P  P                       RV    +F+  A+KWS   G+ K G P LH
Sbjct: 61  LMDPNSPE----------------------RV----AFVSRALKWS-SGGSGKLGHPRLH 93

Query: 164 VMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMY 222
            +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ VA+A+L +
Sbjct: 94  QLLALTLWKEQ---NYCESRYHFLHSADGEGCANMLVEYSTSRGFRSEVDMFVAQAVLQF 150

Query: 223 LSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRAN 279
           L L N K A+ +     ++   +E+    ++P  L+ F++++L  ++   L +F +L   
Sbjct: 151 LCLKNKKSASVVFTTYTQKHPSIEDGPPFVEP--LLNFIWFLLLAVDGGKLTVFTVLCEQ 208

Query: 280 YKASIEREPAFNEMLDDIAEKFFGV--KRRNPMQGIFGDIFKMM 321
           Y+  + R+P +NE LD I + FFGV  K+ +   G+ G++   +
Sbjct: 209 YQPPLRRDPMYNEYLDRIGQLFFGVPPKQTSSYGGLLGNLLTSL 252


>gi|327283665|ref|XP_003226561.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Anolis
           carolinensis]
          Length = 274

 Score =  100 bits (249), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 73/260 (28%), Positives = 131/260 (50%), Gaps = 35/260 (13%)

Query: 49  RYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQ 108
           +++EA +L+++GA L   +NQ    A+L++  +E+L K      +E L+ + K++ L  Q
Sbjct: 5   KHAEARELMYSGALLFFSHNQQNSAADLSMLVLESLEKSDAKVTDELLENLAKVFSLMDQ 64

Query: 109 IPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAE 168
                                 +  RV    +F+  A+KWS   G+ K G P+LH +LA 
Sbjct: 65  ---------------------NSPERV----AFVSRALKWS-SGGSGKLGHPKLHQLLAI 98

Query: 169 YLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMG-KCYPGEDDLAVARAILMYLSLGN 227
            L+ E    +     +HF+   + E  A+ +V +   + Y  E D+ VA+A+L +L L N
Sbjct: 99  TLWKEQ---NYCESRYHFLHSTDGEGCANMLVEYSSSRGYRSEVDMFVAQAVLQFLCLKN 155

Query: 228 MKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASI 284
              A  +     ++   +E     +QP  L+ F++++L  ++   L +F +L   Y+ S+
Sbjct: 156 KTSALVVFTTYTQKHPSIERGPPFVQP--LLNFIWFLLLAVDGGKLTVFTVLCEQYQPSL 213

Query: 285 EREPAFNEMLDDIAEKFFGV 304
           +R+P +NE LD I + FFGV
Sbjct: 214 KRDPMYNEYLDRIGQLFFGV 233


>gi|347963206|ref|XP_311026.4| AGAP000119-PA [Anopheles gambiae str. PEST]
 gi|333467309|gb|EAA06358.4| AGAP000119-PA [Anopheles gambiae str. PEST]
          Length = 326

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 82/306 (26%), Positives = 136/306 (44%), Gaps = 29/306 (9%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KL   V   NFY A QMY+++  RYV+  +Y++ L+LL+ GA   L++ Q + GA+L 
Sbjct: 24  LGKLRASVESKNFYEAHQMYRTLYFRYVSQGKYADVLELLYDGALTMLEHEQYSSGADLG 83

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  ++ L K     +      ++++ EL  +I                      K  V  
Sbjct: 84  LLIIQTLEKAGSTVETPE-QWIKRLAELVSKI----------------------KPTVVD 120

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
             + L  A+KWS    +  TG P +H + A+ LY E  +L +AR  F   +      F  
Sbjct: 121 REALLDRAMKWSGSLVSSPTGHPLMHKLFAQILYREG-DLTLARRHFALAKDGVSCGFLL 179

Query: 198 TIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQP--SDLI 255
             ++   K +PGE DL VA  +L  L+L     A        K   +      P    L+
Sbjct: 180 IEIS-CAKGFPGEVDLFVAHTVLQQLALKEPATAASTFATYCKFHASIACTEPPFAMPLL 238

Query: 256 QFVFYVLQTLERD--ALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGI 313
            F+ ++L  +E++      F  L   YK S++R+P++ + L  I  KFF   R      +
Sbjct: 239 NFLHFLLALVEQNNRMYATFRALCELYKPSLDRDPSYEKYLQKIGVKFFDGSRYEQRNFM 298

Query: 314 FGDIFK 319
           F D+ +
Sbjct: 299 FSDLLQ 304


>gi|407922725|gb|EKG15820.1| hypothetical protein MPH_06959 [Macrophomina phaseolina MS6]
          Length = 327

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 89/329 (27%), Positives = 144/329 (43%), Gaps = 58/329 (17%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q+ I + ++ + EG FY A Q  + I  RYV    +  A+D+L +GA L L+  Q   G 
Sbjct: 6   QKTIARQQEKIAEGQFYEAHQQLRVIGTRYVKQSNWEAAIDILASGAQLLLQAGQGGSGG 65

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L +  +E   K ++  D     ++  +   FP               E T+        
Sbjct: 66  DLCLFLLEVYGKAELKPDAGAKAKLLGLLRSFPP-------------DEPTK-------- 104

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--NNP 192
                 F+   I WS +FG    G PELH +    L ++    D  R   H V G  ++P
Sbjct: 105 ----KRFIGEMIGWSSKFGEYPAGDPELHHVAGTVLAADGDAYDAER---HLVLGTRDSP 157

Query: 193 EKFASTIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDAN--YIMDEVKKQVENK 245
           E     +V    + Y  EDD     L VARA+L YL  GN++ AN  ++    +    N 
Sbjct: 158 EH----LVKLEYEWYT-EDDSHTAPLYVARAVLPYLIAGNIRAANQCFLQFTTRLSQANP 212

Query: 246 QLQLQ--PSDLIQFVFY-----------VLQTLERDALPLFNMLRANYKASIEREPAFNE 292
            L +Q   S  +    Y           +L  + R +  LF ML+++YK  +++ P +N+
Sbjct: 213 SLGVQEVSSSSLDLRIYPSLPLLNFLGLLLLAVTRGSADLFRMLKSHYKTHLDQVPGWND 272

Query: 293 MLDDIAEKFFGVK---RRNPMQGIFGDIF 318
            LD I E +FG++   + NP+  +   +F
Sbjct: 273 ALDQIGEMYFGIRIPSQSNPLMDMMSSMF 301


>gi|440640508|gb|ELR10427.1| hypothetical protein GMDG_00839 [Geomyces destructans 20631-21]
          Length = 327

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 83/328 (25%), Positives = 139/328 (42%), Gaps = 62/328 (18%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I +L+  + EG FY AQQ  + +++RY+ A+ +  A+D+L+  A   LK  Q   G +L+
Sbjct: 9   ITRLQSKITEGQFYEAQQQTRVVASRYIKAKDWPAAIDILYNVASSLLKAEQGGSGGDLS 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
           +  V+   + ++  D  N  ++  +  LF P+ P                          
Sbjct: 69  ILLVDVYKQAELVPDAANKGKLLTLLRLFDPEEPT------------------------- 103

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
               F+   I WS +FG+   G PELH +      SE    D  R   H + G      A
Sbjct: 104 -RKKFITEMIGWSAKFGSYPAGEPELHHVTGSLYASEHDVYDAER---HLLLGTKDS--A 157

Query: 197 STIVNFMGKCYPGEDDLA----VARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS 252
             + N     Y  +D        ARA+L YL LGN++DA+  +     ++ +      PS
Sbjct: 158 PLLANLEYDWYATDDSHTAATYAARAVLPYLLLGNVRDASTALQLFTTRLASS----NPS 213

Query: 253 DLIQFV-------------------FYVLQTLERDALPLFNMLRANYKASIEREPAFNEM 293
            ++Q V                     ++  +E+ A  LF  L+ +Y   I+    ++E 
Sbjct: 214 LVVQDVKASKSGTKVYPSLPLLNFLTLLVAAVEKGAADLFRQLKIHYALLIKEATGWDEA 273

Query: 294 LDDIAEKFFGVKR---RNPMQGIFGDIF 318
           L+ IAE +FG+ R    NP+  + G +F
Sbjct: 274 LESIAEMYFGIARPRQGNPLFDMMGSMF 301


>gi|254281178|ref|NP_001156788.1| Golgi to ER traffic protein 4 homolog isoform 2 [Mus musculus]
 gi|18043415|gb|AAH19557.1| 1110007L15Rik protein [Mus musculus]
          Length = 274

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 73/264 (27%), Positives = 136/264 (51%), Gaps = 35/264 (13%)

Query: 45  VAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYE 104
           ++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L+ + K++ 
Sbjct: 1   MSQSKHAEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVDVADELLENLAKVFS 60

Query: 105 LFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHV 164
           L      P++ E                 RV    +F+  A+KWS   G+ K G P LH 
Sbjct: 61  LMD----PNSPE-----------------RV----AFVSRALKWS-SGGSGKLGHPRLHQ 94

Query: 165 MLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMYL 223
           +LA  L+ E    +     +HF+  ++ E  A+ +V +   + +  E D+ VA+A+L +L
Sbjct: 95  LLALTLWKEQ---NYCESRYHFLHSSDGEGCANMLVEYSTARGFRSEVDMFVAQAVLQFL 151

Query: 224 SLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANY 280
            L N   A  +     ++   +E+    +QP  L+ F++++L  ++   L +F +L   Y
Sbjct: 152 CLKNKNSALVVFTTYTQKHPSIEDGPPFVQP--LLNFIWFLLLAVDGGKLAVFTVLCEQY 209

Query: 281 KASIEREPAFNEMLDDIAEKFFGV 304
           + S+ R+P +NE LD I + FFGV
Sbjct: 210 QPSLRRDPMYNEYLDRIGQLFFGV 233


>gi|354489936|ref|XP_003507116.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Cricetulus
           griseus]
 gi|344246725|gb|EGW02829.1| UPF0363 protein C7orf20-like [Cricetulus griseus]
          Length = 274

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 73/264 (27%), Positives = 135/264 (51%), Gaps = 35/264 (13%)

Query: 45  VAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYE 104
           ++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L+ + K++ 
Sbjct: 1   MSQSKHAEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVDVADELLENLAKVFS 60

Query: 105 LFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHV 164
           L      P++ E                 RV    +F+  A+KWS   G+ K G P LH 
Sbjct: 61  LMD----PNSPE-----------------RV----AFVSRALKWS-SGGSGKLGHPRLHQ 94

Query: 165 MLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMYL 223
           +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ VA+A+L +L
Sbjct: 95  LLALTLWKEQ---NYCESRYHFLHSTDGEGCANMLVEYSTARGFRSEVDMFVAQAVLQFL 151

Query: 224 SLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANY 280
            L N   A  +     ++   +E+    +QP  L+ F++++L  ++   L +F +L   Y
Sbjct: 152 CLKNKNSALVVFTTYTQKHPSIEDGPPFVQP--LLNFIWFLLLAVDGGKLAVFTVLCEQY 209

Query: 281 KASIEREPAFNEMLDDIAEKFFGV 304
           + S+ R+P +NE LD I + FFGV
Sbjct: 210 QPSLRRDPMYNEYLDRIGQLFFGV 233


>gi|297679709|ref|XP_002817666.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Pongo abelii]
          Length = 274

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 136/264 (51%), Gaps = 35/264 (13%)

Query: 45  VAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYE 104
           ++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L+ + K++ 
Sbjct: 1   MSQSKHAEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADELLENLAKVFS 60

Query: 105 LFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHV 164
           L      P++ E                 RV    +F+  A+KWS   G+ K G P LH 
Sbjct: 61  LMD----PNSPE-----------------RV----AFVSRALKWS-SGGSGKLGHPRLHQ 94

Query: 165 MLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMYL 223
           +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ VA+A+L +L
Sbjct: 95  LLALTLWKEQ---NYCESRYHFLHSADGEGCANMLVEYSTSRGFRSEVDMFVAQAVLQFL 151

Query: 224 SLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANY 280
            L N   A+ +     ++   +E+    ++P  L+ F++++L  ++   L +F +L   Y
Sbjct: 152 CLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--LLNFIWFLLLAVDGGKLTVFTVLCEQY 209

Query: 281 KASIEREPAFNEMLDDIAEKFFGV 304
           + S+ R+P +NE LD I + FFG+
Sbjct: 210 QPSLRRDPMYNEYLDRIGQLFFGI 233


>gi|170586936|ref|XP_001898235.1| chromosome 7 open reading frame 20 [Brugia malayi]
 gi|158594630|gb|EDP33214.1| chromosome 7 open reading frame 20, putative [Brugia malayi]
          Length = 361

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 80/310 (25%), Positives = 138/310 (44%), Gaps = 28/310 (9%)

Query: 19  DKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAV 78
           +KL      GN Y A Q+Y+++  R     ++ E  D+L +G    L   +     +LA 
Sbjct: 6   EKLRLCFENGNCYEAHQIYRTLYNRLSNQGKWQELQDMLFSGILRLLAEREAASAIDLAE 65

Query: 79  SFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            FVEAL K K    +  LDR  ++  L     +P  LE+D         L A   R +  
Sbjct: 66  LFVEALEKSKTPVSSVVLDRFDELLNL-----LPAQLEND---------LEANSEREDRR 111

Query: 139 SSFLKAAIKWSIEFGAPK----TGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
             ++   IKWS+  G  K     G P LH  +A+ L+ E    +      HF+  +NPE 
Sbjct: 112 LQYISLGIKWSMAVGDRKRYRRRGHPGLHFRIAKLLWREG---NYTNARSHFMYSDNPET 168

Query: 195 FASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMD---EVKKQVENKQLQLQ 250
           FA  +  + M   Y  E DL +A+A++  L     K A  ++    ++   + +      
Sbjct: 169 FAVFLTEYQMKYGYTTEKDLFIAQAVMQMLCDQKPKSAFKLLQCYCDIHPDIRSG--FPY 226

Query: 251 PSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM 310
           P  L+ F+ + +  +    +  F++L   Y++ I  +P + ++LD I + + G+ +  P 
Sbjct: 227 PFPLLNFLHFTIICIANKEITYFSVLIDQYESEISHDPEYKQLLDRIGQIYMGLPQPQPA 286

Query: 311 -QGIFGDIFK 319
             G  G++ K
Sbjct: 287 PAGFLGNMIK 296


>gi|405959974|gb|EKC25941.1| UPF0363 protein C7orf20 [Crassostrea gigas]
          Length = 333

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 81/310 (26%), Positives = 147/310 (47%), Gaps = 36/310 (11%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + K +  V  GN+Y A QMY+++  RY   ++Y+EA  LL+ GA    K +Q++   +L 
Sbjct: 14  LAKCQSCVEAGNYYEAHQMYRTLYFRYKGQKKYAEAASLLYHGALTLKKYDQISSFTDLC 73

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              +E L + +        D + KI  LF ++  P++ E                     
Sbjct: 74  SLLIELLNQSETPVSE---DIIEKIITLFKEM-TPNSEERQ------------------- 110

Query: 138 CSSFLKAAIKWSIEFGAP-KTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
             +F+ +A+ W+++  A  K G P+LH  L    + E    +  +  +H V   + +  A
Sbjct: 111 --TFVVSAVHWTMKVSAEHKRGHPDLHQKLGLSFWEEK---NYVQARYHLVHSQDGKNCA 165

Query: 197 STIVN-FMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDL- 254
             +V   + K +P E DL +A+A+L +L L N   A      VK    +  +Q  P  + 
Sbjct: 166 LMLVECHVQKGFPNEVDLFIAQAVLQFLCLRNKDTAQVAF--VKYTENHPDVQSGPPYVH 223

Query: 255 --IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV-KRRNPMQ 311
             + F++++L  +E   + +F++L   Y+ SI R+P++ E LD I + FFG+       Q
Sbjct: 224 PLLNFLWFLLLAVEGGRVAVFSILCEKYQTSINRDPSYREYLDQIGQLFFGLPPPPKQSQ 283

Query: 312 GIFGDIFKMM 321
           G  G++ + +
Sbjct: 284 GFLGNLLQSI 293


>gi|169626335|ref|XP_001806568.1| hypothetical protein SNOG_16452 [Phaeosphaeria nodorum SN15]
 gi|160705815|gb|EAT76150.2| hypothetical protein SNOG_16452 [Phaeosphaeria nodorum SN15]
          Length = 350

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 86/334 (25%), Positives = 150/334 (44%), Gaps = 61/334 (18%)

Query: 7   KRTALPPAQENIDKLEKI---------VNEGNFYGAQQMYKSISARYVAAQRYSEALDLL 57
           +++ +P  +  + ++ K+         + EG FY A Q  + I++RYV AQ +  A  +L
Sbjct: 12  EKSNVPKQKRKVHRIHKLPHFLANPARIAEGQFYEAHQQLRVIASRYVKAQDWENATSIL 71

Query: 58  HAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLED 117
           ++GA   L+  Q   G +L +  ++   KG+V  D  +  ++  +   FP+         
Sbjct: 72  YSGAQSLLQAGQGGSGGDLCIFLLDVFNKGEVKPDASSKGKLLGLLRAFPK--------- 122

Query: 118 DDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPEL 177
                E T+              F+   I WS +FG    G PE+H  +A  LY+E  EL
Sbjct: 123 ----DEPTK------------KKFVGEMIAWSSKFGEYPAGDPEIH-NVAGSLYAE--EL 163

Query: 178 DMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDD----LAVARAILMYLSLGNMKDANY 233
           +      H + GN  +   ST+ +     Y  +D     L  AR +L YL  GN++ AN 
Sbjct: 164 EPYEAERHLLLGN--QDSPSTLASLEYAWYEADDSHTAPLYCARGVLPYLLTGNLRGANK 221

Query: 234 I------MDEVKKQVENKQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRAN 279
                      K  +  +Q+    SDL  +        +  +L+++ER    LF  L+++
Sbjct: 222 FFLLFTSFLSKKPGLNTQQISTSASDLRTYPTLPLLNFLGLLLRSVERGDPSLFRQLKSH 281

Query: 280 YKASIEREPAFNEMLDDIAEKFFGVK---RRNPM 310
           Y  ++ ++  +NE LD + E +FG+K   + NPM
Sbjct: 282 YATNL-KDVNWNEALDQLGEMYFGIKVPRQGNPM 314


>gi|297287801|ref|XP_002803231.1| PREDICTED: UPF0363 protein C7orf20-like [Macaca mulatta]
          Length = 323

 Score = 94.7 bits (234), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 79/287 (27%), Positives = 139/287 (48%), Gaps = 35/287 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 29  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 88

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   +E L+ + K++ L  P  P                       RV   
Sbjct: 89  VLESLEKAEVEVADELLENLAKVFSLMDPNSPE----------------------RV--- 123

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ 
Sbjct: 124 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANM 178

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQF 257
           +V +   + +  E D+ VA+A+L       +  A      + K +    +   P + + F
Sbjct: 179 LVEYSTSRGFRSEVDMFVAQAVLQAELQALLLQALASASFLMKAL-FVSINFFPQNNVDF 237

Query: 258 VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
              V +   +  L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 238 KAGVGELTGK--LTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 282


>gi|312089220|ref|XP_003146163.1| hypothetical protein LOAG_10591 [Loa loa]
 gi|307758674|gb|EFO17908.1| hypothetical protein LOAG_10591 [Loa loa]
          Length = 359

 Score = 94.4 bits (233), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 78/308 (25%), Positives = 136/308 (44%), Gaps = 24/308 (7%)

Query: 19  DKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAV 78
           +KL      GN Y A Q+Y+++  R     R+ E  D+L++G    L   +     +LA 
Sbjct: 6   EKLRLCFENGNCYEAHQIYRTLYNRLSNQGRWQELQDMLYSGVLRLLAEREAASAIDLAE 65

Query: 79  SFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            F+EAL K K    +  LDR  ++  L P               +L + L A   R +  
Sbjct: 66  LFMEALEKSKTPVSSVVLDRFDELLNLLPA--------------QLEKDLEANSEREDRR 111

Query: 139 SSFLKAAIKWSIEFGAPK----TGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
             ++   IKW++  G  K     G P LH  +A+ L+ E   ++      HF+  +NPE 
Sbjct: 112 LQYISLGIKWTMAVGDKKRYRRRGHPGLHFRIAKSLWREGNYINARN---HFMYSDNPEA 168

Query: 195 FASTIVNFMGK-CYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS- 252
           FA  +  +  K  Y  E DL +A+A++  L     K A  ++        + +     S 
Sbjct: 169 FAIFLTEYQLKYGYTTEKDLFIAQAVMQMLCDRKPKSALKLLQSYCDMHPDIRSGFPYSF 228

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQG 312
            L+ F+ + +  +    +  F +L   Y+  I R+P + ++LD I + + G+ +    +G
Sbjct: 229 PLLNFLHFAIICIANKEITYFTVLVEQYETEISRDPEYKQLLDRIGQIYMGLPQPQTTRG 288

Query: 313 IF-GDIFK 319
            F G++ K
Sbjct: 289 GFLGNMIK 296


>gi|241672708|ref|XP_002400321.1| conserved hypothetical protein [Ixodes scapularis]
 gi|215506298|gb|EEC15792.1| conserved hypothetical protein [Ixodes scapularis]
          Length = 333

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 83/321 (25%), Positives = 138/321 (42%), Gaps = 59/321 (18%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           ++KL   ++ GN+Y A QMY++I  RY + ++Y E  +LL+ GA ++L   Q  C +   
Sbjct: 25  LNKLHASIDSGNYYEAHQMYRTIYFRYQSQKKYDELKELLYDGA-IKLLTLQQVCASS-- 81

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNL---EDDDDLHELTEALGAAKLR 134
                                VR     +P +P  H+L   E    LH L  A    ++ 
Sbjct: 82  ---------------------VRACLASYP-LPQTHSLPLPERLGRLHSLLAAHSPERM- 118

Query: 135 VEGCSSFLKAAIKWSIEFGAPKT----GSPELHVMLAEYLYSESPELDMARVSFHFVRGN 190
                +FL  A++WS    +       G P LH ++A  L+ E    +     +HFV   
Sbjct: 119 -----TFLTRALQWSSPPSSSPDAPSRGHPALHRLVALTLWKEK---NYQESRYHFVHST 170

Query: 191 NPEKFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQL 249
           + +  A+ +V F   K Y  E DL +A+ +  YL L N   A        ++        
Sbjct: 171 DGDGCAAMLVEFQTTKGYSSEIDLFIAQTVFQYLCLRNPSTATLAFWAYTRR-------- 222

Query: 250 QPSDLIQFVFYVLQ---------TLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
            PS      FY+            +E   L +F +L   Y+ +I R+P++ + LD I + 
Sbjct: 223 HPSVPRGPPFYLPLLNFLWFLLLAVESRKLAVFTVLCEEYQPTISRDPSYPQYLDKIGQL 282

Query: 301 FFGVKRRNPMQGIFGDIFKMM 321
           FFG+      QG+FG++ + +
Sbjct: 283 FFGLPPPPKPQGVFGNLIQTL 303


>gi|453083935|gb|EMF11980.1| DUF410-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 337

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 79/304 (25%), Positives = 135/304 (44%), Gaps = 45/304 (14%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEAL 84
           + EG++Y A Q  ++I+ RYV +     A+D+L+ GA + LK  Q   GA+L++  +E  
Sbjct: 16  IEEGDYYEAHQQLRTIANRYVKSDP-PAAIDILYNGALMLLKAGQGGSGADLSIYLIEVY 74

Query: 85  VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKA 144
            KG++  D  N  R+  +   FP    P+                       G   F+  
Sbjct: 75  GKGELKPDTANKARLMSLLRAFP----PNE---------------------PGKKKFVTG 109

Query: 145 AIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMG 204
            ++WS + G    G PELH ++   LY+E  E+  A         ++ + FA     +  
Sbjct: 110 IVEWSSKNGEFPAGDPELHHVIGT-LYAEEGEVYDAERHLTLGTSDSAQVFADLEYTWYS 168

Query: 205 KCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ----PSDLIQF- 257
              P       ARA+  YL +GN + AN   ++   +    +  L +Q    PS  I+  
Sbjct: 169 DDEPSTASHYAARAVFPYLVIGNTRAANKALLLFTSRLSTSHPGLGVQSISSPSSDIRIY 228

Query: 258 --------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK---R 306
                   +  +L  ++R +  LF  L+ +Y A ++  PA++E L  + E +FG+K   +
Sbjct: 229 PSLPLLNFLGLLLLAVQRGSSDLFKQLKQHYAAHLKEVPAWDEPLLQVGEMYFGIKIPSQ 288

Query: 307 RNPM 310
            NPM
Sbjct: 289 SNPM 292


>gi|310792378|gb|EFQ27905.1| hypothetical protein GLRG_03049 [Glomerella graminicola M1.001]
          Length = 329

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/330 (28%), Positives = 144/330 (43%), Gaps = 61/330 (18%)

Query: 19  DKLEKIV-------NEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           DK+EKI+        EG FY AQQ  + ++ARY+ A  +  A D+L+  A   LK  Q  
Sbjct: 6   DKIEKIIARLQARIGEGQFYEAQQQTRVVAARYIKAANWPAATDILYNVAQSLLKAGQGG 65

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            G +LAV  V+   + ++  D  +  ++  +  LFP               E T      
Sbjct: 66  SGGDLAVMLVDVFRQAELKPDAASKGKLLTLLRLFPA-------------QEPTR----- 107

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
                    F+   I WS +FG    G  +LH +       E    +  R   H V G  
Sbjct: 108 -------KKFIGEMIAWSAKFGEYPAGDADLHHVTGSLYAEEHEAYEAER---HLVLGT- 156

Query: 192 PEKFASTIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDAN--YIMDEVKKQVEN 244
             K ++ ++  M   +  EDD     L  +RAIL YL +GN++ AN  Y +       +N
Sbjct: 157 --KDSAEVLAKMEYVWYKEDDSHTAPLYASRAILPYLLVGNVRAANTCYRVFTSSLSDDN 214

Query: 245 KQLQLQ-----PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFN 291
           K L +Q      +DL  F        +  +L  ++R A   +  L A Y ++I    A+N
Sbjct: 215 KGLGVQDVSSNSADLRIFPGLPLLNFLGLLLLAVQRGAPDAYKQLLAKYSSNIAEVGAWN 274

Query: 292 EMLDDIAEKFFGVKR---RNPMQGIFGDIF 318
           E L+ IAE +FG++R    NP+  + G +F
Sbjct: 275 EALEIIAEMYFGIQRPKQSNPLMDMMGSLF 304


>gi|402085668|gb|EJT80566.1| hypothetical protein GGTG_00561 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 334

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 91/324 (28%), Positives = 138/324 (42%), Gaps = 52/324 (16%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I +L++ + EG FY AQQ  + ++ARY  A  +S A D+L+  +   LK  Q   G +L+
Sbjct: 12  IARLQQRITEGQFYEAQQQTRVVAARYTKASNWSAATDILYNVSQSLLKAGQGGSGGDLS 71

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           V  V+   + ++  D  N  ++     LFP               E T            
Sbjct: 72  VLLVDVFKQAELVPDAVNKGKLLACLRLFPS-------------EEPTR----------- 107

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPEL-DMARVSFHFVRG--NNPEK 194
              F+   I WS +FG    G PELH  +A  LY+E  E  D  R   H V G  ++PE 
Sbjct: 108 -KKFIGEMIAWSSKFGEFPAGDPELH-HVAGSLYAEEHETYDAER---HLVLGTKDSPEI 162

Query: 195 FASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ-- 250
            A     +  +          ARA+L YL +GN++ AN  Y     +   +N+ L +Q  
Sbjct: 163 LARMEYEWYKEDESHTAPQYAARAVLPYLLVGNLRAANACYRAFTSRLSADNQGLGVQDV 222

Query: 251 -----PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDI 297
                  DL  F        +  +L  + R +   F  L A Y       PA+ E L+ +
Sbjct: 223 SSSGGGGDLRVFPSLPLLNFLGLLLLAITRGSPDSFRQLAAKYAPHFAESPAWAEPLEIV 282

Query: 298 AEKFFGVKR---RNPMQGIFGDIF 318
           AE +FG+ R    NP+  + G +F
Sbjct: 283 AEMYFGIARPRQSNPLLDMMGSLF 306


>gi|398393194|ref|XP_003850056.1| hypothetical protein MYCGRDRAFT_74836 [Zymoseptoria tritici IPO323]
 gi|339469934|gb|EGP85032.1| hypothetical protein MYCGRDRAFT_74836 [Zymoseptoria tritici IPO323]
          Length = 333

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 87/318 (27%), Positives = 139/318 (43%), Gaps = 56/318 (17%)

Query: 20  KLEKI-------VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTC 72
           K+EK+       + EG FY A Q  ++I+ RYV A  +  A+DLL +GA + LK  Q   
Sbjct: 5   KIEKVLARQRTKIEEGEFYEAHQQLRTIANRYVKASDWPAAVDLLSSGASMLLKAGQGGS 64

Query: 73  GAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAK 132
           GA+L +  +E   KG++  D  N  R+  + + FP    P+                 AK
Sbjct: 65  GADLCIYLMEVYQKGELKADVANKARILGLLKAFP----PNE---------------PAK 105

Query: 133 LRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--N 190
            +      F+ + ++WS +      G PELH ++      E    D  R   H   G  +
Sbjct: 106 KK------FVGSIVEWSSKCSEFPAGDPELHHVIGTLFAEEGEPYDAER---HLTLGTSD 156

Query: 191 NPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQ 248
           +   FA     +     P      VARA+  YL +GN + AN   ++   K    +  L 
Sbjct: 157 SASVFADMEYEWYSSDEPSTASHYVARAVFPYLLIGNTRGANKALLLFTSKLSSAHPGLG 216

Query: 249 LQ----PSDLIQF---------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLD 295
           +Q    PS  I+          +  +L  ++R +  LF  L+ +Y A + +E A++E L 
Sbjct: 217 VQSISSPSSDIRIYPSLPLLNFLGLLLLAIQRGSADLFKQLKTHYAAHL-KEVAWDETLA 275

Query: 296 DIAEKFFGVK---RRNPM 310
            I E +F +K   + NPM
Sbjct: 276 QIGEMYFAIKIPSQSNPM 293


>gi|326928879|ref|XP_003210600.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Meleagris
           gallopavo]
          Length = 295

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 72/264 (27%), Positives = 127/264 (48%), Gaps = 48/264 (18%)

Query: 44  YVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIY 103
           Y++  +Y EA +L+++GA L   +NQ    A+L++  +E+L K      ++ L+ + K++
Sbjct: 36  YMSQGKYVEARELMYSGALLFFSHNQQNSAADLSMLVLESLEKSDAKVTDDLLENLAKLF 95

Query: 104 ELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELH 163
            L      P++ E                 RV    +F+  A+KWS   G+ K G P+LH
Sbjct: 96  SLMD----PNSPE-----------------RV----AFVSRALKWS-SGGSGKLGHPKLH 129

Query: 164 VMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYL 223
            +LA  L+ +        V +   RG                 Y  E D+ VA+A+L +L
Sbjct: 130 QLLAITLWKDGEGCANMLVEYSSSRG-----------------YRSEVDMFVAQAVLQFL 172

Query: 224 SLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANY 280
            L N   A+ +     ++   +E     +QP  L+ F++++L  ++   L +F +L   Y
Sbjct: 173 CLKNKTSASVVFTTYTQKHPSIEKGPPFVQP--LLNFIWFLLLAVDGGKLTVFTVLCEQY 230

Query: 281 KASIEREPAFNEMLDDIAEKFFGV 304
           + S++R+P +NE LD I + FFGV
Sbjct: 231 QPSLKRDPMYNEYLDRIGQLFFGV 254


>gi|328703129|ref|XP_001944175.2| PREDICTED: Golgi to ER traffic protein 4 homolog [Acyrthosiphon
           pisum]
          Length = 322

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 76/310 (24%), Positives = 136/310 (43%), Gaps = 38/310 (12%)

Query: 21  LEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSF 80
           L   + + NFY A Q+YKS+ +RY++      A+  L  G  +   N+Q+    +LA  F
Sbjct: 18  LNNAIRDENFYMAHQLYKSLYSRYISMHNKDAAIMTLQEGIIVLYDNHQVGIATDLAQLF 77

Query: 81  VEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSS 140
            ++L    +    +  D +R I+  F                    +L A +       +
Sbjct: 78  AKSLSTQHIVPSEQMFDFIRDIFRKF------------------GPSLSAER------DT 113

Query: 141 FLKAAIKWSIEFGAPKT-GSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
            L   I+W+IE     + G P LH  +  Y+Y      ++A+   H++R  +   FA+ +
Sbjct: 114 MLNTIIQWTIEHCPEYSMGHPGLHKRIG-YIYWSEANFELAQ--HHYLRSTDGLAFAAML 170

Query: 200 VNFM-GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK---QVENKQLQLQPSDLI 255
           V     +    E DL + + +L  L + N   A        K    + ++   + P  L+
Sbjct: 171 VEMHKTRGLKYEVDLFITQVVLQGLCMRNTSIAQSTFQSYTKLHPAINDEPPYILP--LL 228

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK--RRNPMQ-- 311
            F+ Y+L+ L+   L  + +L   Y++S+ R+P++ E LD I + FF VK   R P Q  
Sbjct: 229 NFICYLLKILDGGKLKTYIVLCEQYQSSLIRDPSYTEYLDKIGQIFFHVKPFERRPRQQH 288

Query: 312 GIFGDIFKMM 321
           G  G++   +
Sbjct: 289 GFLGNLISTL 298


>gi|225710676|gb|ACO11184.1| UPF0363 protein C7orf20 [Caligus rogercresseyi]
          Length = 318

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 84/308 (27%), Positives = 139/308 (45%), Gaps = 40/308 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   ++EGNFY A Q+Y+ I+ R ++ ++Y++  + L  GA    + NQ + G +L+  
Sbjct: 16  KLRASLDEGNFYEAHQLYRIINFRLLSCKKYADCEETLFEGAVKLFEANQTSSGVDLSKL 75

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
           +++AL +G +  +     RV  +Y+  P      N                         
Sbjct: 76  YMQALTEGSMQPEERIFHRVSILYKDIPSDSPEKN------------------------- 110

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSES--PELDMARVSFHFVRGNNPEKFAS 197
           +FL  AI+WS     P  G   LH + A  L++    PE        HF+   +     S
Sbjct: 111 TFLSIAIQWSANDSHP-NGHSRLHQLFAHSLWNSKRYPE-----SRHHFLYSADGVGCGS 164

Query: 198 TIVNFMGKC-YPGEDDLAVARAILMYLSLGNMKDANY-IMDEVKKQVENKQLQLQPSDLI 255
            +  F     +P E DL V   +L +L L     A + ++  V+K    K        L+
Sbjct: 165 MLAEFHSHLGFPREIDLFVVGTVLQFLCLRKHIAAAFALLTYVEKHPGIKTGPPYSHPLL 224

Query: 256 QFVFYVLQTLE-RDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV----KRRNPM 310
            FV+ +L  +E + +L  F +L   YK S+ER+P++ E LD I + FFGV    K+ +  
Sbjct: 225 NFVWLLLLAIESKQSLSTFMVLIEKYKTSLERDPSYLEYLDKIGQTFFGVPAPKKKPDVF 284

Query: 311 QGIFGDIF 318
            G+F  +F
Sbjct: 285 SGLFDTLF 292


>gi|189190128|ref|XP_001931403.1| hypothetical protein PTRG_01070 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187973009|gb|EDU40508.1| hypothetical protein PTRG_01070 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 324

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 140/317 (44%), Gaps = 52/317 (16%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           ++ + +  + + EG +Y A Q  + I++RY  A  ++ A  LL++GA   L+  Q   G 
Sbjct: 6   EKTLQRQREKIAEGQYYEAHQQLRVIASRYTKASDWTNATSLLYSGALSLLQAGQGGSGG 65

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L +  ++   KG+V  D E+  R+  +   FP+              E T+        
Sbjct: 66  DLCIFLIDVYNKGEVKVDAESKGRLLGLLRAFPK-------------EEPTK-------- 104

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+   I WS  FG    G PE+H  +A  LY+E  +L+      H + G   + 
Sbjct: 105 ----KKFVGEMIAWSSRFGEFPAGDPEIH-HVAGGLYAE--DLEPYEAERHLLLGT--QD 155

Query: 195 FASTIVNFMGKCY----PGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVE------N 244
            AST+ +     Y    P    L  AR +L YL  GN++ AN         +E       
Sbjct: 156 SASTLASLHYDWYTSDAPHTAPLYCARGVLPYLLTGNLRAANKFFLLFTSSLEKAPNPPT 215

Query: 245 KQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDD 296
           + +  + SDL  +        +  +L ++ER    L+  L+++Y   + ++  +NE LD 
Sbjct: 216 QPVSTKSSDLRVYPSLPLLNFLGLLLLSVERGDPSLWRQLKSHYGGDL-KDVNWNEALDQ 274

Query: 297 IAEKFFGVK---RRNPM 310
           I E +FG+K   + NPM
Sbjct: 275 IGEMYFGIKIPRQGNPM 291


>gi|330921918|ref|XP_003299615.1| hypothetical protein PTT_10654 [Pyrenophora teres f. teres 0-1]
 gi|311326620|gb|EFQ92287.1| hypothetical protein PTT_10654 [Pyrenophora teres f. teres 0-1]
          Length = 324

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 82/317 (25%), Positives = 140/317 (44%), Gaps = 52/317 (16%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           ++ + +  + + EG +Y A Q  + I++RY  A  ++ A  LL++GA   L+  Q   G 
Sbjct: 6   EKTLQRQREKIAEGQYYEAHQQLRVIASRYTKASDWTNATSLLYSGALSLLQAGQGGSGG 65

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L +  ++   KG+V  D E+  R+  +   FP+              E T+        
Sbjct: 66  DLCIFLIDVYNKGEVKVDAESKGRLLGLLRAFPK-------------EEPTK-------- 104

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+   I WS  FG    G PE+H  +A  LY+E  +L+      H + G   + 
Sbjct: 105 ----KKFVGEMIAWSSRFGDFPAGDPEIH-HVAGGLYAE--DLEPYEAERHLLLGT--QD 155

Query: 195 FASTIVNFMGKCY----PGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVE------N 244
            AST+ +     Y    P    L  AR +L YL  GN++ AN         +E       
Sbjct: 156 SASTLASLHYDWYTSDAPHTAPLYCARGVLPYLLTGNLRAANKFFLLFTSSLEKAPNPPT 215

Query: 245 KQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDD 296
           + +  + SDL  +        +  +L ++ER    L+  L+++Y   + ++  +NE LD 
Sbjct: 216 QPVSTKSSDLRVYPSLPLLNFLGLLLLSVERGDPSLWRQLKSHYAGDL-KDVNWNEALDQ 274

Query: 297 IAEKFFGVK---RRNPM 310
           I E +FG+K   + NPM
Sbjct: 275 IGEMYFGIKIPRQGNPM 291


>gi|195153252|ref|XP_002017543.1| GL22353 [Drosophila persimilis]
 gi|198454470|ref|XP_001359604.2| GA22074 [Drosophila pseudoobscura pseudoobscura]
 gi|194112600|gb|EDW34643.1| GL22353 [Drosophila persimilis]
 gi|198132807|gb|EAL28754.2| GA22074 [Drosophila pseudoobscura pseudoobscura]
          Length = 336

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 83/312 (26%), Positives = 141/312 (45%), Gaps = 37/312 (11%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   +  G FY A  MY+++  RY A +RY + LDLL  GA   +   Q    A+L + 
Sbjct: 27  KLSHSLAGGAFYEAHMMYRTLYFRYTAQKRYQDCLDLLFDGAQKLIAKQQEGSAADLCLL 86

Query: 80  FVEALVK-GKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            V+ L K G +  D ENL            + VP        L  L + L +  +  E  
Sbjct: 87  LVDTLEKRGPLPEDTENL------------LWVPR-------LGTLIQGLSSGIVERE-- 125

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            + ++ AIKWSI     + G P LH ++A   ++E   ++ AR   H++   +       
Sbjct: 126 -TLIQRAIKWSISLHG-QYGHPVLHKLIAHVFWTEG-NIESAR--HHYLLCQDGSLCGRV 180

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQ---PSDL 254
           ++     + +  E DL + +A+L  LSL + K A     E  +    K L+ +      L
Sbjct: 181 LIEISQSRGFQSEVDLFLVQAVLQQLSLKDRKSAEDTFIEYTR-YHPKLLRHEFPYKEPL 239

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRN-----P 309
           + F++++ + +E  ++  F  LR  Y  S++R+ +F + +  I   +F  +         
Sbjct: 240 VNFLYFLFRLIETKSVTGFRSLRKLYDPSLKRDTSFLKYVTKIGVIYFDEQPETNAGPPG 299

Query: 310 MQGIFGDIFKMM 321
           + G+FGDIF  M
Sbjct: 300 LGGMFGDIFSRM 311


>gi|390459510|ref|XP_003732328.1| PREDICTED: LOW QUALITY PROTEIN: Golgi to ER traffic protein 4
           homolog [Callithrix jacchus]
          Length = 334

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 67/248 (27%), Positives = 126/248 (50%), Gaps = 37/248 (14%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  +++EA +L+++GA L   + Q    A+L++ 
Sbjct: 29  KLRASVEKGDYYEAHQMYRTLFFRYMSQSKHAEARELMYSGALLFFSHGQQNSAADLSML 88

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +E+L K +V   +E L+ + K++ L  P  P                       RV   
Sbjct: 89  VLESLEKAEVEVADELLENLAKVFSLMDPNSPE----------------------RV--- 123

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ 
Sbjct: 124 -AFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANM 178

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDL 254
           +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    ++P  L
Sbjct: 179 LVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPPFVEP--L 236

Query: 255 IQFVFYVL 262
           + F++++L
Sbjct: 237 LNFIWFLL 244


>gi|320586960|gb|EFW99623.1| duf410 domain containing protein [Grosmannia clavigera kw1407]
          Length = 331

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 82/321 (25%), Positives = 142/321 (44%), Gaps = 48/321 (14%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I +++K ++EG FY AQQ  + ++ R++    ++ A+D+L++ A   LK  Q   G +L+
Sbjct: 12  ISRMQKRISEGQFYEAQQQTRVVAVRHIKTANWAAAIDILYSVAQSLLKAGQGGSGGDLS 71

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  V+   + ++  D  N  R+     LF             D  E T            
Sbjct: 72  IFLVDVYKQAELKPDAANKGRLLTCLRLF-------------DQEEPTR----------- 107

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGN--NPEKF 195
              ++   I WS  FGA   G PELH ++      E    +  R   H V G   +PE  
Sbjct: 108 -RKYIAEIIGWSARFGAYPAGDPELHHVIGSLYAEEHEAYEAER---HLVTGTKESPEVL 163

Query: 196 ASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK--QVENKQLQLQ--- 250
           A    ++  +       L   RAIL YL + N++ AN+          V+N  L +Q   
Sbjct: 164 AHMEYDWYKQDDAHTAPLYAGRAILPYLLMANVRAANHCYRTFTSALTVDNPALAVQDVS 223

Query: 251 ---------PS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
                    PS  L+ F+  +L  ++R +  LF  L + Y   +    +++E L+ +AE 
Sbjct: 224 SASNDARVFPSLPLLNFLGLLLLAIQRGSPELFRQLASRYSNQLREVESWSEPLELVAEM 283

Query: 301 FFGV---KRRNPMQGIFGDIF 318
           +FG+   ++RNP+  + G++F
Sbjct: 284 YFGIQQPRQRNPLFDMVGNLF 304


>gi|428186163|gb|EKX55014.1| hypothetical protein GUITHDRAFT_99654 [Guillardia theta CCMP2712]
          Length = 210

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 109/228 (47%), Gaps = 27/228 (11%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           +D++ KIV+EG +Y A Q YKS   RY   +     +DLL +GA   L++ Q   G EL 
Sbjct: 1   MDRISKIVDEGRYYEALQQYKSTYFRY-KLRNNPATIDLLRSGAVKLLEHKQYEGGLELG 59

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              VE LV    A  +++L  V+ I E F                  +E  GA   +V+ 
Sbjct: 60  SLLVEHLVSADPAASSDSLGAVKSIAEAF------------------SECSGAEGHQVQ- 100

Query: 138 CSSFLKAAIKWSIEFGAPKT-GSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
              +L  AIKWS      K+ G   LH  +AE  Y +   L +A +  HF  GN+P++FA
Sbjct: 101 ---YLTRAIKWSKGTRESKSVGDATLHQYIAE-AYLKLNNLPLAHM--HFACGNDPKRFA 154

Query: 197 STIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVEN 244
           + +     +C   E DL  ARAIL  L   N++ A  +++E  K   N
Sbjct: 155 TLLRAMSSECRGEEQDLVWARAILQSLCAKNLELAVGLLEESMKTGNN 202


>gi|195113147|ref|XP_002001130.1| GI22138 [Drosophila mojavensis]
 gi|193917724|gb|EDW16591.1| GI22138 [Drosophila mojavensis]
          Length = 342

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 79/312 (25%), Positives = 140/312 (44%), Gaps = 36/312 (11%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   + EG FY A  MY+++  RY A +RY + LDLL+ GA   ++  Q +  A+L + 
Sbjct: 32  KLSHSLAEGEFYEAHMMYRTLYFRYTAQKRYQDCLDLLYDGAQKLIEKEQESSAADLCLL 91

Query: 80  FVEALVK-GKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            V+ L K G    D +N       +E  P+            L  L   L A  +  E  
Sbjct: 92  LVDTLEKRGPQPEDADN-------FEWVPR------------LGGLIRGLSATTVERE-- 130

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            + +   +KWS      + G P LH ++A   ++E   ++ AR   H++   +       
Sbjct: 131 -TLIHRVVKWSTSVHG-QFGHPVLHKLIAHVFWTEG-NIESAR--HHYLLCQDGSLCGRV 185

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSD--LI 255
           ++     K +  E DL + +A+L  LSL + K A     E  +          P +  L+
Sbjct: 186 LIEISQSKGFHREVDLFLVQAVLQQLSLKDRKTAEDTFTEYTRYHPKLLRHEFPYNEPLV 245

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRR------NP 309
            F++++ + ++  ++  F  LR  Y+ S++R+ +F + L  I   +F  +        + 
Sbjct: 246 NFLYFLFRLIDTKSVVGFRALRKLYEPSLKRDISFTKYLTKIGVTYFDEQPEQVHAGPSG 305

Query: 310 MQGIFGDIFKMM 321
           + G+FGDIF  +
Sbjct: 306 LGGMFGDIFNQL 317


>gi|452841330|gb|EME43267.1| hypothetical protein DOTSEDRAFT_80727 [Dothistroma septosporum
           NZE10]
          Length = 339

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 78/314 (24%), Positives = 129/314 (41%), Gaps = 44/314 (14%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q+ + + +  + EG+FY A Q  ++I+ RYV +Q +  A DLL  GA + LK  Q   G+
Sbjct: 7   QKVLARQQAKIEEGDFYEAHQQLRTIANRYVKSQDWPAAADLLSNGALMLLKAGQGGSGS 66

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L    +E   K +   D  N  R+  +   FP    P+                     
Sbjct: 67  DLCTYLMEVYQKAEFRADVANKARILSLLRAFP----PNEPSKKK--------------- 107

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+   ++WS ++     G PELH ++   LY+E  E   A         ++   
Sbjct: 108 ------FVTGIVEWSAKYSDFPAGDPELHHVIGS-LYAEEGETYDAERHLTLGTSDSASA 160

Query: 195 FASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQL----- 249
            A     +     P       ARA+  YL +GN + AN  +     ++ +    L     
Sbjct: 161 LADMEYEWYASDEPSTASHYAARAVFPYLLIGNTRAANKALLLFTSRLSSLHPGLGVQSI 220

Query: 250 -QPSDLIQF---------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAE 299
             PS  I+          +  +L  ++R    LFN L+ +Y A ++  P ++E L  I E
Sbjct: 221 SSPSSDIRIYPSLPLLNFLGLLLLAIQRGTKDLFNQLKTHYAAHLKEVPHWDEPLLQIGE 280

Query: 300 KFFGVK---RRNPM 310
            +FG+K   + NPM
Sbjct: 281 MYFGIKIPSQSNPM 294


>gi|195390931|ref|XP_002054120.1| GJ24257 [Drosophila virilis]
 gi|194152206|gb|EDW67640.1| GJ24257 [Drosophila virilis]
          Length = 343

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 141/313 (45%), Gaps = 38/313 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   + EG FY A  MY+++  RY A +RY + LDLL  GA   ++  Q    A+L + 
Sbjct: 32  KLSHSLAEGEFYEAHMMYRTLYFRYTAQKRYQDCLDLLFDGAQKLIEKEQEGSAADLCLL 91

Query: 80  FVEALVK-GKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            V+ L K G    D +N       +E  P+               L   L A  +  E  
Sbjct: 92  LVDTLEKRGPRPDDTDN-------FEWVPRFGA------------LIRGLSATTVERE-- 130

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            + +   +KWS      + G P LH ++A   ++E   ++ AR   H++   +       
Sbjct: 131 -TLIHRVVKWSTSVHG-QFGHPVLHKLIAHVFWTEG-NIESAR--HHYLLCQDGSLCGRV 185

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQ---PSDL 254
           ++     K +  E DL + +A+L  LSL + K A+ +  E  +    K L+ +      L
Sbjct: 186 LIEISQSKGFHKEVDLFLVQAVLQQLSLKDRKTADDMFSEYTR-YHPKILRHEFPYKEPL 244

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRR------N 308
           I F++++ + ++  ++  F  LR  Y  S++R+ +F++ L  I   +F  +        +
Sbjct: 245 INFLYFLFRLIDTKSVMGFRALRKLYDPSLKRDASFHKYLTKIGVIYFDEQPEQVHAGPS 304

Query: 309 PMQGIFGDIFKMM 321
            + G+FGDIF  +
Sbjct: 305 GLGGMFGDIFNQL 317


>gi|389634739|ref|XP_003715022.1| hypothetical protein MGG_08069 [Magnaporthe oryzae 70-15]
 gi|351647355|gb|EHA55215.1| hypothetical protein MGG_08069 [Magnaporthe oryzae 70-15]
 gi|440470523|gb|ELQ39590.1| DUF410 domain-containing protein [Magnaporthe oryzae Y34]
 gi|440488504|gb|ELQ68229.1| DUF410 domain-containing protein [Magnaporthe oryzae P131]
          Length = 333

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 89/329 (27%), Positives = 138/329 (41%), Gaps = 63/329 (19%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I +L++ + EG FY AQQ  + ++ARY+    +  A D+L   +   LK  Q   G +LA
Sbjct: 12  IARLQQRIAEGQFYEAQQQTRVVAARYIKESSWGPATDILFNVSQSLLKAGQGGSGGDLA 71

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
              V+   + ++  D+ N  ++  +  LF P  P                          
Sbjct: 72  TLLVDVYKQAELKPDSANKGKLLTLLRLFAPGEPTKKK---------------------- 109

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
               F+   I WS +FG    G PELH  +A  LY+E  E + +    H + G    K +
Sbjct: 110 ----FVTEMIGWSSKFGEYPAGDPELH-HVAGSLYAE--EHETSEAERHLILGT---KDS 159

Query: 197 STIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDANYIMDEVKK--QVENKQLQL 249
           +  +  M   +  EDD     L  ARA+L YL +GN++ AN            +N+ + +
Sbjct: 160 AEALARMEYSWYKEDDSHTAPLYAARAVLPYLLVGNVRAANACYRSFASALSADNQGMGV 219

Query: 250 Q-----PSDLIQF-----------VFYVLQTLERDALPLFNMLRANYKASI-EREPAFNE 292
           Q      +DL  F           +   +Q    DA   F  L   Y   I E  PA+ E
Sbjct: 220 QDVSTNSADLRVFPSLPLLNFLGMLLLAIQKGNPDA---FRKLTMKYGPQIAEAAPAWQE 276

Query: 293 MLDDIAEKFFGVKR---RNPMQGIFGDIF 318
            L+ IAE +FG+ R    NP+  + G +F
Sbjct: 277 SLESIAEMYFGIARPRQSNPLMDMMGSLF 305


>gi|298705302|emb|CBJ48992.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 328

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 78/317 (24%), Positives = 135/317 (42%), Gaps = 26/317 (8%)

Query: 21  LEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSF 80
           +E+ + EG  Y A Q+Y+ + +R  A    + AL +   G  + L         EL    
Sbjct: 20  VEQKLGEGLQYEALQLYRGVVSRKSARGNDNAALTIAEQGIGVLLARGYADSATELGNVL 79

Query: 81  VEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSS 140
           V  L   ++    E L+  R++   +        L ++D      E  G      EG  +
Sbjct: 80  VGILNDHELPVTEERLELFRRVNAKYEDAKRAAVLAEEDRRTAAQEGQGGVGEVEEGKEA 139

Query: 141 -------------FLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
                        FLK A+ W+   G   +G P L  +L   L++E    D+ R + + V
Sbjct: 140 GRGMYSVGVLQGRFLKLAVAWTAARGQRLSGDPGLCGLLGRCLWNEG---DLERGTRYLV 196

Query: 188 RGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQL 247
            G  P++    I+  +      E  LAVA+ +L + S  +++DAN ++   K Q   KQ 
Sbjct: 197 LGERPQELCDLILQDVSDANARE--LAVAKGVLQFASKASLRDANALLAAYKTQTGGKQ- 253

Query: 248 QLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRR 307
                 ++ F   +L+T E DA P+F  L   Y+A + ++ A    ++ IA  +F ++ R
Sbjct: 254 --PTKGVLGFCAMLLKTCEYDATPVFQKLLDAYQADLSKDEALFGSVESIAMTYFNLRLR 311

Query: 308 NP-----MQGIFGDIFK 319
            P     + G+ G + K
Sbjct: 312 PPSMLDNIMGMLGGMPK 328


>gi|290977337|ref|XP_002671394.1| predicted protein [Naegleria gruberi]
 gi|284084963|gb|EFC38650.1| predicted protein [Naegleria gruberi]
          Length = 304

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 76/313 (24%), Positives = 147/313 (46%), Gaps = 40/313 (12%)

Query: 17  NIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAEL 76
           N+DKL+++ ++G FY AQQM+  +  +    ++Y +   +L +G     +  Q+    +L
Sbjct: 2   NVDKLKELCDDGKFYEAQQMFVVLFNKTFKQEKYDKCAQILLSGLNKMHQFKQVALLIDL 61

Query: 77  AVSFVEALVKGKVAYDNE---------NLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
           A   +    K   +  +           +D +R I E +      H LED          
Sbjct: 62  AKYMLSLFTKLNASTTHTFTIKEDKITAIDLIRHIVESY------HQLED---------- 105

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKT----GSPELHVMLAEYLYSESPELDMARVS 183
               K  ++   SFL   + WS E   PK     G P +H+ L +  +      +  +  
Sbjct: 106 -ITPKEDIQRMISFLDKVVTWSAEHQVPKISDSHGEPFIHMELGKLHFKLGANEESNK-- 162

Query: 184 FHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVE 243
            +F++ N+P++FA  I  ++ K   GE DL + RA+L  L L  ++ A  + + ++++ E
Sbjct: 163 -NFLKSNSPDEFAEMISEWIKKGEDGEHDLFITRAVLQLLQLKKLEMAKRVFELLQQKTE 221

Query: 244 NKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPA-FNEM-LDDIAEKF 301
               +L  + L+ FV +++ ++E+  L L++ +  +Y   + R  A F ++ LD I   +
Sbjct: 222 ----KLSSTPLLHFVKFLIISIEKGNLELYHHINQSYATELTRRDASFKDVYLDKIGSIY 277

Query: 302 FGVKRRNPM-QGI 313
           FG+++  P  QG+
Sbjct: 278 FGLQKSKPQNQGL 290


>gi|168068458|ref|XP_001786081.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162662174|gb|EDQ49109.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 175

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 38/74 (51%), Positives = 52/74 (70%)

Query: 176 ELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIM 235
           E +M + S HFV GN PE FA  +V  M KCYPGE DL VARA+L+YLSLGN++D N + 
Sbjct: 29  EAEMMKASQHFVGGNQPETFAKVVVECMDKCYPGEADLVVARAVLLYLSLGNLRDVNRLW 88

Query: 236 DEVKKQVENKQLQL 249
           D V++++ +   +L
Sbjct: 89  DSVRQRLSDSSQEL 102


>gi|451999570|gb|EMD92032.1| hypothetical protein COCHEDRAFT_1173541 [Cochliobolus
           heterostrophus C5]
          Length = 325

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 83/308 (26%), Positives = 132/308 (42%), Gaps = 49/308 (15%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFV 81
           EKI  EG +Y A Q  + I++RY  A  ++ A  +L  GA   L+  Q   G +L +  +
Sbjct: 14  EKIA-EGQYYEAHQQLRVIASRYTKASDWTNATSILFQGAQSLLQAGQGGSGGDLCIFLL 72

Query: 82  EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSF 141
           +   KG++  D E+  R+  +   FP+              E T+              F
Sbjct: 73  DVYNKGEIKVDAESKGRLLTLLRAFPK-------------DEPTK------------KKF 107

Query: 142 LKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--NNPEKFASTI 199
           +   I WS  FG    G PELH  +A  LY+E  EL+      H + G  ++    AS  
Sbjct: 108 VGEMIVWSSRFGDFPAGDPELH-HVAGGLYAE--ELEPYEAERHLLLGTQDSAATLASLH 164

Query: 200 VNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN---YIMDEVKKQVENKQLQLQPSDLIQ 256
            N+     P    L  AR +L YL  GN++ AN    +     ++  N   Q   +   +
Sbjct: 165 YNWYTSDAPHTAPLYCARGVLPYLLTGNLRAANKFFLLFTSAVEKTPNPPTQAVSTKSSE 224

Query: 257 FVFY-----------VLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
           F  Y           +L ++ER    L+  L+++Y   + ++  + E LD I E +FG+K
Sbjct: 225 FRVYPSLPLLNFLGLLLLSVERGDPSLWRQLKSHYAGDL-KDVNWGEALDQIGEMYFGIK 283

Query: 306 ---RRNPM 310
              + NPM
Sbjct: 284 IPRQGNPM 291


>gi|452982488|gb|EME82247.1| hypothetical protein MYCFIDRAFT_197421 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 340

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 73/306 (23%), Positives = 128/306 (41%), Gaps = 41/306 (13%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q+ + + +  + EG+FY A Q   +I+ RYV +  Y++A++LL +GA L LK    + GA
Sbjct: 6   QKVLARQKAKIEEGDFYEAHQQLLTIANRYVKSSSYADAIELLSSGATLLLKAGAGSSGA 65

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L    ++   K +V  D  N  R+  +   FP                         L 
Sbjct: 66  DLCTYLIDIYQKSEVKPDTANKARLLSLLRAFP-------------------------LN 100

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
             G   F+   ++WS + G    G PELH ++   LY+E  E+  A         ++   
Sbjct: 101 EPGKKKFVAGIVEWSSKNGEFPAGDPELHHVIGT-LYAEEGEVYDAERHLTLGTSDSAAV 159

Query: 195 FASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQL----- 249
           FA     +     P       ARA+  YL +GN + AN  +     ++      L     
Sbjct: 160 FADMEYRWYETDEPSTASHYAARAVFPYLLVGNTRAANKTLLLFTSRLSTSHPGLGVQSI 219

Query: 250 -QPSDLIQF---------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAE 299
             PS  I+          +  +L  ++R +  LF  L+++Y   ++    ++E +  + E
Sbjct: 220 SSPSSDIRIYPSLPLLNFLGLLLLAIQRGSADLFKTLKSHYGTHLKEVSTWDEPMLQVGE 279

Query: 300 KFFGVK 305
            +FG+K
Sbjct: 280 MYFGIK 285


>gi|195061282|ref|XP_001995963.1| GH14232 [Drosophila grimshawi]
 gi|193891755|gb|EDV90621.1| GH14232 [Drosophila grimshawi]
          Length = 348

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 79/314 (25%), Positives = 146/314 (46%), Gaps = 40/314 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   + EG +Y A  MY+++  RY A +R+ + LDLL+ GA   ++  Q +  A+L + 
Sbjct: 33  KLSHSLAEGEYYEAHMMYRTLYFRYTAQKRFEDCLDLLYDGAQKLIERQQESSAADLCLL 92

Query: 80  FVEALVKG-KVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            V+ L K  +   D +N   + ++ +L   +  P  +E +  +H +              
Sbjct: 93  LVDTLEKRPQTDNDTDNFQWIPRLGDLIRDLS-PSTVERETLIHRV-------------- 137

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
                  +KWSI       G P LH ++A   ++E   ++ AR  +   +  +   +   
Sbjct: 138 -------VKWSIGVHG-HFGHPMLHKLIANVFWTEE-NIEAARHHYLLCQDGSLCGYVLI 188

Query: 199 IVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVK----KQVENKQLQLQPSDL 254
           +++   K +  E DL V + +L  LSL + K A+    E      K + N+    +P  L
Sbjct: 189 MIS-ERKGFRSEVDLFVVQVVLQQLSLEDRKSADDTFSEYTRYHPKLLRNEFPFKEP--L 245

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQ--- 311
           + F++++L+ +E  +   F  LR  Y  S++R+ +FN+ L  I   +F  ++  P+    
Sbjct: 246 LNFLYFLLRLIEAKSALGFRALRKLYDPSLKRDVSFNKYLTKIGMIYFD-EQPEPVHAGP 304

Query: 312 ----GIFGDIFKMM 321
               G+FGDIF  +
Sbjct: 305 TGLGGMFGDIFNHL 318


>gi|451854495|gb|EMD67788.1| hypothetical protein COCSADRAFT_137125 [Cochliobolus sativus
           ND90Pr]
          Length = 325

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 83/308 (26%), Positives = 131/308 (42%), Gaps = 49/308 (15%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFV 81
           EKI  EG +Y A Q  + I++RY  A  +  A  +L  GA   L+  Q   G +L +  +
Sbjct: 14  EKIA-EGQYYEAHQQLRVIASRYTKASDWPNATSILFQGAQSLLQAGQGGSGGDLCIFLL 72

Query: 82  EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSF 141
           +   KG++  D E+  R+  +   FP+              E T+              F
Sbjct: 73  DVYNKGEIKVDAESKGRLLTLLRAFPK-------------DEPTK------------KKF 107

Query: 142 LKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--NNPEKFASTI 199
           +   I WS  FG    G PELH  +A  LY+E  EL+      H + G  ++    AS  
Sbjct: 108 VGEMIVWSSRFGDFPAGDPELH-HVAGGLYAE--ELEPYEAERHLLLGTQDSAATLASLH 164

Query: 200 VNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN---YIMDEVKKQVENKQLQLQPSDLIQ 256
            N+     P    L  AR +L YL  GN++ AN    +     ++  N   Q   +   +
Sbjct: 165 YNWYTSDAPHTAPLYCARGVLPYLLTGNLRAANKFFLLFTSAVEKTPNPPTQAVSTKSSE 224

Query: 257 FVFY-----------VLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
           F  Y           +L ++ER    L+  L+++Y   + ++  + E LD I E +FG+K
Sbjct: 225 FRVYPSLPLLNFLGLLLLSVERGDPSLWRQLKSHYAGDL-KDVNWGEALDQIGEMYFGIK 283

Query: 306 ---RRNPM 310
              + NPM
Sbjct: 284 IPRQGNPM 291


>gi|429851076|gb|ELA26293.1| duf410 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 328

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 90/322 (27%), Positives = 141/322 (43%), Gaps = 61/322 (18%)

Query: 19  DKLEKIV-------NEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           DK+EKI+        EG +Y AQQ  + ++ARY+ A  +  A D+L+  A   LK  Q  
Sbjct: 6   DKIEKIIARLQVRIEEGQYYEAQQQTRVVAARYIKAANWPAATDILYNVAQSLLKAGQGG 65

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            G +L V  V+   + ++  D  N  ++     LFP                      A 
Sbjct: 66  SGGDLCVMLVDVFKQAELKPDATNKGKLLTCLRLFP---------------------AAE 104

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
             R      F+   + WS +FG    G  +LH  +A  LY+E  E +      H V G  
Sbjct: 105 PTR----KKFVGEMVAWSSKFGEYPAGDSDLH-HVAGSLYAE--EHEPYEAERHLVLGT- 156

Query: 192 PEKFASTIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDAN--YIMDEVKKQVEN 244
             K ++ ++  M   +  EDD     L  ARAIL YL +GN++ AN  Y +       +N
Sbjct: 157 --KDSAEVLARMEYAWYKEDDSHTAPLYAARAILPYLLVGNVRAANTSYRLFTSALSDDN 214

Query: 245 KQLQLQ-----PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFN 291
           K L +Q      +DL  F        +  +L  ++R    ++  L   Y ++I    A++
Sbjct: 215 KGLGVQDVSSNSADLRIFPGLPLLNFLGLLLLAIQRGTPDVYKQLVTKYSSNISEVGAWS 274

Query: 292 EMLDDIAEKFFGVKR---RNPM 310
           E L+ IAE +FG++R    NP+
Sbjct: 275 EALELIAEMYFGIQRPRQSNPL 296


>gi|300123559|emb|CBK24831.2| unnamed protein product [Blastocystis hominis]
          Length = 123

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/122 (36%), Positives = 72/122 (59%), Gaps = 4/122 (3%)

Query: 199 IVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFV 258
           ++++    Y  E DL + R+IL  ++  N++DAN +   V+ Q+E K   +  S L  F 
Sbjct: 2   LLDWASHGYQSELDLFITRSILWLIAKQNLRDANDLFSNVQSQLEAKG-AIMSSPLFHFD 60

Query: 259 FYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIF 318
            ++LQT+ RDA PLFN+L+  Y   +ER+PA  + ++ I E +FG+K   P   +F D+ 
Sbjct: 61  SFILQTVTRDAAPLFNLLKEKYTPELERDPALLQTMEKIGEVYFGIK---PKGSLFSDML 117

Query: 319 KM 320
           KM
Sbjct: 118 KM 119


>gi|355560402|gb|EHH17088.1| Conserved edge-expressed protein [Macaca mulatta]
          Length = 332

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 74/297 (24%), Positives = 140/297 (47%), Gaps = 43/297 (14%)

Query: 45  VAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENL-------- 96
           ++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L        
Sbjct: 1   MSQSKHAEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADELLAGKVGAGS 60

Query: 97  -------------------DRVRK--IYELFPQIP--VPHNLED--DDDLHELTEALGAA 131
                                V K  ++  + ++P  VP  L D  + +   L +     
Sbjct: 61  GWGTILSRDLTCRHVLSGEREVSKALLWAAWLRVPLLVPAALGDAPEPEPENLAKVFSLM 120

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
                   +F+  A+KWS   G+ K G P LH +LA  L+ E    +     +HF+   +
Sbjct: 121 DPNSPERVAFVSRALKWS-SGGSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSAD 176

Query: 192 PEKFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQL 247
            E  A+ +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+   
Sbjct: 177 GEGCANMLVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPP 236

Query: 248 QLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            ++P  L+ F++++L  ++   L +F +L   Y+ S+ R+P +NE LD I + FFGV
Sbjct: 237 FVEP--LLNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGV 291


>gi|312378833|gb|EFR25292.1| hypothetical protein AND_09506 [Anopheles darlingi]
          Length = 303

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 76/305 (24%), Positives = 127/305 (41%), Gaps = 68/305 (22%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KL   +   NFY A QMY+++  RY A  +Y E L+LL++GA   L + Q   GA+L 
Sbjct: 23  LQKLRTSMEAKNFYEAHQMYRTLYFRYSAQAKYDELLELLYSGAVTMLDHLQYGSGADLG 82

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  ++ L   +                         NL  ++ +  L E +G  K  V  
Sbjct: 83  LLIIQTLESAETT-----------------------NLPAEEWMKRLGELIGKIKPCVNE 119

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
             + L+ AIKWS        G P +H ++A+ +Y+E+   ++ +  +H+    +    A 
Sbjct: 120 RETVLEKAIKWSGTVAQSPMGHPLMHKLIAQIMYNEN---NLPQARYHYPLSRDGFSCAF 176

Query: 198 TIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQ 256
            ++     K Y GE DL VA+ +L  L L             K+  E ++          
Sbjct: 177 LLIELSQAKGYAGEVDLFVAQMVLQLLCL-------------KELTEGRK---------- 213

Query: 257 FVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQG---I 313
                          +F  L   YK ++ER+P++++ L  I   FFG  +  P Q     
Sbjct: 214 -------------YAMFRALCDLYKPALERDPSYDKYLRKIGVMFFGASQ--PEQSRSYA 258

Query: 314 FGDIF 318
           FGD+ 
Sbjct: 259 FGDLL 263


>gi|449301078|gb|EMC97089.1| hypothetical protein BAUCODRAFT_147254 [Baudoinia compniacensis
           UAMH 10762]
          Length = 331

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 82/304 (26%), Positives = 130/304 (42%), Gaps = 45/304 (14%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEAL 84
           + EG +Y A Q  ++I+ R V +  +  A+DLL  GA L LK  Q   GA+L    VE  
Sbjct: 16  IEEGEYYEAHQQLRTIANRNVKSNDWESAIDLLSNGAALLLKAGQGGSGADLCNYLVEVY 75

Query: 85  VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKA 144
            K ++  D  N  R+  +   FP    P+                       G   F+  
Sbjct: 76  QKAELKPDVTNKARILGLLRAFP----PNE---------------------PGKKKFVSG 110

Query: 145 AIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMG 204
            I+WS ++G    G PELH ++   L++E  E+  A         ++ +  A     +  
Sbjct: 111 IIEWSSKYGEFPAGDPELHHVIGS-LFAEEGEVYDAERHLALGTSDSAQIMADMEYEWYA 169

Query: 205 KCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ----PSDLIQF- 257
              P       ARA++ YL +GN + AN   ++   K    +  L +Q    PS  I+  
Sbjct: 170 SDEPSTAPSYAARAVIPYLLVGNTRAANKSLLLFTSKLSTLHPGLGVQSISSPSSDIRIY 229

Query: 258 --------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK---R 306
                   +  +L  ++R    LF  L+++Y AS  +E A+ E L  I E +FG+K   +
Sbjct: 230 PSLPLLNFLGLLLLAVQRGTSDLFKQLKSHY-ASHLKEAAWEEPLAQIGEMYFGIKIPSQ 288

Query: 307 RNPM 310
            NPM
Sbjct: 289 SNPM 292


>gi|121703558|ref|XP_001270043.1| DUF410 domain protein [Aspergillus clavatus NRRL 1]
 gi|119398187|gb|EAW08617.1| DUF410 domain protein [Aspergillus clavatus NRRL 1]
          Length = 347

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 83/336 (24%), Positives = 144/336 (42%), Gaps = 61/336 (18%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK----NNQLT 71
           + I + ++ +  GN+Y A Q  + I+ARY+    Y  A ++L  GA   L+         
Sbjct: 7   KTIARQQEKIASGNYYEAHQQLRVIAARYIKQSNYDAAAEILAGGATALLRAGSQQGASA 66

Query: 72  CGAELAVSFVEALVKGKVAY-------DNENLDRVRKIYELFPQIPVPHNLEDDDDLHEL 124
            G +LA+  V+  V  K  +       D E   R +++ EL  + P       D+   + 
Sbjct: 67  SGGDLAIMLVDD-VYIKAGWEITGGDEDAEGRARKKRLIELLREFP------SDEPTRK- 118

Query: 125 TEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSF 184
                           +++  I WS  FG  + G PELH  +   +Y+E  E   A    
Sbjct: 119 ---------------RYIQEMISWSGRFGPLERGDPELHHAVGS-VYAEDSEPYEAEK-- 160

Query: 185 HFVRGNNPEKFASTIVNFMGKCYPGEDDLAVA----RAILMYLSLGNMKDAN--YIMDEV 238
           H V G +  + A T+     + Y  +D    A    RA+  YL +GN+++AN  +++   
Sbjct: 161 HLVLGTS--ESAETLAKLEYEWYTNDDPHTAAIYASRAVFPYLLVGNLRNANKAFLIFTS 218

Query: 239 KKQVENKQLQLQ-----PSD--------LIQFVFYVLQTLERDALPLFNMLRANYKASIE 285
           +    N  L +Q      SD        L+ F+  +L T++R    LF  L A+Y + I 
Sbjct: 219 RLSASNTSLGVQQVSSASSDVRVFPALPLLNFISMLLLTVQRSNADLFKQLTAHYASQIR 278

Query: 286 REPAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
               +++ L  I E++F +   K+ NP+  + G + 
Sbjct: 279 EVGIWDDALSQIGEQYFAIKIPKQGNPLLDMMGSML 314


>gi|345560134|gb|EGX43260.1| hypothetical protein AOL_s00215g593 [Arthrobotrys oligospora ATCC
           24927]
          Length = 333

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 74/330 (22%), Positives = 143/330 (43%), Gaps = 56/330 (16%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           ++ I +L++ + EG  Y A Q  + +SARY+ +     A+D+L +GA   L+ +Q   G 
Sbjct: 6   EKTIARLKQRIEEGQHYEAHQQLRVVSARYIKSSNPQSAIDILFSGAQSLLQASQYASGG 65

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L ++ +E      +  D+ +  R+  +  L P+              E T         
Sbjct: 66  DLTLTLLEVYTNSSIPCDSASKGRLYTLLSLIPET-------------EPTR-------- 104

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                  + +A+ WS +    ++G PELH         E    D  +   H + G   + 
Sbjct: 105 ----KKIINSAVIWSSKNSEYESGDPELHHFAGSIYAKEHEPYDAEK---HLLLGT--KD 155

Query: 195 FASTIVNFMGKCY----PGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQV--ENK--Q 246
             + + N +   Y    P    L ++RA+L YL +GN+++A   +D    ++  +NK  +
Sbjct: 156 SCTVLTNLLYTWYLQDEPSTAPLYISRAVLCYLLIGNVREAVAALDLFTSRLLEDNKGGK 215

Query: 247 LQLQPSD-------------LIQFVFYVLQTLER--DALPLFNMLRANYKASIEREPAFN 291
           L +Q  +             L+ ++  ++ T+ R      +F  LR  Y+  +     ++
Sbjct: 216 LAVQSVETNALDARIFPALPLMNYLRLLVLTVTRGKGGADVFRGLRQQYQNLLAEVGGWD 275

Query: 292 EMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
           E L+ I E +FG+   ++RN M  + G +F
Sbjct: 276 EALEQIGEMYFGIQIPRQRNFMADMMGSLF 305


>gi|24643965|ref|NP_649464.1| CG9853, isoform A [Drosophila melanogaster]
 gi|161078013|ref|NP_001097680.1| CG9853, isoform B [Drosophila melanogaster]
 gi|74948256|sp|Q9VN19.1|GET4_DROME RecName: Full=Golgi to ER traffic protein 4 homolog
 gi|7296857|gb|AAF52132.1| CG9853, isoform A [Drosophila melanogaster]
 gi|25012242|gb|AAN71235.1| LD21829p [Drosophila melanogaster]
 gi|158030157|gb|ABW08596.1| CG9853, isoform B [Drosophila melanogaster]
 gi|220943870|gb|ACL84478.1| CG9853-PA [synthetic construct]
 gi|220953748|gb|ACL89417.1| CG9853-PA [synthetic construct]
          Length = 339

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 80/313 (25%), Positives = 140/313 (44%), Gaps = 38/313 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL + +  G FY A  MY+++  RY A +RY + L+LL  GA   +   Q +  A+L + 
Sbjct: 28  KLSQSLAGGEFYEAHMMYRTLYFRYTAQKRYQDCLELLFDGAQQLIAKEQESSAADLCLL 87

Query: 80  FVEALVK-GKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            V+ L K G  A D +N          F  +P          L  L   L +A +  E  
Sbjct: 88  LVDTLEKRGPQAEDTDN----------FLWVP---------RLGALIRGLNSATVERE-- 126

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            + ++  IKWS      + G P LH ++A   ++E   ++ AR   H++   +       
Sbjct: 127 -TLIQRTIKWSTALHG-QYGHPVLHKLIAHVFWTEG-NIESAR--HHYLLCQDGSLCGRV 181

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQ---PSDL 254
           ++     + + GE DL + +A+L  LSL + K A     E  +    K L+ +      L
Sbjct: 182 LIEISQSRGFQGEMDLFLVQAVLQQLSLKDRKTAEDTFTEYTR-YHAKLLKHEFPYKEPL 240

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFG----VKRRNP- 309
           + F++++ + ++   +  F  LR  Y  S++R+ +F + +  I   +F          P 
Sbjct: 241 VNFLYFLFRLIDAKRVAGFRALRKLYDPSLKRDTSFLKYVAKIGVLYFDEHPEAAHSGPP 300

Query: 310 -MQGIFGDIFKMM 321
            + G+FGDIF  +
Sbjct: 301 GLGGMFGDIFNRL 313


>gi|297719663|ref|NP_001172193.1| Os01g0164700 [Oryza sativa Japonica Group]
 gi|255672902|dbj|BAH90923.1| Os01g0164700 [Oryza sativa Japonica Group]
          Length = 130

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 38/60 (63%), Positives = 49/60 (81%)

Query: 11 LPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQL 70
          LPP Q+ I+KLE +V EGN+Y AQQMYKS  ARY+AAQ+Y EALD+L +GA +QLK+ Q+
Sbjct: 19 LPPPQQTIEKLENMVAEGNYYEAQQMYKSTGARYIAAQKYLEALDILQSGALVQLKHGQV 78


>gi|426255498|ref|XP_004021385.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Ovis aries]
          Length = 400

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 69/254 (27%), Positives = 130/254 (51%), Gaps = 28/254 (11%)

Query: 44  YVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIY 103
           Y+A  +++EA +L+ +GA L   + Q    A+L++  +E+L K +V   +E L+ + K++
Sbjct: 65  YMAQSKHAEARELMCSGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADELLESLAKLF 124

Query: 104 ELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELH 163
                I  PH                A   RV    +F+  A+KWS   G+ K G P LH
Sbjct: 125 TGL-CIRHPHR--------------PAGWTRV----AFVSRALKWS-SGGSGKLGHPRLH 164

Query: 164 VMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMY 222
            +LA  L+  S  L  +++S   +  ++ E  A+ +V +   + +  E D+ VA+A+L +
Sbjct: 165 QLLALTLWKGSAFLACSQLS--PLHSSDGEGCANMLVEYSTARGFRSEADMFVAQAVLQF 222

Query: 223 LSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRAN 279
           L L N   A+ +     ++   +E+    +QP  L+ F++++L  ++   L +F +L   
Sbjct: 223 LCLKNKSSASVVFTTYTQKHPSIESGPPFVQP--LLNFIWFLLLAVDGGKLTVFTVLCEQ 280

Query: 280 YKASIEREPAFNEM 293
           Y+ S+ R+P +NE+
Sbjct: 281 YQPSLRRDPMYNEV 294


>gi|340959852|gb|EGS21033.1| hypothetical protein CTHT_0028730 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 329

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 85/326 (26%), Positives = 145/326 (44%), Gaps = 54/326 (16%)

Query: 18  IDKLEKIVNEGN---FYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           I +L++ + EG     Y A Q  + ++ARY     ++ A+D+L + +   L++ Q   G 
Sbjct: 9   IARLQRRIAEGQPEEQYEAAQETRLVAARYSKQGNWAAAVDILASVSQTLLRSGQGGSGG 68

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKL 133
           +LAV  V+   +     D  +  ++     LF P  PV                      
Sbjct: 69  DLAVLLVDTFRQAGQRVDGASRGKLLGCLRLFQPGEPVRKR------------------- 109

Query: 134 RVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGN--N 191
                  F+K  I WS +FG    G PELH ++   LY E  E + A    H V G   +
Sbjct: 110 -------FVKEMIDWSKKFGDYPAGDPELHHVVGT-LYVEEGEFEAAEK--HLVLGTKES 159

Query: 192 PEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN---------YIMDEVKKQV 242
           PE  A     +  +       L  ARA+L YL + N++ AN          + D     V
Sbjct: 160 PEVLARMEYEWYKQDESHTAPLYCARAVLPYLLVANVRAANTAYRIFTSALVEDNKGLTV 219

Query: 243 EN-----KQLQLQPS-DLIQFVFYVLQTLERDALPLFNMLRANYKASI-EREPAFNEMLD 295
           +N      +L++ PS  L+ F+  +L ++++ +  LF  L++ Y+A++ E    ++  L+
Sbjct: 220 QNIGSQSAELRIFPSLPLLNFISMLLLSVQKGSPDLFRQLKSKYEANLNELNGIWDTALE 279

Query: 296 DIAEKFFGVKR---RNPMQGIFGDIF 318
            IAE +FG++R    NP+  + G +F
Sbjct: 280 LIAEMYFGIQRPRQSNPLLDMMGSLF 305


>gi|291191289|pdb|3LPZ|A Chain A, Crystal Structure Of C. Therm. Get4
          Length = 336

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 85/326 (26%), Positives = 145/326 (44%), Gaps = 54/326 (16%)

Query: 18  IDKLEKIVNEGN---FYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           I +L++ + EG     Y A Q  + ++ARY     ++ A+D+L + +   L++ Q   G 
Sbjct: 16  IARLQRRIAEGQPEEQYEAAQETRLVAARYSKQGNWAAAVDILASVSQTLLRSGQGGSGG 75

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKL 133
           +LAV  V+   +     D  +  ++     LF P  PV                      
Sbjct: 76  DLAVLLVDTFRQAGQRVDGASRGKLLGCLRLFQPGEPVRKR------------------- 116

Query: 134 RVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGN--N 191
                  F+K  I WS +FG    G PELH ++   LY E  E + A    H V G   +
Sbjct: 117 -------FVKEMIDWSKKFGDYPAGDPELHHVVGT-LYVEEGEFEAAEK--HLVLGTKES 166

Query: 192 PEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN---------YIMDEVKKQV 242
           PE  A     +  +       L  ARA+L YL + N++ AN          + D     V
Sbjct: 167 PEVLARMEYEWYKQDESHTAPLYCARAVLPYLLVANVRAANTAYRIFTSALVEDNKGLTV 226

Query: 243 EN-----KQLQLQPS-DLIQFVFYVLQTLERDALPLFNMLRANYKASI-EREPAFNEMLD 295
           +N      +L++ PS  L+ F+  +L ++++ +  LF  L++ Y+A++ E    ++  L+
Sbjct: 227 QNIGSQSAELRIFPSLPLLNFISMLLLSVQKGSPDLFRQLKSKYEANLNELNGIWDTALE 286

Query: 296 DIAEKFFGVKR---RNPMQGIFGDIF 318
            IAE +FG++R    NP+  + G +F
Sbjct: 287 LIAEMYFGIQRPRQSNPLLDMMGSLF 312


>gi|358389687|gb|EHK27279.1| hypothetical protein TRIVIDRAFT_166273 [Trichoderma virens Gv29-8]
          Length = 329

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 88/327 (26%), Positives = 148/327 (45%), Gaps = 62/327 (18%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L++ + EG+FY AQQ  +  ++RY+  Q +  A+D+L++ A   LK  Q   G +L 
Sbjct: 9   VARLQEKIAEGDFYEAQQQTRVAASRYIKTQNWPAAIDILYSVAQSLLKAAQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
           V  V+   + ++  D  +  R+     LF P+ P             +TE++G       
Sbjct: 69  VMMVDVYKQAELKPDATSKGRLLTCLRLFDPEEPTRKKF--------ITESMG------- 113

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--NNPEK 194
                      WS +FG    G PELH  +A  LY+E  E +      H + G  ++PE 
Sbjct: 114 -----------WSSKFGEYPAGDPELH-HVAGSLYAE--EHETYEAERHLILGTKDSPE- 158

Query: 195 FASTIVNFMGKCYP-GEDDLA---VARAILMYLSLGNMKDA--NYIMDEVKKQVENKQLQ 248
               + N     Y  G+  +A    ARA+L YL +GN++ A  +Y +       +N  L 
Sbjct: 159 ---VLFNMEYAWYKEGDAHMAPHFAARAVLPYLLVGNVRAAATSYRLFASALTNDNPTLG 215

Query: 249 LQ-----PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLD 295
           +Q      SD+  F        +  +L  ++R A  ++  L + Y   I+   A+ E L+
Sbjct: 216 VQEVSSATSDVRIFPSLPLLNFLGLLLLAIQRGAPEVYKGLVSKYATQIDEAEAWAEPLE 275

Query: 296 DIAEKFFGVKR---RNP----MQGIFG 315
            I E +FG+++    NP    M G+FG
Sbjct: 276 MIGEMYFGIQKPRQSNPLMDMMSGLFG 302


>gi|340522311|gb|EGR52544.1| predicted protein [Trichoderma reesei QM6a]
          Length = 329

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 89/325 (27%), Positives = 147/325 (45%), Gaps = 58/325 (17%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L++ + EG++Y AQQ  +  ++RY+ AQ +  A+D+L++ A   LK  Q   G +L 
Sbjct: 9   VARLQEKIAEGDYYEAQQQTRVAASRYIKAQNWPAAIDVLYSVAQSLLKAAQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           V  V+   + ++  D  +  R+     LF             D  E T            
Sbjct: 69  VMMVDVYKQAELRPDATSKGRLLTCLRLF-------------DPAEPTR----------- 104

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--NNPEKF 195
              F+  ++ WS +FG    G PELH  +A  LY+E  E +      H + G  ++PE  
Sbjct: 105 -KKFIAESMGWSSKFGEYPAGDPELH-HVAGSLYAE--EHETYEAERHLILGTKDSPEVL 160

Query: 196 ASTIVNFMGKCYPGEDDLA---VARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ 250
                 +  +   G+  LA    ARAIL YL +GN++ AN  Y +       +N  L +Q
Sbjct: 161 LKMEYAWYRE---GDAHLAPHFAARAILPYLLVGNVRAANTCYRLFISALTTDNPSLGVQ 217

Query: 251 ------------PS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDI 297
                       PS  L+ F+  +L  ++R A  ++  L + Y   I+   A+ E L+ I
Sbjct: 218 DVSSATCDARIFPSLPLLNFLGLLLLAIQRGAPEVYKGLVSKYATQIDEAEAWAEPLEMI 277

Query: 298 AEKFFGV---KRRNP----MQGIFG 315
            E +FG+   ++ NP    M G+FG
Sbjct: 278 GEMYFGIAKPRQSNPLMDMMSGLFG 302


>gi|322705460|gb|EFY97046.1| hypothetical protein MAA_07592 [Metarhizium anisopliae ARSEF 23]
          Length = 334

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 82/324 (25%), Positives = 142/324 (43%), Gaps = 54/324 (16%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L++ + +G+FY AQQ  +  ++RY+  + +  A+D+L + A   LK  Q   G +L 
Sbjct: 9   VARLQQKITDGDFYEAQQQTRVAASRYIKTKNWDAAIDILSSVAQSLLKAGQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              V+   + ++A D+ +  R+     LF             D  E T            
Sbjct: 69  TMMVDVYKQAELAPDSASKGRLLTCLRLF-------------DGEEPTR----------- 104

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
              F+   + WS +FG    G PELH  +A  +Y+E  E D      H + G    K + 
Sbjct: 105 -KKFITEMLAWSSKFGEYPAGDPELH-HVAGSIYAE--EHDTYEAERHLLLGT---KDSP 157

Query: 198 TIVNFMGKCYPGEDDLAVA-----RAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ 250
            ++  M   +  E D  +A     RA+L YL +GN++ AN  Y +       +N  L +Q
Sbjct: 158 QVLFKMEYTWYKEGDAHLAPHFAGRAVLPYLLVGNVRAANNCYRLFTSALSSDNPSLGVQ 217

Query: 251 -----PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDI 297
                 SD+  F        +  +L  ++R A  ++  L + Y   I    A+ E L+ I
Sbjct: 218 DVSSASSDIRIFPSLPLLNFLGLLLLAIQRAAPEVYKGLVSKYATQINEADAWTEPLEMI 277

Query: 298 AEKFFGV---KRRNPMQGIFGDIF 318
           AE +FG+   ++ NP+  +   +F
Sbjct: 278 AEMYFGIAKPRQSNPLMDMMSGLF 301


>gi|358401387|gb|EHK50693.1| hypothetical protein TRIATDRAFT_52992 [Trichoderma atroviride IMI
           206040]
          Length = 326

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 85/326 (26%), Positives = 147/326 (45%), Gaps = 60/326 (18%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L++ + +G+FY AQQ  +  ++RY+  Q +S A+D+L++ A   LK  Q   G +L 
Sbjct: 9   VARLQEKITDGDFYEAQQQTRVAASRYIKTQNWSAAIDILYSVAQSLLKAQQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              ++   + ++  D  +  R+     LF             D  E T            
Sbjct: 69  AMMLDVYKQAELKPDATSKGRLLTCLRLF-------------DPLEPTR----------- 104

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--NNPEKF 195
              F+  ++ WS +FG    G PE+H  +A  LY+E  E +      H V G  ++PE  
Sbjct: 105 -KKFITESVGWSAKFGEYPAGDPEIH-HVAGSLYAE--EHETYEAERHLVLGTKDSPE-- 158

Query: 196 ASTIVNFMGKCYP-GEDDLA---VARAILMYLSLGNMK--DANYIMDEVKKQVENKQLQL 249
              ++N     Y  G+  +A    ARA+L YL +GN++   A Y +       +N  L +
Sbjct: 159 --VLINMEYAWYKEGDAHMAPHFAARAVLPYLLVGNVRAATATYRLFISALTNDNPALGV 216

Query: 250 Q-----PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDD 296
           Q      SD+  F        +  +L  ++R A  ++  L + Y + I+   A+ E L+ 
Sbjct: 217 QDVSSNTSDIRIFPSLPLLNFLGLLLLAIQRGAPEVYKGLISKYASQIDEAEAWAEPLEM 276

Query: 297 IAEKFFGVKR---RNP----MQGIFG 315
           I E +FG+++    NP    M G+FG
Sbjct: 277 IGEMYFGIQKPRQSNPLMDMMSGLFG 302


>gi|195451005|ref|XP_002072727.1| GK13758 [Drosophila willistoni]
 gi|194168812|gb|EDW83713.1| GK13758 [Drosophila willistoni]
          Length = 342

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 76/315 (24%), Positives = 143/315 (45%), Gaps = 39/315 (12%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KL   +  G FY A  MY+++  RY A +RY + L+LL  GA   +  +Q +  A+LA
Sbjct: 30  LHKLANSLEAGEFYEAHMMYRTLYFRYTAQKRYQDCLNLLFDGAEKLIAKDQHSSAADLA 89

Query: 78  VSFVEALVK-GKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
           +  ++ L K G +  DN N   V ++  L      P  +E D                  
Sbjct: 90  LLLIDTLEKRGSLPGDN-NFYWVPRLGSLIRGFS-PTTVERD------------------ 129

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
              +F++  +KWS      K G P LH ++A ++Y     +++AR   H++   +     
Sbjct: 130 ---TFIQRVVKWSSGLHG-KYGHPVLHKLIA-HVYWADGNVELAR--HHYLLCQDGSLCG 182

Query: 197 STIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK---QVENKQLQLQPS 252
             ++   + K +  E DL + +A+L  LSL + + A  +  E  +   ++   +   +  
Sbjct: 183 RVLIEISVSKGFRSEVDLFLVQAVLQQLSLKDQRTAEVMFYEYTRYHPKILRHEFPFREP 242

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNP--- 309
            L+ F++ + + ++  ++  F  LR  Y  S++R+ +F + L  I   +F  +   P   
Sbjct: 243 -LVNFLYCLFRLIDTKSVNGFRALRKLYDPSLKRDTSFLKYLTKIGIVYFDEQPEVPHSG 301

Query: 310 ---MQGIFGDIFKMM 321
              + G+ GDIF  +
Sbjct: 302 PSGLGGMIGDIFNRL 316


>gi|378730451|gb|EHY56910.1| hypothetical protein HMPREF1120_04974 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 345

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 84/329 (25%), Positives = 135/329 (41%), Gaps = 54/329 (16%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + I + ++ + EGNFY A Q  + I+ARY+ A  Y+ A D+LH G+ + L+  Q   G +
Sbjct: 8   KTIARQKEKIAEGNFYEAHQQLRVITARYLKAHDYASASDVLHTGSLMLLRAGQGGSGGD 67

Query: 76  LAVSFV-EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           LA+  + +  VKG+   + EN  R+ +I   FP               E T         
Sbjct: 68  LALMLLNDVYVKGEWVCNQENKSRLVEILRAFPT-------------GEPTR-------- 106

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+   + WS +FG  + G PELH    +    E    D  R   H + G     
Sbjct: 107 ----KRFITEMVAWSAKFGDIERGDPELHFEAGKVYAEEGEVYDAER---HLILGTRESA 159

Query: 195 --FASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQL--- 249
              AS   N+     P    +  +R++L YL +G++  AN  +      +      L   
Sbjct: 160 PILASLHYNWYRLDQPHLAAIYASRSVLPYLLVGDLASANQALSTFTSHLTTSNPALLAM 219

Query: 250 -QP-----SDLIQFVFYVL----------QTLERDALPLFNMLRANYKASI-EREPAFNE 292
            QP     S L   VF  +             ++    LF  L   Y + + + E  +++
Sbjct: 220 SQPLESSKSGLSLRVFPSIPLLNFLALLLLAAQKADAGLFRQLAKYYASHLKDTEGLWSD 279

Query: 293 MLDDIAEKFFGVKRRNPMQGIFGDIFKMM 321
            L +I + +FG+  R P QG    +F MM
Sbjct: 280 ALANIGDIWFGI--RLPRQG-GNPLFDMM 305


>gi|302909297|ref|XP_003050041.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256730978|gb|EEU44328.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 328

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 78/320 (24%), Positives = 131/320 (40%), Gaps = 48/320 (15%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L+  + EG++Y AQQ  +  ++RY+    +  A+D+L   A   L+  Q   G +L 
Sbjct: 9   VARLQAKIEEGDYYEAQQQTRVAASRYIKTSNWPAAIDILSNVAQALLRAGQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  ++   + ++  D  +  R+     LF             D  E T            
Sbjct: 69  IMLLDVYKQAELKPDATSKGRLLTCLRLF-------------DAEEPTR----------- 104

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
              F+   I WS +FG    G PELH  +A  LY+E  E   A         ++PE    
Sbjct: 105 -KKFVGEIIAWSAKFGEYPAGDPELH-HVAGSLYAEEHETYEAEKHLILGTKDSPEVLTK 162

Query: 198 TIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ----- 250
               +  +  P    L   RA+L YL +GN++ AN  Y +       +N  + +Q     
Sbjct: 163 MEYTWYKEGEPHTAALFAGRAVLPYLLIGNVRAANTCYRIFTSSLSSDNPNIGVQDVSSA 222

Query: 251 PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFF 302
             D+  F        + ++L  ++R A  LF  L + Y   I     + E L+ + E +F
Sbjct: 223 AGDIRIFPSIPLLNFLGFLLLAVQRGAPELFRALVSKYATQINEAGTWGEALEMVGEMYF 282

Query: 303 GV---KRRNP----MQGIFG 315
           G+   ++ NP    M G FG
Sbjct: 283 GLAKPRQSNPLMDMMSGFFG 302


>gi|396495705|ref|XP_003844611.1| similar to DUF410 domain protein [Leptosphaeria maculans JN3]
 gi|312221191|emb|CBY01132.1| similar to DUF410 domain protein [Leptosphaeria maculans JN3]
          Length = 339

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 81/323 (25%), Positives = 142/323 (43%), Gaps = 56/323 (17%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           ++ + +  + + EG +Y A Q  + +++RY  AQ +S A  +L+ GA   L+  Q   G 
Sbjct: 6   EKTLQRQREKIAEGQYYEAHQQLRVVASRYTKAQDWSNATAILYEGAQSLLQAGQGGSGG 65

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L V  ++   KG++  D  +  ++  +   FP+              E T+        
Sbjct: 66  DLCVFLLDVYSKGEIKPDASSKGKLLSLLRAFPK-------------DEPTK-------- 104

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSES------PELDMARVSFHFVR 188
                 F+   I WS +FG    G PELH  +A  L++E         L+      H + 
Sbjct: 105 ----KKFVGEMIAWSSKFGEFPAGDPELH-HVAGTLFAEGKFFVPHTHLEPYDAERHLLL 159

Query: 189 GNNPEKFASTIVNFMGKCYPGEDD----LAVARAILMYLSLGNMKDANYIMDEVKKQVEN 244
           G   +    T+ +     Y  +D     L  AR +L YL  GN++ AN        Q+  
Sbjct: 160 GT--QDSPDTLSSLEYAWYEADDSHTAPLYCARGVLPYLLTGNLRAANKFFLLFTSQLAK 217

Query: 245 K-------------QLQLQPS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAF 290
           K             +L++ PS  L+ F+  +L ++ER    L+  L+++Y  ++ ++  +
Sbjct: 218 KPNLNTQNVSTNSTELRVYPSLPLLNFLGLLLLSVERGDPSLYRQLKSHYVQNL-KDVNW 276

Query: 291 NEMLDDIAEKFFGV---KRRNPM 310
           NE LD I E +FG+   K+ NPM
Sbjct: 277 NEALDQIGEVYFGIRIPKQGNPM 299


>gi|119467816|ref|XP_001257714.1| hypothetical protein NFIA_051620 [Neosartorya fischeri NRRL 181]
 gi|119405866|gb|EAW15817.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 346

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 81/325 (24%), Positives = 135/325 (41%), Gaps = 52/325 (16%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK----NNQLTCGAELA 77
           EKI + G +Y A Q  + I+ARY+    Y  A ++L  GA   L+          G +LA
Sbjct: 14  EKIAS-GAYYEAHQQLRVIAARYIKQSNYEAAAEILAGGATALLRAGSQQGASASGGDLA 72

Query: 78  VSFVEALVK----GKVAYDNENLDRVRK--IYELFPQIPVPHNLEDDDDLHELTEALGAA 131
           +  V+ +      G    D++   R RK  + EL  + P            E T      
Sbjct: 73  IMLVDEVYTKAGWGITGGDDDAEGRARKKRLIELLREFPS----------EEPTR----- 117

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
                    F++  I WS  FG  + G  ELH   A  +Y+E  E   A          +
Sbjct: 118 -------KRFIQEMIGWSGRFGPVERGDAELH-HAAGSVYAEDNEPYDAEKHLILGTSES 169

Query: 192 PEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQL 249
            E  A     +     P    +  +RA+  YL +GN+++AN  +++   +    N  L +
Sbjct: 170 AETLAKLEYEWYTNDEPHTAAIYASRAVFPYLLVGNLRNANKAFLIFTSRLSSSNTSLGV 229

Query: 250 Q------------PS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDD 296
           Q            PS  L+ F+  +L T++R +  LF  L A+Y + I+    +++ L  
Sbjct: 230 QEVSSASSDVRVFPSLPLLNFISMLLLTIQRGSADLFKQLTAHYASQIQEVGIWDDALSQ 289

Query: 297 IAEKFFGVK---RRNPMQGIFGDIF 318
           I E++F +K   + NP+  + G + 
Sbjct: 290 IGEQYFAIKVPRQGNPLLDMMGSML 314


>gi|70984104|ref|XP_747572.1| DUF410 domain protein [Aspergillus fumigatus Af293]
 gi|66845199|gb|EAL85534.1| DUF410 domain protein [Aspergillus fumigatus Af293]
 gi|159122358|gb|EDP47479.1| DUF410 domain protein [Aspergillus fumigatus A1163]
          Length = 346

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 85/330 (25%), Positives = 141/330 (42%), Gaps = 62/330 (18%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK----NNQLTCGAELA 77
           EKI + G +Y A Q  + I+ARY+    Y  A ++L  GA   L+          G +LA
Sbjct: 14  EKIAS-GAYYEAHQQLRVIAARYIKQSNYEAAAEILAGGATALLRAGSQQGASASGGDLA 72

Query: 78  VSFVEALVK----GKVAYDNENLDRVRK--IYELFPQIPVPHNLEDDDDLHELTEALGAA 131
           +  V+ +      G    D++   R RK  + EL  + P            E T      
Sbjct: 73  IMLVDEVYTKAGWGITGGDDDAEGRARKKRLIELLREFPS----------EEPTR----- 117

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPE-LDMARVSFHFVRGN 190
                    F++  I WS  FG  + G  ELH   A  +Y+E  E  D  +   H V G 
Sbjct: 118 -------KRFIQEMIGWSGRFGPVERGDAELH-HAAGSVYAEDNEPYDAEK---HLVLGT 166

Query: 191 NPEKFASTIVNFMGKCY----PGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVEN 244
           +  + A T+     + Y    P    +  +RA+  YL +GN+++AN  +++   +    N
Sbjct: 167 S--ESAETLAKLEYEWYTNDEPHTAAIYASRAVFPYLLVGNLRNANKAFLVFTSRLSSSN 224

Query: 245 KQLQLQ------------PS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFN 291
             L +Q            PS  L+ F+  +L T++R +  LF  L A+Y + I     ++
Sbjct: 225 TSLGVQEVSSASSDVRVFPSLPLLNFISMLLLTIQRGSADLFKQLTAHYASQIREVGIWD 284

Query: 292 EMLDDIAEKFFGVK---RRNPMQGIFGDIF 318
           + L  I E++F +K   + NP+  + G + 
Sbjct: 285 DALSQIGEQYFAIKVPRQGNPLLDMMGSML 314


>gi|322693066|gb|EFY84941.1| hypothetical protein MAC_09016 [Metarhizium acridum CQMa 102]
          Length = 328

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 78/324 (24%), Positives = 134/324 (41%), Gaps = 54/324 (16%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L++ + +G+FY AQQ  +  ++RY+  + +  A+D+L   A   LK  Q   G +L 
Sbjct: 9   VARLQQKITDGDFYEAQQQTRVAASRYIKTKNWDAAIDILSNVAQSLLKAGQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  V+   + ++  D+ +  R+     LF             D  E T            
Sbjct: 69  IMMVDVYKQAELTPDSASKGRLLTCLRLF-------------DGEEPTR----------- 104

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
              F+   + WS +FG    G PELH  +A  +Y+E  E D      H V G    K + 
Sbjct: 105 -KKFITEMLAWSSKFGEYPAGDPELH-HVAGSIYAE--EHDTYEAERHLVLGT---KDSP 157

Query: 198 TIVNFMGKCYPGEDDLAVA-----RAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS 252
            ++  M   +  E D  +A     RA+L YL +GN++ AN         + N    L   
Sbjct: 158 EVLFKMEYAWYKEGDAHLAPHFAGRAVLPYLLVGNVRAANNCYRLFTSALSNDNPSLGVQ 217

Query: 253 DLIQFVFYV---------------LQTLERDALPLFNMLRANYKASIEREPAFNEMLDDI 297
           D+      +               L  ++R A  ++  L + Y   I    A+ E L+ I
Sbjct: 218 DVSSSSSDIRIFPSLPLLNFLGLLLLAIQRAAPEVYKGLISKYATQINETDAWAEPLEMI 277

Query: 298 AEKFFGV---KRRNPMQGIFGDIF 318
           AE +FG+   ++ NP+  +   +F
Sbjct: 278 AEMYFGITKPRQSNPLMDMMSGLF 301


>gi|302421282|ref|XP_003008471.1| DUF410 domain-containing protein [Verticillium albo-atrum VaMs.102]
 gi|261351617|gb|EEY14045.1| DUF410 domain-containing protein [Verticillium albo-atrum VaMs.102]
          Length = 334

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 82/330 (24%), Positives = 138/330 (41%), Gaps = 62/330 (18%)

Query: 19  DKLEKIVN-------EGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           DK+EKI+        EG +Y AQQ  + ++ARYV A  +  A+D+L + A   LK  Q  
Sbjct: 6   DKIEKIIARLQARTAEGQYYEAQQQTRVVAARYVKAANWPAAVDILASVAQSLLKAGQGG 65

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGA 130
            G +L +  V+   + ++  D  +  R+     LF P  P                    
Sbjct: 66  SGGDLCLMLVDVYKQAELRPDAASKGRLLTCLRLFGPGEPTRK----------------- 108

Query: 131 AKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGN 190
                    +F+   I WS +FG    G  ELH +       E    +  R   H V G 
Sbjct: 109 ---------TFISEMIAWSSKFGDYPAGDAELHHVAGSLYAEEHEAYEAER---HLVLGT 156

Query: 191 NPEKFASTIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDANYIMDEVKKQVEN- 244
              + ++ ++  M   +  EDD     L  +RA+L YL +GN++ AN        Q+ + 
Sbjct: 157 ---RDSAEVLARMEYAWYKEDDSHTAALYASRAVLPYLLVGNLRAANAAYRLFASQLADA 213

Query: 245 -----KQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFN 291
                + +    +D+  F        +  +L  ++R     +  L   Y A+I    A+N
Sbjct: 214 GAGGVQDVSTATADVRIFPGLPLLNFLGLLLLAIQRGTPDAYRTLVNKYAANIAELGAWN 273

Query: 292 EMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
           E L+ +AE +FG+   ++ NP+  + G +F
Sbjct: 274 EALEMVAEMYFGIVKPRQSNPLMDMMGSLF 303


>gi|194898490|ref|XP_001978818.1| GG11609 [Drosophila erecta]
 gi|190650521|gb|EDV47776.1| GG11609 [Drosophila erecta]
          Length = 339

 Score = 78.2 bits (191), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 85/328 (25%), Positives = 145/328 (44%), Gaps = 42/328 (12%)

Query: 9   TALPPAQENID----KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQ 64
           +AL   Q  +     KL + +  G FY A  MY+++  RY A +RY + LDLL  GA   
Sbjct: 13  SALSTGQRGVSRVLAKLSQSLAGGEFYEAHMMYRTLYFRYTAQKRYQDCLDLLFDGAQQL 72

Query: 65  LKNNQLTCGAELAVSFVEALVK-GKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHE 123
           +   Q +  A+L +  ++ L K G  A D EN          F  +P          L  
Sbjct: 73  IAKEQDSSAADLCLLLLDTLEKRGPQAEDTEN----------FLWVP---------RLGA 113

Query: 124 LTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVS 183
           L   L AA +  E   + ++  IKWS      + G P LH ++A   ++E   ++ AR  
Sbjct: 114 LIRGLNAATVERE---TLIQRTIKWSTALHG-QYGHPVLHKLIAHVFWTEG-NIESAR-- 166

Query: 184 FHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQV 242
            H++   +       ++     +   GE DL + +A+L  LSL + K A     E  +  
Sbjct: 167 HHYLLCQDGSLCGRVLIEISQSRGLQGEVDLFLVQAVLQQLSLKDRKTAEDTFTEYTR-Y 225

Query: 243 ENKQLQLQ---PSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAE 299
             K L+ +      L+ F++++ + ++   +  F  LR  Y  S++R+ +F + +  I  
Sbjct: 226 HAKLLKHEFPYKEPLVNFLYFLFRLIDAKRVAGFRALRKLYDPSLKRDTSFLKYVAKIGV 285

Query: 300 KFFGVK----RRNP--MQGIFGDIFKMM 321
            +F  +       P  + G+FGDIF  +
Sbjct: 286 LYFDEQPEPAHSGPPGLGGMFGDIFNRL 313


>gi|431903029|gb|ELK09209.1| hypothetical protein PAL_GLEAN10002642, partial [Pteropus alecto]
          Length = 274

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 70/267 (26%), Positives = 124/267 (46%), Gaps = 39/267 (14%)

Query: 43  RYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKI 102
           RY +  R++EA +L+ +GA L   + Q    A+L++  +E+L K +V   +E L+ + K+
Sbjct: 2   RYTSQSRHAEARELMCSGALLFFGHGQQNSAADLSMLVLESLEKAEVEVADELLENLAKL 61

Query: 103 YELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPE 161
           + L  P  P                       RV    +F+  A+KWS   G+ K G P 
Sbjct: 62  FSLMDPSSPE----------------------RV----AFVSRALKWS-SGGSGKLGHPR 94

Query: 162 LHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAIL 220
           LH +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ VA+A+L
Sbjct: 95  LHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANMLVEYSTSRGFRSEVDMFVAQAVL 151

Query: 221 MYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLR 277
            +L L N   A+ +     ++   +E     +QP     +   ++  ++   L +F +L 
Sbjct: 152 QFLCLKNKSSASVVFTTYTQKHPSIEGGPPFVQPLLNFLWFLLLV--VDGGKLTVFTVLC 209

Query: 278 ANYKASIEREPAFNEMLDDIAEKFFGV 304
             Y+ S+   P  +  LD I + FFGV
Sbjct: 210 EQYQPSL--RPKTHSYLDRIGQLFFGV 234


>gi|46125885|ref|XP_387496.1| hypothetical protein FG07320.1 [Gibberella zeae PH-1]
          Length = 329

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 75/313 (23%), Positives = 135/313 (43%), Gaps = 48/313 (15%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L+  + EG++Y AQQ  +  ++RY+    +  A+D+L + +   L+  Q   G +L 
Sbjct: 9   VARLQAKIEEGDYYEAQQQTRVAASRYIKTSNWPAAIDILSSVSQALLRAGQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              V+   + ++  D ++  R+     LF             +  E T            
Sbjct: 69  AMLVDVYKQAELKPDADSKGRLLTCLRLF-------------EAGEPTR----------- 104

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--NNPEKF 195
              F+   I WS +FG    G PELH  +A  LY+E  E D      H + G  ++PE  
Sbjct: 105 -KKFVGEIIGWSAKFGDYPAGDPELH-HVAGSLYAE--EHDTYEAEKHLLLGTKDSPEVL 160

Query: 196 ASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ--- 250
                 +  +  P +  L   RA+L YL +GN++ AN  Y +       +N  L +Q   
Sbjct: 161 TKMEYTWYKESEPHQAALFAGRAVLPYLLVGNVRAANTCYRLFTSSLSSDNPNLGVQDVS 220

Query: 251 -PSDLIQF---------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
            P+  I+          + ++L  ++R A  L+  L + Y   I    ++ E L+ + E 
Sbjct: 221 SPAGDIRIFPSLPLLNFLGFLLLAIQRGAPELYRALVSKYATQINEAGSWAEALEMVGEM 280

Query: 301 FFGV---KRRNPM 310
           +FG+   ++ NP+
Sbjct: 281 YFGLSKPRQSNPL 293


>gi|195343411|ref|XP_002038291.1| GM10704 [Drosophila sechellia]
 gi|195568181|ref|XP_002102096.1| GD19681 [Drosophila simulans]
 gi|194133312|gb|EDW54828.1| GM10704 [Drosophila sechellia]
 gi|194198023|gb|EDX11599.1| GD19681 [Drosophila simulans]
          Length = 339

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 81/313 (25%), Positives = 140/313 (44%), Gaps = 38/313 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL + +  G FY A  MY+++  RY A +RY + LDLL  GA   +   Q +  A+L + 
Sbjct: 28  KLSQSLAGGEFYEAHMMYRTLYFRYTAQKRYQDCLDLLFDGAQQLIAKEQESSAADLCLL 87

Query: 80  FVEALVK-GKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            ++ L K G  A D +N             + VP        L  L   L AA +  E  
Sbjct: 88  LLDTLEKRGPQAEDTDNF------------LWVPR-------LGALIRGLNAATVERE-- 126

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
            + ++  IKWS      + G P LH ++A   ++E   ++ AR   H++   +       
Sbjct: 127 -TLIQRTIKWSTALHG-QYGHPVLHKLIAHVFWTEG-NIESAR--HHYLLCQDGSLCGRV 181

Query: 199 IVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQ---PSDL 254
           ++     + + GE DL + +A+L  LSL + K A     E  +    K L+ +      L
Sbjct: 182 LIEISQSRGFQGEVDLFLVQAVLQQLSLKDRKTAEDTFTEYTR-YHAKLLKHEFPYKEPL 240

Query: 255 IQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFG----VKRRNP- 309
           + F++++ + ++   +  F  LR  Y  S++R+ +F + +  I   +F          P 
Sbjct: 241 VNFLYFLFRLIDAKRVAGFRALRKLYDPSLKRDTSFLKYIAKIGVLYFDEQPEAAHSGPP 300

Query: 310 -MQGIFGDIFKMM 321
            + G+FGDIF  +
Sbjct: 301 GLGGMFGDIFNRL 313


>gi|342879081|gb|EGU80356.1| hypothetical protein FOXB_09153 [Fusarium oxysporum Fo5176]
          Length = 329

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 73/313 (23%), Positives = 125/313 (39%), Gaps = 48/313 (15%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L+  + EG++Y AQQ  +  ++RY+    +  A+D+L   +   L+  Q   G +L 
Sbjct: 9   VARLQAKIEEGDYYEAQQQTRVAASRYIKTSNWPAAIDILSNVSQALLRAGQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              V+   + ++  D     R+     LF             +  E T            
Sbjct: 69  AMLVDVYKQAELKPDATAKGRLLTCLRLF-------------EAGEPTR----------- 104

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--NNPEKF 195
              F+   I WS +FG    G PELH  +A  LY+E  E D      H + G  ++PE  
Sbjct: 105 -KKFIGEIIAWSAKFGEYPAGDPELH-HVAGSLYAE--EHDTYEAEKHLILGTKDSPEIL 160

Query: 196 ASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLI 255
                N+  +  P +  L   RA+L YL +GN++ AN         + N    L   D+ 
Sbjct: 161 TKMEYNWYKESEPHQAALFAGRAVLPYLLVGNVRAANTCYRLFTSSLSNDNPNLGVQDVS 220

Query: 256 QFVFYV---------------LQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
                +               L  ++R A  L+  L + Y   I    ++ E L+ + E 
Sbjct: 221 SSAGDIRIFPSLPLLNFLGFLLLAIQRGAPELYRALVSKYATQINEAGSWAEALEMVGEM 280

Query: 301 FFGV---KRRNPM 310
           +FG+   ++ NP+
Sbjct: 281 YFGLSKPRQSNPL 293


>gi|449688805|ref|XP_002158335.2| PREDICTED: Golgi to ER traffic protein 4 homolog [Hydra
           magnipapillata]
          Length = 280

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 70/286 (24%), Positives = 129/286 (45%), Gaps = 39/286 (13%)

Query: 43  RYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKI 102
           RY + ++Y +A  +L+    + L++NQ     +L    V+   + K    +E+L+++ KI
Sbjct: 4   RYKSQKKYEDAEYILYDSVIILLEHNQHESAFDLGRLLVKLYTETKKNISDESLEKICKI 63

Query: 103 YELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPEL 162
             L  +     N                          ++  A+ WS + G  K G P+L
Sbjct: 64  LSLSSKSSESRN-------------------------DYISEALSWSSQEGY-KYGHPQL 97

Query: 163 HVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILM 221
           H ++A   + E    D      HF   N+  + A  ++ + +   +PGE DL V +A+L 
Sbjct: 98  HSVIANKYWQEK---DYHSARKHFSYTNDGCQCAMFLIEYHILSGFPGEIDLFVTQAVLQ 154

Query: 222 YLSLGNMKDAN-----YIMDEVKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNML 276
           +L L N   AN     Y  +       N   + QP  L+ F+ ++L  +E+  +  + +L
Sbjct: 155 FLCLRNQPTANTTFINYTSNHPGIHRTNYPYK-QP--LLNFIAFLLLIIEKGNVEQYAIL 211

Query: 277 RANYKASIEREPAFNEMLDDIAEKFFGV-KRRNPMQGIFGDIFKMM 321
             +Y+ SI ++P +++ LD I + FFG+   +    GIFG +  +M
Sbjct: 212 CEHYQPSIHKDPTYSQYLDKIGQIFFGIPPPKKEKSGIFGLLDNLM 257


>gi|403306602|ref|XP_003943815.1| PREDICTED: Golgi to ER traffic protein 4 homolog [Saimiri
           boliviensis boliviensis]
          Length = 309

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 46/160 (28%), Positives = 85/160 (53%), Gaps = 9/160 (5%)

Query: 153 GAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGED 211
           G+ K G P LH +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E 
Sbjct: 118 GSGKLGHPRLHQLLALTLWKEQ---NYCESRYHFLHSADGEGCANMLVEYSTSRGFRSEV 174

Query: 212 DLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERD 268
           D+ VA+A+L +L L N   A+ +     ++   +E+    ++P  L+ F++++L  ++  
Sbjct: 175 DMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPPFMEP--LLNFIWFLLLAVDGG 232

Query: 269 ALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRN 308
            L +F +L   Y+ S+ R+P +NE LD I + FFGV  + 
Sbjct: 233 KLTVFTVLCEQYQPSLRRDPMYNEYLDRIGQLFFGVPPKQ 272


>gi|198419075|ref|XP_002119478.1| PREDICTED: similar to CG9853 CG9853-PB [Ciona intestinalis]
          Length = 315

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 74/309 (23%), Positives = 136/309 (44%), Gaps = 41/309 (13%)

Query: 19  DKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAV 78
           +KL   +   + Y A Q+Y ++  RY A  ++  A +LL+ GA    ++ +    A L +
Sbjct: 13  EKLRLSIERNDLYEAHQLYNTLFYRYSAKNKHDLARNLLYKGALHLFEHEETGSAAHLCL 72

Query: 79  SFVEALVKGKVAYDNENLDRVRKIYELFP-QIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
            ++E+L         E  + + K++   P +IP            EL             
Sbjct: 73  LYLESLENNVHEVTTELAEILGKMHHELPKEIP------------EL------------- 107

Query: 138 CSSFLKAAIKWSIEF-GAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
             +F   AI WS    G P+ GS +L    A   + E    D  +   HF+  ++ +KF 
Sbjct: 108 -ETFENRAITWSAGCTGMPRFGSKDLRFKFAMNYWEEKMFRDSRQ---HFLYSDDGQKFG 163

Query: 197 STIVNFMGKC-YPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSD-- 253
             +  F+  C  P E DL + +A+L  L +   K A  + +   K +     ++ P    
Sbjct: 164 EMLKEFVILCGRPEEIDLFLTQAVLQLLCIKCEKAAEDMFNTYTKIIS----KISPGPPY 219

Query: 254 ---LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPM 310
              +I F+F++L+ ++   L LF +L  NY+ S++++ +F + L+ I E +F V      
Sbjct: 220 NYPIINFLFFLLRAVKERRLILFTILCDNYQPSLKKDASFLKYLEKIGEIYFDVPAVQSK 279

Query: 311 QGIFGDIFK 319
            G F ++ +
Sbjct: 280 PGFFENLMQ 288


>gi|346974654|gb|EGY18106.1| DUF410 domain-containing protein [Verticillium dahliae VdLs.17]
          Length = 333

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 81/330 (24%), Positives = 137/330 (41%), Gaps = 62/330 (18%)

Query: 19  DKLEKIVN-------EGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           DK+EKI+        EG +Y AQQ  + ++ARYV A  +  A+D+L + A   LK  Q  
Sbjct: 6   DKIEKIIARLQARTAEGQYYEAQQQTRVVAARYVKAANWPAAVDILASVAQSLLKAGQGG 65

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGA 130
            G +L +  V+   +  +  D  +  R+     LF P  P                    
Sbjct: 66  SGGDLCLMLVDVYKQAALTPDAASKGRLLTCLRLFGPGEPTRK----------------- 108

Query: 131 AKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGN 190
                    +F+   I WS +FG    G  ELH +       E    +  R   H V G 
Sbjct: 109 ---------TFISEMIAWSSKFGEYPAGDAELHHVAGSLYAEEHEAYEAER---HLVLGT 156

Query: 191 NPEKFASTIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDANYIMDEVKKQVEN- 244
              + ++ ++  M   +  EDD     L  +RA+L YL +GN++ AN        Q+ + 
Sbjct: 157 ---RDSAEVLARMEYAWYREDDSHTAALYASRAVLPYLLVGNLRAANAAYRLFASQLADA 213

Query: 245 -----KQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFN 291
                + +    +D+  F        +  +L  ++R     +  L   Y A+I    A+N
Sbjct: 214 GAGGVQDVSTATADVRIFPGLPLLNFLGLLLLAIQRGTPDAYRTLVNKYAANIAELGAWN 273

Query: 292 EMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
           + L+ +AE +FG+   ++ NP+  + G +F
Sbjct: 274 DALEMVAEMYFGIVKPRQSNPLMDMMGSLF 303


>gi|67540690|ref|XP_664119.1| hypothetical protein AN6515.2 [Aspergillus nidulans FGSC A4]
 gi|40738665|gb|EAA57855.1| hypothetical protein AN6515.2 [Aspergillus nidulans FGSC A4]
 gi|259480085|tpe|CBF70895.1| TPA: DUF410 domain protein (AFU_orthologue; AFUA_6G05050)
           [Aspergillus nidulans FGSC A4]
          Length = 345

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 83/325 (25%), Positives = 135/325 (41%), Gaps = 52/325 (16%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK----NNQLTCGAELA 77
           EKI + G +Y A Q  + I+ARY+    Y  A ++L  GA   L+          G +LA
Sbjct: 14  EKIAS-GAYYEAHQQLRVIAARYIKQNNYDAAAEILAGGATALLRAGSQQGASASGGDLA 72

Query: 78  VSFV-EALVKGK---VAYDNENLDRVRK--IYELFPQIPVPHNLEDDDDLHELTEALGAA 131
           +  V E  VK     V  D++   R RK  + EL  + P            E T      
Sbjct: 73  IMLVTEVYVKAGWEIVGGDDDAEGRARKKRLIELLREFPS----------EEPTR----- 117

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
                    F++ AI WS  FG+ + G PELH  +   +Y+E  E   A          +
Sbjct: 118 -------KRFIQEAIGWSGRFGSVERGDPELHHAVGS-VYAEDQEPYEAEKHLVLGTSES 169

Query: 192 PEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQL 249
            E  A     +     P    +  ARA++ +L +GN++ AN  +++   +    N  L +
Sbjct: 170 AETLAKLEYEWYTSDEPHTAGIYAARAVIPFLLIGNLRSANKAFLIFTSRLSSANPSLGV 229

Query: 250 Q-----PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDD 296
           Q      SD+  F        +  +L  ++R    LF  L A+Y   I+     ++ L  
Sbjct: 230 QDVSSATSDVRVFPALPLLNFLNMLLLAIQRGTADLFKQLTAHYATHIKEAGILDDALPQ 289

Query: 297 IAEKFFGV---KRRNPMQGIFGDIF 318
           I E++FG+   ++ NP+  + G + 
Sbjct: 290 IGEQYFGINIPRQGNPLLDMMGSML 314


>gi|408396466|gb|EKJ75623.1| hypothetical protein FPSE_04124 [Fusarium pseudograminearum CS3096]
          Length = 329

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 74/313 (23%), Positives = 133/313 (42%), Gaps = 48/313 (15%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L+  + EG++Y AQQ  +  ++RY+    +  A+D+L + +   L+  Q   G +L 
Sbjct: 9   VARLQAKIEEGDYYEAQQQTRVAASRYIKTSNWPAAIDILSSVSQALLRAGQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              V+   + ++  D     R+     LF             +  E T            
Sbjct: 69  AMLVDVYKQAELKPDANTKGRLLTCLRLF-------------EAGEPTR----------- 104

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--NNPEKF 195
              F+   + WS +FG    G PELH  +A  LY+E  E D      H + G  ++PE  
Sbjct: 105 -KKFVGEIVGWSAKFGDYPAGDPELH-HVAGSLYAE--EHDTYEAEKHLLLGTKDSPEVL 160

Query: 196 ASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ--- 250
                 +  +  P +  L   RA+L YL +GN++ AN  Y +       +N  L +Q   
Sbjct: 161 TKMEYTWYKESEPHQAALFAGRAVLPYLLVGNVRAANTCYRLFTSSLSSDNPNLGVQDVS 220

Query: 251 -PSDLIQF---------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
            P+  I+          + ++L  ++R A  L+  L + Y   I    ++ E L+ + E 
Sbjct: 221 SPAGDIRIFPSLPLLNFLGFLLLAIQRGAPELYRALVSKYATQINEAGSWAEALEMVGEM 280

Query: 301 FFGV---KRRNPM 310
           +FG+   ++ NP+
Sbjct: 281 YFGLSKPRQNNPL 293


>gi|400596728|gb|EJP64484.1| DUF410 domain-containing protein [Beauveria bassiana ARSEF 2860]
          Length = 333

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/324 (24%), Positives = 142/324 (43%), Gaps = 54/324 (16%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L++ + +G++Y AQQ  +  ++R++  Q +  A+D+L + A   LK  Q   G +L 
Sbjct: 9   VARLQQKIADGDYYEAQQQTRVAASRHIKTQNWPAAIDILSSVAQSLLKAGQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           V  V+   +  +  D  N  R+     LF             D  E T            
Sbjct: 69  VMLVDVYKQAALKPDATNKGRLLTALRLF-------------DGEEPTR----------- 104

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
              F+   + WS +FG    G PELH  +A  LY+   E +      H + G    + ++
Sbjct: 105 -KKFITEIVAWSSKFGEYPAGDPELH-HVAGSLYAA--EHETYEAEKHLILGT---RDSA 157

Query: 198 TIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ 250
            ++  M   +  E D     +  ARA+L YL +GN++ AN  Y +     + +N  L ++
Sbjct: 158 EVLFKMEYAWYKESDAHTAPMYAARAVLPYLLIGNVRAANTCYQLFTSALRADNPALGVE 217

Query: 251 -----PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDI 297
                 +D+  F        +  +L  ++R A  ++  L + Y A I+    + E L+ I
Sbjct: 218 DVSASGADVRVFPSLPLLNFLGLLLLAVQRGAPEVYRGLVSKYAAHIDEVGGWGEALEMI 277

Query: 298 AEKFFGVKR---RNPMQGIFGDIF 318
           AE +FG+++    NP+  +   +F
Sbjct: 278 AEMYFGIQKPRQSNPLMDMMSGLF 301


>gi|425773730|gb|EKV12064.1| hypothetical protein PDIG_46790 [Penicillium digitatum PHI26]
 gi|425782291|gb|EKV20210.1| hypothetical protein PDIP_18710 [Penicillium digitatum Pd1]
          Length = 346

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 84/326 (25%), Positives = 133/326 (40%), Gaps = 58/326 (17%)

Query: 18  IDKL-----EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK----NN 68
           IDK+     EKI + G +Y A Q  + I+ RY+    Y  A ++L  GA   LK      
Sbjct: 5   IDKVIARQREKIAD-GAYYEAHQQLRVIAVRYIKQSNYDAAAEILAGGAKELLKAGSQQG 63

Query: 69  QLTCGAELAVSFV-------EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDL 121
               G +LA+  V       E  + G  A D E   R +++ EL  + P       D+  
Sbjct: 64  ASASGGDLAIMLVLEVYNKAEWEISGG-ADDAEGRSRKKRLIELLREFP------SDEPT 116

Query: 122 HELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMAR 181
            +                 F++  I WS +FG  + G P+LH  +   +Y+E  E   A 
Sbjct: 117 RK----------------RFIQEMIGWSAKFGPLERGDPDLHHAVGS-VYAEDNEPYDAE 159

Query: 182 VSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVK 239
                    + E  A     +     P    +  +RA+  YL  GN++ AN   ++   +
Sbjct: 160 KHLTLGTFESAETLAKLEYEWYTSDEPHTAAIYASRAVFPYLLTGNIRSANKALLIFTSR 219

Query: 240 KQVENKQLQLQ-----PSD--------LIQFVFYVLQTLERDALPLFNMLRANYKASIER 286
               N QL +Q      SD        L+ F+  +L  ++R +  LF  L A+Y A I  
Sbjct: 220 LAAANPQLAVQDVSSASSDARVFPALPLVNFISMLLLAIQRGSADLFRQLTAHYAAQIRE 279

Query: 287 EPAFNEMLDDIAEKFFGVKRRNPMQG 312
              +++ L  I E++F +K   P QG
Sbjct: 280 VGIWDDALAQIGEQYFAIK--IPRQG 303


>gi|380493217|emb|CCF34039.1| hypothetical protein CH063_06110, partial [Colletotrichum
           higginsianum]
          Length = 249

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 74/246 (30%), Positives = 107/246 (43%), Gaps = 45/246 (18%)

Query: 19  DKLEKIV-------NEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           DK+EKI+        EG FY AQQ  + ++ARY+ A  +  A D+L+  A   LK  Q  
Sbjct: 6   DKIEKIIARLQARIGEGQFYEAQQQTRVVAARYIKAANWPAATDILYNVAQSLLKAGQGG 65

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            G +LAV  V+   + ++  D  +  ++     LFP               E T      
Sbjct: 66  SGGDLAVMLVDVFRQAELKPDAASKGKLLTCLRLFPA-------------QEPTR----- 107

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
                    F+   I WS +FG    G  ELH  +A  LY+E  E D      H V G  
Sbjct: 108 -------KKFVGEMIAWSAKFGDYPAGDAELH-HVAGSLYAE--EHDAYEAERHLVLGT- 156

Query: 192 PEKFASTIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDAN--YIMDEVKKQVEN 244
             K ++ ++  M   +  ED+     L  ARAIL YL +GN++ AN  Y +       +N
Sbjct: 157 --KDSAEVLAKMEYVWYKEDESHTAPLYAARAILPYLLIGNVRAANTCYRLFASSLSDDN 214

Query: 245 KQLQLQ 250
           K L +Q
Sbjct: 215 KGLGVQ 220


>gi|195497065|ref|XP_002095943.1| GE25354 [Drosophila yakuba]
 gi|194182044|gb|EDW95655.1| GE25354 [Drosophila yakuba]
          Length = 339

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 142/315 (45%), Gaps = 38/315 (12%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KL + +  G FY A  MY+++  RY A +RY + L+LL  GA   +   Q +  A+L 
Sbjct: 26  LAKLSQSLAGGEFYEAHMMYRTLYFRYTAQKRYQDCLNLLFDGAQQLIAKEQESSAADLC 85

Query: 78  VSFVEALVK-GKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
           +  ++ L K G  A D +N             + VP        L  L   L A+ +  E
Sbjct: 86  LLLLDTLEKRGPQAEDTDNF------------LWVPR-------LGALIRGLNASTVERE 126

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
              + ++  IKWS      + G P LH ++A   ++E   ++ AR   H++   +     
Sbjct: 127 ---TLIQRTIKWSTALHG-QYGHPVLHKLIAHVFWTEG-NIESAR--HHYLLCQDGSLCG 179

Query: 197 STIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQ---PS 252
             ++     + + GE DL + +A+L  LSL + K A     E  +    K L+ +     
Sbjct: 180 RVLIEISQSRGFQGEVDLFLVQAVLQQLSLKDRKTAEDTFTEYTR-YHGKLLKHEYPFQE 238

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK----RRN 308
            L+ F++++ + ++   +  F  LR  Y  S++R+ +F + +  I   +F  +       
Sbjct: 239 PLVNFLYFLFRLIDAKLVAGFRALRKLYDPSLKRDTSFLKYVAKIGVLYFDEEPEPAHSG 298

Query: 309 P--MQGIFGDIFKMM 321
           P  + G+FGDIF  +
Sbjct: 299 PPGLGGMFGDIFNRL 313


>gi|358375661|dbj|GAA92240.1| DUF410 domain protein [Aspergillus kawachii IFO 4308]
          Length = 347

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 85/326 (26%), Positives = 135/326 (41%), Gaps = 54/326 (16%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQL----KNNQLTCGAELA 77
           EKI + G +Y A Q  + I+ARY+    Y  A ++L  GA   L    +      G +LA
Sbjct: 14  EKIAS-GAYYEAHQQLRVIAARYIKQANYDAAAEILAGGAAALLKAGSQQGASASGGDLA 72

Query: 78  VSFVEALVKGKVAY-------DNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGA 130
           +  V   V  K  +       D E   R +++ EL  + P            E T     
Sbjct: 73  IMLV-VEVYTKAGWEITGGNDDAEGRARKKRLIELLHEFPP----------EEPTR---- 117

Query: 131 AKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGN 190
                     F++  I WS  FG  + G PELH   A  +Y+E  E   A     F    
Sbjct: 118 --------KRFIQEMIGWSGRFGPLERGDPELH-HAAGSVYAEDHEPYDAEKHLIFGTSE 168

Query: 191 NPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQ 248
           + E  A     +     P    L  +RA+  YL +GN++ AN  +++   K    N  L 
Sbjct: 169 SAETLAKLEFEWYTHDEPHTAALYASRAVFPYLLIGNLRSANKAFLIFTSKLSSTNPSLG 228

Query: 249 LQ-----PSD--------LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLD 295
           +Q      SD        L+ F+  +L T++R +  L+  L A+Y + I     +++ L 
Sbjct: 229 VQDVSSASSDVRVFPALPLLNFISMLLLTIQRGSSDLYKQLTAHYASHIRDVGIWDDALS 288

Query: 296 DIAEKFFGVK---RRNPMQGIFGDIF 318
            I E++F +K   + NP+  + G +F
Sbjct: 289 QIGEQYFAIKIPRQSNPLLDMMGSMF 314


>gi|325095004|gb|EGC48314.1| DUF410 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 339

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 77/315 (24%), Positives = 125/315 (39%), Gaps = 50/315 (15%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTC-G 73
           ++ I + ++ +  G +Y + Q  + I+ARY+  Q Y  A D+L  GA   L+    +  G
Sbjct: 6   EKTIARQQEKIAAGAYYESHQQLRVIAARYLKQQNYDAAADILAGGAKALLRVPAASASG 65

Query: 74  AELAVSFVEALVKGKVAYD--NENLDRVRK--IYELFPQIPVPHNLEDDDDLHELTEALG 129
            +LA+  +   V  KV +D   + + R RK  + EL  + P P        ++EL     
Sbjct: 66  GDLAIMLI-IDVYNKVEWDVGGDEVGRARKKRLIELLREFP-PEEPTRKRYMNEL----- 118

Query: 130 AAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG 189
                           I WS  FG  + G PE+H  +      E+   D  R   H   G
Sbjct: 119 ----------------IAWSARFGKLERGDPEIHHEIGVLFAQENEPYDAER---HLAIG 159

Query: 190 N--NPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN---------YIMDEV 238
              + E  A    ++     P       +RA+  YL  GN++DAN          +  E 
Sbjct: 160 TSASAEILADLEYDWYTNDEPHTAGTYASRAVFPYLLTGNLRDANKAFSVFTSRLLSSES 219

Query: 239 KKQVENKQLQLQPSD--------LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAF 290
            K +  +Q+  Q +D        ++ F   +L  ++R    LF  L   Y   I+     
Sbjct: 220 SKVLNIQQVSSQSADVRVFPSLPILNFTSLLLLAIQRGTTDLFRQLTRQYAIHIKEVDGL 279

Query: 291 NEMLDDIAEKFFGVK 305
              L  I E +FG+K
Sbjct: 280 ESALAHIGELYFGIK 294


>gi|225554297|gb|EEH02597.1| DUF410 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 339

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 79/323 (24%), Positives = 128/323 (39%), Gaps = 53/323 (16%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTC-G 73
           ++ I + ++ +  G +Y + Q  + I+ARY+  Q Y  A D+L  GA   L+    +  G
Sbjct: 6   EKTIARQQEKIAAGAYYESHQQLRVIAARYLKQQNYDAAADILAGGAKALLRVPAASASG 65

Query: 74  AELAVSFVEALVKGKVAYD--NENLDRVRK--IYELFPQIPVPHNLEDDDDLHELTEALG 129
            +LA   +   V  KV +D   + + R RK  + EL  + P P        ++EL     
Sbjct: 66  GDLATMLI-IDVYNKVEWDVGGDEVGRARKKRLIELLREFP-PEEPTRKRYMNEL----- 118

Query: 130 AAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG 189
                           I WS  FG  + G PE+H  +      E+   D  R   H   G
Sbjct: 119 ----------------IAWSARFGKLERGDPEIHHEIGVLFAQENEPYDAER---HLAIG 159

Query: 190 N--NPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN---------YIMDEV 238
              + E  A    ++     P       +RA+  YL  GN++DAN          +  E 
Sbjct: 160 TSASAEILADLEYDWYTNDEPHTAGTYASRAVFPYLLTGNLRDANKAFSVFTSRLLSSES 219

Query: 239 KKQVENKQLQLQPSD--------LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAF 290
            K +  +Q+  Q +D        ++ F   +L  ++R    LF  L   Y   I+     
Sbjct: 220 SKVLNVQQVSSQSADVRVFPSLPILNFTSLLLLAIQRGTTDLFRQLTRQYAIHIKEVDGL 279

Query: 291 NEMLDDIAEKFFGV---KRRNPM 310
              L  I E +FG+   K+ NP+
Sbjct: 280 ESALAHIGELYFGIKIPKQSNPL 302


>gi|302854459|ref|XP_002958737.1| hypothetical protein VOLCADRAFT_121754 [Volvox carteri f.
           nagariensis]
 gi|300255912|gb|EFJ40193.1| hypothetical protein VOLCADRAFT_121754 [Volvox carteri f.
           nagariensis]
          Length = 380

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 45/148 (30%), Positives = 75/148 (50%), Gaps = 8/148 (5%)

Query: 11  LPPAQE-NIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQ 69
           +PP+ E  + ++ + V  G +Y AQQMYK+   R  A Q++++A+ +L  GA  QL + Q
Sbjct: 1   MPPSVEKTLARIRESVESGAYYEAQQMYKTSYHRSKARQQFADAVAILQEGAVTQLAHGQ 60

Query: 70  LTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEAL- 128
            TCG EL +  +EA V      D + + RV  I   F   P+      D +  + T +  
Sbjct: 61  PTCGVELGMLLIEAYVAASTPADADTVSRVLAIIRAF---PLTARTSADANQGQATSSSS 117

Query: 129 ---GAAKLRVEGCSSFLKAAIKWSIEFG 153
                A   V+  +  + AA+KW+ ++G
Sbjct: 118 SSQAVASPVVDEYARLVAAAVKWANKYG 145



 Score = 41.2 bits (95), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 34/153 (22%), Positives = 64/153 (41%), Gaps = 8/153 (5%)

Query: 162 LHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILM 221
           +H   A YL+       + R   HF+RG + E+FA  + +   +   GE DL + RA+L 
Sbjct: 208 IHSAFASYLWRSFGTDGIGRALPHFIRGEDAEEFAEALYSCARQGPSGEADLWLLRAVLQ 267

Query: 222 YLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQTLERDALP-------LFN 274
            L+  +   +   M   +   +    +  P D     F  L  L  +  P       +F 
Sbjct: 268 VLAAASGSSSTAQMSYARALYDAFIAKDSPLDTPTGHFTELVLLAVEHTPISPRAHDMFA 327

Query: 275 MLRANYKASI-EREPAFNEMLDDIAEKFFGVKR 306
           ++R  Y   +  R+ +   +++ +   FF ++R
Sbjct: 328 LVRERYSDGVLRRDESLGALMERVEGLFFNMRR 360


>gi|413947456|gb|AFW80105.1| hypothetical protein ZEAMMB73_498554 [Zea mays]
          Length = 78

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 29/56 (51%), Positives = 43/56 (76%)

Query: 265 LERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIFKM 320
           LERDA PLF  LR  YK S +R+  F E+LD+IA KF+ ++R+NP++G+F ++FK+
Sbjct: 23  LERDAYPLFRTLRQKYKTSTDRDAVFQELLDEIAAKFYNIQRQNPLEGLFSEMFKI 78


>gi|414876208|tpg|DAA53339.1| TPA: hypothetical protein ZEAMMB73_364896 [Zea mays]
          Length = 129

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 32/72 (44%), Positives = 48/72 (66%)

Query: 249 LQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRN 308
           L PS L       +  LERDA PLF  LR  YK S +R+  F E+LD+IA KF+ ++R+N
Sbjct: 58  LVPSTLAFVTCDTVYRLERDAYPLFRTLRQKYKTSTDRDAVFQELLDEIAAKFYNIQRQN 117

Query: 309 PMQGIFGDIFKM 320
           P++G+F ++F++
Sbjct: 118 PLEGLFSEMFRI 129


>gi|145250101|ref|XP_001396564.1| hypothetical protein ANI_1_48134 [Aspergillus niger CBS 513.88]
 gi|134082077|emb|CAK42194.1| unnamed protein product [Aspergillus niger]
 gi|350636054|gb|EHA24414.1| hypothetical protein ASPNIDRAFT_40314 [Aspergillus niger ATCC 1015]
          Length = 349

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 84/330 (25%), Positives = 135/330 (40%), Gaps = 62/330 (18%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALD--------LLHAGACLQLKNNQLTCG 73
           EKI + G +Y A Q  + I+ARY+    Y  A +        LL AG+    +      G
Sbjct: 14  EKIAS-GAYYEAHQQLRVIAARYIKQANYDAAAELLAGGATALLKAGS----QQGASASG 68

Query: 74  AELAVSFVEALVKGKVAY-------DNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTE 126
            +LA+  V   V  K  +       D E   R +++ EL  + P            E T 
Sbjct: 69  GDLAIMLV-VEVYTKAGWEITGGNDDAEGRARKKRLIELLHEFPP----------EEPTR 117

Query: 127 ALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHF 186
                         F++  I WS  FG  + G PELH   A  +Y+E  E   A     F
Sbjct: 118 ------------RRFIQEIIGWSGRFGPLERGDPELH-HAAGSVYAEDHEPYDAEKHLIF 164

Query: 187 VRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVEN 244
               + E  A     +     P    +  +RA+  YL +GN++ AN  +++   K    N
Sbjct: 165 GTSESAETLAKLEFEWYTHDEPHTAAIYASRAVFPYLLIGNLRSANKAFLIFTSKLSSTN 224

Query: 245 KQLQLQ-----PSD--------LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFN 291
             L +Q      SD        L+ F+  +L T++R +  LF  L A+Y + I+    ++
Sbjct: 225 PSLGVQDVSSASSDVRVFPALPLLNFISMLLLTIQRGSSDLFKQLTAHYASHIKDVGIWD 284

Query: 292 EMLDDIAEKFFGVK---RRNPMQGIFGDIF 318
           + L  I E++F +K   + NP+  + G + 
Sbjct: 285 DALSQIGEQYFAIKVPRQSNPLLDMMGSML 314


>gi|346319942|gb|EGX89543.1| protein family UPF0363 [Cordyceps militaris CM01]
          Length = 354

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 139/324 (42%), Gaps = 30/324 (9%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +L++ + +G++Y AQQ  +  ++R++  Q +  A+D+L + A   LK  Q   G +L 
Sbjct: 9   VARLQQKIADGDYYEAQQQTRVAASRHIKTQNWPAAIDILSSVAQSLLKAGQGGSGGDLC 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALG-AAKLRV- 135
              V+   + ++  D     R+     LF       + E+      +T  L    KLRV 
Sbjct: 69  NMLVDVYKQAELKPDATTKGRLLTALRLF-------DGEEPTRKKFITGMLAYVVKLRVF 121

Query: 136 --EGCSSFL-KAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNP 192
              G  S L K   +WS +FG    G PELH +      +E    +  +      R +  
Sbjct: 122 KRIGVFSMLIKYTSRWSAKFGEFPAGDPELHHVAGSLYAAEHETYEAEKHLLLGTRDSAE 181

Query: 193 EKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ 250
             F      +     P    L  ARA+L YL +GN++ AN  Y +       ++  L +Q
Sbjct: 182 VLFKMEYAWYKESDAPHTAPLFAARAVLPYLLVGNVRAANACYRLFASALSADHPGLGVQ 241

Query: 251 -----PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDI 297
                 SD+  F        +  +L  ++R A  ++  L   Y A I+    + E L+ +
Sbjct: 242 DVATASSDVRVFPSLPLLNFLGLLLLAVQRGAPEVYRGLVHKYAAQIDEAGTWAEALEMV 301

Query: 298 AEKFFGVKR---RNPMQGIFGDIF 318
           AE +FG+++    NP+  +   +F
Sbjct: 302 AEMYFGIQKPRQSNPLMDMMSGLF 325


>gi|56754415|gb|AAW25395.1| SJCHGC06259 protein [Schistosoma japonicum]
 gi|226489474|emb|CAX75881.1| hypothetical protein [Schistosoma japonicum]
          Length = 326

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 78/319 (24%), Positives = 140/319 (43%), Gaps = 40/319 (12%)

Query: 9   TALPPAQENID-KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKN 67
           + + P    +D +L+  VNE  FY A Q++K+I  R V  + Y EAL  L +G+   L N
Sbjct: 2   STVEPKLSKLDQRLKAAVNERRFYEAHQLFKTIHFRCVMKKNYQEALKYLKSGSKFLLDN 61

Query: 68  NQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
            Q   G +LA   ++   + +      +LD V ++      I +    E    +H+  + 
Sbjct: 62  EQTESGTDLACLVLDVYSQSETVLTESHLDEVCELL-----IRMCPGEERTRFIHKTLQL 116

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
           L   K  + G + +L A + W             L +ML+   Y+    L    V+ H  
Sbjct: 117 LQKQKDLLAGFNGYL-ARVLWQ----EGSFSEARLRIMLSSNGYAAGIFL----VALHQR 167

Query: 188 RGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQL 247
            G                    E DL +A+A+L +L +     A  I+        + +L
Sbjct: 168 YG-----------------LRSEIDLFIAQAVLQFLCMKQTSVA--ILTFYTYTRRHPRL 208

Query: 248 QLQP--SD--LIQFVFYVLQTLERD-ALPLFNMLRANYKASIEREPAFNEMLDDIAEKFF 302
           +  P  +D  L+ F+++++  +E+  +L +F++L   Y   + R+ ++ + LD I + +F
Sbjct: 209 EPGPPFTDFPLLNFLWFLMLAIEKKCSLAVFSVLCEKYSPQLNRDSSYLKYLDKIGQLYF 268

Query: 303 GVK-RRNPMQGIFGDIFKM 320
           GVK   + M G F  I +M
Sbjct: 269 GVKPPTSEMNGFFSRIMQM 287


>gi|367037697|ref|XP_003649229.1| hypothetical protein THITE_2107678 [Thielavia terrestris NRRL 8126]
 gi|346996490|gb|AEO62893.1| hypothetical protein THITE_2107678 [Thielavia terrestris NRRL 8126]
          Length = 333

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 86/329 (26%), Positives = 137/329 (41%), Gaps = 60/329 (18%)

Query: 18  IDKLEKIVNEGNF---YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           I +L+K + EG F   Y A Q  + ++ARY     ++ A+D+L   +   L+  Q   G 
Sbjct: 10  IARLQKRIAEGAFEEQYEAAQETRLVAARYAKQANWAAAIDILSRVSQALLRAGQGGSGG 69

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKL 133
           +LAV  V+   +     D  +  R+     LF P  P+                      
Sbjct: 70  DLAVLLVDVYRQAAQRPDGGSRGRLLTCLRLFEPGEPMRKK------------------- 110

Query: 134 RVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE 193
                  F+K  I WS +FG    G PELH ++      E    +  R   H + G    
Sbjct: 111 -------FIKEMIDWSKKFGDYPAGDPELHHVIGSLYAEEHQAEEAER---HLILGT--- 157

Query: 194 KFASTIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQ 246
           K ++ +++ M   +  EDD     L  ARA+L YL + N++ AN  Y       + +NK 
Sbjct: 158 KDSTEVLSRMEYEWYKEDDSHTAPLYCARAVLPYLLVANVRAANASYRAFVSALEADNKN 217

Query: 247 LQLQ------------PSDLIQFVFYVLQTLERDALP-LFNMLRANYKASI-EREPAFNE 292
           L +Q            PS  +     +L    +   P LF  L+  Y A+I E    ++ 
Sbjct: 218 LAVQDVSSQSADIHVFPSLPLLNFLGLLLLAVQKGTPDLFRQLKTKYAANISELGGVWDT 277

Query: 293 MLDDIAEKFFGVKR---RNPMQGIFGDIF 318
            L+ IAE +FG++R    NP+  + G +F
Sbjct: 278 PLEMIAEMYFGIQRPRQSNPLFDMMGSLF 306


>gi|255931517|ref|XP_002557315.1| Pc12g04460 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211581934|emb|CAP80073.1| Pc12g04460 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 342

 Score = 70.9 bits (172), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 79/329 (24%), Positives = 139/329 (42%), Gaps = 64/329 (19%)

Query: 18  IDKL-----EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK----NN 68
           IDK+     EKI + G +Y A Q  + I+ RY+    Y  A ++L  GA   LK      
Sbjct: 5   IDKVIARQREKIAD-GAYYEAHQQLRVIAVRYIKQSNYDAAAEILAGGAKELLKAGAQQG 63

Query: 69  QLTCGAELAVSFVEAL---VKGKVAYDNENLD---RVRKIYELFPQIPVPHNLEDDDDLH 122
               G +LA+  V  +    + ++A  N++ +   R +++ EL  + P       D+   
Sbjct: 64  ASASGGDLAIMLVLEVYNKAEWEIAGGNDDAEGRARKKRLIELLREFP------SDEPTR 117

Query: 123 ELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARV 182
           +                 F++  + WS +FG  + G P+LH  +      ++   D  + 
Sbjct: 118 K----------------RFIQEMVGWSAKFGPLERGDPDLHHAVGSVYAEDNEPYDAEK- 160

Query: 183 SFHFVRGNNPEKFASTIVNFMGKCY----PGEDDLAVARAILMYLSLGNMKDAN--YIMD 236
             H + G +  + A T+     + Y    P    +  +RA+  YL  GN++ AN   ++ 
Sbjct: 161 --HLILGTS--ESAETLAKLEYEWYTNDEPHTAAIYASRAVFPYLLTGNIRSANKALLVF 216

Query: 237 EVKKQVENKQLQLQ-----PSD--------LIQFVFYVLQTLERDALPLFNMLRANYKAS 283
             +    N QL +Q      SD        L+ F+  +L  ++R +  LF  L  +Y A 
Sbjct: 217 TSRLSAANPQLGVQDVSSASSDARVFPALPLMNFITMLLLAIQRGSADLFRQLTGHYAAQ 276

Query: 284 IEREPAFNEMLDDIAEKFFGVKRRNPMQG 312
           I     +++ L  I E++F +K   P QG
Sbjct: 277 IREVGIWDDALAQIGEQYFAIK--IPRQG 303


>gi|194743644|ref|XP_001954310.1| GF16811 [Drosophila ananassae]
 gi|190627347|gb|EDV42871.1| GF16811 [Drosophila ananassae]
          Length = 337

 Score = 70.9 bits (172), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 80/328 (24%), Positives = 142/328 (43%), Gaps = 48/328 (14%)

Query: 9   TALPPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNN 68
           TA       + KL + +  G FY A  MY+++  RY A +RY + LDLL+ GA   +   
Sbjct: 17  TAQRGVSRVLAKLAQSLAGGEFYEAHMMYRTLYFRYTAQKRYQDCLDLLYDGAQQLMAKG 76

Query: 69  QLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLH-----E 123
           Q +  A+L +  ++ L K                     + P P    +D DL       
Sbjct: 77  QESSAADLCLLLLDTLEK---------------------RGPQP----EDSDLWIARWGG 111

Query: 124 LTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVS 183
           L   L AA +  E   + ++  IKWS      + G P LH ++    ++E   ++ AR  
Sbjct: 112 LIGGLSAATVERE---TVIQRTIKWSTALHG-QYGHPVLHKLIGHVFWTEG-NVEAAR-- 164

Query: 184 FHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQV 242
            H++   +       ++     + +  E DL +A+A+L  LSL + K A     E  +  
Sbjct: 165 HHYLLCQDGSLSGRVLIEISQSRGFQSEVDLFLAQAVLQQLSLKDRKTAEDTFTEYTRS- 223

Query: 243 ENKQLQLQ---PSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAE 299
            +K L+ +      LI F++++ + ++   +  F  LR  Y  S++R+ +F + +  I  
Sbjct: 224 HSKLLRHEFPYKEPLINFLYFLFRLIDAKRVAGFRALRKLYDPSLKRDTSFLKYVAKIGV 283

Query: 300 KFFGVKRR------NPMQGIFGDIFKMM 321
            +F  +          + G+FGDIF  +
Sbjct: 284 IYFDEQPEAANVGPTGLGGMFGDIFNRL 311


>gi|307103670|gb|EFN51928.1| hypothetical protein CHLNCDRAFT_139579 [Chlorella variabilis]
          Length = 309

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 44/131 (33%), Positives = 62/131 (47%), Gaps = 16/131 (12%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + KL   +  G FY A +M+K++  RY    ++ E+  L   GA LQL+  QL CG EL 
Sbjct: 4   LRKLAASLAAGQFYEAHEMFKTVYYRYRTRNQHEESYQLCQEGARLQLRQGQLNCGVELC 63

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  VEA      A   E + RV  + + FP+   P   ED                 V+ 
Sbjct: 64  MLLVEAYAADGAAPSEEAVARVAALIDAFPRQGRP-PAEDPP---------------VQE 107

Query: 138 CSSFLKAAIKW 148
           C+SF+ AAIKW
Sbjct: 108 CASFVSAAIKW 118


>gi|335283962|ref|XP_003124291.2| PREDICTED: Golgi to ER traffic protein 4 homolog [Sus scrofa]
          Length = 223

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 71/125 (56%), Gaps = 6/125 (4%)

Query: 184 FHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ- 241
           +HF+  ++ E  A+ +V +   + +  E D+ VA+A+L +L L N   A+ +     ++ 
Sbjct: 61  YHFLHSSDGEGCANMLVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKH 120

Query: 242 --VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAE 299
             +EN    +QP  L+ F++++L  L+   L +F +L   Y+ S+ R+P +NE LD I +
Sbjct: 121 PSIENGPPFVQP--LLNFIWFLLLALDGGKLAVFTVLCEQYQPSLRRDPMYNEYLDRIGQ 178

Query: 300 KFFGV 304
            FFGV
Sbjct: 179 LFFGV 183


>gi|336259482|ref|XP_003344542.1| hypothetical protein SMAC_07550 [Sordaria macrospora k-hell]
 gi|380093256|emb|CCC08914.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 342

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 90/332 (27%), Positives = 141/332 (42%), Gaps = 58/332 (17%)

Query: 15  QENIDKLEKIVNEGNF---YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           Q+ I +L+K + EG F   Y A Q  + ++ARY  A+ +  A+D+L + +   LK     
Sbjct: 10  QKIIARLQKRIAEGQFEEQYEAGQETRLVAARYTKAENWPAAIDILSSVSQALLKAGAGG 69

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            G +LA+  V+   +GKV  D  +  R+     LF                E T      
Sbjct: 70  AGGDLAILLVDVYKQGKVKPDAASRGRLLTCLRLFA-------------FEEPTR----- 111

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
                    F+K    WS +FG    G PELH ++   L +ES E D A    H + G  
Sbjct: 112 -------KKFVKEVFDWSRKFGDYPAGDPELHHVVGT-LAAESHETDEAEK--HLILGT- 160

Query: 192 PEKFASTIVNFMGKCY-----PGEDDLAVARAILMYLSLGNMKDANY---------IMDE 237
            ++ A  +     + Y     P    L  ARAIL YL L N++ AN            D 
Sbjct: 161 -KESAEVLTRMEYEWYKDSEQPHTAALFAARAILPYLLLANVRAANTSYRAFVSALTADN 219

Query: 238 VKKQVEN------KQLQLQPSDLIQFVFYVLQTLERDALP-LFNMLRANYKASI-EREPA 289
               V+N        +++ PS  +     +L    +   P LF  L+  Y +++ E   A
Sbjct: 220 TSLGVQNVESAHGSDIRIFPSLPLLNFLGLLLLAVQKGTPDLFRQLKTKYASNLAELGGA 279

Query: 290 FNEMLDDIAEKFFGVKR---RNPMQGIFGDIF 318
           ++  L+ IAE +FG++R    NP+  + G +F
Sbjct: 280 WDGALEMIAEMYFGIQRPRQSNPLLDMMGSLF 311


>gi|336472728|gb|EGO60888.1| hypothetical protein NEUTE1DRAFT_127669 [Neurospora tetrasperma
           FGSC 2508]
 gi|350294029|gb|EGZ75114.1| DUF410-domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 343

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 92/332 (27%), Positives = 143/332 (43%), Gaps = 58/332 (17%)

Query: 15  QENIDKLEKIVNEGNF---YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           Q+ I +L+K + EG F   Y A Q  + ++ARY  A+ +  A+D+L + +   LK     
Sbjct: 10  QKIIARLQKRIAEGQFEEQYEAGQETRLVAARYTKAENWPAAIDILSSVSQALLKAGAGG 69

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            G +LA+  V+   +GKV  D  +  R+     LF             +  E T      
Sbjct: 70  AGGDLAILLVDVYKQGKVKPDAASRGRLLTCLRLF-------------EFEEPTR----- 111

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
                    F+K    WS +FG    G PELH ++   L +E  E D A    H V G  
Sbjct: 112 -------KKFVKEVFDWSRKFGDYPAGDPELHHVVGT-LAAEKHETDEAEK--HLVLGT- 160

Query: 192 PEKFASTIVNFMGKCY-----PGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVEN 244
            ++ A  +     + Y     P    L  ARAIL YL L N++ AN  Y         EN
Sbjct: 161 -KESAEVLTRMEYEWYKDSEQPHTAALFAARAILPYLLLANVRAANTSYRAFVSALTAEN 219

Query: 245 KQLQL------QPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASI-EREPA 289
             L +      Q SD+  F        +  +L  +++    LF  L+  Y +++ E   A
Sbjct: 220 SSLGVQNVESSQGSDIRIFPSLPLLNFLGLLLLAVQKGTPDLFRQLKTKYASNLAELGGA 279

Query: 290 FNEMLDDIAEKFFGVKR---RNPMQGIFGDIF 318
           ++  L+ IAE +FG++R    NP+  + G +F
Sbjct: 280 WDGALEMIAEMYFGIQRPRQSNPLLDMMGSLF 311


>gi|85103607|ref|XP_961556.1| hypothetical protein NCU01190 [Neurospora crassa OR74A]
 gi|18376278|emb|CAD21391.1| conserved hypothetical protein [Neurospora crassa]
 gi|28923103|gb|EAA32320.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 343

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 92/332 (27%), Positives = 143/332 (43%), Gaps = 58/332 (17%)

Query: 15  QENIDKLEKIVNEGNF---YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           Q+ I +L+K + EG F   Y A Q  + ++ARY  A+ +  A+D+L + +   LK     
Sbjct: 10  QKIIARLQKRIAEGQFEEQYEAGQETRLVAARYTKAENWPAAIDILSSVSQALLKAGAGG 69

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            G +LA+  V+   +GKV  D  +  R+     LF             +  E T      
Sbjct: 70  AGGDLAILLVDVYKQGKVKPDAASRGRLLTCLRLF-------------EFEEPTR----- 111

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
                    F+K    WS +FG    G PELH ++   L +E  E D A    H V G  
Sbjct: 112 -------KKFVKEVFDWSRKFGDYPAGDPELHHVVGT-LAAEKHETDEAEK--HLVLGT- 160

Query: 192 PEKFASTIVNFMGKCY-----PGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVEN 244
            ++ A  +     + Y     P    L  ARAIL YL L N++ AN  Y         EN
Sbjct: 161 -KESAEVLTRMEYEWYKDSEQPHTAALFAARAILPYLLLANVRAANTSYRAFVSALTAEN 219

Query: 245 KQLQL------QPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASI-EREPA 289
             L +      Q SD+  F        +  +L  +++    LF  L+  Y +++ E   A
Sbjct: 220 SSLGVQNVESSQGSDIRIFPSLPLLNLLGLLLLAVQKGTPDLFRQLKTKYASNLAELGGA 279

Query: 290 FNEMLDDIAEKFFGVKR---RNPMQGIFGDIF 318
           ++  L+ IAE +FG++R    NP+  + G +F
Sbjct: 280 WDGALEMIAEMYFGIQRPRQSNPLLDMMGSLF 311


>gi|281340883|gb|EFB16467.1| hypothetical protein PANDA_013407 [Ailuropoda melanoleuca]
          Length = 144

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 42/146 (28%), Positives = 77/146 (52%), Gaps = 8/146 (5%)

Query: 176 ELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYI 234
           E +     +HF+   + E  A  +V +   + +  E D+ VA+A+L +L L N   A+ +
Sbjct: 1   EQNYCESRYHFLHSADGEGCAHMLVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVV 60

Query: 235 MDEVKKQ---VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFN 291
                ++   +EN    +QP  L+ F++++L  ++   L +F +L   Y+ S+ R+P +N
Sbjct: 61  FTTYTQKHPSIENGPPFVQP--LLNFIWFLLLAVDGGKLTVFTVLCEQYQPSLRRDPMYN 118

Query: 292 EMLDDIAEKFFGV--KRRNPMQGIFG 315
           E LD I + FFGV  K+ +   G+ G
Sbjct: 119 EYLDRIGQLFFGVPPKQTSSYGGLLG 144


>gi|430812801|emb|CCJ29811.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 359

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 141/329 (42%), Gaps = 62/329 (18%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           +NI K +K +NE  +Y A Q+ ++I  R+V A  Y +A+DLL++ +   L+  Q   G +
Sbjct: 7   KNILKFQKALNEQQYYEAHQLMRTIVNRHVKACEYDKAVDLLYSSSKSLLEIKQFASGGD 66

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLR 134
           LA+  ++     K+  D     R+  +  LF P  P    L  D                
Sbjct: 67  LALYMIKVYDTMKLRPDGPT--RIIDLLRLFCPDEPTRKRLIHD---------------- 108

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                     ++ WS ++G    G PELH  + + L  ES      +   H + G   + 
Sbjct: 109 ----------SLMWSGKYGDTAIGDPELHHEVGKILAYESSFFSAEK---HLILG-TADS 154

Query: 195 FASTIVNFMGKCYPGEDDLAVA-----RAILMYLSLGNMKDANYIMDEVKKQVENKQ--- 246
           F   +    G  Y   D L +A     RA+  YL + N+  A    + + + + +++   
Sbjct: 155 FGLFLSLLRG--YFDRDTLDMACFYTSRAVFPYLMIYNIACALRAYNTMARHIADRRSTS 212

Query: 247 ----------LQLQPS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLD 295
                     + L PS  L+ F+ +++ T +R +  LF  L+A+Y+ ++++  ++   LD
Sbjct: 213 TLVLSSSSADIILFPSLPLMNFLCFLILTCQRASADLFCTLKAHYEDALKQASSWKPSLD 272

Query: 296 DIAEKFFGVKRRNP--------MQGIFGD 316
            IA  +F +    P        M  +FG+
Sbjct: 273 KIASIYFNIHTGPPPNNTWATLMNQLFGN 301


>gi|347832294|emb|CCD47991.1| similar to DUF410 domain protein [Botryotinia fuckeliana]
          Length = 331

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 74/317 (23%), Positives = 130/317 (41%), Gaps = 56/317 (17%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I +L++   EG +Y A Q  + +++RY+ +Q Y  A D+L++ +   L+ +Q   G +LA
Sbjct: 9   IARLQEKTAEGTYYEAHQQTRVVASRYIKSQNYPAATDILYSVSLSLLQASQGGSGGDLA 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
           +  +E      +  D+ +  R+  +   F P+ P                          
Sbjct: 69  LFLLEVYNTSALTCDSASKGRLFTLLRAFSPEEPSRKR---------------------- 106

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
               F+   I WS +  A   G PELH ++   L  E    D  R   H + G    K +
Sbjct: 107 ----FITETIAWSSKTSAYPAGDPELHHVIGSLLAEEGEAYDAER---HLILGT---KDS 156

Query: 197 STIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDANYIMDEVKKQVE-------- 243
           + ++  M   +  EDD     L  +RA+L YL +GN++DA   +     ++         
Sbjct: 157 AEVLAKMEYEWYLEDDSHTAALYASRAVLPYLLVGNVRDARKSLGIFTTRLREANAGLTV 216

Query: 244 ------NKQLQLQPSDLIQFVFYVLQTLERDALPL-FNMLRANYKASI-EREPAFNEMLD 295
                 N+ +++ PS  +     +L    +    L F  L   Y   + E    + E L 
Sbjct: 217 QELGGANEDIKIYPSIPLLNFLGLLLVAVQKGNSLDFKQLMKKYAGQMKEAGELWEEALT 276

Query: 296 DIAEKFFGVKRRNPMQG 312
            + E +FG++R  P QG
Sbjct: 277 SVGEMYFGIQR--PRQG 291


>gi|50550689|ref|XP_502817.1| YALI0D14344p [Yarrowia lipolytica]
 gi|49648685|emb|CAG81005.1| YALI0D14344p [Yarrowia lipolytica CLIB122]
          Length = 304

 Score = 68.6 bits (166), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 70/334 (20%), Positives = 134/334 (40%), Gaps = 64/334 (19%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           +  I + ++ +  G FY A Q  ++++ RYV  + Y++A+++L+ GA   LK  Q    +
Sbjct: 6   ERTIARNDEKIAAGEFYDAHQGIRTVANRYVKQKAYNDAIEVLYKGAESLLKAKQYASAS 65

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L +  +E     ++  D+ + +R+  + +LF             D+HE T         
Sbjct: 66  DLVLYILEVYTVAEIPVDSVSRNRIVSLIQLF-------------DVHEPT--------- 103

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                     A  WS +FG   +G   LH +    +++++ E   A  +       +   
Sbjct: 104 ---LVKISNDAETWSGKFGPVSSGDAILHNLFGT-VFAKADEGYEAEKNLLLGTKESAPI 159

Query: 195 FASTIVNFMG--KCYPGEDDLAVARAILMYLSLGNMKDANYIMD------EVKKQVENKQ 246
               + ++    K  P +  L ++R +L YL   N++DA           +VK   E   
Sbjct: 160 LGQLLFDWSAESKDAPQDAPLFLSRGVLGYLGTENIRDATVCAQTFLNAFDVKHNPETNN 219

Query: 247 LQLQPSDLIQFVFYVLQTLERDALPLFNMLR------------------ANYKASIEREP 288
           ++     +I            DA PLFN L+                    Y   + + P
Sbjct: 220 VEAAGDTVIT---------TYDAYPLFNFLQLLILACRSKNPETFKRLHTRYAQQLSQLP 270

Query: 289 AFNEMLDDIAEKFFGV---KRRNPMQGIFGDIFK 319
           A+N  L+ I   +FG+   ++ N ++ + G  FK
Sbjct: 271 AWNPALEKIGHVYFGIVPKRQGNMLEDLMGSFFK 304


>gi|441663155|ref|XP_003278748.2| PREDICTED: Golgi to ER traffic protein 4 homolog [Nomascus
           leucogenys]
          Length = 227

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 63/262 (24%), Positives = 109/262 (41%), Gaps = 78/262 (29%)

Query: 45  VAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYE 104
           ++  +++EA +L+++GA L   + Q    A+L++  +E+L K +V   +E L+ + K++ 
Sbjct: 1   MSQSKHAEARELMYSGALLFFSHGQQNSAADLSMLVLESLEKAEVEVADELLENLAKVFS 60

Query: 105 LF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELH 163
           L  P  P                       RV    +F+  A+KWS   G+ K G P LH
Sbjct: 61  LMDPNSPE----------------------RV----AFVSRALKWS-SGGSGKLGHPRLH 93

Query: 164 VMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNF-MGKCYPGEDDLAVARAILMY 222
            +LA  L+ E    +     +HF+   + E  A+ +V +   + +  E D+ VA+A+L  
Sbjct: 94  QLLALTLWKEQ---NYCESRYHFLHSADGEGCANMLVEYSTSRGFRSEVDMFVAQAVL-- 148

Query: 223 LSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKA 282
                                                          L +F +L   Y+ 
Sbjct: 149 --------------------------------------------HGKLTVFTVLCEQYQP 164

Query: 283 SIEREPAFNEMLDDIAEKFFGV 304
           S+ R+P +NE LD I + FFGV
Sbjct: 165 SLRRDPMYNEYLDRIGQLFFGV 186


>gi|406606675|emb|CCH41899.1| hypothetical protein BN7_1438 [Wickerhamomyces ciferrii]
          Length = 315

 Score = 67.8 bits (164), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 70/326 (21%), Positives = 147/326 (45%), Gaps = 55/326 (16%)

Query: 12  PPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           P  Q  + + +  + EG++Y A Q  ++I+ R V A+ Y++A++LL+  A + LK++Q  
Sbjct: 11  PKLQRTLQRFKAKIAEGDYYEAHQTLRTIANRNVKAKAYTDAIELLYIAAQILLKSSQPA 70

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            G++L    +E   + +   D+ +  ++ ++  LFP                 ++     
Sbjct: 71  TGSDLTSYLLEVYTESQTPVDSNSKSKIIQLISLFP-----------------SQEPTLK 113

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELH-VMLAEYLYSESPELDMAR-VSFHFVRG 189
           ++ +E  +        WSI+FG    G P LH V+ A++L  E+P  D+A     H + G
Sbjct: 114 QVSIEASN--------WSIKFGEYPFGDPFLHDVIGAKFL--ENPS-DLAYDAERHLLLG 162

Query: 190 NNPEKFASTIVNFMGKCY---PGEDD--LAVARAILMYLSLGNMKDANYIMDE-VKKQVE 243
            +     +   N + + Y   P  ++  L  +R +L YL + N+K A   + + +   + 
Sbjct: 163 TSAS--VTLYFNLIWEWYLEDPSLENAGLYASRIVLNYLFIENVKIAKEALTKFINNLIN 220

Query: 244 NKQLQLQPSDLIQFVFYVLQ-------------TLERDALPLFNMLRANYKASIEREPAF 290
           + +LQ +  + +   F++ +             T++      ++ L+ +Y   I  +   
Sbjct: 221 HGKLQHESIEELNEKFHIFESVPLLNLLQLTLITIQTKNSQYYSKLKNHYNQVI-TQSGL 279

Query: 291 NEMLDDIAEKFFGV---KRRNPMQGI 313
           N+  D + E +  +   K  NP+Q +
Sbjct: 280 NDSFDYLGELYLDIRVPKNINPLQSL 305


>gi|240277064|gb|EER40574.1| DUF410 domain-containing protein [Ajellomyces capsulatus H143]
          Length = 339

 Score = 67.4 bits (163), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 74/315 (23%), Positives = 122/315 (38%), Gaps = 50/315 (15%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTC-G 73
           ++ I + ++ +  G +Y + Q  + I+ARY+  Q Y  A D+L  GA   L+    +  G
Sbjct: 6   EKTIARQQEKIAAGAYYESHQQLRVIAARYLKQQNYDAAADILAGGAKALLRVPAASASG 65

Query: 74  AELAVSFVEALVKGKVAYD--NENLDRVRK--IYELFPQIPVPHNLEDDDDLHELTEALG 129
            +LA+  +   V  KV +D   + + R RK  + EL  + P            E T  + 
Sbjct: 66  GDLAIMLI-IDVYNKVEWDVGGDEVGRARKKRLIELLREFPP----------EEPTRKI- 113

Query: 130 AAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG 189
                      ++   I WS        G P++H  +      E+   D  R   H   G
Sbjct: 114 -----------YMNELIAWSARIAILDRGYPDIHHEIGVLFAQENEPYDAER---HLAIG 159

Query: 190 N--NPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN---------YIMDEV 238
              + E  A    N+     P       +RA+  YL  GN++DAN          +  E 
Sbjct: 160 TSASAEILADLEYNWYTNDEPHTAGTYASRAVFPYLLTGNLRDANKAFSVFTSRLLSSES 219

Query: 239 KKQVENKQLQLQPSD--------LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAF 290
            K +  +Q+  Q +D        ++ F   +L  ++R    LF  L   Y   I+     
Sbjct: 220 SKVLNIQQVSSQSADVRVFPSLPILNFTSLLLLAIQRGTTDLFRQLTRQYAIHIKEVDGL 279

Query: 291 NEMLDDIAEKFFGVK 305
              L  I E +FG+K
Sbjct: 280 ESALAHIGELYFGIK 294


>gi|169776921|ref|XP_001822926.1| hypothetical protein AOR_1_62114 [Aspergillus oryzae RIB40]
 gi|83771663|dbj|BAE61793.1| unnamed protein product [Aspergillus oryzae RIB40]
 gi|391874269|gb|EIT83179.1| hypothetical protein Ao3042_11584 [Aspergillus oryzae 3.042]
          Length = 346

 Score = 67.4 bits (163), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 84/334 (25%), Positives = 139/334 (41%), Gaps = 70/334 (20%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALD--------LLHAGACLQLKNNQLTCG 73
           EKI + G +Y A Q  + I+ARY+    Y  A +        LL AG+    +      G
Sbjct: 14  EKIAS-GAYYEAHQQLRVIAARYIKQANYDAAAELLAGGATALLRAGS----QQGASASG 68

Query: 74  AELAVSFV-EALVKGKVAY-----DNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
            +LA+  V E   K          D E   R +++ EL               LHE    
Sbjct: 69  GDLAIMLVIEVYTKAGWEITGNDDDTEGRARKKRLIEL---------------LHEFPSE 113

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPE-LDMARVSFHF 186
               K        F++  I WS  FG  + G PELH   A  +Y+E  E  D  +   H 
Sbjct: 114 EPTRK-------RFIQEMIGWSGRFGPLERGDPELH-HAAGSVYAEDHEPYDAEK---HL 162

Query: 187 VRGNNPEKFASTIVNFMGKCY----PGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKK 240
           + G +  + A T+     + Y    P    +  +RA+  YL  GN++ AN  +++   + 
Sbjct: 163 ILGTS--ESAETLAKLEHEWYTNDEPHTAAIYASRAVFPYLLNGNLRSANKAFLIFTSRL 220

Query: 241 QVENKQLQLQ------------PS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIERE 287
              N  + LQ            P+  L+ F+  +L T++R +  LF  L ++Y + I+  
Sbjct: 221 SSSNPSIPLQDVSSSSIDARVFPALPLLNFISMLLLTIQRGSADLFKQLTSHYASQIQEV 280

Query: 288 PAFNEMLDDIAEKFFGVK---RRNPMQGIFGDIF 318
             +++ L  I E++F +K   + NP+  + G + 
Sbjct: 281 GIWDDFLAQIGEQYFSIKIPRQGNPLLDMMGSML 314


>gi|238494002|ref|XP_002378237.1| DUF410 domain protein [Aspergillus flavus NRRL3357]
 gi|220694887|gb|EED51230.1| DUF410 domain protein [Aspergillus flavus NRRL3357]
          Length = 363

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 81/331 (24%), Positives = 136/331 (41%), Gaps = 69/331 (20%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALD--------LLHAGACLQLKNNQLTCGAEL 76
           +  G +Y A Q  + I+ARY+    Y  A +        LL AG+    +      G +L
Sbjct: 33  IASGAYYEAHQQLRVIAARYIKQANYDAAAELLAGGATALLRAGS----QQGASASGGDL 88

Query: 77  AVSFV-EALVKGKVAY-----DNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGA 130
           A+  V E   K          D E   R +++ EL               LHE       
Sbjct: 89  AIMLVIEVYTKAGWEITGNDDDTEGRARKKRLIEL---------------LHEFPSEEPT 133

Query: 131 AKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPE-LDMARVSFHFVRG 189
            K        F++  I WS  FG  + G PELH   A  +Y+E  E  D  +   H + G
Sbjct: 134 RK-------RFIQEMIGWSGRFGPLERGDPELH-HAAGSVYAEDHEPYDAEK---HLILG 182

Query: 190 NNPEKFASTIVNFMGKCY----PGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVE 243
            +  + A T+     + Y    P    +  +RA+  YL  GN++ AN  +++   +    
Sbjct: 183 TS--ESAETLAKLEHEWYTNDEPHTAAIYASRAVFPYLLNGNLRSANKAFLIFTSRLSSS 240

Query: 244 NKQLQLQ------------PS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAF 290
           N  + LQ            P+  L+ F+  +L T++R +  LF  L ++Y + I+    +
Sbjct: 241 NPSIPLQDVSSSSIDARVFPALPLLNFISMLLLTIQRGSADLFKQLTSHYASQIQEVGIW 300

Query: 291 NEMLDDIAEKFFGVK---RRNPMQGIFGDIF 318
           ++ L  I E++F +K   + NP+  + G + 
Sbjct: 301 DDFLAQIGEQYFSIKIPRQGNPLLDMMGSML 331


>gi|47212611|emb|CAF92817.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 150

 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/152 (30%), Positives = 76/152 (50%), Gaps = 26/152 (17%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           KL   V +G++Y A QMY+++  RY++  ++SEA +L++ GA L    NQ    A+L++ 
Sbjct: 24  KLRASVEKGDYYEAHQMYRTLYFRYMSQAKHSEARELMYNGALLFFSYNQQNSAADLSML 83

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
            +E L K K   ++E L+ + K++ L  Q   P  +                        
Sbjct: 84  VLEVLEKSKGKVEDEILECLVKLFSLMDQ-NSPERV------------------------ 118

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLY 171
           +F+  A+KWS   G+ K G P LH +LA  L+
Sbjct: 119 AFVSRALKWSTG-GSGKLGHPRLHQLLAVTLW 149


>gi|448532336|ref|XP_003870410.1| Get4 protein [Candida orthopsilosis Co 90-125]
 gi|380354765|emb|CCG24280.1| Get4 protein [Candida orthopsilosis]
          Length = 312

 Score = 66.2 bits (160), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 62/234 (26%), Positives = 100/234 (42%), Gaps = 52/234 (22%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + I + +  ++ G+FY A Q  ++I+ RYV A++Y EA DLL+ GA + ++N +    ++
Sbjct: 7   KTIQRFQSKIDSGSFYEAHQTLRTITNRYVKAKQYPEARDLLYQGANILIRNKEYASASD 66

Query: 76  LAVSFVEALVKGKVAYDNENL-----DRVRKIYELFPQIPVPHNLEDDDDLHELTEALGA 130
           L +  ++        Y+ E L     D   K+ EL   +P      +D  L++L +    
Sbjct: 67  LIIYLIQ-------IYEEEGLKVTDKDAKLKLIELISNLP-----NNDPSLNDLAKTSVT 114

Query: 131 AKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDM---------AR 181
              + EGC +F                G  ELH +    L   + E ++         A 
Sbjct: 115 WSQKSEGCKNF----------------GDNELHHLFGSKLIQAAEEENIDHDQKVKLFAI 158

Query: 182 VSFHFVRGNNPEKFAS--TIVNFM---GKCYPGEDDLA-VARAILMYLSLGNMK 229
              H V G     F S    VNF+    K  P  D  A +AR +L Y  L N+K
Sbjct: 159 AELHLVLGT----FESVPVYVNFLVETAKANPDADSGALLARPVLNYAYLKNIK 208


>gi|358338767|dbj|GAA57316.1| Golgi to ER traffic protein 4 homolog [Clonorchis sinensis]
          Length = 560

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 72/314 (22%), Positives = 127/314 (40%), Gaps = 50/314 (15%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           +L+  + E  FY A Q+YK++  R +  + Y EAL  L++GA   L N Q   G ELA  
Sbjct: 244 RLQSAIEERRFYEAHQIYKTVYFRCITRRNYVEALKYLYSGADFLLTNKQWESGIELASM 303

Query: 80  FVEALVKGKV----AYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRV 135
            +E   K K+     + N+  D +RK+           N  +D                 
Sbjct: 304 ILEVYTKSKMELTEVHLNQMCDLLRKM-----------NPSEDR---------------- 336

Query: 136 EGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKF 195
              S+F++ A        +   G  E    LA   + E          F  +   N    
Sbjct: 337 ---SNFIQKATGLLNNHKSLLCGFNE---QLARIFWQEG---SFTEARFRIMLSFNGHAV 387

Query: 196 ASTIVNFMGK-CYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS-- 252
            S ++    +     E DL + +A+L +L +     A  ++        + +L+  P   
Sbjct: 388 GSFLIALHQRYGLRSEIDLFITQAVLQFLCMRQAPAA--VLTFYTYTRGHPRLEPGPPFT 445

Query: 253 --DLIQFVFYVLQTLERDA-LPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK--RR 307
              L+ FV++++  +ER   + +F++L   Y   I R+  +   LD I + +FG+K    
Sbjct: 446 RFPLLNFVWFLMLAIERKCTVSVFSLLCEKYSPQIRRDSTYLAYLDKIGQLYFGIKPASS 505

Query: 308 NPMQGIFGDIFKMM 321
             M  +F ++ KM+
Sbjct: 506 GGMNNLFSNLLKML 519


>gi|116199123|ref|XP_001225373.1| hypothetical protein CHGG_07717 [Chaetomium globosum CBS 148.51]
 gi|88178996|gb|EAQ86464.1| hypothetical protein CHGG_07717 [Chaetomium globosum CBS 148.51]
          Length = 331

 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 82/328 (25%), Positives = 138/328 (42%), Gaps = 58/328 (17%)

Query: 18  IDKLEKIVNEGNF---YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           I +L+K + EG F   Y A Q  + ++ARY     +  A+D+L   +   L+  Q   G 
Sbjct: 9   IARLQKRIAEGAFEEQYEAAQETRLVAARYTKQANWPAAVDILSNVSQALLRAGQGGSGG 68

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +LA+  V+   +     D  +  ++     LF             +  E T         
Sbjct: 69  DLAILLVDVYRQAAQKPDGASRGKLLTCLRLF-------------EREEPTR-------- 107

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+K  I+WS +FG    G PELH ++      E    +  R   H + G    K
Sbjct: 108 ----KKFIKEMIEWSKKFGEFPAGDPELHHVVGALFAEEHEAEEAER---HLILGT---K 157

Query: 195 FASTIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQL 247
            ++ +++ M   +  ED+     L  ARA+L YL + N++ AN  Y         +NK L
Sbjct: 158 ESAEVLSRMEYDWYKEDESHTAPLYCARAVLPYLLVANVRAANASYRTFVSALAADNKGL 217

Query: 248 QL-----QPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASI-EREPAFNEM 293
            +     Q SD+  F        +  +L  +++    LF  L+  Y  +I E    ++  
Sbjct: 218 AVQDVSSQGSDVRIFPSLPLLNFLGLLLLAVQKGTPDLFRQLKTKYGPNINELGDTWDTA 277

Query: 294 LDDIAEKFFGVKR---RNPMQGIFGDIF 318
           L+ IAE +FG++R    NP+  + G +F
Sbjct: 278 LEMIAEMYFGIQRPRQSNPLFDMMGSLF 305


>gi|261187606|ref|XP_002620222.1| DUF410 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239594113|gb|EEQ76694.1| DUF410 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239608907|gb|EEQ85894.1| DUF410 domain-containing protein [Ajellomyces dermatitidis ER-3]
 gi|327354086|gb|EGE82943.1| DUF410 domain-containing protein [Ajellomyces dermatitidis ATCC
           18188]
          Length = 342

 Score = 64.3 bits (155), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 73/322 (22%), Positives = 126/322 (39%), Gaps = 51/322 (15%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTC-G 73
           ++ I + ++ +  G +Y + Q  + I+ARY+  Q Y  A D+L  GA   L+    +  G
Sbjct: 6   EKTIARQQEKIAAGAYYESHQQLRVIAARYLKQQNYDAAADILAGGAKALLRAPGASASG 65

Query: 74  AELAVSFVEAL---VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGA 130
            +LA+  +  +    + +V  D     R +++ EL    P P        ++EL      
Sbjct: 66  GDLAIMLIIDVYNKAEWEVGRDEVGRGRKKRLIELLRDFP-PEEPTRKRYMNEL------ 118

Query: 131 AKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG- 189
                          + WS  FG  + G PE+H  +      E+   D  R   H   G 
Sbjct: 119 ---------------VAWSARFGELERGDPEIHHEIGSLFAQENEPYDAER---HLAIGT 160

Query: 190 -NNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--------YIMDEVKK 240
            ++ E  A    ++     P    +  +RA+  YL  GN++DAN         +      
Sbjct: 161 ADSAEVLARLEYDWYTHDEPHTAGIYASRAVFPYLLTGNLRDANKAFLIFTSRLSSSGTS 220

Query: 241 QVEN-KQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFN 291
           Q  N +Q+  Q +D+  +           +L  ++R    LF  L   Y   I     + 
Sbjct: 221 QALNIQQVSSQSADVRVYPSLPLLNFTSLLLLAIQRGTADLFRQLTRQYGMHIREVDGWE 280

Query: 292 EMLDDIAEKFFGV---KRRNPM 310
             L  I E +FG+   K+ NP+
Sbjct: 281 RALAYIGELYFGIKIPKQSNPL 302


>gi|444320481|ref|XP_004180897.1| hypothetical protein TBLA_0E03240 [Tetrapisispora blattae CBS 6284]
 gi|387513940|emb|CCH61378.1| hypothetical protein TBLA_0E03240 [Tetrapisispora blattae CBS 6284]
          Length = 309

 Score = 63.9 bits (154), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 78/328 (23%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFV 81
           EKI NE  FY A QM ++I+ RYV +++YS+A++L+  G+    K NQ   G +L +  +
Sbjct: 13  EKIQNE-EFYEAHQMLRTIANRYVHSKKYSDAIELIFKGSLSLFKANQSGSGVDLTLYLL 71

Query: 82  EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSF 141
           E     ++  D+E++ R+ +I ++ P          D DL ++   +             
Sbjct: 72  EVYKLAEIKVDDESVGRLIQILQVLPLT--------DPDLKDVATGMN------------ 111

Query: 142 LKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVN 201
                 WS   G  K G   LH  L   L++ +   +  R      R ++ +++   + +
Sbjct: 112 -----NWSCNNGDSKFGDARLHDALGRALWNGNYVYEAERYWTLGTR-DSRDRYVKNLWD 165

Query: 202 F------MGKCYPGED-DLAVARAILMYLSLGNMKDA---------NYI----MDEVKKQ 241
           +             ED  L + R +  YL + N+  A         NYI    +D V K 
Sbjct: 166 WYLQSQEDSSSSDKEDVSLYLGRLVFNYLFVSNISYAIDSTKVFLNNYIDYKKIDNV-KH 224

Query: 242 VENKQLQL-------QPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEML 294
           ++    QL          + +Q +    QT E      FN L+  Y A   +   F   L
Sbjct: 225 IDKTDFQLILFDSNDSSLNFLQLLILACQTKEA---SYFNSLKNQYPA---QSNQFKNEL 278

Query: 295 DDIAEKFFGV---KRRNPMQ----GIFG 315
           + + +++FG+   K+ N +Q    G+ G
Sbjct: 279 EFLGQEYFGIVAPKQSNFLQDMMSGLLG 306


>gi|406861836|gb|EKD14889.1| hypothetical protein MBM_07100 [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 327

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 75/312 (24%), Positives = 129/312 (41%), Gaps = 47/312 (15%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I +L+  + EG +Y AQQ  + ++ARYV ++ +  A+D+L   +   LK  Q   G +L 
Sbjct: 9   IVRLQGKITEGAYYEAQQQTRVVAARYVKSESWPAAIDILFNVSQSLLKAGQGGSGGDLG 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
           +  ++   +  +  D+ +  ++  +  LF             D  E T            
Sbjct: 69  LFLIDVYKQAGLNPDSSSKGKLLTLLRLF-------------DSEEPTR----------- 104

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG--NNPEKF 195
              F+   I WS + G    G PELH +       E    +  R   H   G  ++PE F
Sbjct: 105 -KKFIAEMIGWSAKCGEYPAGEPELHHVAGSLYAEEHEAYEAER---HLTLGTKDSPEVF 160

Query: 196 ASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQV--ENKQLQLQ--- 250
           A     +  +       L  AR +  YL  GN++DA   +     ++   NK L +Q   
Sbjct: 161 AKLEYEWYTQDDSHTAALYAARVVFPYLLTGNVRDATKSLRLFTSRLVEANKGLAVQDVS 220

Query: 251 --PSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEK 300
              SD+  +        +  +L  +++ +  LF  L++ Y   I     ++E L  +AE 
Sbjct: 221 SSASDMRIYPSLPLLNFLGLLLLAVQKGSAGLFTQLKSKYAVHIREAGIWDEALTSVAEM 280

Query: 301 FFGVKRRNPMQG 312
           +FG+ R  P QG
Sbjct: 281 YFGIAR--PRQG 290


>gi|367024851|ref|XP_003661710.1| hypothetical protein MYCTH_2301453 [Myceliophthora thermophila ATCC
           42464]
 gi|347008978|gb|AEO56465.1| hypothetical protein MYCTH_2301453 [Myceliophthora thermophila ATCC
           42464]
          Length = 331

 Score = 63.9 bits (154), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 82/328 (25%), Positives = 140/328 (42%), Gaps = 58/328 (17%)

Query: 18  IDKLEKIVNEGNF---YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           I +L+K + EG F   Y A Q  + ++ARY     +S A+D+L   +   L+  Q   G 
Sbjct: 9   IARLQKRIAEGAFEEQYEAAQETRLVAARYTKQSNWSAAVDILSNVSQTLLRAGQGGSGG 68

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +LA+  V+   +     D  +  ++     LF         E  + + +           
Sbjct: 69  DLAILLVDVYRQAGQKPDGSSRGKLLTCLRLF---------ERGEPMRK----------- 108

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                 F+K  + WS + G    G PELH ++      E    +  R   H + G    K
Sbjct: 109 -----KFIKEMMDWSKKLGDFPAGDPELHHVVGAMYAEEHEAEEAER---HLILGT---K 157

Query: 195 FASTIVNFMGKCYPGEDD-----LAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQL 247
            ++ +++ M   +  ED+     L  ARA+L YL + N++ AN  Y         +NK L
Sbjct: 158 ESAEVLSRMEYEWYKEDESHTAPLYCARAVLPYLLVANVRAANASYRAFVSALVADNKGL 217

Query: 248 QL-----QPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASI-EREPAFNEM 293
            +     Q SD+  F        +  +L  +++    LF  L+  Y A+I E   A++  
Sbjct: 218 AVQDVSSQGSDVRIFPSLPLLNFLGLLLLAVQKGHPDLFRQLKTKYAANINELGGAWDTA 277

Query: 294 LDDIAEKFFGVKR---RNPMQGIFGDIF 318
           L+ IAE +FG++R    NP+  + G +F
Sbjct: 278 LEMIAEMYFGIQRPRQSNPLFDMMGSLF 305


>gi|171693867|ref|XP_001911858.1| hypothetical protein [Podospora anserina S mat+]
 gi|170946882|emb|CAP73686.1| unnamed protein product [Podospora anserina S mat+]
          Length = 349

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 78/333 (23%), Positives = 137/333 (41%), Gaps = 53/333 (15%)

Query: 12  PPAQENIDKLEKIVNEGNFYGAQ----QMYKSISARYVAAQRYSEALDLLHA--GACLQL 65
           P   + I +L+  +  G  + AQ    Q  + ++ARY  +Q Y+ A+D+L +   + L+ 
Sbjct: 16  PKIAKIITRLQSKITPGMPFEAQYEAAQETRLVAARYTKSQNYTAAIDILSSVSQSLLKT 75

Query: 66  KNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELT 125
                  G +LA+  V+   +  +  D  +  RV     LF   P   N +         
Sbjct: 76  GPTGGGSGGDLAILLVDVYKQAGLKVDATSKGRVLTCLRLFD--PAEPNRK--------- 124

Query: 126 EALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFH 185
                          F+K  ++WS ++     G PELH ++   L  E   LD A    H
Sbjct: 125 --------------KFVKDVVEWSKKYSDYPAGDPELHHVIGSMLAEEKESLDEAE--RH 168

Query: 186 FVRG---NNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQV 242
            + G   ++P   A     +  +       L  ARA+L YL + N+K A         Q+
Sbjct: 169 LILGTTKDSPPVLAQLEYEWYKQDETHTAPLYCARAVLPYLLMANLKAATTCYKVFTSQL 228

Query: 243 --ENKQLQLQ--------PS-DLIQFVFYVLQTLERDALP-LFNMLRANYKASIER--EP 288
             EN  L +Q        PS  L+ F+  ++  +++   P +F  L+  Y  +I+   + 
Sbjct: 229 AAENPSLAVQDVGETKIFPSLPLMNFLGLLIVAVQKGNNPEVFRQLKGKYMPTIKEATDG 288

Query: 289 AFNEMLDDIAEKFFGVKR---RNPMQGIFGDIF 318
            ++  L+ I E +FG++R    NP+  + G   
Sbjct: 289 VWDTALELIGEMYFGIQRPRQSNPLFDMMGSFL 321


>gi|242823976|ref|XP_002488168.1| DUF410 domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218713089|gb|EED12514.1| DUF410 domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 341

 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 74/334 (22%), Positives = 132/334 (39%), Gaps = 59/334 (17%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK----NNQLT 71
           + I + ++ +  G++Y A Q  + I+ARY+    Y  A ++L  GA   L+         
Sbjct: 7   KTIARQQEKIAAGSYYEAHQQLRVIAARYLKQSNYDAAAEILTGGAKALLRAGSQQGASA 66

Query: 72  CGAELAVSFVEALVKGKVAY-----DNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTE 126
            G +LA+  V   V  K  +     D E+  R +++ EL  + P                
Sbjct: 67  SGGDLAIMLV-VEVYNKAEWEIGLDDAESRGRKKRLIELLREFPPDE------------- 112

Query: 127 ALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHF 186
                         ++   I WS +FG  + G PELH         E+   D  R   H 
Sbjct: 113 ---------PSRKRYITEIISWSSKFGPLERGDPELHHAAGSVYAEENEPYDAER---HL 160

Query: 187 VRGNNPEKFASTIVNFMGKCYPGED----DLAVARAILMYLSLGNMKDANYIM------- 235
           + G      A T+     + Y  +D     +  +RA++ YL   N+++AN  +       
Sbjct: 161 ILGTTDS--AETLAKLEFEWYTADDAHTAGIYASRAVIPYLLTANLRNANKALLIFTSRL 218

Query: 236 -----DEVKKQVENKQ--LQLQPSDLIQFVFYVLQ-TLERDALPLFNMLRANYKASIERE 287
                    +QV +    +++ PS  +     +L   ++R +  LF  L  +Y   IE  
Sbjct: 219 SAAYPSLASQQVSSTSADVRIYPSLPLLNFLSLLLIAVQRGSADLFRDLTRHYTPHIEEV 278

Query: 288 PAFNEMLDDIAEKFFGVK---RRNPMQGIFGDIF 318
             +++ L  I E +F +K   + NP+  + G +F
Sbjct: 279 GIWDDALATIGEMYFNIKIPRQGNPLLDMMGSMF 312


>gi|367016985|ref|XP_003682991.1| hypothetical protein TDEL_0G04130 [Torulaspora delbrueckii]
 gi|359750654|emb|CCE93780.1| hypothetical protein TDEL_0G04130 [Torulaspora delbrueckii]
          Length = 306

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 69/310 (22%), Positives = 130/310 (41%), Gaps = 54/310 (17%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q+ + + +  +  G++Y A Q  ++I+ RYV ++ Y +A+DL+  GA   LK  Q   G 
Sbjct: 9   QKTLQRFQDRIEAGDYYEAHQTLRTIANRYVRSKSYQDAIDLISHGAQSFLKAKQGGSGT 68

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKL 133
           +L    +E     +V  D+ ++ R+ ++  +  PQ P   NL+D                
Sbjct: 69  DLVFYLLEVYEVAEVKVDDVSVSRLVQLLVIIDPQEP---NLKD---------------- 109

Query: 134 RVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE 193
                   + A   WSI+FG  K G P LH  +   L       +  R   + + G  P 
Sbjct: 110 -------VVTAMNNWSIKFGDYKFGDPYLHSAIGSVLIEGGFAYEAER---YLMLG--PH 157

Query: 194 KFASTIVNFMGKCYPGEDDLAV-----ARAILMYLSLGNMKDANYIMDE-VKKQVENKQL 247
           +  +  ++ +   +  EDD+       +R I  +L + N+  A    D+ +   +E    
Sbjct: 158 EAVNKYISLIWDWFYQEDDVETVGDFFSRPIFNFLFISNISYAYEARDKLLNLLIEKHSP 217

Query: 248 QLQPSDLIQFVFY-------------VLQTLERDALPLFNMLRANYKASIEREPAFNEML 294
           + Q ++   F  Y             ++ T +   + LF+ L+ +Y  S  +  A  E L
Sbjct: 218 KFQITEKNGFKMYYFDNFADLNFLQLIILTCQTKNVDLFHNLKEHYGGSSSKYAAELEFL 277

Query: 295 DDIAEKFFGV 304
               +++FG+
Sbjct: 278 ---GQEYFGI 284


>gi|260941229|ref|XP_002614781.1| hypothetical protein CLUG_05559 [Clavispora lusitaniae ATCC 42720]
 gi|238851967|gb|EEQ41431.1| hypothetical protein CLUG_05559 [Clavispora lusitaniae ATCC 42720]
          Length = 312

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 62/289 (21%), Positives = 120/289 (41%), Gaps = 46/289 (15%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           +  + + ++ ++  +FY A Q  ++I+ RYV +  YSEA+DLL  G+ +   N +    +
Sbjct: 6   KRTVARFQQRIDNKDFYEAHQTLRTIANRYVKSSHYSEAIDLLTQGSSILATNKEYGSAS 65

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L    ++   +  +  DN   D   K+ EL   +P     + D  L +L          
Sbjct: 66  DLISYLIDVYSEAGIRCDNTAKDYKLKLVELISLLP-----DSDPSLGDLA--------- 111

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDM--------ARVSFHF 186
                   K A+ WS   G  K G   LH +    + + +  L +        A    H 
Sbjct: 112 --------KQALAWSKGMGGSKFGDSSLHHLFGTKMLAGAQTLSIEEDRYKMFALAELHL 163

Query: 187 VRGNNPEKFASTIVNFMGKCY-----PGEDDLAVARAILMYLSLGNMKDANYIMD----E 237
           + G          V+F+   +       +  +  +RA++ Y  L N+K A   +D    +
Sbjct: 164 ILGTYES--LPVYVDFLFSWWAESGKSADPGIFASRAVINYAYLKNIKFARLALDRFVAK 221

Query: 238 VKKQVENKQLQ----LQPSDLIQFVFYVLQTLER-DALPLFNMLRANYK 281
            K+   ++ ++       ++L+ F+  ++ TL + DA   F  L ++YK
Sbjct: 222 YKETASHETIEGLDFFADAELLNFLSLLVVTLGKFDASDKFMKLYSHYK 270


>gi|380478499|emb|CCF43563.1| hypothetical protein CH063_13229 [Colletotrichum higginsianum]
          Length = 223

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 93/194 (47%), Gaps = 29/194 (14%)

Query: 148 WSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCY 207
           WS +FG    G  ELH  +A  LY+E  E D      H V G    K ++ ++  M   +
Sbjct: 8   WSAKFGDYPAGDAELH-HVAGSLYAE--EHDAYEAERHLVLGT---KDSAEVLAKMEYVW 61

Query: 208 PGEDD-----LAVARAILMYLSLGNMKDAN--YIMDEVKKQVENKQLQLQ-----PSDLI 255
             ED+     L  ARAIL YL +GN++ AN  Y +       +NK L +Q      +DL 
Sbjct: 62  YKEDESHTAPLYAARAILPYLLIGNVRAANTCYRLFASSLSDDNKGLGVQDVSSNSADLR 121

Query: 256 QF--------VFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKR- 306
            F        +  +L  ++R A   +  L A Y ++I    A+NE L+ IAE +FG++R 
Sbjct: 122 IFPGLPLLNFLGLLLLAVQRGAPDAYKQLLAKYSSNIAEVGAWNEALEIIAEMYFGIQRP 181

Query: 307 --RNPMQGIFGDIF 318
              NP+  + G +F
Sbjct: 182 RQSNPLMDMMGSLF 195


>gi|45201361|ref|NP_986931.1| AGR265Wp [Ashbya gossypii ATCC 10895]
 gi|44986295|gb|AAS54755.1| AGR265Wp [Ashbya gossypii ATCC 10895]
 gi|374110181|gb|AEY99086.1| FAGR265Wp [Ashbya gossypii FDAG1]
          Length = 303

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 57/219 (26%), Positives = 93/219 (42%), Gaps = 35/219 (15%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           +D+  + V EG +Y A Q  ++++ RYV  +++ +A+DL+  G    +   Q +   +L 
Sbjct: 11  MDRFTEKVKEGEYYEAHQTLRTVANRYVRGKKWPQAIDLITHGIHTLMGVGQSSSVVDLT 70

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              +E   +G V    EN+ R+ KI        V   LED D             LR + 
Sbjct: 71  RYLLEVYEQGNVECTEENVGRLVKIL-------VGLELEDPD-------------LR-DV 109

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
           C+        WS+  G    G P LH ++ + L      ++  R   +FV GN       
Sbjct: 110 CTGM----NNWSVRHGRYTYGDPYLHSVIGKRLLEAGYLVEAER---YFVLGNRDS--LE 160

Query: 198 TIVNFMGKCYPGEDDLAV-----ARAILMYLSLGNMKDA 231
             VN++   Y      AV      R +  YL +GN+K+A
Sbjct: 161 EYVNWLWDWYQQLQQPAVVGEFLGRLVFGYLGVGNLKNA 199


>gi|256080092|ref|XP_002576317.1| hypothetical protein [Schistosoma mansoni]
 gi|350646018|emb|CCD59295.1| hypothetical protein Smp_050130 [Schistosoma mansoni]
          Length = 326

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 78/317 (24%), Positives = 132/317 (41%), Gaps = 59/317 (18%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           +L+  V+E  FY A Q+YK+I+ R +  + Y EAL  L  G+   L N Q   G ELA  
Sbjct: 14  RLKVAVSERRFYEAHQLYKTINFRCMMRKNYQEALKYLKLGSEFLLDNEQWESGTELACL 73

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIP----------VPHNLEDDDDLHELTEALG 129
            ++   + +      +LD + ++  L    P            H L+   DL  +     
Sbjct: 74  VLDVYSQSETVLTESHLDEICEL--LIRMCPGEERTRFVHKALHLLQKQKDLLAVFNGYL 131

Query: 130 AAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG 189
           A  L  EG  SF +A                 L +ML+   Y+    L    V+ H   G
Sbjct: 132 ARVLWQEG--SFSEA----------------RLRIMLSSNGYAAGVFL----VALHQRYG 169

Query: 190 NNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQL 249
                               E DL +A+A+L +L +  +  A  I+        + +L+ 
Sbjct: 170 -----------------LRSEIDLFLAQAVLQFLCMKQISVA--ILTFYTYTRRHPRLEP 210

Query: 250 QPS----DLIQFVFYVLQTLERD-ALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            P      L+ F+++++  +E+  +L +F++L   Y   + R+ ++ + LD I + +FGV
Sbjct: 211 GPPFADFPLLNFLWFLMLAIEKKCSLAVFSVLCEKYSPQLNRDSSYLKYLDKIGQLYFGV 270

Query: 305 K-RRNPMQGIFGDIFKM 320
           K   N M   F  I +M
Sbjct: 271 KPPANEMNNFFSRIMQM 287


>gi|344233754|gb|EGV65624.1| DUF410-domain-containing protein [Candida tenuis ATCC 10573]
          Length = 314

 Score = 62.8 bits (151), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 82/335 (24%), Positives = 146/335 (43%), Gaps = 60/335 (17%)

Query: 17  NIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAEL 76
            I + +  ++ G+FY A Q  ++I+ RY+ A++Y EA++LL+ G+ +  KN + T   +L
Sbjct: 8   TIQRFQAKIDSGSFYEAHQTLRTIANRYIKAKQYDEAINLLYQGSVILAKNKEYTSACDL 67

Query: 77  AVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
            V  +E   +     D++  D   KI E+   IP                          
Sbjct: 68  IVYLLEVYHESSKTIDDK--DAKDKIIEVISYIPSSD----------------------P 103

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLA-EYLYS--ESPELDMARVS------FHFV 187
           G +   K A++WS +  + K G  +LH +   ++L S  ES   D+ +        FH +
Sbjct: 104 GLADLSKRAVEWS-KNESNKFGDHDLHNLFGTKFLNSLGESKADDVEKRKIFSVGEFHSI 162

Query: 188 RGNNPEKFASTIVNFMGKCYPGEDD---LAVARAILMYLSLGNMKDA----NYIMDEVKK 240
            GN+     +T  +F+ +     D+   + + RA++ Y  L N++ A    +  M ++ K
Sbjct: 163 LGNSSS--LNTYTDFLYQWAKSSDEDEGIFITRAVVNYAYLKNIEFAKECLSKFMAKLIK 220

Query: 241 QVENKQLQLQP---------SDLIQFVFYVLQTL-ERDALPLFNMLRANYKASIEREPAF 290
              N +   Q            +I FV  ++ TL ++DA   F  L   YKA +      
Sbjct: 221 DSPNYETLTQDHHTVYYYPQKSVINFVQLLVITLCKQDAGHKFLKLFDLYKADLTHYQLV 280

Query: 291 NEMLDDIAEKFFGVKRRNPM--QGIF----GDIFK 319
             + + +   +F +K  NP   Q +F    GD+FK
Sbjct: 281 GPV-EYLGRFYFNLKLGNPAANQNMFANLMGDLFK 314


>gi|326472471|gb|EGD96480.1| hypothetical protein TESG_03922 [Trichophyton tonsurans CBS 112818]
          Length = 341

 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 80/335 (23%), Positives = 136/335 (40%), Gaps = 58/335 (17%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGA--CLQLKNNQ--L 70
           ++ I + ++ +  G +Y + Q  + I+ARY+    Y  A ++L  GA   LQ    Q   
Sbjct: 6   EKTIARQQEKIASGAYYESHQQLRVIAARYIKQSNYDAAANILSEGAKALLQAGAQQGAS 65

Query: 71  TCGAELAVSFV-EALVKG--KVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
             G +LA+  V E   K   +VA D     R +++ EL  + P P        ++E+   
Sbjct: 66  ASGGDLAIMLVLEVYNKAGWEVADDEIGKKRKQRLIELLREFP-PEEPTRKRYINEI--- 121

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
                             I WS +FG  + G P+LH  +      E+   D  +   H  
Sbjct: 122 ------------------ISWSSKFGGLERGDPDLHHAIGSTYAQENEPYDAEK---HLA 160

Query: 188 RGNNPEKFASTIVNFMGKCYP--GEDDLAV--ARAILMYLSLGNMKDANYI--------- 234
            G    + A T+ N   K Y   G+    +   RA+  YL  GN+ +AN           
Sbjct: 161 FGT--VESADTLANLEYKWYTYDGQHTAGIYCCRAVFPYLLTGNILNANKAFVVFTQLLS 218

Query: 235 MDEVKKQVENKQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIER 286
             E  K +  +++  Q +D+  +           +L T++R    LF  L   Y++ I+ 
Sbjct: 219 TSESGKSLGVQEVSSQQNDVRVYPSLPLLNFTNLLLLTIQRGTPDLFRQLLKQYQSHIQE 278

Query: 287 EPAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
              ++  L  I E +FG+   K+ NP+  + G  F
Sbjct: 279 AGEWDHALAHIGELYFGIKIPKQSNPLMDMMGMFF 313


>gi|393229694|gb|EJD37312.1| DUF410-domain-containing protein [Auricularia delicata TFB-10046
           SS5]
          Length = 334

 Score = 61.6 bits (148), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 69/328 (21%), Positives = 123/328 (37%), Gaps = 52/328 (15%)

Query: 11  LPPAQENIDKLEKI---VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKN 67
           +PPA      L  I   + EG+ Y A Q  ++ ++RY  + +++ A+++L   A   LK 
Sbjct: 1   MPPAPSANKALSAILPQIAEGHAYEAHQKARTFASRYTKSGQHAVAIEVLFQSARELLKT 60

Query: 68  NQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
            Q   G +LAV  ++A     V  D+E+  RV ++  L                      
Sbjct: 61  GQQGSGTDLAVMMLDAYDAADVGVDDESRGRVTQLIAL---------------------- 98

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
           +G  +   +   + +  A+ WS ++G    G P+LH  + E LY E       +   H +
Sbjct: 99  VGPGQTWRK---TVVDKAVAWSAKYGECPAGDPDLHHYIGELLYKEG---AFVQAEPHLL 152

Query: 188 RGN--NPEKFASTIVNFMGKCYPGED-DLAVARAILMYLSLGNMKDANYIMDEVKKQV-- 242
                +  +    ++    K     D  L  AR  L YL L N+  A   +         
Sbjct: 153 SAGKRDSARLLGQLLAEWSKAGSSSDLGLFAARGTLPYLELENILAARAFLRSFLATASL 212

Query: 243 ----------ENKQLQLQPSDLIQFVFYVLQTLERDALP----LFNMLRANY--KASIER 286
                        ++ L   +L+ F+   + TL+R   P     +  L   Y  K     
Sbjct: 213 PGTTALPVGDSGDEIVLTSDNLLNFLQLAVPTLQRAPSPRAREAWTRLVGTYSTKGGAIT 272

Query: 287 EPAFNEMLDDIAEKFFGVKRRNPMQGIF 314
            P   + + D++  +F ++   P    F
Sbjct: 273 HPEIRKAIQDLSATYFNIQPPRPAGNPF 300


>gi|115398518|ref|XP_001214848.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114191731|gb|EAU33431.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 344

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 78/330 (23%), Positives = 132/330 (40%), Gaps = 63/330 (19%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALD--------LLHAGACLQLKNNQLTCG 73
           EKI + G +Y A Q  + I+ARY+    Y  A +        LL AG+    +      G
Sbjct: 14  EKIAS-GAYYEAHQQLRVIAARYIKQANYDAAAELLAGGATALLRAGS----QQGASASG 68

Query: 74  AELAVSFVEAL-------VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTE 126
            +LA+  V  +       + GK   D E+  R +++ EL  + P            E T 
Sbjct: 69  GDLAIMLVLEVYNKSGWEISGKD--DAEDRSRKKRLIELLHEFPP----------EEPTR 116

Query: 127 ALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHF 186
                         +++  I WS   G  + G P+LH   A  +Y+E  E   A      
Sbjct: 117 ------------RRYIQEMIAWSGRCGPLERGDPDLH-HAAGTVYAEDHEPYDAEKHLIL 163

Query: 187 VRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIMDEVKKQVEN 244
               + E  A     +     P    +  +RA+  YL  GN++ AN  +++   +    N
Sbjct: 164 GTSESAETLAKLEYEWYTNDEPHTAAIYASRAVFPYLLTGNIRSANKAFLIFTSRLSSSN 223

Query: 245 KQLQLQ------------PS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFN 291
             L  Q            P+  L+ F+  +L T++R +  LF  L A Y + I+    ++
Sbjct: 224 PSLGSQVVSSASADVRVFPALPLLNFISMLLLTIQRGSADLFKQLTAQYASQIQEVGLWD 283

Query: 292 EMLDDIAEKFFGVK---RRNPMQGIFGDIF 318
           + L  I E++F +K   + NP+  + G + 
Sbjct: 284 DALAQIGEQYFAIKIPRQGNPLLDMMGSML 313


>gi|315044719|ref|XP_003171735.1| DUF410 domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311344078|gb|EFR03281.1| DUF410 domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 341

 Score = 61.2 bits (147), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 83/335 (24%), Positives = 139/335 (41%), Gaps = 58/335 (17%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGA--CLQLKNNQ--L 70
           ++ I + ++ +  G +Y + Q  + I+ARY+    Y  A ++L  GA   LQ    Q   
Sbjct: 6   EKTIARQQEKIASGAYYESHQQLRVIAARYIKQSNYDAAANILSEGAKALLQAGAQQGAS 65

Query: 71  TCGAELAVSFV-EALVKG--KVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
             G +LA+  V E   K   +VA D    +R +++ EL  + P P        ++E+   
Sbjct: 66  ASGGDLAIMLVLEVYNKAGWEVADDETGKERKQRLVELLQEFP-PEEPTRKRYINEI--- 121

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
                             I WS +FG  + G PELH  +      ++   D  +   H  
Sbjct: 122 ------------------ISWSSKFGDLERGDPELHHAIGSVYAQDNEPYDAEK---HLA 160

Query: 188 RGNNPEKFASTIVNFMGKCYPGEDD----LAVARAILMYLSLGNMKDANYI--------- 234
            G    + A T+ N   K Y  +      +   RA+  YL  GN+ +AN           
Sbjct: 161 FGTV--ESAETLANLEYKWYTYDAQHTAGIYCCRAVFPYLLTGNILNANKAFLVFTRLLS 218

Query: 235 MDEVKK-----QVENKQ--LQLQPS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIER 286
             E  K     +V +KQ  +++ PS  L+ F   +L  ++R    LF  L   Y++ I+ 
Sbjct: 219 TSESGKALGVQEVSSKQNDVRVYPSLPLLNFTNLLLLAIQRGTPDLFRQLLKQYQSHIQE 278

Query: 287 EPAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
              ++  L  I E +FG+   K+ NP+  + G  F
Sbjct: 279 AGEWDHALAHIGELYFGIKIPKQSNPLMDMMGMFF 313


>gi|156048198|ref|XP_001590066.1| hypothetical protein SS1G_08830 [Sclerotinia sclerotiorum 1980]
 gi|154693227|gb|EDN92965.1| hypothetical protein SS1G_08830 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 359

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 75/311 (24%), Positives = 130/311 (41%), Gaps = 57/311 (18%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEAL 84
             EGN+Y A Q  + +++RY+ +Q Y  A D+L+  +   L+ +Q   G +LA+  +E  
Sbjct: 43  TTEGNYYEAHQQTRVVASRYIKSQNYPAATDILYNVSHSLLQASQGGSGGDLALFLLEVY 102

Query: 85  VKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLK 143
               +  D+ +  R+  +   F P+ P                    ++ R      F+ 
Sbjct: 103 NLSSLTCDSASKGRLFTLLRAFSPEEP--------------------SRKR------FIT 136

Query: 144 AAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFM 203
             + WS +      G PELH ++   L  E    D  R   H V G    K ++ ++  M
Sbjct: 137 EMVAWSSKTSLYPAGDPELHHVVGGLLAEEGEVYDAER---HLVLGT---KDSAEVLAKM 190

Query: 204 GKCYPGEDD-----LAVARAILMYLSLGNMKDANYIM---------DEVKKQVE------ 243
              +  ED+     L  +RA+L YL +GN++DA   +         D     V+      
Sbjct: 191 EYEWYREDESHTAALYASRAVLPYLLVGNVRDARKSLGIFGERLRGDNTGLTVQELGEGG 250

Query: 244 NKQLQLQPSDLIQFVFYVLQTLERDALPL-FNMLRANYKASI-EREPAFNEMLDDIAEKF 301
           N+ +++ PS  +     +L    +    L F  L   Y   + +    ++E L  IAE +
Sbjct: 251 NQDIKIYPSIPLLNFLGLLLVAVQKGNSLDFKQLTKKYAGHMKDTGELWDEALLGIAEMY 310

Query: 302 FGVKRRNPMQG 312
           FG++R  P QG
Sbjct: 311 FGIQR--PRQG 319


>gi|354543284|emb|CCE40002.1| hypothetical protein CPAR2_100410 [Candida parapsilosis]
          Length = 312

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 67/233 (28%), Positives = 102/233 (43%), Gaps = 50/233 (21%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + I + +  ++ G+FY A Q  ++I+ RYV A++YSEA DLL+ GA + +KN +    ++
Sbjct: 7   KTIQRFQSKIDAGSFYEAHQTLRTITNRYVKAKQYSEARDLLYQGADILIKNKEHASASD 66

Query: 76  LAVSFVEALVKGKVAYDNENL---DRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAK 132
           L +  ++        Y+ E L   DR  K+  +     +P+N   D  L++L        
Sbjct: 67  LILYLIQ-------VYEEEGLKVTDREAKLKLIDLISSLPNN---DPSLNDLA------- 109

Query: 133 LRVEGCSSFLKAAIKWSIEF-GAPKTGSPELHVMLAEYLYS--ESPELD-------MARV 182
                     KA+I WS +  G  K G  ELH +    L    E   +D        A  
Sbjct: 110 ----------KASITWSQKSQGCEKFGDNELHHLFGSKLIQVVEEDNIDYEQKAKIFAVA 159

Query: 183 SFHFVRGNNPEKFAST--IVNFM---GKCYPGEDDLA-VARAILMYLSLGNMK 229
             H V G     F S    VNF+    K     D  A +AR +L Y  L N+K
Sbjct: 160 ELHLVLGT----FESVPIYVNFLVATAKANSNTDAGALLARPVLNYAYLKNIK 208


>gi|403218469|emb|CCK72959.1| hypothetical protein KNAG_0M01060 [Kazachstania naganishii CBS
           8797]
          Length = 320

 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/334 (23%), Positives = 136/334 (40%), Gaps = 70/334 (20%)

Query: 12  PPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           P  + ++++  + V++G FY A Q  +++  RYV    Y+EA++L   GA L LK  Q  
Sbjct: 14  PQGKRSLERFSQRVSKGEFYEAHQALRAVINRYVNRAEYAEAVELAAGGAELLLKAGQGG 73

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            GA+L    +E      VA D+ ++ R+ ++  L            D  L ++   +   
Sbjct: 74  SGADLVFYLLEVYDVAGVAVDDASVARLVQLLVLLDAT--------DPTLKDVATGMN-- 123

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGN- 190
                           WS++ G    G P LH  +A  L       +  R   + + G  
Sbjct: 124 ---------------NWSVKHGECPFGDPRLHNAVAAKLLEAGLVYESER---YMLLGTE 165

Query: 191 -NPEKFASTIVNFMGKCYPGEDDLA---VARAILMYLSLGNMKDA---------NYI--- 234
            + EK+   +  +      G D LA   V+R +  YL +GN+K A          Y+   
Sbjct: 166 ESLEKYVDLLYGWFED--AGSDALAGDFVSRLVFNYLFIGNVKFALRSSQQFLTRYVDSG 223

Query: 235 ----MDEVKKQV-----ENKQLQLQPSD-----LIQFVFYVLQTLERDALPLFNMLRANY 280
               +D + K +     E+    L  +D      +Q V     T  RD   LF  L+  Y
Sbjct: 224 AAGTVDTITKDIGGDSGESTFTMLYFADHMELNFLQLVLLTCMTKNRD---LFATLKQQY 280

Query: 281 KASIEREPAFNEMLDDIAEKFFGV---KRRNPMQ 311
            A   +E  ++  L  + +++FG+   K+ N +Q
Sbjct: 281 -ADCAKE--YSTQLQVLGQEYFGITAPKQANFLQ 311


>gi|326481693|gb|EGE05703.1| DUF410 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 341

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/335 (23%), Positives = 135/335 (40%), Gaps = 58/335 (17%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGA--CLQLKNNQ--L 70
           ++ I + ++ +  G +Y + Q  + I+ARY+    Y  A ++L  GA   LQ    Q   
Sbjct: 6   EKTIARQQEKIASGAYYESHQQLRVIAARYIKQSNYDAAANILSEGAKALLQAGAQQGAS 65

Query: 71  TCGAELAVSFV-EALVKG--KVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
             G +LA+  V E   K   +VA D     R +++ EL  + P P        ++E+   
Sbjct: 66  ASGGDLAIMLVLEVYNKAGWEVADDEIGKKRKQRLIELLREFP-PEEPTRKRYINEI--- 121

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
                             I WS +FG  + G P+LH  +      E+   D  +   H  
Sbjct: 122 ------------------ISWSSKFGGLERGDPDLHHAIGSTYAQENEPYDAEK---HLA 160

Query: 188 RGNNPEKFASTIVNFMGKCYP--GEDDLAV--ARAILMYLSLGNMKDANYI--------- 234
            G    + A T+ N   K Y   G+    +   RA+  YL  GN+ +AN           
Sbjct: 161 FGT--VESADTLANLEYKWYTYDGQHTAGIYCCRAVFPYLLTGNILNANKAFVVFTQLLS 218

Query: 235 MDEVKKQVENKQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIER 286
             E  K +  +++  Q +D+  +           +L T++R    LF  L   Y++ I+ 
Sbjct: 219 TSESGKSLGVQEVSSQQNDVRVYPSLPLLNFTNLLLLTIQRGTPDLFRQLLKQYQSHIQE 278

Query: 287 EPAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
              ++  L  I E +FG+   K+  P+  + G  F
Sbjct: 279 AGEWDHALAHIGELYFGIKIPKQSYPLMDMMGMFF 313


>gi|150865772|ref|XP_001385119.2| hypothetical protein PICST_67811 [Scheffersomyces stipitis CBS
           6054]
 gi|149387028|gb|ABN67090.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 326

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 61/246 (24%), Positives = 106/246 (43%), Gaps = 43/246 (17%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q ++ + +  ++ G FY A Q  ++I+ RYV A++Y EA+DLL+ G+ +  KN +    +
Sbjct: 7   QRSLQRFQSKIDGGAFYEAHQTLRTITNRYVKAKQYQEAIDLLYQGSSILSKNKEYASSS 66

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L    ++  V+  V    +N D   K+ EL   +P     + D  L++L          
Sbjct: 67  DLISYLIQVFVESGVTV-ADNKDLKLKLIELITHLP-----DSDPSLNDLA--------- 111

Query: 135 VEGCSSFLKAAIKWSI-EFGAPKTGSPELHVMLA----EYLYSESPELD---------MA 180
                   K A+KWS  +    K G  +LH +       YL +++ E            A
Sbjct: 112 --------KQALKWSQGKDEKDKFGDVDLHHIFGAKYLNYLKNDNGETIKNDDERQKVFA 163

Query: 181 RVSFHFVRGNNPEKFASTIVNFMGKCYPGEDD----LAVARAILMYLSLGNMKDANYIMD 236
               H + G          VN++ +    +      L ++RA++ Y  L N+K AN   D
Sbjct: 164 VTELHLILGTYES--LPLYVNYLYQWAQVDTSIDAGLFLSRAVINYAYLKNIKFANEAKD 221

Query: 237 EVKKQV 242
              KQ+
Sbjct: 222 LFLKQL 227


>gi|255712761|ref|XP_002552663.1| KLTH0C10208p [Lachancea thermotolerans]
 gi|238934042|emb|CAR22225.1| KLTH0C10208p [Lachancea thermotolerans CBS 6340]
          Length = 303

 Score = 58.5 bits (140), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 71/308 (23%), Positives = 128/308 (41%), Gaps = 58/308 (18%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEAL 84
           +  G++Y A Q  ++I+ RYV ++ Y EA+DL+  GA   L   Q     +L    +E  
Sbjct: 18  IEAGDYYEAHQTLRTIANRYVRSKNYREAIDLITEGAQALLGAKQGGSATDLIFYLLEVY 77

Query: 85  VKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLK 143
              +V  ++ ++ R+ ++  L   + P   N++D                        + 
Sbjct: 78  NVAEVKVEDVSVSRLVRLLALLDAEEP---NMKD-----------------------VIT 111

Query: 144 AAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFM 203
               WS++FG  K G P LH ++   L  E   +  A         ++ +K+   I  + 
Sbjct: 112 GMNNWSVKFGNCKFGDPYLHSVIGSKL-VEGGHVYEAERYLTLGTHDSLDKYLDLIWEWY 170

Query: 204 GKCYPGEDDLAV----ARAILMYLSLGNMKDANYIMDE-VKKQVENKQLQLQPSD----- 253
            +     D  AV    +R I  YL +GN+++A     + + K +EN   + +  D     
Sbjct: 171 LQS---GDREAVGDFFSRLIFNYLFIGNVQNAEQAKQKFLAKFIENFNPKYELIDRGEFK 227

Query: 254 -----------LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFF 302
                       +Q + Y  QT   D   LF+ L+ +Y  S ER   +   LD + +++F
Sbjct: 228 TYCFEDYPELNFLQLLPYSCQTKNAD---LFDFLKKHYAGSSER---YGSELDYLGQEYF 281

Query: 303 GVKRRNPM 310
           G+  + PM
Sbjct: 282 GIVVQKPM 289


>gi|403165131|ref|XP_003325167.2| hypothetical protein PGTG_06704 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
 gi|375165540|gb|EFP80748.2| hypothetical protein PGTG_06704 [Puccinia graminis f. sp. tritici
           CRL 75-36-700-3]
          Length = 444

 Score = 58.2 bits (139), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 75/359 (20%), Positives = 143/359 (39%), Gaps = 78/359 (21%)

Query: 12  PPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYS---EALDLLHAGACLQLKNN 68
           P     +  ++ +V++  +Y A Q Y++ +AR + +   +   +A+ LL  G+ L L+  
Sbjct: 80  PSLVRTLKSIQNLVDQQEWYAAHQKYRTSAARLLKSNEQASIQDAISLLFEGSRLLLERG 139

Query: 69  QLTCGAELAVSFV-EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
           Q   G +L +  + + LV  KV    E  +++  +  L                   T +
Sbjct: 140 QTGSGTDLGLMLIRDVLVAKKVGLGVEERNKLLTLITL-------------------TGS 180

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
            G+ +       + + AA+ W+ + G   TG P LH+ + E L+ E+ +   A +     
Sbjct: 181 SGSWR------KTLIDAAVSWTCDAGDCPTGDPALHLAVGERLFKEN-QFQSAVIHLLSA 233

Query: 188 RGNNPEKFASTIVNFMGK--------CYPGEDDLAVARAILMYLSLGNMKDA-NYIMDEV 238
              +  K  S ++    K        C  G      A  +L YL LG++  A +++ D +
Sbjct: 234 CNRDAAKVLSNVLWAWSKEDKSNPDGCLSGR---YAAVGVLGYLELGSIISARSFLEDFL 290

Query: 239 KKQVEN--KQLQLQ-PSD---------------LIQFVFYVLQ----TLERDALPLFN-- 274
           +K +    K L  + P+D                +Q +    Q    +L   + P  N  
Sbjct: 291 QKTISTYPKLLNCELPADCNNSTIHVFVVPSLNFLQLLILTCQVGPGSLPNGSTPPPNHT 350

Query: 275 ---MLRANYKASIER---------EPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIFKMM 321
              + +A Y+A I R          PA  E    +AE +FG++++     I  D+   +
Sbjct: 351 GPTIGKAVYQAMIGRYSRFDGWVSSPAIQESFATLAEMYFGIRQQRQGGNILSDLMGSL 409


>gi|432100478|gb|ELK29097.1| SUN domain-containing protein 1 [Myotis davidii]
          Length = 1007

 Score = 57.8 bits (138), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 53/208 (25%), Positives = 100/208 (48%), Gaps = 35/208 (16%)

Query: 73  GAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAK 132
            A+L++  +E+L K +V   +E L+ + K++ L      P++ E                
Sbjct: 651 AADLSMLVLESLEKAEVEVADELLENLAKLFSLMD----PNSPE---------------- 690

Query: 133 LRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNP 192
            RV    +F+  A+KWS   G+ + G P LH +LA  L+ E    +     +HF+   + 
Sbjct: 691 -RV----AFVSRALKWS-SGGSGRLGHPRLHQLLALTLWREQ---NYCESRYHFLHSADG 741

Query: 193 EKFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQ---VENKQLQ 248
           E  A  +V +   + +  E D+ VA+A+L +L L N   A+ +     ++   +E+    
Sbjct: 742 EGCAHMLVEYSTSRGFRSEVDMFVAQAVLQFLCLKNKSSASVVFTTYTQKHPSIEDGPPF 801

Query: 249 LQPSDLIQFVFYVLQTLERDALPLFNML 276
           +QP  L+ F++++L  L+   L +F +L
Sbjct: 802 VQP--LLNFIWFLLLALDGGKLTVFTVL 827


>gi|410081688|ref|XP_003958423.1| hypothetical protein KAFR_0G02560 [Kazachstania africana CBS 2517]
 gi|372465011|emb|CCF59288.1| hypothetical protein KAFR_0G02560 [Kazachstania africana CBS 2517]
          Length = 306

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 73/333 (21%), Positives = 133/333 (39%), Gaps = 65/333 (19%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + +  +  G++Y A Q  ++I+ RYV ++ Y +A++L+   A   L+  Q +  ++
Sbjct: 10  KTLQRFQSKIKAGDYYEAHQTLRTIANRYVRSKHYDDAIELIIHAATSFLEVKQFSIASD 69

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRV 135
           L    ++     ++  ++   D V  + +L   +       D+  L E+  A+       
Sbjct: 70  LIFYLLDVYDTAQIQVND---DSVANLIQLLVLVDA-----DEPQLKEIATAMN------ 115

Query: 136 EGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKF 195
                       WSI+F   K G P LH ++A  L       +  R   +F+ G N    
Sbjct: 116 -----------NWSIKFSDFKFGEPRLHNIIASKLLEAGLVYEAER---YFILGTNES-- 159

Query: 196 ASTIVNFMGKCYPGEDDLA------VARAILMYLSLGNMKDANYIM---DEVKKQVENK- 245
               V+ + + Y   DD A      ++R I  YL + N    NY++   D+   Q+ NK 
Sbjct: 160 LDKYVDLIWEWYLRSDDDAYGAGEPLSRLIFNYLFISNF---NYVIKSRDKFLNQLLNKF 216

Query: 246 -QLQLQPSDLIQFVFYVLQ---------------TLERDALPLFNMLRANYKASIEREPA 289
             L+ +  D  QF  Y                  T +     LF  L+  Y    E    
Sbjct: 217 NDLKYEIIDKNQFKLYYFHDEKLKDLNFLQLMIITCQTKNKDLFTNLKNQYP---ECTTK 273

Query: 290 FNEMLDDIAEKFFG---VKRRNPMQGIFGDIFK 319
           F   L+ + +++F    VK+ N +Q +    FK
Sbjct: 274 FKTQLEFLGQEYFDIVPVKQPNFLQDMMSGFFK 306


>gi|255727967|ref|XP_002548909.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
 gi|240133225|gb|EER32781.1| conserved hypothetical protein [Candida tropicalis MYA-3404]
          Length = 330

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 41/156 (26%), Positives = 70/156 (44%), Gaps = 25/156 (16%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + I + +  ++ G+FY A Q  ++I+ RYV A+++SEA DLL+ GA + + N +    ++
Sbjct: 7   KTIQRFQSRIDNGSFYEAHQTLRTITNRYVKAKQFSEAKDLLYQGATILVNNKEYASASD 66

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRV 135
           L    V+   +  V   +   D   ++ +L   IP      DD  L +L+          
Sbjct: 67  LITYLVQIYTEEGVEVSDR--DAKNRLIDLVSGIP-----NDDPSLVDLS---------- 109

Query: 136 EGCSSFLKAAIKWSIEFG-APKTGSPELHVMLAEYL 170
                  K  I WS +     K G P+LH +    L
Sbjct: 110 -------KVCISWSKQSSECSKFGDPDLHHLFGSKL 138


>gi|296810072|ref|XP_002845374.1| DUF410 domain-containing protein [Arthroderma otae CBS 113480]
 gi|238842762|gb|EEQ32424.1| DUF410 domain-containing protein [Arthroderma otae CBS 113480]
          Length = 351

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 76/341 (22%), Positives = 134/341 (39%), Gaps = 60/341 (17%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGA--CLQLKNNQ--L 70
           ++ I + ++ +  G +Y A Q  + I+ARY+    Y  A ++L  GA   LQ    Q   
Sbjct: 6   EKTIARQQEKIASGAYYEAHQQLRVIAARYIKQSNYDAAANILSGGAKALLQAGAQQGVS 65

Query: 71  TCGAELAVSFV-EALVKG--KVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
             G +LA+  V E   K   +VA D    +R +++ EL  + P P        ++E+   
Sbjct: 66  ASGGDLAIMLVLEVYNKAGWEVADDETGKERKQRLIELLQEFP-PEEPTRKRYINEI--- 121

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARV----- 182
                             I WS +FG  + G P+LH  +   +Y++    D  ++     
Sbjct: 122 ------------------ISWSSKFGDLERGDPDLHHSIGS-IYAQGLVSDQGQILENEP 162

Query: 183 ---SFHFVRGN--NPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYI--- 234
                H   G   + E       N+          +   RA+  YL  GN+ +AN     
Sbjct: 163 YDAERHLAFGTVESAETLGKLEYNWYTYDAQHTAGIYCCRAVFPYLLTGNILNANKAFLV 222

Query: 235 ------MDEVKKQVENKQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANY 280
                   E  K +  +++  Q +D+  +           +L  ++R    LF  L   Y
Sbjct: 223 FTRLLSTSESGKALGVQEVSSQQNDVRVYPSLPLLNFTNLLLLAIQRGTPDLFRQLLKQY 282

Query: 281 KASIEREPAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
           ++ I+    ++  L  I E +FG+   K+ NP+  + G  F
Sbjct: 283 QSHIKEAGEWDHALAHIGELYFGIKIPKQSNPLMDMMGMFF 323


>gi|448102593|ref|XP_004199842.1| Piso0_002389 [Millerozyma farinosa CBS 7064]
 gi|359381264|emb|CCE81723.1| Piso0_002389 [Millerozyma farinosa CBS 7064]
          Length = 324

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 56/233 (24%), Positives = 94/233 (40%), Gaps = 40/233 (17%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  I + +  ++   +Y A Q  ++I+ RYV +++Y EA DLL+ GA +  KN +     
Sbjct: 7   QRTITRFKSKIDSKEYYEAHQTLRTITNRYVNSKQYKEAADLLYQGASILAKNGEYASAG 66

Query: 75  ELAVSFVEALVKGKVAYDN--ENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAK 132
           +L    +E   +  +  DN  EN +   K  E+   +P       D  L +L+       
Sbjct: 67  DLLSYMIEVYSEAGIKCDNDGENKEYKLKFIEVVRLLP-----NSDSVLVDLS------- 114

Query: 133 LRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYS-----------ESPELDMAR 181
                     K +I WS +    K G  ELH +  E               E  +   A 
Sbjct: 115 ----------KQSIAWSSKSTGSKFGDNELHHVFGEKFLKSLDDSSSSLTEEERQKVFAL 164

Query: 182 VSFHFVRGNNPEKFASTIVNFMGKCYPGEDD---LAVARAILMYLSLGNMKDA 231
              H + G +        V+F+ +      D   L ++RA++ YL L N++ A
Sbjct: 165 AELHLILGVHSS--LPLYVDFLYRWSESSSDDKGLFLSRAVINYLYLKNVRFA 215


>gi|66475290|ref|XP_627461.1| hypothetical protein [Cryptosporidium parvum Iowa II]
 gi|32398675|emb|CAD98635.1| hypothetical predicted protein, unknown function [Cryptosporidium
           parvum]
 gi|46228926|gb|EAK89775.1| possible apicomplexan conserved protein [Cryptosporidium parvum
           Iowa II]
          Length = 297

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 38/179 (21%), Positives = 83/179 (46%), Gaps = 22/179 (12%)

Query: 161 ELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAIL 220
           +LHV  + Y +      + ++   + +   NPE FA+ ++ +  K Y  E D+ + RA+L
Sbjct: 119 KLHVWASNYYFVHQ---NFSKAQLYVILTENPELFANILIEWSKKGYKSEKDIFLLRAVL 175

Query: 221 MYLSLGNMKDANYIMDEVKKQVENKQLQLQ--------------PSDLIQFVFYVLQTLE 266
           + LSL   + A+ ++D+  + +                      PS  IQ  F+++   E
Sbjct: 176 VLLSLNKTQFAHDLLDKYCQLLNESNDTSLSLSSSCNSNIDSNIPSAPIQMAFFIISACE 235

Query: 267 ----RDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIFKMM 321
               + A+  F++++  Y   I R+ +F+ ++D I + +F  +++N    +  ++  M 
Sbjct: 236 LSDKKTAMHFFDIIKNKYALLIRRDSSFSRIIDKIDQIYFN-RQKNASFNMLSNLLSMF 293


>gi|341896362|gb|EGT52297.1| CBN-CEE-1 protein [Caenorhabditis brenneri]
          Length = 362

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 67/313 (21%), Positives = 131/313 (41%), Gaps = 31/313 (9%)

Query: 17  NIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSE-ALDLLHAGACLQLKNNQLTCGAE 75
            + +LE+   +  FY A Q Y++   R    +  +E A+ L+        +  Q  C  +
Sbjct: 5   TLSRLEEFEKDKKFYDALQFYRTKVTRSFRLKSANEVAISLVTHALNFFFREKQYQCAID 64

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRV 135
           +A  + E L K  V  ++   + +      F        +E++    +  + L  A+ + 
Sbjct: 65  IATQYAECLSKNNVDLNHAAFELLADSVAKFANFA---EIENEAGNTQPEQLLNTARCKC 121

Query: 136 EGCSSFLKAAIKWSIEFGAP----KTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
                 +  AI+W+ +        K GS   H +LA+   +    L++A+   HF+  ++
Sbjct: 122 ------VDLAIQWTKQKSTTPQEKKYGSAAFHTVLAKKFIAVD-HLELAKN--HFLLSDD 172

Query: 192 PEKFASTIVNFMGKCYPGED------DLAVARAILMYLSLGNMKDANYIMDEVKKQVENK 245
            + FA     F+ + Y  ++      D+ +  A+L  L L     A  +  +  K   NK
Sbjct: 173 SKSFA----KFLHQEYEKDNIKKVDSDIIIVEAVLQTLCLDRFPFAVALFKDYVK--PNK 226

Query: 246 QLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
               +P  L  F   +   +E +  P ++ L A+Y+  ++R  AF   L  I + +FG++
Sbjct: 227 YPFEKP--LFNFQHILFDVIETENQPQYSELTASYQTELKRCYAFIGYLTRIGKLYFGIR 284

Query: 306 RRNPMQGIFGDIF 318
               M G  G++F
Sbjct: 285 DNQCMSGGLGNLF 297


>gi|149238403|ref|XP_001525078.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146451675|gb|EDK45931.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 322

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/130 (29%), Positives = 62/130 (47%), Gaps = 17/130 (13%)

Query: 17  NIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAEL 76
            I + +  ++ G+FY A Q  ++I+ RYV A++Y+EA DLL+ GA +  KN +    ++L
Sbjct: 8   TIQRFQTKIDSGSFYEAHQTIRTITNRYVKAKQYTEARDLLYQGATILTKNKEYASASDL 67

Query: 77  AVSFVEALVKGKVAYDNENL-----DRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
               ++        YD E L     D   K+ EL   +P      DD  L++L  A  + 
Sbjct: 68  INYLIQ-------VYDEEELKITDKDAKYKLIELISNLP-----NDDPLLNDLANASISW 115

Query: 132 KLRVEGCSSF 141
             +   C  F
Sbjct: 116 SQKSSVCEKF 125


>gi|302657586|ref|XP_003020512.1| hypothetical protein TRV_05406 [Trichophyton verrucosum HKI 0517]
 gi|291184352|gb|EFE39894.1| hypothetical protein TRV_05406 [Trichophyton verrucosum HKI 0517]
          Length = 345

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 77/336 (22%), Positives = 133/336 (39%), Gaps = 56/336 (16%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGA--CLQLKNNQ--L 70
           ++ I + ++ +  G +Y + Q  + I+ARY+    Y  A ++L  GA   LQ    Q   
Sbjct: 6   EKTIARQQEKIASGAYYESHQQLRVIAARYIKQSNYDAAANILSEGAKALLQAGAQQGAS 65

Query: 71  TCGAELAVSFV-EALVKG--KVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
             G +LA+  V E   K   +VA D     R +++ EL  + P P        ++E+   
Sbjct: 66  ASGGDLAIMLVLEVYNKAGWEVADDEVGKKRKQRLIELLREFP-PEEPTRKRYINEI--- 121

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAE-YLYSESPELDMARVSFHF 186
                             I WS +FG  + G P+LH  +   Y      E +      H 
Sbjct: 122 ------------------ISWSSKFGGLERGDPDLHHAIGSAYAQGRMLENEPYDAEKHL 163

Query: 187 VRGNNPEKFASTIVNFMGKCYPGEDD----LAVARAILMYLSLGNMKDANYI-------- 234
             G    + A T+     K Y  ++     +   RA+  YL  GN+ +AN          
Sbjct: 164 AFGTV--ESADTLAKLEYKWYTYDEQHTAGIYCCRAVFPYLLTGNVLNANKAFVVFTQLL 221

Query: 235 -MDEVKKQVENKQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIE 285
              E  K +  +++  Q +D+  +           +L  ++R    LF  L   Y++ I+
Sbjct: 222 STSESGKSLGVQEVSSQQNDVRVYPSLPLLNFTNLLLLAIQRGTPDLFRQLLKQYQSHIQ 281

Query: 286 REPAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
               ++  L  I E +FG+   K+ NP+  + G  F
Sbjct: 282 EAGEWDHALAHIGELYFGIKIPKQSNPLMDMMGMFF 317


>gi|323346567|gb|EGA80854.1| YOR164C-like protein [Saccharomyces cerevisiae Lalvin QA23]
          Length = 243

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 96/223 (43%), Gaps = 35/223 (15%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L+  GA   LK  Q   G +
Sbjct: 15  KTLQRFENKIKAGDYYEAHQTLRTIANRYVRSKSYEHAIELISQGALSFLKAKQGGSGTD 74

Query: 76  LAVSFVEALVKGKVAYDNENLDR-VRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           L    +E     +V  D+ ++ R VR I EL P  P   NL+D                 
Sbjct: 75  LIFYLLEVYDLAEVKVDDISVARLVRLIAELDPSEP---NLKD----------------V 115

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE- 193
           + G ++       WSI+F   K G P LH  +   L       +  R   +F+ G +   
Sbjct: 116 ITGMNN-------WSIKFSEYKFGDPYLHNTIGSKLLEGDFVYEAER---YFMLGTHDSM 165

Query: 194 -KFASTIVNFMGKCYPGEDDLAV---ARAILMYLSLGNMKDAN 232
            K+   + +++ +    ED       +R +  YL + N+  A+
Sbjct: 166 IKYVDLLWDWLCQVDDIEDSTVAEFFSRLVFNYLFISNISFAH 208


>gi|67597006|ref|XP_666115.1| hypothetical protein [Cryptosporidium hominis TU502]
 gi|54657038|gb|EAL35882.1| hypothetical protein Chro.60112 [Cryptosporidium hominis]
          Length = 235

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/178 (21%), Positives = 83/178 (46%), Gaps = 22/178 (12%)

Query: 161 ELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAIL 220
           +LHV  + Y +      + ++   + +   NPE FA+ ++ +  K Y  E D+ + RA+L
Sbjct: 57  KLHVWASNYYFIHQ---NFSKAQLYVILTENPELFANILIEWSKKGYQSEKDIFLLRAVL 113

Query: 221 MYLSLGNMKDANYIMDEVKKQVENKQLQLQ--------------PSDLIQFVFYVLQTLE 266
           + LSL   + A+ ++D+  + +                      PS  IQ  F+++   E
Sbjct: 114 VLLSLNKTQFAHDLLDKYCQLLNESNDTSLSLSSSFNSNIDSNIPSAPIQMAFFIISACE 173

Query: 267 ----RDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIFKM 320
               + A+  F++++  Y   I R+ +F+ ++D I + +F  +++N    +  ++  M
Sbjct: 174 LSDKKTAMHFFDIIKNKYALLIRRDSSFSRIIDKIDQIYFN-RQKNASFNMLSNLLSM 230


>gi|398365507|ref|NP_014807.3| Get4p [Saccharomyces cerevisiae S288c]
 gi|74676531|sp|Q12125.1|GET4_YEAST RecName: Full=Golgi to ER traffic protein 4; AltName: Full=Guided
           entry of tail-anchored proteins 4
 gi|284793798|pdb|2WPV|A Chain A, Crystal Structure Of S. Cerevisiae Get4-Get5 Complex
 gi|284793799|pdb|2WPV|C Chain C, Crystal Structure Of S. Cerevisiae Get4-Get5 Complex
 gi|284793800|pdb|2WPV|E Chain E, Crystal Structure Of S. Cerevisiae Get4-Get5 Complex
 gi|284793801|pdb|2WPV|G Chain G, Crystal Structure Of S. Cerevisiae Get4-Get5 Complex
 gi|1293725|gb|AAB47411.1| O3580p [Saccharomyces cerevisiae]
 gi|1420405|emb|CAA99370.1| unnamed protein product [Saccharomyces cerevisiae]
 gi|45269962|gb|AAS56362.1| YOR164C [Saccharomyces cerevisiae]
 gi|151945783|gb|EDN64024.1| conserved protein [Saccharomyces cerevisiae YJM789]
 gi|190407483|gb|EDV10750.1| hypothetical protein SCRG_01557 [Saccharomyces cerevisiae RM11-1a]
 gi|256272793|gb|EEU07764.1| YOR164C-like protein [Saccharomyces cerevisiae JAY291]
 gi|259149650|emb|CAY86454.1| EC1118_1O4_3829p [Saccharomyces cerevisiae EC1118]
 gi|285815045|tpg|DAA10938.1| TPA: Get4p [Saccharomyces cerevisiae S288c]
 gi|323331517|gb|EGA72932.1| YOR164C-like protein [Saccharomyces cerevisiae AWRI796]
 gi|323335551|gb|EGA76836.1| YOR164C-like protein [Saccharomyces cerevisiae Vin13]
 gi|349581322|dbj|GAA26480.1| K7_Yor164cp [Saccharomyces cerevisiae Kyokai no. 7]
 gi|365763104|gb|EHN04635.1| YOR164C-like protein [Saccharomyces cerevisiae x Saccharomyces
           kudriavzevii VIN7]
 gi|392296494|gb|EIW07596.1| Get4p [Saccharomyces cerevisiae CEN.PK113-7D]
          Length = 312

 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 96/223 (43%), Gaps = 35/223 (15%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L+  GA   LK  Q   G +
Sbjct: 15  KTLQRFENKIKAGDYYEAHQTLRTIANRYVRSKSYEHAIELISQGALSFLKAKQGGSGTD 74

Query: 76  LAVSFVEALVKGKVAYDNENLDR-VRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           L    +E     +V  D+ ++ R VR I EL P  P   NL+D                 
Sbjct: 75  LIFYLLEVYDLAEVKVDDISVARLVRLIAELDPSEP---NLKD----------------V 115

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE- 193
           + G ++       WSI+F   K G P LH  +   L       +  R   +F+ G +   
Sbjct: 116 ITGMNN-------WSIKFSEYKFGDPYLHNTIGSKLLEGDFVYEAER---YFMLGTHDSM 165

Query: 194 -KFASTIVNFMGKCYPGEDDLAV---ARAILMYLSLGNMKDAN 232
            K+   + +++ +    ED       +R +  YL + N+  A+
Sbjct: 166 IKYVDLLWDWLCQVDDIEDSTVAEFFSRLVFNYLFISNISFAH 208


>gi|212546233|ref|XP_002153270.1| DUF410 domain protein [Talaromyces marneffei ATCC 18224]
 gi|210064790|gb|EEA18885.1| DUF410 domain protein [Talaromyces marneffei ATCC 18224]
          Length = 342

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 73/324 (22%), Positives = 124/324 (38%), Gaps = 56/324 (17%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK----NNQLT 71
           + I + ++ +  G++Y A Q  + I+ARY+    Y  A ++L  GA   L+         
Sbjct: 7   KTIARQQEKIAAGSYYEAHQQLRVIAARYLKQSNYDAAAEILTGGAKALLRAGSQQGASA 66

Query: 72  CGAELAVSF-VEALVKGK---VAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
            G +LA+   VE   K +   +  D E+  R +++ EL  + P P               
Sbjct: 67  SGGDLAIMLVVEVYNKAEWEIIPEDAESRGRKKRLIELLREFP-PDE------------- 112

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
                        ++   I WS +FG  + G PELH  +      E+   D  R   H V
Sbjct: 113 --------PSRKRYITEMISWSSKFGPLERGDPELHHAVGSVYAEENEPYDAER---HLV 161

Query: 188 RGNNPEKFASTIVNFMGKCYPGEDDLAVA----RAILMYLSLGNMKDANYIMDEVKKQVE 243
            G      A T+     + Y  ++    A    RA+L +L   N++ AN  +     ++ 
Sbjct: 162 LGTIDS--AETLAKLEFEWYTSDEAHTAAIYASRAVLPFLLTANLRSANKALLIFTSRLS 219

Query: 244 NKQLQLQPSDLIQFVFYV---------------LQTLERDALPLFNMLRANYKASIEREP 288
              L L   ++      V               L  ++R +  LF  L  +Y   I+   
Sbjct: 220 AANLSLASQEVSSTSSDVRIYPSLPLLNFLSLLLIAVQRGSADLFRDLTRHYTPHIKEVS 279

Query: 289 AFNEMLDDIAEKFFGVKRRNPMQG 312
            +++ L  I E +F +K   P QG
Sbjct: 280 IWDDALATIGEMYFNIK--IPRQG 301


>gi|323302916|gb|EGA56720.1| YOR164C-like protein [Saccharomyces cerevisiae FostersB]
 gi|323307200|gb|EGA60483.1| YOR164C-like protein [Saccharomyces cerevisiae FostersO]
          Length = 219

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 96/223 (43%), Gaps = 35/223 (15%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L+  GA   LK  Q   G +
Sbjct: 15  KTLQRFENKIKAGDYYEAHQTLRTIANRYVRSKSYEHAIELISQGALSFLKAKQGGSGTD 74

Query: 76  LAVSFVEALVKGKVAYDNENLDR-VRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           L    +E     +V  D+ ++ R VR I EL P  P   NL+D                 
Sbjct: 75  LIFYLLEVYDLAEVKVDDISVARLVRLIAELDPSEP---NLKD----------------V 115

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE- 193
           + G ++       WSI+F   K G P LH  +   L       +  R   +F+ G +   
Sbjct: 116 ITGMNN-------WSIKFSEYKFGDPYLHNTIGSKLLEGDFVYEAER---YFMLGTHDSM 165

Query: 194 -KFASTIVNFMGKCYPGEDDLAV---ARAILMYLSLGNMKDAN 232
            K+   + +++ +    ED       +R +  YL + N+  A+
Sbjct: 166 IKYVDLLWDWLCQVDDIEDSTVAEFFSRLVFNYLFISNISFAH 208


>gi|323352121|gb|EGA84658.1| YOR164C-like protein [Saccharomyces cerevisiae VL3]
          Length = 219

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 96/223 (43%), Gaps = 35/223 (15%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L+  GA   LK  Q   G +
Sbjct: 15  KTLQRFENKIKAGDYYEAHQTLRTIANRYVRSKSYEHAIELISQGALSFLKAKQGGSGTD 74

Query: 76  LAVSFVEALVKGKVAYDNENLDR-VRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           L    +E     +V  D+ ++ R VR I EL P  P   NL+D                 
Sbjct: 75  LIFYLLEVYDLAEVKVDDISVARLVRLIAELDPSEP---NLKD----------------V 115

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE- 193
           + G ++       WSI+F   K G P LH  +   L       +  R   +F+ G +   
Sbjct: 116 ITGMNN-------WSIKFSEYKFGDPYLHNTIGSKLLEGDFVYEAER---YFMLGTHDSM 165

Query: 194 -KFASTIVNFMGKCYPGEDDLAV---ARAILMYLSLGNMKDAN 232
            K+   + +++ +    ED       +R +  YL + N+  A+
Sbjct: 166 IKYVDLLWDWLCQVDDIEDSTVAEFFSRLVFNYLFISNISFAH 208


>gi|258569625|ref|XP_002543616.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237903886|gb|EEP78287.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 344

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 72/330 (21%), Positives = 130/330 (39%), Gaps = 50/330 (15%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK----NNQLT 71
           + I +  + +  G +Y + Q  + I+ARY+    Y  A D+L  GA   LK         
Sbjct: 7   KTIARQREKIAAGAYYESHQQLRVIAARYLKQSNYDAAADILAGGAIALLKAGAQQGASA 66

Query: 72  CGAELAVSFVEAL---VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEAL 128
            G +LA+  V  +    + +++ D     R +++ E+  + P            E T   
Sbjct: 67  SGGDLAIMLVIDVYNKAEWELSDDEVGKRRRQRLIEILREFPA----------EEPTR-- 114

Query: 129 GAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVR 188
                       ++   I WS ++G  +TG PE+H  +   +Y+E  E+  A        
Sbjct: 115 ----------KRYVSEMISWSSKYGDIETGDPEIHHAIGS-IYAEENEVYDAERHLALGT 163

Query: 189 GNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQV------ 242
             + E  A     +          +  ARA+  YL  G+ ++AN      + Q+      
Sbjct: 164 VESAEVLARLEYRWYTHDAVHTAGIYAARAVFPYLLTGSPRNANKAFLIFRNQLSSSPTS 223

Query: 243 ----------ENKQLQLQPS-DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFN 291
                     +N  +++ PS  L+ F   +L  ++R    LF  L   Y   I+    ++
Sbjct: 224 QALGVQEVSSQNNDVRVYPSLPLLNFTNLLLLAIQRGHPDLFKQLLRQYATYIKEAGDWD 283

Query: 292 EMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
           + L  I E +FG+   K+ NP+  + G  F
Sbjct: 284 QALAHIGEVYFGIRIPKQSNPLMDMMGMFF 313


>gi|195999346|ref|XP_002109541.1| hypothetical protein TRIADDRAFT_53676 [Trichoplax adhaerens]
 gi|190587665|gb|EDV27707.1| hypothetical protein TRIADDRAFT_53676 [Trichoplax adhaerens]
          Length = 308

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 65/290 (22%), Positives = 115/290 (39%), Gaps = 36/290 (12%)

Query: 14  AQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCG 73
           A + + K E   NEG+FY A Q+Y+   AR+ A + Y++A++ +     L  +   +   
Sbjct: 21  AMKMLKKAEMKYNEGDFYEAHQIYRIAYARFRAKELYNDAMEAIRPAILLFTRCKCIDLA 80

Query: 74  AELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKL 133
            +LA+  +E L   ++      L+ +  I  +F         + D D             
Sbjct: 81  KDLALLIIETLTDRRIPVTTAILENLSSICGMF---------DSDSD------------A 119

Query: 134 RVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE 193
           RV    + ++ AI ++   G P+ G   L+ +LAE +Y +     MA     +    N E
Sbjct: 120 RV----TIIEKAILYTKAVGNPEHGDNRLYKLLAE-MYWKEKNFGMAIKCLMY---GNAE 171

Query: 194 KFASTIVNF-MGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS 252
             A  +V+  +   Y  E DL +   IL  L       A   +    K          P 
Sbjct: 172 TLAGALVDLSITFGYNDEADLIITNCILQLLCSNRKNIAAATLQHYVKTHPQLSGSRAPL 231

Query: 253 DLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFF 302
            LI F+ YV++ +E   L          K   ++  +FN  ++D  + + 
Sbjct: 232 PLINFLNYVIRAVETYYLTQIG------KVYFKKTMSFNTFIEDTDKSYI 275


>gi|327297466|ref|XP_003233427.1| hypothetical protein TERG_06416 [Trichophyton rubrum CBS 118892]
 gi|326464733|gb|EGD90186.1| hypothetical protein TERG_06416 [Trichophyton rubrum CBS 118892]
          Length = 341

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 76/335 (22%), Positives = 133/335 (39%), Gaps = 58/335 (17%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGA--CLQLKNNQ--L 70
           ++ I + ++ +  G +Y + Q  + I+ARY+    Y  A ++L  GA   LQ    Q   
Sbjct: 6   EKTIARQQEKIASGAYYESHQQLRVIAARYIKQSNYDAAANILSEGAKALLQAGAQQGAS 65

Query: 71  TCGAELAVSFV-EALVKG--KVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
             G +LA+  V E   K   +VA D     R +++ EL  + P P        ++E+   
Sbjct: 66  ASGGDLAIMLVLEVYNKAGWEVADDEVGKKRKQRLIELLREFP-PEEPTRKRYINEI--- 121

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
                             I WS +FG  + G  +LH  +      E+   D  +   H  
Sbjct: 122 ------------------ISWSSKFGELERGDSDLHHSIGSAYAQENEPYDAEK---HLA 160

Query: 188 RGNNPEKFASTIVNFMGKCYPGEDD----LAVARAILMYLSLGNMKDANYI--------- 234
            G    + A T+     K Y  ++     +   RA+  YL  GN+ +AN           
Sbjct: 161 FGTV--ESADTLAKLEYKWYTYDEQHTAGIYCCRAVFPYLLTGNILNANKAFVVFTQLLS 218

Query: 235 MDEVKKQVENKQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIER 286
             E  K +  +++  Q +D+  +           +L  ++R    LF  L   Y++ I+ 
Sbjct: 219 TSEPGKSLGVQEVSSQQNDVRVYPSLPLLNFTNLLLLAIQRGTPDLFRQLLKQYQSHIQE 278

Query: 287 EPAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
              ++  L  I E +FG+   K+ NP+  + G  F
Sbjct: 279 AGEWDHALAHIGELYFGIKIPKQSNPLMDMMGMFF 313


>gi|213408445|ref|XP_002174993.1| UPF0363 protein c [Schizosaccharomyces japonicus yFS275]
 gi|212003040|gb|EEB08700.1| UPF0363 protein c [Schizosaccharomyces japonicus yFS275]
          Length = 309

 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 64/312 (20%), Positives = 122/312 (39%), Gaps = 53/312 (16%)

Query: 31  YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVA 90
           Y   QM +++  R V  +++  A++LL++GA      N+     +LA   ++   K    
Sbjct: 20  YEGHQMLRTLVNRQVKNKQFDTAVELLYSGASTLFDMNETASAGDLAQYIMDTYTKAGFE 79

Query: 91  YDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSI 150
               +  R  ++  +FP+            +H+L                     I WS 
Sbjct: 80  TTPAHKARAIEVLAMFPK----EESTRKQYVHDL---------------------IAWSK 114

Query: 151 EFGAPKTGSPELHVMLAE-YLYSESPELDMARVSFHFVRGNN--PEKFASTIVN-FMGKC 206
           +  +  +G  ++   +A  ++  E PE    +   HFV GN+  P  +   +   + G  
Sbjct: 115 K--STPSGDEDVQDAVANAFVKWEEPE----KAESHFVLGNSQSPLSYGHMLYQWYTGAE 168

Query: 207 YPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENK-QLQLQPSD------------ 253
              +  +   RA++ YL   N+  A   + E  K +  K Q  +Q +             
Sbjct: 169 TKTDAGVFAGRAVINYLLAENLNGAKTALSEFSKLLSEKGQETMQKASIDDVSVSVFPNH 228

Query: 254 -LIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK----RRN 308
            L+ F + +L    R+A P F  L+  Y   ++  P + E  + I EK+F +K    + N
Sbjct: 229 PLVNFFWLLLVAASREAQPSFLNLKETYAEQLKAMPGWQEASEKIGEKYFHIKPAFSQGN 288

Query: 309 PMQGIFGDIFKM 320
            + G+    F +
Sbjct: 289 LLSGLMNSFFGL 300


>gi|299689114|pdb|3LKU|A Chain A, Crystal Structure Of S. Cerevisiae Get4 In Complex With An
           N-Terminal Fragment Of Get5
 gi|299689116|pdb|3LKU|C Chain C, Crystal Structure Of S. Cerevisiae Get4 In Complex With An
           N-Terminal Fragment Of Get5
 gi|299689118|pdb|3LKU|E Chain E, Crystal Structure Of S. Cerevisiae Get4 In Complex With An
           N-Terminal Fragment Of Get5
          Length = 292

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 95/223 (42%), Gaps = 35/223 (15%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L+  GA   LK  Q   G +
Sbjct: 7   KTLQRFENKIKAGDYYEAHQTLRTIANRYVRSKSYEHAIELISQGALSFLKAKQGGSGTD 66

Query: 76  LAVSFVEALVKGKVAYDNENLDR-VRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           L    +E     +V  D+ ++ R VR I EL P  P   NL+D                 
Sbjct: 67  LIFYLLEVYDLAEVKVDDISVARLVRLIAELDPSEP---NLKD----------------V 107

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE- 193
           + G ++       WSI+F   K G P LH  +   L       +  R   +F  G +   
Sbjct: 108 ITGXNN-------WSIKFSEYKFGDPYLHNTIGSKLLEGDFVYEAER---YFXLGTHDSX 157

Query: 194 -KFASTIVNFMGKCYPGEDDLAV---ARAILMYLSLGNMKDAN 232
            K+   + +++ +    ED       +R +  YL + N+  A+
Sbjct: 158 IKYVDLLWDWLCQVDDIEDSTVAEFFSRLVFNYLFISNISFAH 200


>gi|50311919|ref|XP_455991.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|49645127|emb|CAG98699.1| KLLA0F20295p [Kluyveromyces lactis]
          Length = 302

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 79/338 (23%), Positives = 132/338 (39%), Gaps = 81/338 (23%)

Query: 19  DKLEKI-------VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           DKL K        +  G++Y A Q  ++I+ RYV A++Y E++DL+   A   L++ Q  
Sbjct: 3   DKLTKTLQRFNARIEAGDYYEAHQTLRTIANRYVRAKKYDESIDLISHAAGAFLRSKQGG 62

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            G++L    +E                   +Y+L     VP N   D  +  L + L   
Sbjct: 63  SGSDLIFYLLE-------------------VYDL---ASVPCN---DSSVSRLIQLL--- 94

Query: 132 KLRVEGCSSFLKAAIK----WSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
            + V+G    LK  +     WS++    K G P LH  +A  L       +  R   + V
Sbjct: 95  -VLVDGQEPNLKDVVTGMNNWSVKHSEYKFGDPYLHSAIATKLLEGGYTYEAER---YLV 150

Query: 188 RGNNPEKFASTIVNFMGKCY--PGEDDL---AVARAILMYLSLGNMKDANYIMDEVKKQV 242
            G +        V+ +   Y   GED      ++R I  YL + N++ A     E+ K+ 
Sbjct: 151 LGTHDS--MKRYVDLLWDWYQQSGEDSSFPDFISRMIFNYLFIFNIEYALEGSKELLKRF 208

Query: 243 ENKQLQL------------------QPSDLIQFVFYVLQTLERDALPLFNMLRANYKASI 284
            N +  +                  +  + +Q +    QT E     LF  L++ Y    
Sbjct: 209 TNHKTDVKFELTEKGPYKIDYFSSWESLNFLQLLIIACQTQET---ILFQNLKSQYSTLA 265

Query: 285 EREPAFNEMLDDIAEKFFGVKRRNP-------MQGIFG 315
            +   +N  L  + +++FG+K   P       M G FG
Sbjct: 266 NK---YNSELQFLGQEYFGIKVEKPINFLQDMMSGFFG 300


>gi|365991775|ref|XP_003672716.1| hypothetical protein NDAI_0K02820 [Naumovozyma dairenensis CBS 421]
 gi|343771492|emb|CCD27473.1| hypothetical protein NDAI_0K02820 [Naumovozyma dairenensis CBS 421]
          Length = 325

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 69/325 (21%), Positives = 136/325 (41%), Gaps = 72/325 (22%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L++ GA   LK  Q   G +
Sbjct: 16  KTLQRFETKIKAGDYYEAHQTLRTIANRYVRSKSYEHAIELVYQGATSLLKAKQGGSGTD 75

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRV 135
           L    +E      VA   ++ D + ++ +L   +        + +L ++  A+       
Sbjct: 76  LIFYLLEIY---DVAEIRQSKDSISRLIQLLVLVDAT-----EPNLKDVATAMN------ 121

Query: 136 EGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGN--NPE 193
                       WSI+FG  K G P +H ++A  L +     +  R    FV G   + +
Sbjct: 122 -----------NWSIKFGEYKFGDPYIHNVIALKLLANDHVYEAERF---FVLGTQESVD 167

Query: 194 KFASTIVNFMGKCYPGEDDLA-------------VARAILMYLSLGNMKDA--------N 232
           K+ + I ++  +    ED  A              +R +  YL + N+K A        N
Sbjct: 168 KYVTLIWDWFLQA--TEDANANENTSEHVDVCDFFSRLVFNYLFILNIKFARLASSTFLN 225

Query: 233 YIMDEVK-------------KQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRAN 279
             M++ K             K V  + +Q Q  + +Q +    Q+ ++    LF  L+ +
Sbjct: 226 LFMEKFKPSHEIIEKGNADNKIVFFRDIQYQDLNFLQLILLTCQSRDK---SLFLNLKDH 282

Query: 280 YKASIEREPAFNEMLDDIAEKFFGV 304
           Y  S  +   ++  L+ + +++FG+
Sbjct: 283 YTESASK---YHSQLEFLGQEYFGI 304


>gi|448098766|ref|XP_004198991.1| Piso0_002389 [Millerozyma farinosa CBS 7064]
 gi|359380413|emb|CCE82654.1| Piso0_002389 [Millerozyma farinosa CBS 7064]
          Length = 324

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 64/154 (41%), Gaps = 20/154 (12%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  I + +  ++   +Y A Q  ++I+ RYV +++Y EA DLL+ GA +  KN +     
Sbjct: 7   QRTITRFKSKIDSKEYYEAHQTLRTITNRYVNSKQYKEAADLLYQGASILAKNGEYASAG 66

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L    +E   +  +  DNE  ++  K+      I V   L + D +             
Sbjct: 67  DLLSYMIEVYSEAGIKCDNEGENKEYKL----KFIEVVRLLPNSDSV------------- 109

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAE 168
                   K +I WS      K G  ELH +  E
Sbjct: 110 ---LVDLSKQSIAWSSNTTGSKFGDNELHHVFGE 140


>gi|401841378|gb|EJT43778.1| GET4-like protein [Saccharomyces kudriavzevii IFO 1802]
          Length = 312

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 93/223 (41%), Gaps = 35/223 (15%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L+  GA   LK  Q   G +
Sbjct: 15  KTLQRFESKIKAGDYYEAHQTLRTIANRYVRSKSYEHAIELISQGALSFLKAKQGGSGTD 74

Query: 76  LAVSFVEALVKGKVAYDNENLDR-VRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           L    +E     +V   + ++ R VR I EL P  P   NL+D                 
Sbjct: 75  LIFYLLEVYDLAEVKVGDVSVARLVRLIAELDPSEP---NLKD----------------- 114

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE- 193
                  + A   WSI+F   K G P LH  +   L       +  R   +F+ G +   
Sbjct: 115 ------VITAMNNWSIKFSEYKFGDPYLHNTIGTKLLEGGFVYEAER---YFMLGTHDSM 165

Query: 194 -KFASTIVNFMGKCYPGEDDLAV---ARAILMYLSLGNMKDAN 232
            K+   + +++ +    ED       +R I  YL + N+  A+
Sbjct: 166 IKYVDLLWDWLCQVDKVEDSTVAEFFSRLIFNYLFICNISFAH 208


>gi|365758332|gb|EHN00181.1| YOR164C-like protein [Saccharomyces cerevisiae x Saccharomyces
           kudriavzevii VIN7]
          Length = 302

 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 55/223 (24%), Positives = 93/223 (41%), Gaps = 35/223 (15%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L+  GA   LK  Q   G +
Sbjct: 15  KTLQRFESKIKAGDYYEAHQTLRTIANRYVRSKSYEHAIELISQGALSFLKAKQGGSGTD 74

Query: 76  LAVSFVEALVKGKVAYDNENLDR-VRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           L    +E     +V   + ++ R VR I EL P  P   NL+D                 
Sbjct: 75  LIFYLLEVYDLAEVKVGDVSVARLVRLIAELDPSEP---NLKD----------------- 114

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE- 193
                  + A   WSI+F   K G P LH  +   L       +  R   +F+ G +   
Sbjct: 115 ------VITAMNNWSIKFSEYKFGDPYLHNTIGTKLLEGGFVYEAER---YFMLGTHDSM 165

Query: 194 -KFASTIVNFMGKCYPGEDDLAV---ARAILMYLSLGNMKDAN 232
            K+   + +++ +    ED       +R I  YL + N+  A+
Sbjct: 166 IKYVDLLWDWLCQVDKVEDSTVAEFFSRLIFNYLFICNISFAH 208


>gi|302510937|ref|XP_003017420.1| hypothetical protein ARB_04301 [Arthroderma benhamiae CBS 112371]
 gi|291180991|gb|EFE36775.1| hypothetical protein ARB_04301 [Arthroderma benhamiae CBS 112371]
          Length = 345

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 77/336 (22%), Positives = 132/336 (39%), Gaps = 56/336 (16%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGA--CLQLKNNQ--L 70
           ++ I + ++ +  G +Y + Q  + I+ARY+    Y  A ++L  GA   LQ    Q   
Sbjct: 6   EKTIARQQEKIASGAYYESHQQLRVIAARYIKQSNYDAAANILSEGAKALLQAGAQQGAS 65

Query: 71  TCGAELAVSFV-EALVKG--KVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
             G +LA   V E   K   +VA D     R +++ EL  + P P        ++E+   
Sbjct: 66  ASGGDLATMLVLEVYNKAGWEVADDEVGKKRKQQLIELLREFP-PEEPTRKRYINEI--- 121

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAE-YLYSESPELDMARVSFHF 186
                             I WS +FG  + G P+LH  +   Y      E +      H 
Sbjct: 122 ------------------ISWSSKFGGLERGDPDLHHAIGSAYAQVRMLENEPYDAEKHL 163

Query: 187 VRGNNPEKFASTIVNFMGKCYPGEDD----LAVARAILMYLSLGNMKDANYI-------- 234
             G    + A T+     K Y  ++     +   RA+  YL  GN+ +AN          
Sbjct: 164 AFGTV--ESADTLAKLEYKWYTYDEQHTAGIYCCRAVFPYLLTGNILNANKAFVVFTQLL 221

Query: 235 -MDEVKKQVENKQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIE 285
              E  K +  +++  Q +D+  +           +L  ++R    LF  L   Y++ I+
Sbjct: 222 STSESGKSLGVQEVSSQQNDVRVYPSLPLLNFTNLLLLAIQRGTPDLFRQLLKQYQSHIQ 281

Query: 286 REPAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
               ++  L  I E +FG+   K+ NP+  + G  F
Sbjct: 282 EAGEWDHALAHIGELYFGIKIPKQSNPLMDMMGMFF 317


>gi|238882620|gb|EEQ46258.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 327

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 72/278 (25%), Positives = 114/278 (41%), Gaps = 72/278 (25%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I + +  ++ G+FY A Q  ++I+ RYV A++Y EA DLL+ G+ + LKN +    ++L 
Sbjct: 9   IQRFQAKIDSGSFYEAHQTLRTITNRYVKAKQYKEARDLLYQGSTILLKNKEFASASDLI 68

Query: 78  VSFVEALVKGKVAYDNENL-----DRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAK 132
              ++        YD E +     D   K+ +L   +P      +D  L +L        
Sbjct: 69  NYLIQI-------YDEEGILVSDKDAKLKLIDLISNLP-----NNDPSLSDLA------- 109

Query: 133 LRVEGCSSFLKAAIKWSIEF-GAPKTGSPELHVMLA-------EYLYSE----------- 173
                     K+++ WS +  G  K G  ELH +         EY  S+           
Sbjct: 110 ----------KSSLNWSKKSPGCEKFGDCELHHLFGSKFLKFVEYGTSQHDIESENGKVV 159

Query: 174 -SPELDMARV----SFHFVRGNNPEKFAST--IVNFM---GKCYPGED-DLAVARAILMY 222
             PE + A+V      H V G     F S    +N++    K  P  D  + + RAI+ Y
Sbjct: 160 LDPE-ERAKVFAIAELHLVLGT----FESVPLYINYLVKYAKANPEVDPGVFLGRAIVNY 214

Query: 223 LSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFY 260
             L N+K      D   K++E    +   SD   F FY
Sbjct: 215 SYLKNIKFVKEAQDIFLKEIEVTNKETLKSD---FQFY 249


>gi|68470322|ref|XP_720665.1| hypothetical protein CaO19.11703 [Candida albicans SC5314]
 gi|68470585|ref|XP_720538.1| hypothetical protein CaO19.4228 [Candida albicans SC5314]
 gi|46442411|gb|EAL01700.1| hypothetical protein CaO19.4228 [Candida albicans SC5314]
 gi|46442545|gb|EAL01833.1| hypothetical protein CaO19.11703 [Candida albicans SC5314]
          Length = 327

 Score = 54.7 bits (130), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 72/278 (25%), Positives = 114/278 (41%), Gaps = 72/278 (25%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I + +  ++ G+FY A Q  ++I+ RYV A++Y EA DLL+ G+ + LKN +    ++L 
Sbjct: 9   IQRFQAKIDSGSFYEAHQTLRTITNRYVKAKQYKEARDLLYQGSTILLKNKEFASASDLI 68

Query: 78  VSFVEALVKGKVAYDNENL-----DRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAK 132
              ++        YD E +     D   K+ +L   +P      +D  L +L        
Sbjct: 69  NYLIQI-------YDEEGILVSDKDAKLKLIDLISNLP-----NNDPSLSDLA------- 109

Query: 133 LRVEGCSSFLKAAIKWSIEF-GAPKTGSPELHVMLA-------EYLYSE----------- 173
                     K+++ WS +  G  K G  ELH +         EY  S+           
Sbjct: 110 ----------KSSLNWSKKSPGCEKFGDCELHHLFGSKFLKFVEYGTSQHDIESENGKVV 159

Query: 174 -SPELDMARV----SFHFVRGNNPEKFAST--IVNFM---GKCYPGED-DLAVARAILMY 222
             PE + A+V      H V G     F S    +N++    K  P  D  + + RAI+ Y
Sbjct: 160 LDPE-ERAKVFAIAELHLVLGT----FESVPLYINYLVQYAKANPEVDPGVFLGRAIVNY 214

Query: 223 LSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFY 260
             L N+K      D   K++E    +   SD   F FY
Sbjct: 215 SYLKNIKFVKEAQDIFLKEIEVTNKETLKSD---FQFY 249


>gi|402587323|gb|EJW81258.1| hypothetical protein WUBG_07834 [Wuchereria bancrofti]
          Length = 146

 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 43/152 (28%), Positives = 66/152 (43%), Gaps = 18/152 (11%)

Query: 19  DKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAV 78
           +KL      GN Y A Q+Y+++  R     ++ E  D+L++G    L   +     +LA 
Sbjct: 6   EKLRLCFENGNCYEAHQIYRTLYNRLSNQGKWQELQDMLYSGILRLLAEREAASAIDLAE 65

Query: 79  SFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            FVEAL K K    +  LDR  ++  L P       LE D         L A   R +  
Sbjct: 66  LFVEALEKSKTPVSSVVLDRFDELLNLLPA-----QLEKD---------LEANSEREDRR 111

Query: 139 SSFLKAAIKWSIEFGAPK----TGSPELHVML 166
             ++   +KWS+  G  K     G P LH+++
Sbjct: 112 LQYISLGVKWSMAVGDRKRYRRRGHPGLHLLV 143


>gi|154276848|ref|XP_001539269.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150414342|gb|EDN09707.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 251

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 54/214 (25%), Positives = 85/214 (39%), Gaps = 31/214 (14%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTC-GAELAVSF-VE 82
           +  G +Y A Q  + I+ARY+  Q Y  A D+L  GA   L+    +  G +LA+   ++
Sbjct: 20  IAAGAYYEAHQQLRVIAARYLKQQNYDAAADILAGGAKALLRVPAASASGGDLAIMLIID 79

Query: 83  ALVKGKVAYDNENLDRVRK--IYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSS 140
              K +     + + R RK  + EL  + P P        ++EL                
Sbjct: 80  VYNKAEWDVGGDEVGRARKKRLIELLREFP-PEEPTRRRYMNEL---------------- 122

Query: 141 FLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGN--NPEKFAST 198
                I WS  FG  + G PE+H  +      E+   D  R   H   G   + E  A  
Sbjct: 123 -----ISWSARFGKLERGDPEIHHEIGVLFAQENEPYDAER---HLAIGTPASAEILADL 174

Query: 199 IVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN 232
             ++     P       +RA+  +L  GN++DAN
Sbjct: 175 EYDWYTNDEPHTAGTYASRAVFPFLLTGNLRDAN 208


>gi|154302957|ref|XP_001551887.1| hypothetical protein BC1G_09222 [Botryotinia fuckeliana B05.10]
          Length = 211

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/175 (24%), Positives = 74/175 (42%), Gaps = 30/175 (17%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I +L++   EG +Y A Q  + +++RY+ +Q Y  A D+L++ +   L+ +Q   G +LA
Sbjct: 9   IARLQEKTAEGTYYEAHQQTRVVASRYIKSQNYPAATDILYSVSLSLLQASQGGSGGDLA 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
           +  +E      +  D+ +  R+  +   F P+ P                    ++ R  
Sbjct: 69  LFLLEVYNTSALTCDSASKGRLFTLLRAFSPEEP--------------------SRKR-- 106

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
               F+   I WS +  A   G PELH ++   L  E    D  R   H + G  
Sbjct: 107 ----FITETIAWSSKTSAYPAGDPELHHVIGSLLAEEGEAYDAER---HLILGTK 154


>gi|254579929|ref|XP_002495950.1| ZYRO0C06886p [Zygosaccharomyces rouxii]
 gi|238938841|emb|CAR27017.1| ZYRO0C06886p [Zygosaccharomyces rouxii]
          Length = 307

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/157 (26%), Positives = 69/157 (43%), Gaps = 26/157 (16%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  + + +  +  G+FY A Q  ++I+ RYV ++ Y +A+DL+  GA   LK +Q   G 
Sbjct: 9   QRTLQRFQIKIEAGDFYEAHQTLRTIANRYVRSKSYQDAIDLITQGALSFLKADQGGSGT 68

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L    +E                   +Y+L  QI V     D+  +  L + L A    
Sbjct: 69  DLIFYLLE-------------------VYDL-AQIKV-----DETSVSRLVQLLVAVDAS 103

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELH-VMLAEYL 170
                  +     WSI++   K G P LH V+ ++Y+
Sbjct: 104 EPNLKDVITGMNNWSIKYSEFKFGDPSLHNVIGSKYI 140


>gi|241955575|ref|XP_002420508.1| uncharacterized protein [Candida dubliniensis CD36]
 gi|223643850|emb|CAX41587.1| uncharacterized protein [Candida dubliniensis CD36]
          Length = 327

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 62/129 (48%), Gaps = 17/129 (13%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           I + +  ++ G+FY A Q  ++I+ RYV A++Y EA DLL+ G+ + L N +    ++L 
Sbjct: 9   IQRFQAKIDSGSFYEAHQTLRTITNRYVKAKQYKEARDLLYQGSTILLNNKEYASASDLI 68

Query: 78  VSFVEALVKGKVAYDNENL-----DRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAK 132
              ++        YD E +     D   K+ +L   +P      +D  L++L ++     
Sbjct: 69  NYLIQ-------IYDEEGILVSDKDAKLKLIDLISNLP-----NNDPSLNDLAKSSLNWS 116

Query: 133 LRVEGCSSF 141
            +  GC  F
Sbjct: 117 KKSSGCEKF 125


>gi|401623566|gb|EJS41661.1| YOR164C [Saccharomyces arboricola H-6]
          Length = 312

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 54/223 (24%), Positives = 92/223 (41%), Gaps = 35/223 (15%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L+  GA   LK  Q   G +
Sbjct: 15  KTLQRFESKIKAGDYYEAHQTLRTIANRYVRSKSYEHAIELISQGALSFLKAKQGGSGTD 74

Query: 76  LAVSFVEALVKGKVAYDNENLDR-VRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           L    +E     +V  ++ ++ R VR I EL P  P   NL+D                 
Sbjct: 75  LIFYLLEVYDLAEVKVNDISVARLVRLIAELDPSEP---NLKD----------------- 114

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPE- 193
                  +     WSI+F   K G P LH  +   L       +  R   +F+ G +   
Sbjct: 115 ------IITGMNNWSIKFSEYKFGDPYLHNTIGTKLLEGGLVYEAER---YFMLGTHDSM 165

Query: 194 -KFASTIVNFMGKCYPGEDDLAV---ARAILMYLSLGNMKDAN 232
            K+   +  ++ +    ED       +R I  YL + N+  A+
Sbjct: 166 VKYVDLLWGWLCQVDELEDGTVAEFFSRLIFNYLFICNISFAH 208


>gi|156844483|ref|XP_001645304.1| hypothetical protein Kpol_1037p43 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156115964|gb|EDO17446.1| hypothetical protein Kpol_1037p43 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 309

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 72/316 (22%), Positives = 132/316 (41%), Gaps = 67/316 (21%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y +A+DL+  G+   LK  Q   G +
Sbjct: 13  KTLQRFESRIAAGDYYEAHQTLRTIANRYVRSKSYPDAIDLISQGSLSFLKVKQGGSGTD 72

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPH-NLEDDDDLHELTEALGAAKLR 134
           L    +E     +V  D+ ++ R   + +L   + V   NL+D                 
Sbjct: 73  LIFYLLEVYDIAEVQVDDVSVSR---LVQLLVNVDVNEPNLKD----------------- 112

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEK 194
                  + A   WS++F   K G P LH ++   L      +  A         ++ +K
Sbjct: 113 ------VIVAMNNWSVKFSDYKFGDPYLHNVIGSKLLERGEYVYEAERYLLLGTEDSVDK 166

Query: 195 FASTIVNFMGKCYPGEDDLA-----VARAILMYLSLGNM------KDA--NYIMDEVKKQ 241
           +   I N+  +    + DL       +R IL YL + N+      KD      +D+ + +
Sbjct: 167 YIELIWNWFNQ----DGDLQEVGSFFSRLILNYLFISNIAFAYKAKDVFLQSFVDKYQPK 222

Query: 242 VENKQLQLQPSDLIQFVFY-----------VLQTLERDALPLFNMLRANYKASIEREPA- 289
           V+N  ++     + QF  Y             QT  +D   LF  L+  Y     +EPA 
Sbjct: 223 VQN--IEKNGFKMFQFEEYPELNFLQLLLLTCQTKNKD---LFINLKTYY-----QEPAK 272

Query: 290 -FNEMLDDIAEKFFGV 304
            FN+ ++ + +++FG+
Sbjct: 273 KFNKEMEFLGQEYFGI 288


>gi|452821130|gb|EME28164.1| hypothetical protein Gasu_43300 [Galdieria sulphuraria]
          Length = 260

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 67/277 (24%), Positives = 111/277 (40%), Gaps = 33/277 (11%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           + + ++ +G+FY A+QM  ++  R   + +  EA DLL  GA    K  Q  C   L   
Sbjct: 8   RFDSLIEQGSFYEAEQMCTALYHRLQRSGKKQEAFDLLLYGARQLAKGEQFQCAFVLLDK 67

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
             E +   K  +  E+  +V  +         P++L+    L    E L + K     CS
Sbjct: 68  AQEYM--KKQYFSEESFSQVLNL----LGTTFPNHLKPKVTLLNKLEKLASEK----HCS 117

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
             +   +    +F A      E                D  R   ++   N+ + + S +
Sbjct: 118 EDIVHKLH---KFQAETCIGME----------------DYRRALSYYSFLNDTQLYTSIL 158

Query: 200 VNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVF 259
             +M +    E+DL V R IL  L+ G    A     EV K    K   L  S LI F++
Sbjct: 159 EQWMSEGLYSEEDLFVTRQILCLLAQGKRTMAK----EVFKLCLQKHDNLNSSPLIGFIY 214

Query: 260 YVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDD 296
            +L+ LER+       L A YK+  +R+    ++L +
Sbjct: 215 LLLECLERNLFSSLESLIAIYKSVFDRDANLWKLLHE 251


>gi|219114807|ref|XP_002178199.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409934|gb|EEC49864.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 386

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 75/325 (23%), Positives = 138/325 (42%), Gaps = 52/325 (16%)

Query: 4   QRPKRTALPPAQENID----KLEKIV------NEGNFYGAQQMYKSISARYVAAQRYSEA 53
           +R K+ A   + EN D    +LEKI+      +E   Y A Q+ +S   + V    ++ A
Sbjct: 7   RRRKQLAARASAENQDLVAQQLEKILAQEADMDEATAYEALQLAQSQVRKKVNRAEFASA 66

Query: 54  LDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPH 113
            DL ++ +   LK N+++  ++L    V+ L +         + R+ ++ E   Q     
Sbjct: 67  CDLAYSTSLNLLKKNRVSVASQLLALLVQVLRETHTEETETWIARLVELQEAHSQA---- 122

Query: 114 NLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYS- 172
            +E         EA    +L+ +    +L+A   WS + G  K G  +L  ML E  +  
Sbjct: 123 -MEASSGSMPDQEANRLHRLQCD----WLRACASWSSDLGTVKYGHNQLQQMLGEQCWKL 177

Query: 173 ---ESPELDMARVSF----HFVRGNNPEKFASTIVNFMG----------KCYPGEDDLAV 215
              E+ E ++  +      H V    P    + +                C P   D  +
Sbjct: 178 SLMETDEEEVMDLKCDAVQHMVCAEQPNMIVTWLETLPAPTDEETAQGHTCPPALRDALL 237

Query: 216 ARAILMYLSLGNMKDANYIMDEVKKQVENKQLQ------------LQPSDLIQFVFYVLQ 263
            RA+L+  +L N++DAN ++    ++VEN+ ++              PS +I F   +L+
Sbjct: 238 TRALLLCCALENLRDANILIKAFIEKVENRDVKELSASYTNKEDGKAPSHVI-FGSMLLR 296

Query: 264 TLERDAL--PLFNMLRANYKASIER 286
             E+D+   PLF+ L  ++K  ++R
Sbjct: 297 VCEKDSRTGPLFSWLLRSFKRELDR 321


>gi|363750466|ref|XP_003645450.1| hypothetical protein Ecym_3129 [Eremothecium cymbalariae
           DBVPG#7215]
 gi|356889084|gb|AET38633.1| Hypothetical protein Ecym_3129 [Eremothecium cymbalariae
           DBVPG#7215]
          Length = 305

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 51/223 (22%), Positives = 93/223 (41%), Gaps = 31/223 (13%)

Query: 12  PPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           P   + + +    V+ G++Y AQQ  ++++ RYV  +++ EA+D +  G    L+  Q+ 
Sbjct: 7   PKLDKIMTRFNAKVDGGHYYEAQQTLRTVANRYVRGKKWPEAIDFITHGIHSFLEAGQMV 66

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            GA+L +  +E         +  N+ R+        QI V  + E+            + 
Sbjct: 67  LGADLTMYLLEVYNMSGAKCNEGNVARL-------VQILVGLDAEEP-----------SL 108

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
           K+   G ++       WSI  G  K G P LH ++ + L       D  R   + + G  
Sbjct: 109 KVICTGMNN-------WSINNGTYKFGDPYLHCVIGKRLLEAGYVYDAER---YLLLGTQ 158

Query: 192 PEKFASTIVNFMGKCYPGEDDLA---VARAILMYLSLGNMKDA 231
                 + + +      G+D      ++R I  YLS+ N+K A
Sbjct: 159 DTMIKYSELMWDWYVQSGDDKAVGSFLSRLIFSYLSIYNIKYA 201


>gi|159474440|ref|XP_001695333.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158275816|gb|EDP01591.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 71

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 25/63 (39%), Positives = 39/63 (61%)

Query: 16 ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
          + + +++  V EG +Y AQQMYK+   R  A  + ++AL +L  GA  QL + Q+TCG E
Sbjct: 7  KTLARIKGTVAEGAYYEAQQMYKTAFHRSKARNQLADALQILQEGAITQLAHGQVTCGVE 66

Query: 76 LAV 78
          L +
Sbjct: 67 LGM 69


>gi|367005170|ref|XP_003687317.1| hypothetical protein TPHA_0J00600 [Tetrapisispora phaffii CBS 4417]
 gi|357525621|emb|CCE64883.1| hypothetical protein TPHA_0J00600 [Tetrapisispora phaffii CBS 4417]
          Length = 311

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 72/156 (46%), Gaps = 27/156 (17%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ R+V A+ Y +A+DL+  GA   LK  Q   G +
Sbjct: 9   KTLQRFETRIAAGDYYEAHQTLRTIANRFVRAKAYGDAIDLITEGALSFLKVKQGGSGTD 68

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPH-NLEDDDDLHELTEALGAAKLR 134
           L    +E     ++  D+ ++ R+ +I      + V   N++D                 
Sbjct: 69  LIFYLLEVYDVAEIKADDTSVSRLVQI---LVNVDVSEPNIKD----------------V 109

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYL 170
           + G ++       WS++FG  K G P LH ++ + L
Sbjct: 110 IVGMNN-------WSVKFGDCKFGDPYLHSVMGKKL 138


>gi|281351386|gb|EFB26970.1| hypothetical protein PANDA_021491 [Ailuropoda melanoleuca]
          Length = 107

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 66/133 (49%), Gaps = 28/133 (21%)

Query: 40  ISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENLDRV 99
           +S+RY++  +++EA +L+ +GA L   + Q    A+L++  +EAL K +V   +E L+ +
Sbjct: 1   LSSRYMSQSKHTEARELMCSGALLFFSHGQQNSAADLSMLVLEALEKAEVEVADELLENL 60

Query: 100 RKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTG 158
            K++ L  P  P                       RV    +F+  A+KWS   G+ K G
Sbjct: 61  AKLFSLMDPNSPE----------------------RV----AFVSRALKWS-SGGSGKLG 93

Query: 159 SPELHVMLAEYLY 171
            P LH +LA  L+
Sbjct: 94  HPRLHQLLALTLW 106


>gi|156100555|ref|XP_001616005.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148804879|gb|EDL46278.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 277

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 56/294 (19%), Positives = 116/294 (39%), Gaps = 33/294 (11%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEAL 84
           + +G  Y A Q+ KS+  R +   + S  + ++   +    K  Q     +L       L
Sbjct: 16  IEKGQHYEAHQLIKSLINRKLQKNQVSSCMSIIFYFSQKMAKAEQYVLLCDLMYQCCTIL 75

Query: 85  VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKA 144
            +  + +  E   ++ +I+ L P    P++ E+                       F+  
Sbjct: 76  TQHNIKFSEEYAKKIIEIFNLCP----PNSTEEK--------------------YKFMNK 111

Query: 145 AIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMG 204
            I WS        G  + H  +A Y Y E  +  +A    HF+   + E     I  +  
Sbjct: 112 CILWSTN-DDEIFGYLDFHKAIA-YAYFEDKKYSLAHN--HFIYLEDNEILYKIICLWQK 167

Query: 205 KCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQT 264
           + YP E D  + R  L  L L   + A  ++++ +  ++ K + L     +Q  + +  +
Sbjct: 168 EGYPSESDFFILRTTLCLLVLDKFEQALDLINKFEPNLDRKDVPLP----VQIAYLITCS 223

Query: 265 LERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIF 318
               + PL+  ++  Y+  +  +P F + +  + +  F  KR++ +  +F +IF
Sbjct: 224 CIYRSAPLYEDVKYKYRLILNYDPDFQKYIGILDDVLFN-KRKHDLFSMFQNIF 276


>gi|403417617|emb|CCM04317.1| predicted protein [Fibroporia radiculosa]
          Length = 350

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/242 (22%), Positives = 97/242 (40%), Gaps = 35/242 (14%)

Query: 12  PPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLT 71
           P +   +D +  ++  G  Y A Q  ++ ++RY  AQ+Y  A+D+L   A   LK  Q  
Sbjct: 7   PSSSRALDAVLPLIASGQAYEAHQKARTFASRYAKAQQYDTAIDVLFQSARELLKAGQQG 66

Query: 72  CGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAA 131
            G +LA   ++         + E+  RV +                   L  LT + GA 
Sbjct: 67  SGIDLASFMLDVYDSKGEEVNEESRGRVTQ-------------------LIALTGSSGAW 107

Query: 132 KLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG-- 189
           +       + +  A+ WS +  A   G P+LH  + E LY E   +  A    +   G  
Sbjct: 108 R------KTIIDKAVGWSGKHCANPAGDPDLHHYIGELLYKEG--VFEAAEPHYLTSGKR 159

Query: 190 NNPEKFASTIVNFMGKCYPGEDDLA--VARAILMYLSLGNMKDANYIMDEVKKQVENKQL 247
           ++    A   ++++G       +LA    R  + YL  GN+  A   + +   Q+ + + 
Sbjct: 160 DSARSLAQLYIDWLGD----RGNLAAFALRGTIPYLQNGNILAARTFVTQYVSQLTSTRP 215

Query: 248 QL 249
            L
Sbjct: 216 AL 217


>gi|146423343|ref|XP_001487601.1| hypothetical protein PGUG_00978 [Meyerozyma guilliermondii ATCC
           6260]
 gi|146388722|gb|EDK36880.1| hypothetical protein PGUG_00978 [Meyerozyma guilliermondii ATCC
           6260]
          Length = 310

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/153 (23%), Positives = 65/153 (42%), Gaps = 22/153 (14%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  I +    ++ G++Y A Q  + I+ RYV A++YS+A+DLL+  A +    ++     
Sbjct: 6   QRTITRFRAKIDAGSYYEAHQTVRIITNRYVKAKQYSDAIDLLYQAASILAAKSEYASAT 65

Query: 75  ELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           +L +  +    +  +  D  + +   K+ EL  Q+P     + D  L +L          
Sbjct: 66  DLIIYLLAVYSEAGIICDAAHKEYKMKLIELVNQLP-----QTDPALADLA--------- 111

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLA 167
                   K +  WS E    K G  +LH +  
Sbjct: 112 --------KKSTNWSQENTENKFGDADLHHLFG 136


>gi|350581360|ref|XP_003481020.1| PREDICTED: hypothetical protein LOC100738092 [Sus scrofa]
          Length = 238

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 39/63 (61%), Gaps = 2/63 (3%)

Query: 242 VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKF 301
           +EN    +QP  L+ F++++L  L+   L +F +L   Y+ S+ R+P +NE LD I + F
Sbjct: 128 IENGPPFVQP--LLNFIWFLLLALDGGKLAVFTVLCEQYQPSLRRDPMYNEYLDRIGQLF 185

Query: 302 FGV 304
           FGV
Sbjct: 186 FGV 188


>gi|295669089|ref|XP_002795093.1| DUF410 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226285786|gb|EEH41352.1| DUF410 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 350

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 72/325 (22%), Positives = 120/325 (36%), Gaps = 63/325 (19%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGA-CLQLKNNQLTCG 73
           ++ I + ++ +  G +Y + Q  + I+ARY+    Y  A D+L  GA  L         G
Sbjct: 6   EKTIIRQQEKIAAGAYYESHQQLRVIAARYLKQFNYDAAADILAGGAKALLHAPGASASG 65

Query: 74  AELAVSFV---------EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHEL 124
            +LA+  +         EA+ +G+   D     R +++ EL  + P            E 
Sbjct: 66  GDLAIMLIVDVYNKAEWEAVPEGQG--DLVGRGRKKRLIELLREFPP----------EEP 113

Query: 125 TEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSF 184
           T               ++   + WS  FG  + G  E+H  +      E+   D  R   
Sbjct: 114 TR------------RRYINEMVAWSARFGELERGDREIHHEVGALFAQENEPYDAER--- 158

Query: 185 HFVRGNNPEKFASTIVNFMGKCY----PGEDDLAVARAILMYLSLGNMKDANYI------ 234
           H   G      A T+     + Y    P    +   RA++ YL  GN+++AN        
Sbjct: 159 HLAVGTTDS--AETLARLEYEWYTYDEPHTAGMYALRAVIPYLLTGNLRNANKAFLIFTS 216

Query: 235 -----MDEVKKQVEN-KQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANY 280
                +D  K Q  N +Q+  Q SD+  +           +L  ++R +  LF  L   Y
Sbjct: 217 RLSSPVDPGKAQALNVQQVSSQSSDVNVYPSLPLLNFTSLLLLAIQRGSADLFRQLTRQY 276

Query: 281 KASIEREPAFNEMLDDIAEKFFGVK 305
                        L  I E +FG+K
Sbjct: 277 AHHTREVDGLERTLAYIGELYFGIK 301


>gi|389585469|dbj|GAB68200.1| hypothetical protein PCYB_127650 [Plasmodium cynomolgi strain B]
          Length = 277

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/297 (19%), Positives = 117/297 (39%), Gaps = 39/297 (13%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEAL 84
           + +G  Y A Q+ KS+  R +   + S  + ++   +    K  Q     +L       L
Sbjct: 16  IEKGQHYEAHQLIKSLINRKLQKNQVSSCMSIIFYFSQKMAKAEQYVLLCDLMYQCCTIL 75

Query: 85  VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKA 144
            +  + +  E   ++ +I+ L P    P++ E+                       F+  
Sbjct: 76  TEHNIKFSEEYAKKIIEIFNLCP----PNSTEEK--------------------YKFMNK 111

Query: 145 AIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMG 204
            I WS        G  + H  +A Y Y E  +  +A    HF+   + E        +  
Sbjct: 112 CIMWSTN-DDEIFGYLDFHKAIA-YAYFEDKKYSLAHN--HFIYLEDNEILYKIFCLWQK 167

Query: 205 KCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQT 264
           + YP E +  + R  L  L L   + A  ++++ +  ++ K + L     IQ  + +  +
Sbjct: 168 EAYPSESNFFILRTTLCLLVLDKFEQALDLINKFEPNLDRKDVPLP----IQIAYLITCS 223

Query: 265 LERDALPLFNMLRANYKASIEREPAFNE---MLDDIAEKFFGVKRRNPMQGIFGDIF 318
               ++PL+  ++  Y+  +  +P F +   +LDD+       KR++ +  +F +IF
Sbjct: 224 CLYRSVPLYEDVKYKYRLILNYDPDFQKYIGILDDVVFN----KRKHDLFSMFQNIF 276


>gi|392576802|gb|EIW69932.1| hypothetical protein TREMEDRAFT_38601 [Tremella mesenterica DSM
           1558]
          Length = 345

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 39/145 (26%), Positives = 58/145 (40%), Gaps = 25/145 (17%)

Query: 29  NFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGK 88
           N Y A Q   + +AR V +   + ALD+L A A   LK  +   G EL V   + + +G 
Sbjct: 28  NPYSAHQKALTTTARLVHSDHVNVALDILFATARELLKLGEAASGVELGVRMTQIMTEGD 87

Query: 89  VAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKW 148
           VA D + L  + ++  L P                      +   R       + AAI+W
Sbjct: 88  VAVDQKTLASLTQLLALTPP---------------------SGPWR----KKLVDAAIRW 122

Query: 149 SIEFGAPKTGSPELHVMLAEYLYSE 173
           +   G   TG P LH  + E  Y +
Sbjct: 123 TQVKGTCPTGDPGLHQYVGEMNYKD 147


>gi|221059685|ref|XP_002260488.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
 gi|193810561|emb|CAQ41755.1| hypothetical protein, conserved in Plasmodium species [Plasmodium
           knowlesi strain H]
          Length = 277

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/294 (18%), Positives = 116/294 (39%), Gaps = 33/294 (11%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEAL 84
           + +G  Y A Q+ KS+  R +   + +  + ++   +    K  Q     +L       L
Sbjct: 16  IEKGQHYEAHQLIKSLINRKLQKNQVNSCMSIIFYFSQKMAKAKQYVLLCDLMYQCCTIL 75

Query: 85  VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKA 144
            +  + +  E   ++ +I+ L P    P++ E+                       F+  
Sbjct: 76  TQHNIEFSEEYAKKIIEIFNLCP----PNSTEEK--------------------YKFMNK 111

Query: 145 AIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFMG 204
            I WS        G  + H  +A Y Y E  +  +A    HF+  ++ E     I  +  
Sbjct: 112 CITWSTN-DDEIFGYLDFHKAIA-YAYFEDKKYSLAHN--HFIYLDDNEILYKIICLWQK 167

Query: 205 KCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQT 264
           + YP E    + R  L  L L   + A  ++++ +  ++ K + L     IQ  + +  +
Sbjct: 168 EAYPSESHFFILRTTLCLLVLDKFEQALDLINKFEPNLDRKDVPLP----IQIAYLITSS 223

Query: 265 LERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIF 318
               + PL+  ++  Y+  +  +P F + +  + +  F  KR++ +  +F ++F
Sbjct: 224 CIYKSAPLYEDVKYKYRLILNYDPDFQKYIGILDDVVFN-KRKHDLFSMFQNMF 276


>gi|366994210|ref|XP_003676869.1| hypothetical protein NCAS_0F00290 [Naumovozyma castellii CBS 4309]
 gi|342302737|emb|CCC70513.1| hypothetical protein NCAS_0F00290 [Naumovozyma castellii CBS 4309]
          Length = 316

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/157 (24%), Positives = 73/157 (46%), Gaps = 27/157 (17%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L++ GA   L   Q   G +
Sbjct: 13  KTLHRFETRIKAGDYYEAHQTLRTIANRYVNSKSYDHAIELIYQGAQSFLTAKQGGSGTD 72

Query: 76  LAVSFVEALVKGKVAYDNENLDR-VRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLR 134
           L    +E     +   +++++ R +R +  L PQ P   N++D                 
Sbjct: 73  LIFYLLEVYELAQTEMNDDSMRRLIRLMILLDPQEP---NVKD----------------V 113

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLY 171
             G ++       W+I+ G+ K G P LH ++A  L+
Sbjct: 114 ATGMNN-------WTIKAGSYKFGDPWLHDIIALKLW 143


>gi|403222957|dbj|BAM41088.1| uncharacterized protein TOT_030000351 [Theileria orientalis strain
           Shintoku]
          Length = 302

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 52/222 (23%), Positives = 97/222 (43%), Gaps = 16/222 (7%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           + +++  + E  FY + Q   +   R    ++Y+E+L+ L+  A + L+NNQ     EL 
Sbjct: 9   LTRIDTEIKENKFYESMQHILTFVKRNALRKKYNESLETLYKYALVYLENNQYVLSGELM 68

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
             +V+    G VA   + +     ++E  P    PH+     D ++  E      L    
Sbjct: 69  DEYVKVAESGSVALTPQMVTNFVSVFET-PWKTSPHS----SDFNKSKEYQSTMSL---- 119

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAS 197
            + F+  AI +S     P  G+PELH  L  + Y +  +   A+   + V   + E    
Sbjct: 120 FTKFMNRAILYSKSVEFP-NGNPELHKSLGSF-YKQDKQYIKAQA--NLVYSQDVELLFE 175

Query: 198 TIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN---YIMD 236
           +I  +  +    + D    RA+LM L+  ++ +A    Y++D
Sbjct: 176 SIKEWKEEENRLDGDYYYLRAVLMLLASRDVLNAKCFVYMLD 217


>gi|313237149|emb|CBY12369.1| unnamed protein product [Oikopleura dioica]
          Length = 372

 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 70/319 (21%), Positives = 132/319 (41%), Gaps = 46/319 (14%)

Query: 19  DKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAV 78
           ++L   + +G FY A QM +++++RY  ++++ +A+ L+  GA    +  Q    ++L +
Sbjct: 10  ERLSSKIADGEFYEAHQMIRTLASRYRTSKKHQDAMKLITKGALQLYELKQCGSASDLGI 69

Query: 79  SFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            +               LD + KI E         N E    +H+ T  +   +      
Sbjct: 70  LYCTI------------LDDLEKITET--------NEEQISKIHKAT--IKYLEYEQPER 107

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAE-YLYSE-----SPELDMARVSFHFVR---- 188
             F +  +K + E      GSP+  +  A+ YL +      +  +D+  VS    R    
Sbjct: 108 EQFERRILKRTGEICGNLMGSPKTRLNFAQNYLEASKAARTAKAMDIFTVSISSARQHFQ 167

Query: 189 ----GNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK---Q 241
               G    KF +    F  +    E +L   + +L  L    +  AN  +        Q
Sbjct: 168 YSENGREIGKFLAHFTEFSSRNM-SEIELICTQMVLQLLHQQTVSVANEFLQAFSTFHPQ 226

Query: 242 VENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASI-EREPAFNEMLDDIAEK 300
           +E++     P  L+ + F++L+T++   +  F  L   Y+  + + +P   E +  I E 
Sbjct: 227 LESRFPSNYP--LLNYCFFLLKTIDSKDVKAFKFLNFAYQPFLADPDPILMEYVVRIGEI 284

Query: 301 FFGV--KRRNPMQGIFGDI 317
           +FG+  K+  P QG F DI
Sbjct: 285 YFGIVEKKARP-QGGFMDI 302


>gi|124809295|ref|XP_001348539.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
 gi|23497435|gb|AAN36978.1| conserved Plasmodium protein, unknown function [Plasmodium
           falciparum 3D7]
          Length = 277

 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 64/297 (21%), Positives = 118/297 (39%), Gaps = 34/297 (11%)

Query: 22  EKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFV 81
           EKI N G  Y A Q+ KSI  R +   + S  + ++   +       Q     +L     
Sbjct: 14  EKIEN-GLHYEAHQLIKSIINRKLQKNQISNCISVIFYFSLKFATAEQYVLLCDLMYQCC 72

Query: 82  EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSF 141
             L +  + +   NL+   KI E+F + P P++ E+                       F
Sbjct: 73  LILTQHDIEF---NLEYANKIIEIFQKCP-PNSTEEK--------------------YKF 108

Query: 142 LKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVN 201
           +  +I WS +      G  E H ++  Y Y E  +  +A    HF+  ++ +     I  
Sbjct: 109 MNKSILWS-KSEENVYGYLEFHKVIG-YAYFEDKQYSLAHN--HFIYLDDNDILYEIICL 164

Query: 202 FMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFVFYV 261
           +  + YP E    V R IL  L L     A  +++  +  +  + + L     IQ  + +
Sbjct: 165 WRKEAYPSESHFFVLRTILCLLVLDKFDQALNLINMFETNLNRQDVPLP----IQLAYLI 220

Query: 262 LQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIF 318
             +    +L L+  ++  Y+  +  +P F + ++ I    F  K+ N +  +F +IF
Sbjct: 221 TSSCIYKSLALYEDVKYKYRLILNYDPDFQKYINIIDAAVFNKKKHN-LFSMFQNIF 276


>gi|303322354|ref|XP_003071170.1| hypothetical protein CPC735_037310 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240110869|gb|EER29025.1| hypothetical protein CPC735_037310 [Coccidioides posadasii C735
           delta SOWgp]
 gi|320040641|gb|EFW22574.1| hypothetical protein CPSG_00473 [Coccidioides posadasii str.
           Silveira]
          Length = 345

 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 67/334 (20%), Positives = 131/334 (39%), Gaps = 58/334 (17%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEA--------LDLLHAGACLQLKN 67
           + I + ++ +  G +Y + Q  + I+ARY+    Y  A        + LL AGA    + 
Sbjct: 7   KTIARQQEKIAAGAYYESHQQLRVIAARYLKQSNYDAAAELLAGGAIALLKAGA----QQ 62

Query: 68  NQLTCGAELAVSFVEAL---VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHEL 124
                G +LA+  V  +    + +++ D     R +++ ++  + P            E 
Sbjct: 63  GASASGGDLAIMLVVDVYNKAEWELSDDEVGNKRRQRLIDILQEFPP----------EEP 112

Query: 125 TEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSF 184
           T               ++   I WS +FG  +TG PE+H  +   +Y+E  E+  A    
Sbjct: 113 TR------------KRYINEMIAWSAKFGYIETGDPEMHHAVGS-IYAEENEVYDAERHL 159

Query: 185 HFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVEN 244
                ++ E  A     +          +  ARA+  YL  G+ ++AN      + Q+ +
Sbjct: 160 ALGTADSAEILARLEYTWYTHDAVHTAGIYAARAVFPYLLTGSFRNANKAFLIFRNQLSS 219

Query: 245 ---------KQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIERE 287
                    +++  Q +D+  +           +L  ++R     F  L  +Y   I+  
Sbjct: 220 SPTSHALNVQEVSSQSNDVRVYPSLPLLNFTSLLLLAIQRGNADSFKQLLRHYATYIQEA 279

Query: 288 PAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
             +++ L  I E +FG+   K+ NP+  + G  F
Sbjct: 280 GEWDQALAHIGEVYFGIRIPKQSNPLMDMMGMFF 313


>gi|328854284|gb|EGG03417.1| hypothetical protein MELLADRAFT_117311 [Melampsora larici-populina
           98AG31]
          Length = 369

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 73/363 (20%), Positives = 140/363 (38%), Gaps = 90/363 (24%)

Query: 12  PPAQENIDKLEKIVNEGNFYGAQQMYKSISARYVAA---QRYSEALDLLHAGACLQLKNN 68
           P     +  +  +V++  +Y A Q Y++ +AR + +   Q   +A+ LL  G+ L L+ +
Sbjct: 4   PSLARTLKSILNLVDQEEWYAAHQKYRTSAARLLKSNEPQSIQDAISLLFEGSKLLLQRH 63

Query: 69  QLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEAL 128
           QL  G +L +  +  +              V + Y        P+  E+   L  L   +
Sbjct: 64  QLGSGTDLGLMLIRDVY-------------VSRSY--------PYTEEERHKLLNLIALI 102

Query: 129 GAAKLRVEGCSSFLKAAIKWSIEFGAPK----TGSPELHVMLAEYLYSESPELDMARVSF 184
           GA         ++ K  I  SI +        +G P LH  + E +Y++    D+A +  
Sbjct: 103 GAKD-------NWRKMIIDSSISYSTTSPDFPSGDPALHFSIGE-IYAKEKRWDLAEI-- 152

Query: 185 HFVRGNNPEKFASTIVNFMGKCYPGEDDLA-------VARAILMYLSLGNMKDA------ 231
           H +   + +  A ++ N +      E DL          R ++ YL +G++  A      
Sbjct: 153 HLLSAGDQDS-AKSLSNVLWSWSILEKDLQEKSLGRYAGRGVIGYLEIGSITSAKSFLED 211

Query: 232 ----------NYIMDEVKKQVENKQLQ------LQPSDLIQFVFYVLQTLERDALP---- 271
                     N ++D ++ ++++   Q      L   + +Q +    Q +   +LP    
Sbjct: 212 FLNKSIKRFPNLLVDSIEIKLKSDNFQPIQIFSLPSLNFLQLLILTCQ-IGPGSLPSGLP 270

Query: 272 -------------LFNMLRANYKASIEREPAFN---EMLDDIAEKFFGVKRRNPMQGIFG 315
                        +F  +   Y + +E+    N   E LD IAE +FG+K +     I  
Sbjct: 271 APIGHTGPTIGKAVFQAMVGRY-SRVEKWIGSNQIKESLDLIAEMYFGIKPQRQSGNILN 329

Query: 316 DIF 318
           D+ 
Sbjct: 330 DLM 332


>gi|50424787|ref|XP_460983.1| DEHA2F14234p [Debaryomyces hansenii CBS767]
 gi|49656652|emb|CAG89341.1| DEHA2F14234p [Debaryomyces hansenii CBS767]
          Length = 325

 Score = 47.4 bits (111), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 48/98 (48%), Gaps = 2/98 (2%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGA 74
           Q  I + +  +  G +Y A Q  ++I+ RYV + +Y++A+DLL+ G+ +   N +    +
Sbjct: 7   QRTILRFKSKIESGAYYEAHQTLRTITNRYVKSNQYNDAIDLLYQGSSILSSNKEYASAS 66

Query: 75  ELAVSFVEALVKGKVAYDNE--NLDRVRKIYELFPQIP 110
           +L    +    +      NE  N D   K+ EL   +P
Sbjct: 67  DLISYLITVYEESDTKCTNEGSNKDYKLKLIELISLLP 104


>gi|209877266|ref|XP_002140075.1| hypothetical protein [Cryptosporidium muris RN66]
 gi|209555681|gb|EEA05726.1| hypothetical protein, conserved [Cryptosporidium muris RN66]
          Length = 298

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 43/199 (21%), Positives = 83/199 (41%), Gaps = 22/199 (11%)

Query: 139 SSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
           S  L  A+ W  +      G   LH   +   Y E       +   H +  ++ + +   
Sbjct: 96  SKLLNEALLWC-KVSNMNEGEDILHKWASNIYYKEGI---YHKAQGHVIFIDDADLYVDI 151

Query: 199 IVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDE-------------VKKQVENK 245
           ++ +    Y  E +L V RA+L+ LSL  +  A  ++D+             +++   NK
Sbjct: 152 LLKWQEMGYKSEKELFVLRAVLILLSLPKLNYARELLDKYCLINGYLININNLEQNNINK 211

Query: 246 QLQLQPSDLIQFVFYVLQTLERD----ALPLFNMLRANYKASIEREPAFNEMLDDIAEKF 301
            +  +P   IQ  F ++ T E       L   ++++  Y   + R+P FN++++ I +  
Sbjct: 212 IINKKPPSPIQLAFLIVSTCELSDKYLGLDFMDIIKRKYALILRRDPLFNKLIEKIEQNI 271

Query: 302 FGVKRRNPMQGIFGDIFKM 320
            G K  N  +GI  ++  M
Sbjct: 272 LG-KVSNQTRGILTNLLSM 289


>gi|392594269|gb|EIW83593.1| cytoplasmic protein [Coniophora puteana RWD-64-598 SS2]
          Length = 351

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 40/167 (23%), Positives = 72/167 (43%), Gaps = 29/167 (17%)

Query: 11  LPPAQEN----IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLK 66
           +PPA  +    ++ +  +V  G  Y A Q  ++ +ARYV + +++ A+D+L   A    K
Sbjct: 1   MPPASTSSTRALNAILPLVAAGQPYEAHQKARTFAARYVKSGQHATAIDVLFQSARELFK 60

Query: 67  NNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTE 126
             Q   G +L V  ++    G+V  D+E+  R+ +                   L  LT 
Sbjct: 61  IGQQGSGTDLTVFLLDTYETGQVGVDDESRGRLTQ-------------------LIALTG 101

Query: 127 ALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSE 173
           + G+ + +V      +  +I WS + G    G  +L     E LY +
Sbjct: 102 SSGSWRKQV------IDRSIAWSAKAGPCPAGDQDLQHYAGEVLYKD 142


>gi|17553188|ref|NP_498034.1| Protein CEE-1 [Caenorhabditis elegans]
 gi|351058306|emb|CCD65749.1| Protein CEE-1 [Caenorhabditis elegans]
          Length = 362

 Score = 46.6 bits (109), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 63/305 (20%), Positives = 122/305 (40%), Gaps = 31/305 (10%)

Query: 17  NIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSE-ALDLLHAGACLQLKNNQLTCGAE 75
            + +L +   E  FY A Q Y++   R    +  +E A+ L+        +  Q  C  +
Sbjct: 5   TLSRLSEFEKEKKFYDALQFYRTKVTRSFRLKNANEVAIPLVKHALEFFFREKQYQCAID 64

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRV 135
           +   + E L K  V    E  + +      F        +E++    E  + L  A+ + 
Sbjct: 65  ITTQYAECLSKNNVDLGEEAFNILADSVSKFANFA---EIENEAGNSEPEQLLNTARCKC 121

Query: 136 EGCSSFLKAAIKWSIEFGAP----KTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNN 191
                 +  AI+W+ +        K GS   H +LA+ L +    +++A+   HF+  ++
Sbjct: 122 ------VDLAIQWTKQKATTPIEKKYGSAAFHTVLAKKLVA-VDHVELAKN--HFLLSDD 172

Query: 192 PEKFASTIVNFMGKCYPG------EDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENK 245
            + FA    NF+ + Y        E ++ +  A+   L L     A  +  E       K
Sbjct: 173 FKSFA----NFLHQEYESFKQKESEPEVIIVEAVFQSLCLDRFPFAVSLFTEYVAPA--K 226

Query: 246 QLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVK 305
               +P  L+ F   +   +E +  P ++ L ++Y+  ++R  AF   L  I + +FG++
Sbjct: 227 YPFTKP--LLNFQHILFDVIETENQPQYSELTSSYQTELKRSLAFIGYLTRIGKLYFGIR 284

Query: 306 RRNPM 310
               M
Sbjct: 285 DNQCM 289


>gi|428671768|gb|EKX72683.1| conserved hypothetical protein [Babesia equi]
          Length = 302

 Score = 46.2 bits (108), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 56/287 (19%), Positives = 118/287 (41%), Gaps = 21/287 (7%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           +++  ++    Y + Q   S+  R +   +Y E+L++L+  A + ++  +     EL   
Sbjct: 11  RIDSEISGNKCYESMQHILSLVKRNLLRNKYKESLEILYNYASIFIEKKEFILAGELMDE 70

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDD---DLHELTEALGAAKLRVE 136
           + +   KGK+  + + ++ +  I+E  P      +   +D   D + +T  L        
Sbjct: 71  YAKVADKGKLFCNIKVINNIISIFE-SPWTSAAFSSTSNDQKMDDYSVTMLLYI------ 123

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
               F+  AI +S     P  G+P  H  +  Y   E    D  +   + V   + E   
Sbjct: 124 ---KFMNRAISFSKSETHPH-GNPLFHKSIGRYYIHEK---DYVKAQGNLVYSQDVELLL 176

Query: 197 STIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQ 256
           S I  +       E D    RAILM ++ G++ +A   +  +   +E+ ++ L     IQ
Sbjct: 177 SMINEWKKHASEDEHDFFYLRAILMLVACGDICNAKCFIYLLDVNLEDPEVPLP----IQ 232

Query: 257 FVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFG 303
               + +  +     LF  +   Y+  ++ +P +  ++  I + +FG
Sbjct: 233 LAHLLTEACDPPDEKLFEKICHVYQPILDLDPEYKALVKTIRDSYFG 279


>gi|119196501|ref|XP_001248854.1| hypothetical protein CIMG_02625 [Coccidioides immitis RS]
 gi|392861945|gb|EJB10431.1| hypothetical protein CIMG_02625 [Coccidioides immitis RS]
          Length = 345

 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 67/334 (20%), Positives = 130/334 (38%), Gaps = 58/334 (17%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEA--------LDLLHAGACLQLKN 67
           + I + ++ +  G +Y + Q  + I+ARY+    Y  A        + LL AGA    + 
Sbjct: 7   KTIARQQEKIAAGAYYESHQQLRVIAARYLKQSNYDAAAELLAGGAIALLKAGA----QQ 62

Query: 68  NQLTCGAELAVSFVEAL---VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHEL 124
                G +LA+  V  +    + +++ D     R +++ ++  + P            E 
Sbjct: 63  GASASGGDLAIMLVVDVYNKAEWELSDDEVGNKRRQRLIDILQEFPP----------EEP 112

Query: 125 TEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSF 184
           T               ++   I WS +FG  +TG PE+H  +   +Y+E  E+  A    
Sbjct: 113 TR------------KRYINEMIAWSAKFGDIETGDPEMHHAVGS-IYAEENEVYDAERHL 159

Query: 185 HFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVEN 244
                 + E  A     +          +  ARA+  YL  G+ ++AN      + Q+ +
Sbjct: 160 ALGTAVSAEILARLEYTWYTHDAVHTAGIYAARAVFPYLLTGSFRNANKAFLIFRNQLSS 219

Query: 245 ---------KQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIERE 287
                    +++  Q +D+  +           +L  ++R     F  L  +Y   I+  
Sbjct: 220 SPTSHALNVQEVSSQSNDVRVYPSLPLLNFTGLLLLAIQRGNADSFKQLLRHYATYIQEA 279

Query: 288 PAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
             +++ L  I E +FG+   K+ NP+  + G  F
Sbjct: 280 GEWDQALAHIGEVYFGIRIPKQSNPLMDMMGMFF 313


>gi|167538216|ref|XP_001750773.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163770690|gb|EDQ84372.1| predicted protein [Monosiga brevicollis MX1]
          Length = 337

 Score = 45.8 bits (107), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 24/52 (46%), Positives = 31/52 (59%)

Query: 251 PSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFF 302
           P+ L +FV  VL TL RDAL LF  LR  Y   +E+  A N +LD IA  ++
Sbjct: 256 PTPLAEFVDRVLATLTRDALSLFEDLRVAYADVLEQHAATNPLLDRIAAYYY 307


>gi|226294694|gb|EEH50114.1| DUF410 domain-containing protein [Paracoccidioides brasiliensis
           Pb18]
          Length = 350

 Score = 45.8 bits (107), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 74/339 (21%), Positives = 128/339 (37%), Gaps = 62/339 (18%)

Query: 15  QENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGA-CLQLKNNQLTCG 73
           ++ I + ++ +  G +Y + Q  + I+ARY+    Y  A D+L  GA  L         G
Sbjct: 6   EKTIIRQQEKIAAGAYYESHQQLRVIAARYLKQLNYDAAADILAGGAKALLHAPGASASG 65

Query: 74  AELAVSFV---------EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHEL 124
            +LA+  +         EA+ +G+   D     R +++ EL  + P            E 
Sbjct: 66  GDLAIMLIVDVYNKAEWEAVPEGQG--DLVGRGRKKRLIELLHEFPP----------EEP 113

Query: 125 TEALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSF 184
           T               ++   I WS  FG  + G  E+H  +      E+   D  R   
Sbjct: 114 TR------------RRYINEMIAWSARFGELERGDREIHHEVGALFAQENEPYDAER--- 158

Query: 185 HFVRG--NNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIM----- 235
           H   G  ++ E  A     +     P    +   RA++ YL  GN+++AN  +I+     
Sbjct: 159 HLAVGTTDSAEILARLEYEWYTYDEPHTAGIYALRAVIPYLLTGNLRNANKAFIIFTSRL 218

Query: 236 ----DEVKKQVEN-KQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKA 282
               D  K Q  N +Q+  Q SD+  +           +L  ++  +  LF  L   Y  
Sbjct: 219 SSPADPGKAQALNAQQVSSQSSDVNVYPSLPLLNFTSLLLLAIQSGSADLFRQLTRQYAH 278

Query: 283 SIEREPAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
                  +      I E +FG+   K+ N +  + G +F
Sbjct: 279 HTREVDGWERASAYIGELYFGIKIPKQSNALMDMMGMMF 317


>gi|225678591|gb|EEH16875.1| DUF410 domain-containing protein [Paracoccidioides brasiliensis
           Pb03]
          Length = 350

 Score = 45.8 bits (107), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 77/336 (22%), Positives = 126/336 (37%), Gaps = 63/336 (18%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGA-CLQLKNNQLTCGAEL 76
           I + EKI   G +Y + Q  + I+ARY+    Y  A D+L  GA  L         G +L
Sbjct: 10  IRQQEKIA-AGAYYESHQQLRVIAARYLKQLNYDAAADILAGGAKALLHAPGASASGGDL 68

Query: 77  AVSFV---------EALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
           A+  +         EA+ +G+   D     R +++ EL  + P            E T  
Sbjct: 69  AIMLIVDVYNKAEWEAVPEGQG--DLVGRGRKKRLIELLHEFPP----------EEPTR- 115

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
                        ++   I WS  FG  + G  E+H  +      E+   D  R   H  
Sbjct: 116 -----------RRYINEMIAWSARFGELERGDREIHHEVGALFAQENEPYDAER---HLA 161

Query: 188 RG--NNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDAN--YIM-------- 235
            G  ++ E  A     +     P    +   RA++ YL  GN+++AN  +I+        
Sbjct: 162 VGTTDSAEILARLEYEWYTYDEPHTAGIYALRAVIPYLLTGNLRNANKAFIIFTSRLSSP 221

Query: 236 -DEVKKQVEN-KQLQLQPSDLIQF--------VFYVLQTLERDALPLFNMLRANYKASIE 285
            D  K Q  N +Q+  Q SD+  +           +L  ++  +  LF  L   Y     
Sbjct: 222 ADPGKAQALNAQQVSSQSSDVNVYPSLPLLNFTSLLLLAIQSGSADLFRQLTRQYAHHTR 281

Query: 286 REPAFNEMLDDIAEKFFGV---KRRNPMQGIFGDIF 318
               +      I E +FG+   K+ N +  + G +F
Sbjct: 282 EVDGWERASAYIGELYFGIKIPKQSNALMDMMGMMF 317


>gi|85001071|ref|XP_955254.1| hypothetical protein [Theileria annulata strain Ankara]
 gi|65303400|emb|CAI75778.1| hypothetical protein, conserved [Theileria annulata]
          Length = 321

 Score = 45.1 bits (105), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 57/293 (19%), Positives = 123/293 (41%), Gaps = 16/293 (5%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           +++  + E  FY + Q   ++  R    ++Y E+L++L+  +   L   +     EL   
Sbjct: 10  RIDSEIKEKKFYESMQHILTLIKRLAMRKKYEESLEILYKYSLTYLDEKEYILAGELMDE 69

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
           + +      +  ++  L+++  +++ FP   +P N  ++ +     +     ++ V    
Sbjct: 70  YTKICESYNIPLNSNILNQLITVFQ-FPYSHIPLNTTNNTNTANTIDKSKLEEIMVLYV- 127

Query: 140 SFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTI 199
            ++  AI +S   G P  GSP LH +L EY Y    + + A+   + +   + E     I
Sbjct: 128 KYMNRAILYSKSEGEP-NGSPALHRVLGEY-YFHVKQYNKAQS--NLIYSQDVELLLKVI 183

Query: 200 VNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQ----------L 249
             +  +    + D    RA+LM L+ G++ +A   +  +   +E+ ++            
Sbjct: 184 NEWKAESDEDDGDFFYLRAVLMLLAAGDVCNAKCFLYLLDYDLEDAEVTGYSGGFPLLPS 243

Query: 250 QPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFF 302
                IQF   + ++ E     LF      YK  IE +  + ++L+ I   +F
Sbjct: 244 LLPLPIQFGNLLSESCENGNFELFKDACETYKDLIELDKEYTQLLNRIKSNYF 296


>gi|302684013|ref|XP_003031687.1| hypothetical protein SCHCODRAFT_67905 [Schizophyllum commune H4-8]
 gi|300105380|gb|EFI96784.1| hypothetical protein SCHCODRAFT_67905 [Schizophyllum commune H4-8]
          Length = 336

 Score = 44.7 bits (104), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 60/236 (25%), Positives = 93/236 (39%), Gaps = 39/236 (16%)

Query: 31  YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEAL-VKGKV 89
           Y A Q  ++ +ARYV + +   A+D+L   A   LK  Q   G +LA   +E    KG+ 
Sbjct: 20  YEAHQKARTFAARYVKSNQLDTAIDVLFQSARELLKTGQPGSGVDLAGFMLEVYETKGEK 79

Query: 90  AYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWS 149
             D E+  RV +                   L  LT + G  +       S +  A+ WS
Sbjct: 80  VTD-ESRGRVTQ-------------------LIALTGSEGTWR------KSLIDKAVAWS 113

Query: 150 IEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRG---NNPEKFASTIVNFMGKC 206
            + GA  +G P+LH  +   L  E    D+A    H +     ++    A + + +M   
Sbjct: 114 AKHGAYPSGDPDLHHYIGSLLVKEG-AYDVAE--HHLLAAGLRDSARLLAESFIRWMEAG 170

Query: 207 YPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQL----QLQPSDLIQFV 258
             GE      +  L YL  GN+  A   +     Q+   +L     LQ  D +Q V
Sbjct: 171 --GEAGNYALKGTLPYLLNGNILAARTFITTFVAQLPQSELVSAPPLQAGDSVQLV 224


>gi|403353649|gb|EJY76367.1| hypothetical protein OXYTRI_02126 [Oxytricha trifallax]
          Length = 283

 Score = 44.3 bits (103), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 44/185 (23%), Positives = 83/185 (44%), Gaps = 14/185 (7%)

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFA 196
           G  S ++  I+  IEF   +    E   +LA+   SE      +    +F++  + ++  
Sbjct: 106 GDHSRMENLIERVIEF--TRNEKKEFSRLLAQRYKSEKK---YSLAYSYFLKSRDTDEVI 160

Query: 197 STIVNFMGKCYPGEDDLAVARAILMYLS-LGNMKDANYIMDEVKKQVENKQLQLQPSDLI 255
             +   M   Y  E DL +AR  L +++     +DA     +VK+     Q Q+ P  ++
Sbjct: 161 ECLEEVMKHGYKNEMDLFIARIQLDFIARYEGFEDAK----KVKRHFIKSQEQMTP--IL 214

Query: 256 QFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFG--VKRRNPMQGI 313
            F   + + ++     LF  ++  Y   + R+P F E LD I+ K FG  +K+ + +  +
Sbjct: 215 NFTEMLPELIQVGDFGLFKEIKEKYDQELNRDPTFIEYLDKISVKHFGQKLKQESGLAKL 274

Query: 314 FGDIF 318
            G +F
Sbjct: 275 LGSLF 279


>gi|71027887|ref|XP_763587.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68350540|gb|EAN31304.1| hypothetical protein TP03_0559 [Theileria parva]
          Length = 324

 Score = 43.9 bits (102), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 53/306 (17%), Positives = 128/306 (41%), Gaps = 28/306 (9%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVS 79
           +++  + E  FY + Q   ++  R    ++Y E+L++L+  + + L   +     EL   
Sbjct: 10  RIDSEIKEKKFYESMQHILTLIKRLAMRKKYDESLEILYKYSLIYLDEKEYILAGELMDE 69

Query: 80  FVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCS 139
           + +      +  +   ++++  +++ FP   + ++  ++   + +     + K   E  +
Sbjct: 70  YTKLCETYNIPLNTNIVNQLVTVFQ-FPYSHITYSNTNNSGANNVDTNTASPKSTNENTN 128

Query: 140 S----------------FLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVS 183
           +                ++  AI +S     P  G+P LH +L  Y +  +   + ++  
Sbjct: 129 NIIDKNKLEEIMLLYVKYMNRAILYSKSVEEP-NGNPILHRVLGVYYFHVN---EYSKAQ 184

Query: 184 FHFVRGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVE 243
            + +   + E     +  +  +    + D    RA+LM L++G++ +A   +  +   +E
Sbjct: 185 GNLIYSQDVELLLKVVNEWKAESDEDDGDFFYLRAVLMLLAVGDVCNAKCFLYLLDYNLE 244

Query: 244 NKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFG 303
           + +L +     IQF   + ++ E     LF      YK  IE +  +  +L+ I E +F 
Sbjct: 245 DHELPIP----IQFGNVLSESCEIRNFELFKEACETYKDLIELDSEYPHLLNRIKEIYF- 299

Query: 304 VKRRNP 309
             + NP
Sbjct: 300 --KDNP 303


>gi|390594549|gb|EIN03959.1| hypothetical protein PUNSTDRAFT_108788 [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 356

 Score = 43.9 bits (102), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 56/258 (21%), Positives = 99/258 (38%), Gaps = 61/258 (23%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           ++ +  ++  G  Y A Q  ++ ++RY  ++++  A+D+L   A   LK  Q   G +LA
Sbjct: 11  LNAILPLITAGQPYEAHQKARTFASRYQKSRQFDTAIDVLFQSARELLKAGQPGSGTDLA 70

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
              +E           E+  R+ +                   L  LT + G+ +     
Sbjct: 71  SFLLEVYENKGETVTEESRGRLTQ-------------------LIALTGSNGSWR----- 106

Query: 138 CSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSES------PEL------DMARV-SF 184
             + +  A+ WS + G    G P+L   + E LY E       P L      D ARV + 
Sbjct: 107 -KTLIDKAVAWSAKHGPCPAGDPDLQHYVGELLYKEGAFEAAEPHLLAAGKRDSARVLAQ 165

Query: 185 HFV----RGNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKK 240
            FV     G +P  FA                    R +L YL  GN+  A   ++   +
Sbjct: 166 MFVDWSAAGGSPGTFA-------------------LRGVLPYLLNGNILAARTFINSFTR 206

Query: 241 QVENKQLQLQPSDLIQFV 258
           Q+ + +  L  S++ + +
Sbjct: 207 QIISSKPSLASSNVSETI 224


>gi|268574472|ref|XP_002642213.1| Hypothetical protein CBG18187 [Caenorhabditis briggsae]
          Length = 362

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 62/302 (20%), Positives = 118/302 (39%), Gaps = 25/302 (8%)

Query: 17  NIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSE-ALDLLHAGACLQLKNNQLTCGAE 75
            + +LE+      FY A Q Y++   R    +  +E AL L+        +  Q  C  +
Sbjct: 5   TLSRLEEFEKTNKFYDALQFYRTKVTRSFRLKSANEVALALVKHALEFFFREKQYQCAID 64

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRV 135
           +   + E L K         L+ +      F        +E +    +  + L  A+ + 
Sbjct: 65  ITTQYAECLSKNNADLATPTLELLADSVAKFANFA---EIEKEAGNTQQEQLLSTARCKC 121

Query: 136 EGCSSFLKAAIKW----SIEFGAPKTGSPELHVMLAE-YLYSESPELDMARVSFHFVRGN 190
                 +  AI+W    S      K GS   H +LA+ +L  +  EL       HF+  +
Sbjct: 122 ------VDLAIQWTKQKSTSADEKKYGSAAFHTILAKKFLAVDHFEL----AKNHFLLSD 171

Query: 191 NPEKFASTIVNFMGKCYP--GEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQ 248
           + + F+  +   + K      E DL +   +L  L L     A  + ++  K    K   
Sbjct: 172 DSKSFSQFLKEKLVKELNRYQESDLIIVETVLQILCLDRFPFAVVVFNDFVK--PEKYPF 229

Query: 249 LQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRN 308
            +P  ++ F   +   +E +  P ++ L A+Y++ ++R  +F   L  I + +FG++   
Sbjct: 230 SKP--ILNFQHILFDVIETENQPQYSELTASYQSELKRSYSFVGYLTRIGKMYFGIRDNQ 287

Query: 309 PM 310
            M
Sbjct: 288 CM 289


>gi|397602274|gb|EJK58132.1| hypothetical protein THAOC_21763 [Thalassiosira oceanica]
          Length = 471

 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 68/304 (22%), Positives = 125/304 (41%), Gaps = 62/304 (20%)

Query: 31  YGAQQMYKSISARYVAAQRYSEALDLLHAGA-CLQLKNNQLTCGAELAVSFVEALVKGKV 89
           Y A Q+ +S   R V    ++ A+D+ +  +  L  K+ ++   ++L    V+ L +   
Sbjct: 87  YDALQLAQSAVRRNVKIGNFAGAVDVAYETSLALLTKSGRVAVSSQLLAELVKVLGETHT 146

Query: 90  AYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWS 149
           A D E + R  ++ E +        L+ D ++    E     +L ++    FLK  +KWS
Sbjct: 147 ACDAEWVARFTRLDEAYRAA-----LDGDGNMSP-EERGRLQRLHLQ----FLKKGLKWS 196

Query: 150 IEFGAPKTGSPELHVMLAEYLYS----------ESP------ELDMARVSFHFVRGNNPE 193
            + G+ + G   +H +L ++ ++          E P      E     V     RG+   
Sbjct: 197 NDLGSVRYGHQGMHSLLGDHCWNMSCDASVVDKEKPLGGDDDESTAEEVEIEL-RGDAVG 255

Query: 194 KF--ASTIVNFMGK-----------------CYPGEDDLAVARAILMYLSLGNMKDA--- 231
            +  A  I   +GK                 C P + D  + RA+L+ L++ N+  A   
Sbjct: 256 HYAMAERIDAILGKLRGLPSPTDEETAAGHACPPAQRDELLTRAVLVLLAVENLSGARSL 315

Query: 232 --NYIMD-------EVKKQVENKQLQLQPSDLIQFVFYVLQTLERDA--LPLFNMLRANY 280
             +++ D       E+KK   +K     PS  + F   +++  E+DA   PLF  L  N+
Sbjct: 316 AGSFLSDVSERPPAELKKSYLDKTDGRSPSHAM-FCSMLVRICEKDAKTAPLFQWLVKNF 374

Query: 281 KASI 284
            A +
Sbjct: 375 GAEL 378


>gi|68076687|ref|XP_680263.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56501172|emb|CAH99757.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 297

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 63/297 (21%), Positives = 123/297 (41%), Gaps = 39/297 (13%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKN-NQLTCGAELAVSFVEA 83
           + +G +Y A Q+ KSI  R +   + +  L+++   + L+  N  Q     +L       
Sbjct: 35  IGKGLYYEAHQLIKSIINRKLQKDQLASCLNIIFYFS-LKFANVKQYVLLCDLMYQCCVI 93

Query: 84  LVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLK 143
           L +  + +D   L    KI E+F + P P++ E+                       F+ 
Sbjct: 94  LTENNIEFD---LYYADKIIEIFKKCP-PNSTEEK--------------------YKFMN 129

Query: 144 AAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFASTIVNFM 203
             I WS +      G  E H  +    Y E  +  +A    H++  ++ E   + I  + 
Sbjct: 130 KCILWSTD-DDHIFGYLEFHRAMGN-SYFEDKKYSLAHN--HYIYLDDNEMLYNIISFWR 185

Query: 204 GKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQ--LQLQPSDLIQFVFYV 261
            + YP E +  + R  L  L L  ++ A  +++  +K ++ K   L +Q + LI      
Sbjct: 186 KEAYPSEYNFFILRTTLCLLVLYKVEQALDLIENFEKNLDKKDAPLPVQVAYLI-----T 240

Query: 262 LQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGIFGDIF 318
              L ++ L L+  ++  Y+  +  +P F + +  I +  F  K++N    IF ++F
Sbjct: 241 CSCLYKNEL-LYEDIKYKYRLILNYDPKFQKYIQMIDDDIFK-KKKNNFLSIFQNMF 295


>gi|308487600|ref|XP_003105995.1| CRE-CEE-1 protein [Caenorhabditis remanei]
 gi|308254569|gb|EFO98521.1| CRE-CEE-1 protein [Caenorhabditis remanei]
          Length = 377

 Score = 43.5 bits (101), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 64/313 (20%), Positives = 118/313 (37%), Gaps = 38/313 (12%)

Query: 20  KLEKIVNEGNFYGAQQMYKSISARYVAAQRYSE-ALDLLHAGACLQLKNNQLTCGAELAV 78
           +LE+      +Y A Q Y++   R    +  +E AL L+        +  Q  C  ++A 
Sbjct: 8   RLEEFEKNKKYYDALQFYRTKVTRSFRLKSANEVALALVKHALDFFFREKQYQCAIDIAT 67

Query: 79  SFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGC 138
            + E L K  V  +    + + +    F       N+  +    +L   L  A+ R    
Sbjct: 68  QYAECLSKNDVELEASTFELLAESVAKFANFAEIENVAGNIQQEQL---LSTARCRC--- 121

Query: 139 SSFLKAAIKWSIEFGAP-------------------KTGSPELHVMLAEYLYSESPELDM 179
              +  AI+W     +                    K GS   H +LA+ L +     D+
Sbjct: 122 ---VDLAIQWYFSLLSSYEHFCDFRTKQKSTNQYEKKYGSAAFHALLAKKLVA-VDHFDL 177

Query: 180 ARVSFHFVRGNNPEKFASTIVNFMGKCY--PGEDDLAVARAILMYLSLGNMKDANYIMDE 237
           A+   HF+  ++ + FA  + N   K +    E D+ +   +L  + L     A  + +E
Sbjct: 178 AKN--HFLLSDDSKSFALFLHNDFEKAHNKEAESDIIIVETVLQVICLDRFPFAVALFNE 235

Query: 238 VKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDI 297
             K   NK    +P  L+ F   +   +E +     + L  +Y+  ++R  A    L  I
Sbjct: 236 YVK--PNKYPFAKP--LLNFQHILFDVIETENQQQLSELTTSYQTELKRSYALIGYLTKI 291

Query: 298 AEKFFGVKRRNPM 310
            + +FG++    M
Sbjct: 292 GKLYFGLRDSQCM 304


>gi|71006032|ref|XP_757682.1| hypothetical protein UM01535.1 [Ustilago maydis 521]
 gi|46097357|gb|EAK82590.1| hypothetical protein UM01535.1 [Ustilago maydis 521]
          Length = 392

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 43/158 (27%), Positives = 70/158 (44%), Gaps = 31/158 (19%)

Query: 31  YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEAL-VKGKV 89
           Y   Q  ++ + R V  ++Y EA+  LH GA   L  ++   G +LAV  ++   +K +V
Sbjct: 5   YELHQKLRTKTVRQVKKKQYDEAISTLHQGALELLSQSEQGSGCDLAVYMIDVYGIKNQV 64

Query: 90  AYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWS 149
             D+E  DR+  I                     L++A  AA    +     + AA+KWS
Sbjct: 65  -VDSEARDRITDI---------------------LSKA--AADFWRK---KVIDAAVKWS 97

Query: 150 IEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
           ++     TG P L + +A+ LY+   E D      HF+
Sbjct: 98  VKASGASTGDPLLRLFIAK-LYAH--EADFEHAEPHFL 132


>gi|300123560|emb|CBK24832.2| unnamed protein product [Blastocystis hominis]
          Length = 81

 Score = 43.1 bits (100), Expect = 0.18,   Method: Composition-based stats.
 Identities = 21/50 (42%), Positives = 33/50 (66%)

Query: 21 LEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQL 70
          L+  + EGN+Y A Q+YKS+ +RY  A++Y E+ +LL  G     K+NQ+
Sbjct: 12 LDMQLEEGNYYEALQLYKSLQSRYETAEKYKESDELLFHGIENMNKHNQV 61


>gi|223999265|ref|XP_002289305.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974513|gb|EED92842.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 415

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 43/198 (21%), Positives = 79/198 (39%), Gaps = 59/198 (29%)

Query: 141 FLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNP-------- 192
           FL+  +KWS + G+ K GS  +H +L ++ ++ S +  + +        NN         
Sbjct: 145 FLRRCLKWSNDLGSVKYGSLGVHALLGDHCWAMSCDEAVVKTGEEKEVANNKASGEGGDD 204

Query: 193 ----EKFA------STIVNF-------------------------MG-KCYPGEDDLAVA 216
               +++A        + ++                         MG  C P + D  + 
Sbjct: 205 EEYDDEYAEIGLRNEAVTHYAMAEKVDVILEKLKSLPPPTDEESKMGHTCLPAQRDALLT 264

Query: 217 RAILMYLSLGNMKDANYIMDEVKKQVENKQLQL------------QPSDLIQFVFYVLQT 264
           R+IL+ L++ N++DA  +     K +E +  ++             PS  I FV  +L+ 
Sbjct: 265 RSILVLLAVENLRDATKLASSFLKDIETRTEEVLKKSYMDKNDGKAPSH-IMFVGMLLRI 323

Query: 265 LERD--ALPLFNMLRANY 280
            E+D    PLFN L  N+
Sbjct: 324 CEKDVKTAPLFNWLVRNF 341


>gi|388852764|emb|CCF53682.1| uncharacterized protein [Ustilago hordei]
          Length = 407

 Score = 42.0 bits (97), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 39/157 (24%), Positives = 64/157 (40%), Gaps = 29/157 (18%)

Query: 31  YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVA 90
           Y   Q  ++ + R V  ++Y EA+  LH GA   L   +   G +LA+  ++       A
Sbjct: 5   YELHQKLRTKTVRQVKKKQYDEAISTLHQGALDLLAQKEQGSGCDLAIYMIDVYGIKNQA 64

Query: 91  YDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKWSI 150
            D+E+ DR+  I  L    P         D                     + AA+KWS+
Sbjct: 65  VDSESRDRITDI--LTAAAP---------DFWR---------------KKVVDAAVKWSV 98

Query: 151 EFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFV 187
           +     TG P L + +A  +Y++  E +  R   HF+
Sbjct: 99  KASEAPTGDPLLRLFVAN-MYAKDGEFE--RAEQHFL 132


>gi|339236205|ref|XP_003379657.1| conserved hypothetical protein [Trichinella spiralis]
 gi|316977656|gb|EFV60728.1| conserved hypothetical protein [Trichinella spiralis]
          Length = 247

 Score = 42.0 bits (97), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 44/244 (18%), Positives = 95/244 (38%), Gaps = 46/244 (18%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           +++L+  ++   FY A Q Y++++ R+   +++S  L +L+       +  Q     ++A
Sbjct: 16  LNRLDVYLSSNKFYDAHQTYRALATRWTQQRKFSSLLPILYESVGKFSEKGQNLIAEDMA 75

Query: 78  VSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEG 137
             +V AL   K   +   +  ++ I++                             + E 
Sbjct: 76  GLYVAALKASKSPINCSVISNLKSIFQWM-----------------------ILNCKSES 112

Query: 138 CSSFLKAAIKWSIEFGA--PKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKF 195
              FL   + WS+E     P+ G P+LH M  E  +      +     FH +  ++ +  
Sbjct: 113 QERFLSQILGWSLEVSPEYPR-GHPDLHRMYVEVYHGAG---NFHSSIFHIMLIDDGQ-- 166

Query: 196 ASTIVNFMGKCY-----PGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQ 250
              I  ++ +CY     P E D+ + + IL        +D N        Q+ ++ +   
Sbjct: 167 --YIGKYLAECYSKYAFPSELDIFITQFIL--------RDGNDATSSTVGQLASRDVNAN 216

Query: 251 PSDL 254
           P+++
Sbjct: 217 PNNI 220


>gi|194334680|ref|YP_002016540.1| mandelate racemase/muconate lactonizing protein [Prosthecochloris
           aestuarii DSM 271]
 gi|194312498|gb|ACF46893.1| Mandelate racemase/muconate lactonizing protein [Prosthecochloris
           aestuarii DSM 271]
          Length = 365

 Score = 42.0 bits (97), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 45/134 (33%), Positives = 68/134 (50%), Gaps = 18/134 (13%)

Query: 29  NFYGAQQMYKSISARYVA-----AQRYSEALD-LLHAGACLQLKNNQLTCGAELAVSFVE 82
           +FY  QQ Y+++  +  A     A      LD +  +   L+L +NQ +   E A+ F+E
Sbjct: 149 HFY--QQGYRTLKLKVNALNKADATGQIRTLDRVFGSSITLRLDSNQ-SFDLEAAIRFLE 205

Query: 83  ALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDD-----DDLHELT-EALGAAKL--- 133
           AL K ++AY  E L   +KI ELF +  VP  L++      D  H L  E LGA  L   
Sbjct: 206 ALPKKRIAYIEEPLSDPQKIAELFRRSGVPIALDESLWQTPDLRHALPDECLGAYILKPS 265

Query: 134 RVEGCSSFLKAAIK 147
           R+ G ++ +K AI+
Sbjct: 266 RIGGITATMKLAIE 279


>gi|429462969|ref|YP_007184432.1| glutamate-1-semialdehyde-2,1-aminomutase [Candidatus
           Kinetoplastibacterium crithidii (ex Angomonas deanei
           ATCC 30255)]
 gi|451811750|ref|YP_007448205.1| glutamate-1-semialdehyde 2,1-aminomutase GSA [Candidatus
           Kinetoplastibacterium crithidii TCC036E]
 gi|343433341|gb|AEM25280.1| glutamate-1-semialdehyde 2,1-aminomutase [Candidatus
           Kinetoplastibacterium crithidii]
 gi|429338483|gb|AFZ82906.1| glutamate-1-semialdehyde-2,1-aminomutase [Candidatus
           Kinetoplastibacterium crithidii (ex Angomonas deanei
           ATCC 30255)]
 gi|451776908|gb|AGF47907.1| glutamate-1-semialdehyde 2,1-aminomutase GSA [Candidatus
           Kinetoplastibacterium crithidii TCC036E]
          Length = 429

 Score = 41.6 bits (96), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 54/125 (43%), Gaps = 21/125 (16%)

Query: 145 AIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARV----------SFHFVRGNNPEK 194
           A ++ + FGAP  G     V+LAE L S  P +D  R+          +    RG     
Sbjct: 78  AAEYGLSFGAPTKG----EVILAEMLVSRIPSIDKIRLVNSGTEATMTAIRLARGYTGR- 132

Query: 195 FASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDL 254
             + IV F G CY G  D  + +A    L+ GN   A    + VK  +    L+   SD+
Sbjct: 133 --TKIVKFEG-CYHGHSDSLLVKAGSGLLTFGNPSSAGVPDNFVKDTI---SLEYNNSDI 186

Query: 255 IQFVF 259
           ++ VF
Sbjct: 187 VKEVF 191


>gi|19075383|ref|NP_587883.1| ER membrane insertion protein Get4 (predicted) [Schizosaccharomyces
           pombe 972h-]
 gi|74582446|sp|O74432.1|GET4_SCHPO RecName: Full=Golgi to ER traffic protein 4
 gi|3451313|emb|CAA20450.1| ER membrane insertion protein Get4 (predicted) [Schizosaccharomyces
           pombe]
          Length = 303

 Score = 41.2 bits (95), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 51/208 (24%), Positives = 85/208 (40%), Gaps = 35/208 (16%)

Query: 31  YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVA 90
           Y   QM +++  R V A+++ +A+ LL++GA    +  Q    A+LA+  ++   K   A
Sbjct: 17  YEGHQMLRTLVNRQVKAKKHDDAVALLYSGAKTLFEIEQTGSAADLAIYMLDVYEKASYA 76

Query: 91  YDNENLDRVRKIYELFP--------------QIPVPHNLEDDDDLH--------ELTEAL 128
              +N  R   +  LFP              +       + D D+H        +  E  
Sbjct: 77  ASLDNKARALTLLGLFPAEEGARKQYVKRLLEWSKSAGPQGDKDVHFAVATMFVKWKEPA 136

Query: 129 GAAKLRVEG----CSSFLKAAIKW--SIEFGAPKT--GSPELHVMLAEYLYSESPELDMA 180
            A K  V G      ++ +    W  S    +P T  G P L+ +LAE L S    L+  
Sbjct: 137 SAEKHFVLGNEKSARAYGETMYYWFTSDSSISPDTFAGRPVLNYLLAENLISAWNSLET- 195

Query: 181 RVSFHFVRGNNPEKFASTIVNFMGKCYP 208
             + HF + N P+      ++F GK +P
Sbjct: 196 -FTKHFTKSNAPD---VENMSFDGKDFP 219


>gi|50293087|ref|XP_448968.1| hypothetical protein [Candida glabrata CBS 138]
 gi|49528281|emb|CAG61938.1| unnamed protein product [Candida glabrata]
          Length = 307

 Score = 41.2 bits (95), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 37/156 (23%), Positives = 67/156 (42%), Gaps = 27/156 (17%)

Query: 16  ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAE 75
           + + + E  +  G++Y A Q  ++I+ RYV ++ Y  A++L++ GA   LK  Q    ++
Sbjct: 11  KTLQRFESRIAAGDYYEAHQTLRTIANRYVRSKSYDHAIELINQGALSFLKAKQGGSASD 70

Query: 76  LAVSFVEALVKGKVAYDNENLDRVRKIYELF-PQIPVPHNLEDDDDLHELTEALGAAKLR 134
           L    +E      +     ++ R+ ++  L   Q P   NL+D                 
Sbjct: 71  LIFYLLEVYDVADIKVSETSVGRLAQLLVLVDDQEP---NLKD----------------I 111

Query: 135 VEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYL 170
           V G ++       WSI+    K G P LH ++   L
Sbjct: 112 VTGMNN-------WSIKHSEFKFGDPTLHNVIGNKL 140


>gi|326432243|gb|EGD77813.1| hypothetical protein PTSG_12807 [Salpingoeca sp. ATCC 50818]
          Length = 291

 Score = 41.2 bits (95), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 64/282 (22%), Positives = 122/282 (43%), Gaps = 43/282 (15%)

Query: 29  NFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGK 88
           +FY AQ++  S++ R  A Q+++ A   L  G    L+N++    A+L    ++ L +  
Sbjct: 23  DFYEAQEVLISLTNRLRAQQQFARAKAALEDGMDKLLQNHKDNAVAQLVKVHIDVLSESP 82

Query: 89  VAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKAAIKW 148
              D++ L+      +L        N    DD+  L               SFL+  I  
Sbjct: 83  ---DHDPLNAGDYYVQL--------NERFSDDVSLL---------------SFLRHTIAA 116

Query: 149 SIEFGAPKTGSPELHVMLAE-YLYSESPELDMAR---VSFHFVRGNNPEKFASTIVNFMG 204
           +   G    G  +LH   A+ +L ++ P++  A    V   +  G       +T V+ + 
Sbjct: 117 TTPEGGSH-GRCQLHFSAAQSFLRADPPKVKEALEQLVLSGYEPGQAKGDVVTTTVDALL 175

Query: 205 KCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPS--DLIQFVFYVL 262
                E  L   + ++        + A Y   ++ + V +  ++  P+    ++F  YVL
Sbjct: 176 AAPVEEQALLACKHVI--------RVAAYTSRDLGQAVYDAYVERAPAMGRPLEFARYVL 227

Query: 263 QTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            TL RDA PL   LR  Y  +I+ +   +++L+ +A++++GV
Sbjct: 228 LTLTRDAAPLLTDLRKAYADTIDDD--LDKVLNKVAKRYYGV 267


>gi|221502983|gb|EEE28693.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 313

 Score = 40.8 bits (94), Expect = 0.77,   Method: Compositional matrix adjust.
 Identities = 61/295 (20%), Positives = 123/295 (41%), Gaps = 29/295 (9%)

Query: 21  LEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSF 80
           +++ ++ G+ Y A Q+ +++  R++A +   +A++L           +Q     +L ++ 
Sbjct: 29  VDRKIDNGDLYDAHQLVRTLFFRFMAKRETLQAVELCRLYGLRFAGLDQEALAVDLGMNM 88

Query: 81  VEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSS 140
           + AL       +           E  P  P        D + EL  A  AA  + +G   
Sbjct: 89  LTALEASGTEEE-----------ETAPSEP------QLDQIIELFNACAAAGDK-KGVDK 130

Query: 141 --FLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
             F+  A+KWS    AP  G   LH   A+  + E           H +   +PE  +  
Sbjct: 131 YKFINRALKWSRTPQAP-FGHVRLHRAAADAYWKER---RYGLCQGHLIYCRDPEALSQM 186

Query: 199 IVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFV 258
           +  +    Y  E      R +L+ L L + + A  ++D   +   + ++   P+ L Q  
Sbjct: 187 LKEWQATGYLSERPFFWLRLVLILLCLRDTETAEKLLDSSGENWTSCEV---PAPL-QLA 242

Query: 259 FYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGI 313
           + ++   +  +  LF++L+  Y   + R+P F + +D+I ++  G  +R P  G+
Sbjct: 243 YLLVCACKYKSGKLFDLLKQKYHLVLRRDPTFAKYMDEIEKRAIGRIQR-PSTGL 296


>gi|401412586|ref|XP_003885740.1| conserved hypothetical protein [Neospora caninum Liverpool]
 gi|325120160|emb|CBZ55714.1| conserved hypothetical protein [Neospora caninum Liverpool]
          Length = 315

 Score = 40.8 bits (94), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 59/295 (20%), Positives = 124/295 (42%), Gaps = 29/295 (9%)

Query: 21  LEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSF 80
           +++ ++ G+ Y A Q+ +++  R++A +  ++A++L           +Q     +L ++ 
Sbjct: 31  VDRKIDSGDLYDAHQLVRTLFFRFMAKRESTQAVELCRVYGLRFAGLDQEALAVDLGMNM 90

Query: 81  VEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSS 140
           + AL   + A D                   P + E    + EL  A  AA  + +G   
Sbjct: 91  LTALEASRQAEDG-----------------APPSDEQLGQVIELFNACAAAGGK-KGVDK 132

Query: 141 F--LKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
           +  +  A+KWS    +P  G   LH   AE  + E           H +   +PE  +  
Sbjct: 133 YKLINRALKWSRSPQSP-FGHVRLHRAAAEAYWKEQ---RYGLCQGHLIYCRDPEALSQM 188

Query: 199 IVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFV 258
           +  +    Y  E      R IL+ L L + + A  ++    +  ++ ++   P+ L Q  
Sbjct: 189 VKEWQAAGYLSERPFFWLRLILILLCLRDTETAEKVLGGSGENWDSSEV---PAPL-QLA 244

Query: 259 FYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGI 313
           + ++   +  +  LF++L+  Y   + R+  F + +D+I ++  G  +R P  G+
Sbjct: 245 YLLVCACKYKSGKLFDLLKQKYHLVLRRDETFAKYMDEIEKRTIGRIQR-PSVGL 298


>gi|237842749|ref|XP_002370672.1| hypothetical protein TGME49_019080 [Toxoplasma gondii ME49]
 gi|211968336|gb|EEB03532.1| hypothetical protein TGME49_019080 [Toxoplasma gondii ME49]
          Length = 313

 Score = 40.8 bits (94), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 61/295 (20%), Positives = 123/295 (41%), Gaps = 29/295 (9%)

Query: 21  LEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSF 80
           +++ ++ G+ Y A Q+ +++  R++A +   +A++L           +Q     +L ++ 
Sbjct: 29  VDRKIDNGDLYDAHQLVRTLFFRFMAKRETLQAVELCRLYGLRFAGLDQEALAVDLGMNM 88

Query: 81  VEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSS 140
           + AL       +           E  P  P        D + EL  A  AA  + +G   
Sbjct: 89  LTALEASGTEEE-----------ETAPSEP------QLDQIIELFNACAAAGDK-KGVDK 130

Query: 141 --FLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
             F+  A+KWS    AP  G   LH   A+  + E           H +   +PE  +  
Sbjct: 131 YKFINRALKWSRTPQAP-FGHVRLHRAAADAYWKER---RYGLCQGHLIYCRDPEALSQM 186

Query: 199 IVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFV 258
           +  +    Y  E      R +L+ L L + + A  ++D   +   + ++   P+ L Q  
Sbjct: 187 LKEWQATGYLSERPFFWLRLVLILLCLRDTETAEKLLDSSGENWTSFEV---PAPL-QLA 242

Query: 259 FYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGI 313
           + ++   +  +  LF++L+  Y   + R+P F + +D+I ++  G  +R P  G+
Sbjct: 243 YLLVCACKYKSGKLFDLLKQKYHLVLRRDPTFAKYMDEIEKRTIGRIQR-PSTGL 296


>gi|221485643|gb|EEE23924.1| conserved hypothetical protein [Toxoplasma gondii GT1]
          Length = 313

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 61/295 (20%), Positives = 123/295 (41%), Gaps = 29/295 (9%)

Query: 21  LEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSF 80
           +++ ++ G+ Y A Q+ +++  R++A +   +A++L           +Q     +L ++ 
Sbjct: 29  VDRKIDNGDLYDAHQLVRTLFFRFMAKRETLQAVELCRLYGLRFAGLDQEALAVDLGMNM 88

Query: 81  VEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSS 140
           + AL       +           E  P  P        D + EL  A  AA  + +G   
Sbjct: 89  LTALEASGTEEE-----------ETAPSEP------QLDQIIELFNACAAAGDK-KGVDK 130

Query: 141 --FLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
             F+  A+KWS    AP  G   LH   A+  + E           H +   +PE  +  
Sbjct: 131 YKFINRALKWSRTPQAP-FGHVRLHRAAADAYWKER---RYGLCQGHLIYCRDPEALSQM 186

Query: 199 IVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQLQLQPSDLIQFV 258
           +  +    Y  E      R +L+ L L + + A  ++D   +   + ++   P+ L Q  
Sbjct: 187 LKEWQATGYLSERPFFWLRLVLILLCLRDTETAEKLLDSSGENWTSFEV---PAPL-QLA 242

Query: 259 FYVLQTLERDALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGVKRRNPMQGI 313
           + ++   +  +  LF++L+  Y   + R+P F + +D+I ++  G  +R P  G+
Sbjct: 243 YLLVCACKYKSGKLFDLLKQKYHLVLRRDPTFAKYMDEIEKRAIGRIQR-PSTGL 296


>gi|393221266|gb|EJD06751.1| cytoplasmic protein [Fomitiporia mediterranea MF3/22]
          Length = 341

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 30/149 (20%), Positives = 62/149 (41%), Gaps = 25/149 (16%)

Query: 25  VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEAL 84
           + +G+ Y A Q  ++ ++RY+ + +Y  A+D+L   +   L   Q   G +LA+  +E  
Sbjct: 20  IADGHAYEAHQKARTFASRYIKSAQYDTAIDVLFQSSRALLTAGQPGSGTDLALFLIE-- 77

Query: 85  VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVEGCSSFLKA 144
                            +YE        + +  ++   ELT+ +           + +  
Sbjct: 78  -----------------VYE------AKNEVVSEESRGELTQLIALTGSNGTWRKTLIDK 114

Query: 145 AIKWSIEFGAPKTGSPELHVMLAEYLYSE 173
           A+ W+ + G    G P+L+  + E LY +
Sbjct: 115 AVTWTAKAGPCPAGDPDLNHYVGELLYKD 143


>gi|409048323|gb|EKM57801.1| hypothetical protein PHACADRAFT_171018 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 350

 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 37/168 (22%), Positives = 65/168 (38%), Gaps = 30/168 (17%)

Query: 11  LPPAQ-----ENIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQL 65
           +PPA      + ++ +  ++  G  Y A Q  ++ ++RY  + +Y  A+D+L   A   L
Sbjct: 1   MPPAAPASSGKALNAILPLIEAGQPYEAHQKARTFASRYQKSGQYDTAIDVLFQSARELL 60

Query: 66  KNNQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELT 125
           K  Q   G +L +  +E        YD +  D V                  D+    LT
Sbjct: 61  KAGQQGSGTDLTLFLLE-------VYDTKG-DEV-----------------SDETRGRLT 95

Query: 126 EALGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSE 173
           + +           + +  +  WS + G   +G P L   + E LY E
Sbjct: 96  QLIALTGSSGSWRKTIIDRSTAWSAKHGPCPSGDPSLQHYIGELLYKE 143


>gi|298241103|ref|ZP_06964910.1| FAD linked oxidase domain protein [Ktedonobacter racemifer DSM
           44963]
 gi|297554157|gb|EFH88021.1| FAD linked oxidase domain protein [Ktedonobacter racemifer DSM
           44963]
          Length = 458

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 63/154 (40%), Gaps = 27/154 (17%)

Query: 148 WSIEFGAPKTG---------SPELHVMLAEYLYSESPELDMARVSFHFVRGNNPEKFAST 198
           W+I  G    G          P   V+    +Y +  E++    ++       PE+  ST
Sbjct: 180 WAIRGGGGNFGVVTSFDFRVQPHTTVVGGSVIY-DMAEVETVLAAWTSSMREAPEELNST 238

Query: 199 IVNFMG------------KCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVKKQVENKQ 246
           IV F G             CY G+D++A  +AI  +L LG +K  N         +EN  
Sbjct: 239 IVLFPGFGPQAVPQVMVLLCYAGDDEVAANKAIAPFLQLGTVKVQNIQKKPCYAMLENAA 298

Query: 247 LQLQPSDLIQFVFYVLQTLERDALPLFNMLRANY 280
           L      + Q  F  L+TL ++AL    ++ ANY
Sbjct: 299 LPPGLQMIAQDGF--LKTLSKEAL---GVIAANY 327


>gi|449540297|gb|EMD31290.1| hypothetical protein CERSUDRAFT_119842 [Ceriporiopsis subvermispora
           B]
          Length = 351

 Score = 39.3 bits (90), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 56/250 (22%), Positives = 92/250 (36%), Gaps = 63/250 (25%)

Query: 18  IDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELA 77
           ++ +  ++  G  Y A Q  ++ ++RYV + ++  A+D+L   A   LK  Q   G +LA
Sbjct: 14  LNAILPLIAAGQPYEAHQKARTFASRYVKSNQFDTAIDVLFQSARELLKAGQQGSGTDLA 73

Query: 78  VSFVEAL-VKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
              ++    KG+V  + E+  R+ ++  L                        +A  R  
Sbjct: 74  SFLLDVYESKGEV-VNEESRGRITQLLALVGS---------------------SASWR-- 109

Query: 137 GCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSES------PEL------DMAR-VS 183
              + +  A+ WS   G    G P+L   + E LY +       P L      D AR ++
Sbjct: 110 --KTIIDKAVAWSSRNGPCPAGDPDLQHYIGELLYKDGLFDAAEPHLLAAGKRDSARLLA 167

Query: 184 FHFVR----GNNPEKFASTIVNFMGKCYPGEDDLAVARAILMYLSLGNMKDANYIMDEVK 239
             FV+    G  P  FA                    R  + YL  GN+  A     E  
Sbjct: 168 QMFVKWATTGGTPGTFA-------------------LRGTVPYLLNGNILAARAFTSEFV 208

Query: 240 KQVENKQLQL 249
            Q+   Q  L
Sbjct: 209 SQITASQPSL 218


>gi|47183431|emb|CAG13855.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 61

 Score = 39.3 bits (90), Expect = 2.2,   Method: Composition-based stats.
 Identities = 19/44 (43%), Positives = 29/44 (65%), Gaps = 1/44 (2%)

Query: 270 LPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV-KRRNPMQG 312
           L +F +L   YK S++R+P +NE LD I + FFGV  +++P  G
Sbjct: 3   LTVFTVLCEQYKPSLKRDPMYNEYLDRIGQLFFGVPPKQSPSYG 46


>gi|344301800|gb|EGW32105.1| hypothetical protein SPAPADRAFT_61181 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 311

 Score = 39.3 bits (90), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 67/332 (20%), Positives = 123/332 (37%), Gaps = 56/332 (16%)

Query: 17  NIDKLEKIVNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAEL 76
            ID+ +  +  G++Y A Q  ++I+ R++   +   A+ LL+ GA +   + +    ++L
Sbjct: 7   TIDRFQAKIESGSYYEAHQTLRTIANRFIKGDQPILAIQLLYQGASILTTHKEYASASDL 66

Query: 77  AVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVE 136
               V+  V     +  +      K+ +L   +P       +  L EL+           
Sbjct: 67  ISYIVQ--VYSDFGFSVDEKSAKMKLIDLISGLP-----PSEPSLVELS----------- 108

Query: 137 GCSSFLKAAIKWSIEFG--APKTGSPELHVMLAEYLYSESPELD----------MARVSF 184
                    +KWS   G    K G  +LH +          E +           A    
Sbjct: 109 ------TQCLKWSQSSGEVPHKFGDSQLHHVFGSMFLKRVEEEEEEVEENRSKYFALAEI 162

Query: 185 HFVRGNNPEKFASTIVNFMGKCYPGEDDLA----VARAILMYLSLGNMKDANYIMDEVKK 240
           H V G           N++ +C     D+     ++RAI+ YL L N+K A    D   K
Sbjct: 163 HLVLGTVDS--VPLYTNYLSQCSKANPDVDPGVFLSRAIINYLYLNNVKFAQDAHDLFMK 220

Query: 241 QVE-----NKQLQLQPSDLIQFVFYVLQTLERDAL---PLFNMLRANYKASIEREPAFNE 292
            VE     N  +      L+ FV  +L   +++       F  L  +YK  +      + 
Sbjct: 221 NVEGGVPSNGIVYFDEYPLLNFVQLLLVLFKKEHASNSSQFMGLYTHYKPLLTNYGILSA 280

Query: 293 MLDDIAEKFFGVK-----RRNPMQGIFGDIFK 319
            L+ +   +F ++     + N +  + GD+FK
Sbjct: 281 -LEYLGSTYFQLRLGQKPQGNMLSSLMGDLFK 311


>gi|398791125|ref|ZP_10551904.1| putative hydrolase or acyltransferase of alpha/beta superfamily
           [Pantoea sp. YR343]
 gi|398216314|gb|EJN02866.1| putative hydrolase or acyltransferase of alpha/beta superfamily
           [Pantoea sp. YR343]
          Length = 303

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 27/119 (22%), Positives = 53/119 (44%), Gaps = 8/119 (6%)

Query: 91  YDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALGAAKLRVE--GCSSFLKAAIKW 148
           +   ++ R+ K   + P +    N  D   + +    L    LR E     + ++    +
Sbjct: 189 FQKTSITRLAKSEAVRPALVTASNNSDRKTVTDAVYELMITDLRPELARIKTPIEVVYAY 248

Query: 149 SIEFGAPKTGSPELHVMLAEYLYSESPELDMARV--SFHFVRGNNPEKFASTIVNFMGK 205
            + FG P TG   ++       Y+ +P++   R+  SFHF+  + PE+F+S +  F+ K
Sbjct: 249 DVLFGVPATGVDAMY----RQAYASAPDIHFTRIDNSFHFIMLDQPERFSSAVEAFLNK 303


>gi|164662180|ref|XP_001732212.1| hypothetical protein MGL_0805 [Malassezia globosa CBS 7966]
 gi|159106114|gb|EDP44998.1| hypothetical protein MGL_0805 [Malassezia globosa CBS 7966]
          Length = 398

 Score = 38.9 bits (89), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 20/74 (27%), Positives = 38/74 (51%)

Query: 31  YGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVA 90
           Y   Q  ++ + R +   +++EA++ LH+GA   L  N+     +LA   +E  ++  V 
Sbjct: 13  YELHQKLRTKTVRLMRKNKHNEAVETLHSGAIQLLDMNEEGSACDLAEYLLEVYMQAHVP 72

Query: 91  YDNENLDRVRKIYE 104
            DN + +R+  I E
Sbjct: 73  MDNTSRERINSILE 86


>gi|349804015|gb|AEQ17480.1| putative golgi to er traffic protein 4 [Hymenochirus curtipes]
          Length = 115

 Score = 38.5 bits (88), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 269 ALPLFNMLRANYKASIEREPAFNEMLDDIAEKFFGV 304
            L +F +L   Y+ S++R+P +NE LD I + FFGV
Sbjct: 43  VLTVFTVLCEQYQPSLKRDPMYNEYLDRIGQLFFGV 78


>gi|353238471|emb|CCA70416.1| hypothetical protein PIIN_04355 [Piriformospora indica DSM 11827]
          Length = 340

 Score = 38.1 bits (87), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 67/326 (20%), Positives = 120/326 (36%), Gaps = 68/326 (20%)

Query: 11  LPPAQENIDKLEKI---VNEGNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKN 67
           +PP       L  I   ++ G  Y A Q  ++ ++RY  A  +  A+ +L+  A    K 
Sbjct: 1   MPPPLSADKALAAILPQISAGQAYEAHQKARTFASRYSKAGAHDVAISVLYESATALFKA 60

Query: 68  NQLTCGAELAVSFVEALVKGKVAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEA 127
            QL  G +L+V  +E        Y+N+  +                   D+     LT+ 
Sbjct: 61  GQLGSGTDLSVLMLEM-------YENKEQE------------------VDEASRARLTQL 95

Query: 128 LGAAKLRVEGCSSFLKAAIKWSIEFGAPKTGSPELHVMLAEYLYSE------SPEL---- 177
           +       +     +  A+ WS   G+  +G P LH  +   L  E       P L    
Sbjct: 96  ISLVGGEGQWRKMIVDRAVAWSARHGSYPSGDPSLHGFIGALLAKEHDFVAAEPHLLASG 155

Query: 178 ------DMARVSFHFVR-GNNPEKFAST-IVNFM--GKCYPGEDDLAVARAILMYLSLGN 227
                  +A++ F + R G  P  +AS  ++ F+  G        ++   ++L+      
Sbjct: 156 TRESAKTLAQMLFAWSRDGAEPGTYASKGVIPFLLTGNVLAARTFMSTFLSLLITARPNI 215

Query: 228 M---------KDANYIMDEVKKQVENKQLQLQPSDLIQFVFYVLQTLERDALPLFNMLRA 278
           +           AN I  +VK Q E  +     S+L Q          + A  ++  L  
Sbjct: 216 LATPSNAIAGSTANPIRFDVKGQSERDERAKSSSNLQQ---------AKQAQEMWVRLCG 266

Query: 279 NYKA--SIEREPAFNEMLDDIAEKFF 302
           +Y++   +  +P + E L +I   FF
Sbjct: 267 SYQSRRGLVAQPVYREALGEIGTLFF 292


>gi|392396942|ref|YP_006433543.1| hypothetical protein Fleli_1318 [Flexibacter litoralis DSM 6794]
 gi|390528020|gb|AFM03750.1| tetratricopeptide repeat protein [Flexibacter litoralis DSM 6794]
          Length = 439

 Score = 37.7 bits (86), Expect = 8.0,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 43/73 (58%), Gaps = 8/73 (10%)

Query: 37  YKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTCGAELAVSFVEALVKGKVAYDNENL 96
           Y  ++  YV  +RY+EA+ +  A +CL+L+ N+        V F+EAL   K A D E +
Sbjct: 325 YADLTRLYVHEKRYNEAIKV--ADSCLRLEPNRHD------VYFLEALSNYKTARDKEGI 376

Query: 97  DRVRKIYELFPQI 109
           + +++++  FP++
Sbjct: 377 NVLKQLFIAFPKL 389


>gi|254570777|ref|XP_002492498.1| Protein of unknown function [Komagataella pastoris GS115]
 gi|238032296|emb|CAY70319.1| Protein of unknown function [Komagataella pastoris GS115]
 gi|328353490|emb|CCA39888.1| UPF0363 protein YOR164C [Komagataella pastoris CBS 7435]
          Length = 334

 Score = 37.4 bits (85), Expect = 9.9,   Method: Compositional matrix adjust.
 Identities = 52/239 (21%), Positives = 104/239 (43%), Gaps = 52/239 (21%)

Query: 20  KLEKIVNE-------GNFYGAQQMYKSISARYVAAQRYSEALDLLHAGACLQLKNNQLTC 72
           KLEK +N        G++Y AQQ  ++I  RY+ A+ +  A +LL+  + +  +N +   
Sbjct: 7   KLEKTINRFKARIDAGSYYEAQQTIRTIVNRYIRAKDHDAASELLYHSSLIFSENGRFDE 66

Query: 73  GAELAVSFVEALVKGK---VAYDNENLDRVRKIYELFPQIPVPHNLEDDDDLHELTEALG 129
           G +L    ++  +  +    +Y    L R+ +I   +P        ++D ++  L E L 
Sbjct: 67  GTDLLNYLLDVYLLQEDDISSYKEIQLPRIIQILASYP--------DNDSNIPTLGEKLT 118

Query: 130 AAKLRVEGCSSFLKAAIKWSIEFGAPKTGSP----ELHVMLAEYL-YSESPELDMARVSF 184
           +                     F   KTGS     +L++++ E L YS +P  D    S 
Sbjct: 119 S---------------------FANKKTGSTLGDSKLNLIIGEKLYYSGNP--DSVSNSE 155

Query: 185 HFVRGNNPEKFASTIVNFMGKCYPGEDDLA------VARAILMYLSLGNMKDANYIMDE 237
            ++   + +K    +V+   + Y  +++ +      + R I+ YL + N+K   + ++E
Sbjct: 156 KYLLLTDSDKANELLVDLCLQEYLDQENNSHNLHEYLERLIIPYLKINNIKLVKHTVEE 214


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.320    0.136    0.390 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,863,494,193
Number of Sequences: 23463169
Number of extensions: 196465035
Number of successful extensions: 493058
Number of sequences better than 100.0: 350
Number of HSP's better than 100.0 without gapping: 258
Number of HSP's successfully gapped in prelim test: 92
Number of HSP's that attempted gapping in prelim test: 492328
Number of HSP's gapped (non-prelim): 430
length of query: 321
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 179
effective length of database: 9,027,425,369
effective search space: 1615909141051
effective search space used: 1615909141051
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)